We propose a model-based clustering procedure where each component can take into account cluster-specific mild outliers through a flexible distributional assumption, and a proportion of observations is additionally trimmed. We propose a penalized likelihood approach for estimation and selection of the proportions of mild and gross outliers. A theoretically grounded penalty parameter is then obtained. Simulation studies illustrate the advantages of our procedure over flexible mixtures without trimming, and over trimmed normal mixture models (tclust). We conclude with an original real data example on the identification of the source from illicit drug shipments seized in Italy and Spain. The methodology proposed in this paper has been implemented in R functions which can be downloaded from .

Farcomeni, A., Punzo, A. (2020). Robust model-based clustering with mild and gross outliers. TEST, 29(4), 989-1007 [10.1007/s11749-019-00693-z].

Robust model-based clustering with mild and gross outliers

Alessio Farcomeni
;
2020-01-01

Abstract

We propose a model-based clustering procedure where each component can take into account cluster-specific mild outliers through a flexible distributional assumption, and a proportion of observations is additionally trimmed. We propose a penalized likelihood approach for estimation and selection of the proportions of mild and gross outliers. A theoretically grounded penalty parameter is then obtained. Simulation studies illustrate the advantages of our procedure over flexible mixtures without trimming, and over trimmed normal mixture models (tclust). We conclude with an original real data example on the identification of the source from illicit drug shipments seized in Italy and Spain. The methodology proposed in this paper has been implemented in R functions which can be downloaded from .
2020
Pubblicato
Rilevanza internazionale
Articolo
Esperti anonimi
Settore SECS-S/01 - STATISTICA
English
tclust
Contaminated normal
Penalized likelihood
Farcomeni, A., Punzo, A. (2020). Robust model-based clustering with mild and gross outliers. TEST, 29(4), 989-1007 [10.1007/s11749-019-00693-z].
Farcomeni, A; Punzo, A
Articolo su rivista
File in questo prodotto:
File Dimensione Formato  
mildGross.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 388.64 kB
Formato Adobe PDF
388.64 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/259545
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 22
  • ???jsp.display-item.citation.isi??? 19
social impact