The aim of this article is to propose a procedure to cluster functional observations in a subspace of reduced dimension. The dimensional reduction is obtained by constraining the cluster centroids to lie into a subspace which preserves the maximum amount of discriminative information contained in the original data. The model is estimated by using penalized least squares to take into account the functional nature of the data. The smoothing is carried out within the clustering and its amount is adaptively calibrated. A simulation study shows how the combination of these two elements, feature-extraction and automatic data-driven smoothing, improves the performance of clustering by reducing irrelevant and redundant information in the data. The effectiveness of the proposal is demonstrated by an application to a real dataset regarding a speech recognition problem. Implementation details of the algorithm together with a computer code are available in the online supplements.

Gattone, S.a., Rocci, R. (2012). Clustering curves on a reduced subspace. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 21(2), 361-379 [10.1080/10618600.2012.679237].

Clustering curves on a reduced subspace

GATTONE, STEFANO ANTONIO;ROCCI, ROBERTO
2012-01-01

Abstract

The aim of this article is to propose a procedure to cluster functional observations in a subspace of reduced dimension. The dimensional reduction is obtained by constraining the cluster centroids to lie into a subspace which preserves the maximum amount of discriminative information contained in the original data. The model is estimated by using penalized least squares to take into account the functional nature of the data. The smoothing is carried out within the clustering and its amount is adaptively calibrated. A simulation study shows how the combination of these two elements, feature-extraction and automatic data-driven smoothing, improves the performance of clustering by reducing irrelevant and redundant information in the data. The effectiveness of the proposal is demonstrated by an application to a real dataset regarding a speech recognition problem. Implementation details of the algorithm together with a computer code are available in the online supplements.
2012
Pubblicato
Rilevanza internazionale
Articolo
Esperti anonimi
Settore SECS-S/01 - STATISTICA
English
Con Impact Factor ISI
Cluster analysis; Dimension reduction; Functional data analysis; Regularization.
Gattone, S.a., Rocci, R. (2012). Clustering curves on a reduced subspace. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 21(2), 361-379 [10.1080/10618600.2012.679237].
Gattone, Sa; Rocci, R
Articolo su rivista
File in questo prodotto:
File Dimensione Formato  
jcgs.pdf

solo utenti autorizzati

Descrizione: Articolo principale
Licenza: Copyright dell'editore
Dimensione 772.96 kB
Formato Adobe PDF
772.96 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/76230
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 15
  • ???jsp.display-item.citation.isi??? 12
social impact