Abstract
In this paper we propose a novel clustering method for functional data based on the principal curve clustering approach. By this method functional data are approximated using functional principal component analysis (FPCA) and the principal curve clustering is then performed on the principal scores. The proposed method makes use of the nonparametric principal curves to summarize the features of the principal scores extracted from the original functional data, and a probabilistic model combined with Bayesian Information Criterion is employed to automatically and simultaneously find the appropriate number of features, the optimal degree of smoothing and the corresponding cluster members. The simulation studies show that the proposed method outperforms the existing functional clustering approaches considered. The capability of this method is also demonstrated by the applications in the human mortality and fertility data.
Original language | English |
---|---|
Pages (from-to) | (In-Press) |
Journal | Communications in Statistics - Theory and Methods |
Volume | (In-Press) |
Early online date | 15 Jan 2021 |
DOIs | |
Publication status | E-pub ahead of print - 15 Jan 2021 |
Keywords
- Clustering
- functional data analysis
- functional principal component analysis
- principal curves
ASJC Scopus subject areas
- Statistics and Probability