Abstract
In this paper we propose a novel clustering method for functional data based on the principal curve clustering approach. By this method functional data are approximated using functional principal component analysis (FPCA) and the principal curve clustering is then performed on the principal scores. The proposed method makes use of the nonparametric principal curves to summarize the features of the principal scores extracted from the original functional data, and a probabilistic model combined with Bayesian Information Criterion is employed to automatically and simultaneously find the appropriate number of features, the optimal degree of smoothing and the corresponding cluster members. The simulation studies show that the proposed method outperforms the existing functional clustering approaches considered. The capability of this method is also demonstrated by the applications in the human mortality and fertility data.
Original language | English |
---|---|
Pages (from-to) | 7264-7283 |
Number of pages | 20 |
Journal | Communications in Statistics - Theory and Methods |
Volume | 51 |
Issue number | 20 |
Early online date | 15 Jan 2021 |
DOIs | |
Publication status | Published - 18 Oct 2022 |
Bibliographical note
This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License(http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits noncommercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way.
Funder
Funding Information: This work was jointly funded by the Institute and Faculty of Actuaries (IFoA) and the College of Science and Engineering of the University of Leicester (UoL) through a PhD studentship.
Keywords
- Clustering
- functional data analysis
- functional principal component analysis
- principal curves
ASJC Scopus subject areas
- Statistics and Probability