Functional data clustering using principal curve methods

  • Ruhao Wu
  • , Bo Wang
  • , Aiping Xu

    Research output: Contribution to journalArticlepeer-review

    21 Citations (Scopus)
    147 Downloads (Pure)

    Abstract

    In this paper we propose a novel clustering method for functional data based on the principal curve clustering approach. By this method functional data are approximated using functional principal component analysis (FPCA) and the principal curve clustering is then performed on the principal scores. The proposed method makes use of the nonparametric principal curves to summarize the features of the principal scores extracted from the original functional data, and a probabilistic model combined with Bayesian Information Criterion is employed to automatically and simultaneously find the appropriate number of features, the optimal degree of smoothing and the corresponding cluster members. The simulation studies show that the proposed method outperforms the existing functional clustering approaches considered. The capability of this method is also demonstrated by the applications in the human mortality and fertility data.
    Original languageEnglish
    Pages (from-to)7264-7283
    Number of pages20
    JournalCommunications in Statistics - Theory and Methods
    Volume51
    Issue number20
    Early online date15 Jan 2021
    DOIs
    Publication statusPublished - 18 Oct 2022

    Bibliographical note

    This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License
    (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits noncommercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way.

    Funder


    Funding Information: This work was jointly funded by the Institute and Faculty of Actuaries (IFoA) and the College of Science and Engineering of the University of Leicester (UoL) through a PhD studentship.

    UN SDGs

    This output contributes to the following UN Sustainable Development Goals (SDGs)

    1. SDG 3 - Good Health and Well-being
      SDG 3 Good Health and Well-being

    Keywords

    • Clustering
    • functional data analysis
    • functional principal component analysis
    • principal curves

    ASJC Scopus subject areas

    • Statistics and Probability

    Fingerprint

    Dive into the research topics of 'Functional data clustering using principal curve methods'. Together they form a unique fingerprint.

    Cite this