An integrated framework for diagnosing process faults with incomplete features

Roozbeh Razavi-Far, Mehrdad Saif, Vasile Palade, Shiladitya Chakrabarti

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)
87 Downloads (Pure)

Abstract

Handling missing values and large-dimensional features are crucial requirements for data-driven fault diagnosis systems. However, most intelligent data-driven diagnostic systems are not able to handle missing data. The presence of high-dimensional feature sets can also further complicate the process of fault diagnosis. This paper aims to devise a missing data imputation unit along with a dimensionality reduction unit in the pre-processing module of the diagnostic system. This paper proposes a novel pooling strategy for missing data imputation (PSMI). This strategy can simplify complex patterns of missingness and incrementally update the pool. The pre-processing module receives incomplete observations, PSMI estimates missing values, and, then, the dimensionality reduction unit transforms completed observations onto a lower-dimensional feature space. These transformed observations are then fed as inputs to the fault classification module for decision making and diagnosis. This diagnostic scheme makes use of various state-of-the-art missing data imputation, dimensionality reduction and classification algorithms. This enables a comprehensive comparison and allows to find the best techniques for the sake of diagnosing faults in the Tennessee Eastman process. The obtained results show the effectiveness of the proposed pooling strategy and indicate that principal component analysis imputation and heteroscedastic discriminant analysis approaches outperform other imputation and dimensionality reduction techniques in this diagnostic application.
Original languageEnglish
Pages (from-to)75-93
Number of pages19
JournalKnowledge and Information Systems
Volume64
Issue number1
Early online date26 Nov 2021
DOIs
Publication statusPublished - Jan 2022

Bibliographical note

The final publication is available at Springer via http://dx.doi.org/ 10.1007/s10115-021-01625-w

Copyright © and Moral Rights are retained by the author(s) and/ or other copyright owners. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This item cannot be reproduced or quoted extensively from without first obtaining permission in writing from the copyright holder(s). The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the copyright holders.

Keywords

  • Data analysis
  • Dimensionality reduction
  • Fault diagnosis
  • Heteroscedastic discriminant analysis
  • Missing data imputation
  • Principal component analysis

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Human-Computer Interaction
  • Hardware and Architecture
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'An integrated framework for diagnosing process faults with incomplete features'. Together they form a unique fingerprint.

Cite this