Abstract
Random Forest (RF) is one of the mostly used machine learning techniques in fault detection and diagnosis of industrial systems. However, its implementation suffers from certain drawbacks when considering the correlations between variables. In addition, to perform a fault detection and diagnosis, the classical RF only uses the raw data by the direct use of measured variables. The direct raw data could yield to poor performance due to the data redundancies and noises. Thus, this paper proposes four improved RF methods to overcome the above-mentioned limitations. The developed methods aim to reduce at first the amount of the training data and select the first kernel principal components (KPCs) using different kernel principal component analysis (PCA) based dimensionality reduction schemes. Then, the retained KPCs are fed to the RF classifier for fault diagnosis purposes. Finally, the proposed techniques are applied to a wind energy conversion (WEC) system. Different case studies were investigated in order to illustrate the effectiveness and robustness of the developed techniques compared to the state-of-the-art methods. The obtained results show the low computation time and high diagnosis accuracy of the proposed approaches (an average accuracy of 91%).
| Original language | English |
|---|---|
| Article number | 9253525 |
| Pages (from-to) | 6914-6921 |
| Number of pages | 8 |
| Journal | IEEE Sensors Journal |
| Volume | 21 |
| Issue number | 5 |
| DOIs | |
| Publication status | Published - 1 Mar 2021 |
| Externally published | Yes |
Keywords
- Kernel principal component analysis (KPCA)
- Random forest (RF)
- fault detection and diagnosis
- hierarchical K-means (H-Kmeans)
- reduced KPCA
- wind energy conversion systems