Abstract
Prediction of faults reliably for air handling units (AHU) is a key aspect of correcting errors and eliminating non-optimal functionality. Machine learning classification methods with data sampling are widely utilized to forecast these kinds of events, which, by their nature, seldom occur in equipment. The model proposed in this paper harnesses seven years of data from an air handling unit that contains information about, for example, temperature, humidity, CO2 content, and fan speed. This paper contributes to the field of imbalanced classification problems by proposing a novel data undersampling algorithm to enhance the classification model results in the presence of imbalanced and missing data. Moreover, this paper compares several oversampling methods, undersampling methods, probability calibration, and machine learning methods. Then, the paper reports on the proposed final model (proposed undersampling Algorithm 1, Tomek Links, and Logistic Regression) to forecast imperfect heat recovery events in an air handling unit that occur relatively seldom. The precision of the final model was 0.93 for the unseen data; this result was reasonable considering the imbalance of data concurring with missing data sequences.
Original language | English |
---|---|
Title of host publication | 2022 IEEE 5th International Conference on Big Data and Artificial Intelligence (BDAI) |
Publisher | IEEE |
Pages | 82-86 |
Number of pages | 5 |
Volume | 5 |
ISBN (Electronic) | 978-1-6654-7081-0 |
DOIs | |
Publication status | Published - 29 Aug 2022 |
MoE publication type | A4 Conference publication |
Event | International Conference on Big Data and Artificial Intelligence - Fuzhou, China Duration: 8 Jul 2022 → 10 Jul 2022 Conference number: 5 |
Conference
Conference | International Conference on Big Data and Artificial Intelligence |
---|---|
Abbreviated title | BDAI |
Country/Territory | China |
City | Fuzhou |
Period | 08/07/2022 → 10/07/2022 |
Keywords
- machine learning
- classification algorithms
- imbalanced data
- data preprocessing