TY - JOUR
T1 - Effective Voting-based Ensemble Learning for Segregated Load Forecasting with Low Sampling Data
AU - Khan, Shahzeb Ahmad
AU - Rehman, Attique Ur
AU - Arshad, Ammar
AU - Alqahtani, Mohammed H.
AU - Mahmoud, Karar
AU - Lehtonen, Matti
N1 - Publisher Copyright:
Authors
PY - 2024/6/13
Y1 - 2024/6/13
N2 - In power system planning and operation, load forecasting is an important task as it helps ensure a reliable and efficient electricity supply. For effective operation of the smart grid, load forecasting is also an important thing to keep balancing dispatch of power, load management, and load shifting. In this regard, this paper aims to propose an accurate load forecasting based on implementing and integrating different load forecasting models using standalone machine learning and ensemble machine learning models, particularly for segregated real-world load data. In the given context, machine learning models namely, k-nearest neighbor, random forest, decision tree, and voting ensemble regression, are used in this study. The time series load data for this research work was acquired from a real-world load database namely, Pecan Street Dataport. For performance evaluation, two statistical error matrices are used, i.e., mean absolute error (MAE) and mean squared error (MSE). For simulation purposes, Python along with different machine-learning libraries was employed. Moreover, for numerical data analysis and visualization, this research work utilizes different packages like NumPy, pandas, and matplotlib. The empirical study presents the comparative performance analysis of machine learning models for load forecasting utilizing low sampling load data, both at aggregated as well as at segregated levels. Standalone and ensemble machine learning algorithms yield very good forecasting results, and this research has revealed that machine learning models trained on segregated data exhibit superior performance compared to those trained on aggregated data. On segregated data, the proposed voting- based ensemble machine learning algorithm outperforms all the other models with MAE 0.05708, followed by k-nearest neighbors (with MAE 0.05879), random forest (with MAE 0.07069), and decision tree (with MAE 0.07361).
AB - In power system planning and operation, load forecasting is an important task as it helps ensure a reliable and efficient electricity supply. For effective operation of the smart grid, load forecasting is also an important thing to keep balancing dispatch of power, load management, and load shifting. In this regard, this paper aims to propose an accurate load forecasting based on implementing and integrating different load forecasting models using standalone machine learning and ensemble machine learning models, particularly for segregated real-world load data. In the given context, machine learning models namely, k-nearest neighbor, random forest, decision tree, and voting ensemble regression, are used in this study. The time series load data for this research work was acquired from a real-world load database namely, Pecan Street Dataport. For performance evaluation, two statistical error matrices are used, i.e., mean absolute error (MAE) and mean squared error (MSE). For simulation purposes, Python along with different machine-learning libraries was employed. Moreover, for numerical data analysis and visualization, this research work utilizes different packages like NumPy, pandas, and matplotlib. The empirical study presents the comparative performance analysis of machine learning models for load forecasting utilizing low sampling load data, both at aggregated as well as at segregated levels. Standalone and ensemble machine learning algorithms yield very good forecasting results, and this research has revealed that machine learning models trained on segregated data exhibit superior performance compared to those trained on aggregated data. On segregated data, the proposed voting- based ensemble machine learning algorithm outperforms all the other models with MAE 0.05708, followed by k-nearest neighbors (with MAE 0.05879), random forest (with MAE 0.07069), and decision tree (with MAE 0.07361).
KW - Data models
KW - Electricity
KW - ensemble learning
KW - Forecasting
KW - Load forecasting
KW - load forecasting
KW - Load modeling
KW - low sampling data
KW - Machine learning
KW - machine learning
KW - Predictive models
KW - segregated loads
UR - http://www.scopus.com/inward/record.url?scp=85196102376&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2024.3413679
DO - 10.1109/ACCESS.2024.3413679
M3 - Article
AN - SCOPUS:85196102376
SN - 2169-3536
VL - 12
SP - 84074
EP - 84087
JO - IEEE Access
JF - IEEE Access
ER -