TY - JOUR
T1 - Analysis and visualization of accidents severity based on LightGBM-TPE
AU - Li, Kun
AU - Xu, Haocheng
AU - Liu, Xiao
N1 - Funding Information:
This work is supported by Foundation of Hebei University of Technology , Tianjin, China, under grants 280000-104 .
Funding Information:
This work is supported by Foundation of Hebei University of Technology, Tianjin, China, under grants 280000-104.
Publisher Copyright:
© 2022 The Authors
PY - 2022/4
Y1 - 2022/4
N2 - In recent years, road traffic accidents, as a leading cause of accidental deaths, have been attracting more and more attention across several disciplines. Notably, the feature study on accidents severity can help exactly identify causality between different risk factors and road accidents, thereby substantially improving road traffic safety. Meanwhile, the application of data visualization to traffic safety investigations is still lacking. Motivated by this, we incorporate the visualization method into machine learning to analyze the traffic accidents data of the UK in 2017. A hybrid algorithm, namely Light Gradient Boosting Machine-Tree-structured Parzen Estimator (LightGBM-TPE) is proposed. Compared with other typical machine learning algorithms, it performs better in terms of the metrics f1,accuracy, recall and precision. Using LightGBM-TPE to calculate the SHAP value of each feature, we find that “Longitude”, “Latitude”, “Hour” and “Day_of_Week” are four risk factors most closely related with accident severity. Visualization for the data further verifies this conclusion. Overall, our research tries to explore an innovative way to understand and evaluate feature importance of road traffic accidents, which can help suggest effective solutions to improve traffic safety.
AB - In recent years, road traffic accidents, as a leading cause of accidental deaths, have been attracting more and more attention across several disciplines. Notably, the feature study on accidents severity can help exactly identify causality between different risk factors and road accidents, thereby substantially improving road traffic safety. Meanwhile, the application of data visualization to traffic safety investigations is still lacking. Motivated by this, we incorporate the visualization method into machine learning to analyze the traffic accidents data of the UK in 2017. A hybrid algorithm, namely Light Gradient Boosting Machine-Tree-structured Parzen Estimator (LightGBM-TPE) is proposed. Compared with other typical machine learning algorithms, it performs better in terms of the metrics f1,accuracy, recall and precision. Using LightGBM-TPE to calculate the SHAP value of each feature, we find that “Longitude”, “Latitude”, “Hour” and “Day_of_Week” are four risk factors most closely related with accident severity. Visualization for the data further verifies this conclusion. Overall, our research tries to explore an innovative way to understand and evaluate feature importance of road traffic accidents, which can help suggest effective solutions to improve traffic safety.
KW - Data visualization
KW - Feature importance
KW - LightGBM-TPE
KW - Traffic accidents severity
UR - http://www.scopus.com/inward/record.url?scp=85126607335&partnerID=8YFLogxK
U2 - 10.1016/j.chaos.2022.111987
DO - 10.1016/j.chaos.2022.111987
M3 - Article
AN - SCOPUS:85126607335
SN - 0960-0779
VL - 157
SP - 1
EP - 7
JO - Chaos Solitons and Fractals
JF - Chaos Solitons and Fractals
M1 - 111987
ER -