Abstract
We apply machine learning to automate the root cause analysis in agile software testing environments. In particular, we extract relevant features from raw log data after interviewing testing engineers (human experts). Initial efforts are put into clustering the unlabeled data, and despite obtaining weak correlations between several clusters and failure root causes, the vagueness in the rest of the clusters leads to the consideration of labeling. A new round of interviews with the testing engineers leads to the definition of five ground-truth categories. Using manually labeled data, we train artificial neural networks that either classify the data or pre-process it for clustering. The resulting method achieves an accuracy of 88.9%. The methodology of this paper serves as a prototype or baseline approach for the extraction of expert knowledge and its adaptation to machine learning techniques for root cause analysis in agile environments.
Original language | English |
---|---|
Title of host publication | Proceedings - 2019 IEEE 12th International Conference on Software Testing, Verification and Validation, ICST 2019 |
Publisher | IEEE |
Pages | 379-390 |
Number of pages | 12 |
ISBN (Electronic) | 9781728117355 |
DOIs | |
Publication status | Published - 1 Apr 2019 |
MoE publication type | A4 Article in a conference publication |
Event | IEEE International Conference on Software Testing, Verification and Validation - Xi'an, China Duration: 22 Apr 2019 → 27 Apr 2019 Conference number: 12 |
Conference
Conference | IEEE International Conference on Software Testing, Verification and Validation |
---|---|
Abbreviated title | ICST |
Country/Territory | China |
City | Xi'an |
Period | 22/04/2019 → 27/04/2019 |
Keywords
- Artificial neural networks
- Automation
- Classification
- Clustering
- Log data analysis
- Machine learning
- Root cause analysis
- Software testing