Decision tree enhancer (DTE): Improving decision trees with optimization

Research output: Contribution to journalArticleScientificpeer-review

2 Downloads (Pure)

Abstract

Decision trees are off-the-shelf machine learning models widely used for classification and regression tasks in medical, logistics, financial, and other critical areas where interpretability is a key factor. They can efficiently handle numerical and categorical variables, making them a versatile choice for various applications. However, traditional decision-tree training methods are based on greedy heuristics, which cannot provide guarantees regarding whether further improvements could be achieved. We propose Decision Tree Enhancer (DTE), which employs optimization as a post-training step to improve previously trained decision trees. Moreover, the proposed method precludes the need for a pre-processing step for continuous features such as discretization or bucketization, and can be applied regardless of the model used to first train the decision tree. Lastly, DTE’s mathematical programming formulation enables, for example, the consideration of recall thresholds and class prioritization. Tested on 63 classification datasets from the UCI Machine Learning Repository, using tree depths from 1 to 5, four time limits (1, 5, 10, and 30 s), and 5 randomized train-test splits for cross-validation, the proposed post-training step demonstrated superior performance over CART (Classification And Regression Tree), for both in- and out-of-sample data. With a 30 s time limit, DTE was able to improve the weighted recall in 83.2% of the datasets with an average improvement of 9.0% in training and 5.0% in testing.
Original languageEnglish
Article number115597
Pages (from-to)1-11
Number of pages11
JournalKnowledge-Based Systems
Volume339
DOIs
Publication statusPublished - 22 Apr 2026
MoE publication typeA1 Journal article-refereed

Keywords

  • Interpretability
  • Post-training optimization
  • Decision tree
  • Mathematical programming

Fingerprint

Dive into the research topics of 'Decision tree enhancer (DTE): Improving decision trees with optimization'. Together they form a unique fingerprint.

Cite this