In-Person Poster presentation / poster accept

Average Sensitivity of Decision Tree Learning

Satoshi Hara ⋅ Yuichi Yoshida

Keywords: average sensitivity Trustworthy Machine Learning decision tree Social Aspects of Machine Learning

[ Slides] [ Poster] [ OpenReview]

Abstract

A decision tree is a fundamental model used in data mining and machine learning. In practice, the training data used to construct a decision tree may change over time or contain noise, and a drastic change in the learned tree structure owing to such data perturbation is unfavorable. For example, in data mining, a change in the tree implies a change in the extracted knowledge, which raises the question of whether the extracted knowledge is truly reliable or is only a noisy artifact. To alleviate this issue, we design decision tree learning algorithms that are stable against insignificant perturbations in the training data. Specifically, we adopt the notion of average sensitivity as a stability measure, and design an algorithm with low average sensitivity that outputs a decision tree whose accuracy is close to the optimal decision tree. The experimental results on real-world datasets demonstrate that the proposed algorithm enables users to select suitable decision trees considering the trade-off between average sensitivity and accuracy.

Video

Chat is not available.