Abstract
Machine learning (ML) models are increasingly being used in decision-making, but they can be difficult to understand because most ML models are black boxes, meaning that their inner workings are not transparent. This can make interpreting the results of ML models and understanding the underlying data-generation process (DGP) challenging. In this article, we propose a novel methodology called Simple Interaction Finding Technique (SIFT) that can help make ML models more interpretable. SIFT is a data- and model-agnostic approach that can be used to identify interaction effects between variables in a dataset. This can help improve our understanding of the DGP and make ML models more transparent and explainable to a wider audience. We test the proposed methodology against various factors (such as ML model complexity, dataset noise, spurious variables, and variable distributions) to assess its effectiveness and weaknesses. We show that the methodology is robust against many potential problems in the underlying dataset as well as ML algorithms.
Original language | English |
---|---|
Pages (from-to) | 549-576 |
Number of pages | 28 |
Journal | Decision Sciences |
Volume | 55 |
Issue number | 6 |
DOIs | |
State | Published - Dec 2024 |
Keywords
- explainable AI (XAI)
- hypothesis generation in ML
- interaction effects in ML
- interpretable machine learning (IML)