An interpretable machine learning methodology to generate interaction effect hypotheses from complex datasets

Murtaza Nasir, Nichalin S. Summerfield, Serhat Simsek, Asil Oztekin

Research output: Contribution to journalArticlepeer-review

Abstract

Machine learning (ML) models are increasingly being used in decision-making, but they can be difficult to understand because most ML models are black boxes, meaning that their inner workings are not transparent. This can make interpreting the results of ML models and understanding the underlying data-generation process (DGP) challenging. In this article, we propose a novel methodology called Simple Interaction Finding Technique (SIFT) that can help make ML models more interpretable. SIFT is a data- and model-agnostic approach that can be used to identify interaction effects between variables in a dataset. This can help improve our understanding of the DGP and make ML models more transparent and explainable to a wider audience. We test the proposed methodology against various factors (such as ML model complexity, dataset noise, spurious variables, and variable distributions) to assess its effectiveness and weaknesses. We show that the methodology is robust against many potential problems in the underlying dataset as well as ML algorithms.

Original languageEnglish
Pages (from-to)549-576
Number of pages28
JournalDecision Sciences
Volume55
Issue number6
DOIs
StatePublished - Dec 2024

Keywords

  • explainable AI (XAI)
  • hypothesis generation in ML
  • interaction effects in ML
  • interpretable machine learning (IML)

Fingerprint

Dive into the research topics of 'An interpretable machine learning methodology to generate interaction effect hypotheses from complex datasets'. Together they form a unique fingerprint.

Cite this