TY - JOUR
T1 - Explainable Artificial Intelligence in Deep Learning-Based Solar Storm Predictions
AU - Rawashdeh, Adam O.
AU - Wang, Jason T.L.
AU - Herbert, Katherine G.
N1 - Publisher Copyright:
Copyright © 2025 by the authors.
PY - 2025/5/14
Y1 - 2025/5/14
N2 - A deep learning model is often considered a black-box model, as its internal workings tend to be opaque to the user. Because of the lack of transparency, it is challenging to understand the reasoning behind the model’s predictions. Here, we present an approach to making a deep learning-based solar storm prediction model interpretable, where solar storms include solar flares and coronal mass ejections (CMEs). This deep learning model, built based on a long short-term memory (LSTM) network with an attention mechanism, aims to predict whether an active region (AR) on the Sun’s surface that produces a flare within 24 hours will also produce a CME associated with the flare. The crux of our approach is to model data samples in an AR as time series and use the LSTM network to capture the temporal dynamics of the data samples. To make the model’s predictions accountable and reliable, we leverage post hoc model-agnostic techniques, which help elucidate the factors contributing to the predicted output for an input sequence and provide insights into the model’s behavior across multiple sequences within an AR. To our knowledge, this is the first time that interpretability has been added to an LSTM-based solar storm prediction model.
AB - A deep learning model is often considered a black-box model, as its internal workings tend to be opaque to the user. Because of the lack of transparency, it is challenging to understand the reasoning behind the model’s predictions. Here, we present an approach to making a deep learning-based solar storm prediction model interpretable, where solar storms include solar flares and coronal mass ejections (CMEs). This deep learning model, built based on a long short-term memory (LSTM) network with an attention mechanism, aims to predict whether an active region (AR) on the Sun’s surface that produces a flare within 24 hours will also produce a CME associated with the flare. The crux of our approach is to model data samples in an AR as time series and use the LSTM network to capture the temporal dynamics of the data samples. To make the model’s predictions accountable and reliable, we leverage post hoc model-agnostic techniques, which help elucidate the factors contributing to the predicted output for an input sequence and provide insights into the model’s behavior across multiple sequences within an AR. To our knowledge, this is the first time that interpretability has been added to an LSTM-based solar storm prediction model.
UR - https://www.scopus.com/pages/publications/105007944215
M3 - Conference article
AN - SCOPUS:105007944215
SN - 2334-0754
VL - 38
JO - Proceedings of the International Florida Artificial Intelligence Research Society Conference, FLAIRS
JF - Proceedings of the International Florida Artificial Intelligence Research Society Conference, FLAIRS
T2 - 38th International Florida Artificial Intelligence Research Society Conference, FLAIRS 2025
Y2 - 20 May 2025 through 23 May 2025
ER -