TY - JOUR
T1 - Stratifying no-show patients into multiple risk groups via a holistic data analytics-based framework
AU - Simsek, Serhat
AU - Tiahrt, Thomas
AU - Dag, Ali
N1 - Publisher Copyright:
© 2020 Elsevier B.V.
PY - 2020/5
Y1 - 2020/5
N2 - Accurate prediction of no-show patients plays a crucial role as it enables researchers to increase the efficiency of their scheduling systems. The purpose of the current study is to formulate a novel hybrid data mining-based methodology to a) accurately predict the no-show patients, b) build a parsimonious model by employing a comprehensive variable selection procedure, c) build a model that does not suffer due to data imbalance, and d) provide healthcare agencies with a patient-specific risk level. Our study suggests that an Artificial Neural Network (ANN) model should be employed as a classification algorithm in predicting patient no-shows by using the variable set that is commonly selected by a Genetic Algorithm (GA) and Simulated Annealing (SA). In addition, we used Random Under Sampling (RUS) to improve the performance of the model in predicting the minority group (no-show) patients. The patient-specific risk scores were justified by applying a threshold sensitivity analysis. Also, the web-based decision support tool that can be adopted by clinics is developed. The clinics can incorporate their own intuition/incentive to make the final decision on the cases where the model is not confident enough (i.e. when the estimated probabilities fall near the decision boundary). These insights enable health care professionals to improve clinic utilization and patient outcomes.
AB - Accurate prediction of no-show patients plays a crucial role as it enables researchers to increase the efficiency of their scheduling systems. The purpose of the current study is to formulate a novel hybrid data mining-based methodology to a) accurately predict the no-show patients, b) build a parsimonious model by employing a comprehensive variable selection procedure, c) build a model that does not suffer due to data imbalance, and d) provide healthcare agencies with a patient-specific risk level. Our study suggests that an Artificial Neural Network (ANN) model should be employed as a classification algorithm in predicting patient no-shows by using the variable set that is commonly selected by a Genetic Algorithm (GA) and Simulated Annealing (SA). In addition, we used Random Under Sampling (RUS) to improve the performance of the model in predicting the minority group (no-show) patients. The patient-specific risk scores were justified by applying a threshold sensitivity analysis. Also, the web-based decision support tool that can be adopted by clinics is developed. The clinics can incorporate their own intuition/incentive to make the final decision on the cases where the model is not confident enough (i.e. when the estimated probabilities fall near the decision boundary). These insights enable health care professionals to improve clinic utilization and patient outcomes.
KW - Data mining
KW - Healthcare informatics
KW - Medical decision making
KW - Patient no-shows
UR - http://www.scopus.com/inward/record.url?scp=85080101302&partnerID=8YFLogxK
U2 - 10.1016/j.dss.2020.113269
DO - 10.1016/j.dss.2020.113269
M3 - Article
AN - SCOPUS:85080101302
SN - 0167-9236
VL - 132
JO - Decision Support Systems
JF - Decision Support Systems
M1 - 113269
ER -