TY - JOUR
T1 - The importance of prediction model validation and assessment in obesity and nutrition research
AU - Ivanescu, A. E.
AU - Li, P.
AU - George, B.
AU - Brown, A. W.
AU - Keith, S. W.
AU - Raju, D.
AU - Allison, D. B.
N1 - Publisher Copyright:
© 2016 Macmillan Publishers Limited All rights reserved.
PY - 2016/6/1
Y1 - 2016/6/1
N2 - Deriving statistical models to predict one variable from one or more other variables, or predictive modeling, is an important activity in obesity and nutrition research. To determine the quality of the model, it is necessary to quantify and report the predictive validity of the derived models. Conducting validation of the predictive measures provides essential information to the research community about the model. Unfortunately, many articles fail to account for the nearly inevitable reduction in predictive ability that occurs when a model derived on one data set is applied to a new data set. Under some circumstances, the predictive validity can be reduced to nearly zero. In this overview, we explain why reductions in predictive validity occur, define the metrics commonly used to estimate the predictive validity of a model (for example, coefficient of determination (R 2), mean squared error, sensitivity, specificity, receiver operating characteristic and concordance index) and describe methods to estimate the predictive validity (for example, cross-validation, bootstrap, and adjusted and shrunken R 2). We emphasize that methods for estimating the expected reduction in predictive ability of a model in new samples are available and this expected reduction should always be reported when new predictive models are introduced.
AB - Deriving statistical models to predict one variable from one or more other variables, or predictive modeling, is an important activity in obesity and nutrition research. To determine the quality of the model, it is necessary to quantify and report the predictive validity of the derived models. Conducting validation of the predictive measures provides essential information to the research community about the model. Unfortunately, many articles fail to account for the nearly inevitable reduction in predictive ability that occurs when a model derived on one data set is applied to a new data set. Under some circumstances, the predictive validity can be reduced to nearly zero. In this overview, we explain why reductions in predictive validity occur, define the metrics commonly used to estimate the predictive validity of a model (for example, coefficient of determination (R 2), mean squared error, sensitivity, specificity, receiver operating characteristic and concordance index) and describe methods to estimate the predictive validity (for example, cross-validation, bootstrap, and adjusted and shrunken R 2). We emphasize that methods for estimating the expected reduction in predictive ability of a model in new samples are available and this expected reduction should always be reported when new predictive models are introduced.
UR - http://www.scopus.com/inward/record.url?scp=84946106320&partnerID=8YFLogxK
U2 - 10.1038/ijo.2015.214
DO - 10.1038/ijo.2015.214
M3 - Review article
C2 - 26449421
AN - SCOPUS:84946106320
SN - 0307-0565
VL - 40
SP - 887
EP - 894
JO - International Journal of Obesity
JF - International Journal of Obesity
IS - 6
ER -