TY - JOUR
T1 - Expressing Uncertainty in Information Systems Analytics Research
T2 - A Demonstration of Bayesian Analysis Applied to Binary Classification Problems
AU - Twitchell, Douglas P.
AU - Fuller, Christie M.
N1 - Publisher Copyright:
© 2022 Elsevier Ltd
PY - 2023/1
Y1 - 2023/1
N2 - The measures typically used to assess binary classification problems fail to incorporate the uncertainty inherent to many contexts into the results. We propose using a Bayesian model to express the uncertainty in binary classification problems. This study identified 10 previous studies that provided sufficient data to demonstrate the use of Bayesian analysis in Information Systems (IS) contexts with varying levels of uncertainty. The analysis and user study show that the addition of Bayesian analysis is most useful in high uncertainty contexts with a wide interval for positive predictive value. Such an interval will lead to high uncertainty, even with very certain sensitivity and specificity. The usefulness of Bayesian analysis in conditions of medium uncertainty depends on the context. In conditions of low uncertainty, Bayesian analysis does not add much value. The user study showed that presenting models with uncertainty changed researcher perception of which model performed the best with 18 of 21 researchers changing their opinion. We recommend that authors estimate the uncertainty in their models and provide confusion matrices and prevalence estimates in their results to enable Bayesian analysis as research in a domain matures.
AB - The measures typically used to assess binary classification problems fail to incorporate the uncertainty inherent to many contexts into the results. We propose using a Bayesian model to express the uncertainty in binary classification problems. This study identified 10 previous studies that provided sufficient data to demonstrate the use of Bayesian analysis in Information Systems (IS) contexts with varying levels of uncertainty. The analysis and user study show that the addition of Bayesian analysis is most useful in high uncertainty contexts with a wide interval for positive predictive value. Such an interval will lead to high uncertainty, even with very certain sensitivity and specificity. The usefulness of Bayesian analysis in conditions of medium uncertainty depends on the context. In conditions of low uncertainty, Bayesian analysis does not add much value. The user study showed that presenting models with uncertainty changed researcher perception of which model performed the best with 18 of 21 researchers changing their opinion. We recommend that authors estimate the uncertainty in their models and provide confusion matrices and prevalence estimates in their results to enable Bayesian analysis as research in a domain matures.
KW - Bayesian analysis
KW - Binary classification
KW - Positive predictive value
KW - Prevalence
KW - Uncertainty
UR - http://www.scopus.com/inward/record.url?scp=85141385430&partnerID=8YFLogxK
UR - https://scholarworks.boisestate.edu/itscm_facpubs/98
U2 - 10.1016/j.ipm.2022.103132
DO - 10.1016/j.ipm.2022.103132
M3 - Article
SN - 0306-4573
VL - 60
JO - Information Processing & Management
JF - Information Processing & Management
IS - 1
M1 - 103132
ER -