TY - JOUR
T1 - Logistic regression and Bayesian networks to study outcomes using large data sets
AU - Lee, Sun Mi
AU - Abbott, Patricia
AU - Johantgen, Mary
PY - 2005
Y1 - 2005
N2 - Background: In nursing research, the interest in using large health care databases to predict nursing sensitive outcomes is growing rapidly. Traditionally, one of the most frequently used methods is logistic regression (LR), which, although powerful and familiar, has several limitations when used in the analysis of large databases. As a result, innovative approaches are required. Approach: To (a) introduce an innovative/alternative data analysis approach (Bayesian network), (b) discuss the constraints of LR and the complementary advantages of Bayesian networks (BNs) in working with large and multidimensional health care data, and (c) provide a fundamental understanding of the use of BNs in the nursing/health care domain. Results: Studies have shown that BNs have several advantages over LR in analyzing complex and large data: (a) statistical assumptions, such as linearity and additivity, are relaxed; (b) handling of a larger number of predictors and identification of interactions among predictors is less complex; and (c) the discovery of structure, pattern, and knowledge, for example, of unknown, complex, and nonlinear relationships, in data is facilitated. Conclusion: Outcome studies, such as those undertaken by nurse researchers, may benefit from the examination and use of innovative approaches such as BNs to the analysis of very large and complex health care data sets.
AB - Background: In nursing research, the interest in using large health care databases to predict nursing sensitive outcomes is growing rapidly. Traditionally, one of the most frequently used methods is logistic regression (LR), which, although powerful and familiar, has several limitations when used in the analysis of large databases. As a result, innovative approaches are required. Approach: To (a) introduce an innovative/alternative data analysis approach (Bayesian network), (b) discuss the constraints of LR and the complementary advantages of Bayesian networks (BNs) in working with large and multidimensional health care data, and (c) provide a fundamental understanding of the use of BNs in the nursing/health care domain. Results: Studies have shown that BNs have several advantages over LR in analyzing complex and large data: (a) statistical assumptions, such as linearity and additivity, are relaxed; (b) handling of a larger number of predictors and identification of interactions among predictors is less complex; and (c) the discovery of structure, pattern, and knowledge, for example, of unknown, complex, and nonlinear relationships, in data is facilitated. Conclusion: Outcome studies, such as those undertaken by nurse researchers, may benefit from the examination and use of innovative approaches such as BNs to the analysis of very large and complex health care data sets.
KW - Bayesian network
KW - Large databases
KW - Logistic regression
KW - Nursing research
KW - Outcomes
KW - Prediction
UR - http://www.scopus.com/inward/record.url?scp=17244382596&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=17244382596&partnerID=8YFLogxK
U2 - 10.1097/00006199-200503000-00009
DO - 10.1097/00006199-200503000-00009
M3 - Article
C2 - 15778655
AN - SCOPUS:17244382596
SN - 0029-6562
VL - 54
SP - 133
EP - 138
JO - Nursing Research
JF - Nursing Research
IS - 2
ER -