TY - JOUR
T1 - Organizing and Analyzing the Activity Data in NHANES
AU - Leroux, Andrew
AU - Di, Junrui
AU - Smirnova, Ekaterina
AU - Mcguffey, Elizabeth J.
AU - Cao, Quy
AU - Bayatmokhtari, Elham
AU - Tabacu, Lucia
AU - Zipunnikov, Vadim
AU - Urbanek, Jacek K.
AU - Crainiceanu, Ciprian
N1 - Funding Information:
Funding This research was supported by National Heart, Lung, and Blood Institute (R 01 HL123407), National Institute of Neurological Disorders and Stroke (R 01 NS060910), and National Institute on Aging Training Grant (T 32 AG000247).
Funding Information:
We would like to thank the CDC, specifically the National Center for Health Statistics for collecting, organizing, and making public this unique data resource. We would also like to thank them for the permission to repost the publicly available NHANES and NDI data in analytic format. Also, we would like to thank the thousands of anonymous participants in the NHANES, whose data led to the exciting findings in this paper.
Publisher Copyright:
© 2019, International Chinese Statistical Association.
PY - 2019/7/15
Y1 - 2019/7/15
N2 - The NHANES study contains objectively measured physical activity data collected using hip-worn accelerometers from multiple cohorts. However, using the accelerometry data has proven daunting because (1) currently, there are no agreed-upon standard protocols for data storage and analysis; (2) data exhibit heterogeneous patterns of missingness due to varying degrees of adherence to wear-time protocols; (3) sampling weights need to be carefully adjusted and accounted for in individual analyses; (4) there is a lack of reproducible software that transforms the data from its published format into analytic form; and (5) the high dimensional nature of accelerometry data complicates analyses. Here, we provide a framework for processing, storing, and analyzing the NHANES accelerometry data for the 2003–2004 and 2005–2006 surveys. We also provide an NHANES data package in R, to help disseminate high-quality, processed activity data combined with mortality and demographic information. Thus, we provide the tools to transition from “available data online” to “easily accessible and usable data”, which substantially reduces the large upfront costs of initiating studies of association between physical activity and human health outcomes using NHANES. We apply these tools in an analysis showing that accelerometry features have the potential to predict 5-year all-cause mortality better than known risk factors such as age, cigarette smoking, and various comorbidities.
AB - The NHANES study contains objectively measured physical activity data collected using hip-worn accelerometers from multiple cohorts. However, using the accelerometry data has proven daunting because (1) currently, there are no agreed-upon standard protocols for data storage and analysis; (2) data exhibit heterogeneous patterns of missingness due to varying degrees of adherence to wear-time protocols; (3) sampling weights need to be carefully adjusted and accounted for in individual analyses; (4) there is a lack of reproducible software that transforms the data from its published format into analytic form; and (5) the high dimensional nature of accelerometry data complicates analyses. Here, we provide a framework for processing, storing, and analyzing the NHANES accelerometry data for the 2003–2004 and 2005–2006 surveys. We also provide an NHANES data package in R, to help disseminate high-quality, processed activity data combined with mortality and demographic information. Thus, we provide the tools to transition from “available data online” to “easily accessible and usable data”, which substantially reduces the large upfront costs of initiating studies of association between physical activity and human health outcomes using NHANES. We apply these tools in an analysis showing that accelerometry features have the potential to predict 5-year all-cause mortality better than known risk factors such as age, cigarette smoking, and various comorbidities.
KW - Accelerometry
KW - NHANES
KW - Physical activity
KW - Prediction
UR - http://www.scopus.com/inward/record.url?scp=85061322934&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85061322934&partnerID=8YFLogxK
U2 - 10.1007/s12561-018-09229-9
DO - 10.1007/s12561-018-09229-9
M3 - Article
C2 - 32047572
AN - SCOPUS:85061322934
SN - 1867-1764
VL - 11
SP - 262
EP - 287
JO - Statistics in Biosciences
JF - Statistics in Biosciences
IS - 2
ER -