TY - JOUR
T1 - Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models
AU - Scharfstein, Daniel O.
AU - Rotnitzky, Andrea
AU - Robins, James M.
N1 - Funding Information:
Daniela. Scharfstein is Assistant Professor of Biostatistics, Johns Hopkins School of Hygiene and Public Health, Baltimore, MD 21205. Andrea Rotnitzky is Associate Professor of Biostatistics and James M. Robins is Professor of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA 02115. This research was partially supported by National Institute of Health grants I-R29-0M48704-0, 5ROIA132475-07, ROICA74112, I-ROI-MH56639-0IAI, ROI-HD-38209-01, and I-ROI-DAI0184-0IA2. The authors wish to thank Victor DeOruttola for helpful discussions.
Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 1999/12/1
Y1 - 1999/12/1
N2 - Consider a study whose design calls for the study subjects to be followed from enrollment (time t = 0) to time t = T, at which point a primary endpoint of interest Y is to be measured. The design of the study also calls for measurements on a vector Vt) of covariates to be made at one or more times t during the interval [0, T). We are interested in making inferences about the marginal mean μ0 of Y when some subjects drop out of the study at random times Q prior to the common fixed end of follow-up time T. The purpose of this article is to show how to make inferences about μ0 when the continuous drop-out time Q is modeled semiparametrically and no restrictions are placed on the joint distribution of the outcome and other measured variables. In particular, we consider two models for the conditional hazard of drop-out given (V(T), Y), where V(t) denotes the history of the process Vt) through time t, t ∈ [0, T). In the first model, we assume that λQ(t|V(T), Y) exp(α0Y), where α0 is a scalar parameter and λ0(t|V(t)) is an unrestricted positive function of t and the process V(t). When the process Vt) is high dimensional, estimation in this model is not feasible with moderate sample sizes, due to the curse of dimensionality. For such situations, we consider a second model that imposes the additional restriction that λ0(t|V(t)) = λ0(t) exp(γ′0(t)), where λ0t) is an unspecified baseline hazard function, W(t) = w(t, V(t)), w(·,·) is a known function that maps (t, V(t)) to Rq, and γ0 is a q × 1 unknown parameter vector. When α0 ≠ 0, then drop-out is nonignorable. On account of identifiability problems, joint estimation of the mean μ0 of Y and the selection bias parameter α0 may be difficult or impossible. Therefore, we propose regarding the selection bias parameter α0 as known, rather than estimating it from the data. We then perform a sensitivity analysis to see how inference about α0 changes as we vary α0 over a plausible range of values. We apply our approach to the analysis of ACTG 175, an AIDS clinical trial.
AB - Consider a study whose design calls for the study subjects to be followed from enrollment (time t = 0) to time t = T, at which point a primary endpoint of interest Y is to be measured. The design of the study also calls for measurements on a vector Vt) of covariates to be made at one or more times t during the interval [0, T). We are interested in making inferences about the marginal mean μ0 of Y when some subjects drop out of the study at random times Q prior to the common fixed end of follow-up time T. The purpose of this article is to show how to make inferences about μ0 when the continuous drop-out time Q is modeled semiparametrically and no restrictions are placed on the joint distribution of the outcome and other measured variables. In particular, we consider two models for the conditional hazard of drop-out given (V(T), Y), where V(t) denotes the history of the process Vt) through time t, t ∈ [0, T). In the first model, we assume that λQ(t|V(T), Y) exp(α0Y), where α0 is a scalar parameter and λ0(t|V(t)) is an unrestricted positive function of t and the process V(t). When the process Vt) is high dimensional, estimation in this model is not feasible with moderate sample sizes, due to the curse of dimensionality. For such situations, we consider a second model that imposes the additional restriction that λ0(t|V(t)) = λ0(t) exp(γ′0(t)), where λ0t) is an unspecified baseline hazard function, W(t) = w(t, V(t)), w(·,·) is a known function that maps (t, V(t)) to Rq, and γ0 is a q × 1 unknown parameter vector. When α0 ≠ 0, then drop-out is nonignorable. On account of identifiability problems, joint estimation of the mean μ0 of Y and the selection bias parameter α0 may be difficult or impossible. Therefore, we propose regarding the selection bias parameter α0 as known, rather than estimating it from the data. We then perform a sensitivity analysis to see how inference about α0 changes as we vary α0 over a plausible range of values. We apply our approach to the analysis of ACTG 175, an AIDS clinical trial.
KW - Augmented inverse probability of censoring weighted estimators
KW - Cox proportional hazards model
KW - Identification; Missing data
KW - Noncompliance; Nonparametric methods
KW - Randomized trials
KW - Sensitivity analysis
KW - Time-dependent covariates
UR - http://www.scopus.com/inward/record.url?scp=0442278084&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0442278084&partnerID=8YFLogxK
U2 - 10.1080/01621459.1999.10473862
DO - 10.1080/01621459.1999.10473862
M3 - Article
AN - SCOPUS:0442278084
SN - 0162-1459
VL - 94
SP - 1096
EP - 1120
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 448
ER -