TY - JOUR
T1 - Testing gene-environment interaction in large-scale case-control association studies
T2 - Possible choices and comparisons
AU - Mukherjee, Bhramar
AU - Ahn, Jaeil
AU - Gruber, Stephen B.
AU - Chatterjee, Nilanjan
PY - 2012/2/1
Y1 - 2012/2/1
N2 - Several methods for screening gene-environment interaction have recently been proposed that address the issue of using gene-environment independence in a data-adaptive way. In this report, the authors present a comparative simulation study of power and type I error properties of 3 classes of procedures: 1) the standard 1-step case-control method; 2) the case-only method that requires an assumption of gene-environment independence for the underlying population; and 3) a variety of hybrid methods, including empirical-Bayes, 2-step, and model averaging, that aim at gaining power by exploiting the assumption of gene-environment independence and yet can protect against false positives when the independence assumption is violated. These studies suggest that, although the case-only method generally has maximum power, it has the potential to create substantial false positives in large-scale studies even when a small fraction of markers are associated with the exposure under study in the underlying population. All the hybrid methods perform well in protecting against such false positives and yet can retain substantial power advantages over standard case-control tests. The authors conclude that, for future genome-wide scans for gene-environment interactions, major power gain is possible by using alternatives to standard case-control analysis. Whether a case-only type scan or one of the hybrid methods should be used depends on the strength and direction of gene-environment interaction and association, the level of tolerance for false positives, and the nature of replication strategies.
AB - Several methods for screening gene-environment interaction have recently been proposed that address the issue of using gene-environment independence in a data-adaptive way. In this report, the authors present a comparative simulation study of power and type I error properties of 3 classes of procedures: 1) the standard 1-step case-control method; 2) the case-only method that requires an assumption of gene-environment independence for the underlying population; and 3) a variety of hybrid methods, including empirical-Bayes, 2-step, and model averaging, that aim at gaining power by exploiting the assumption of gene-environment independence and yet can protect against false positives when the independence assumption is violated. These studies suggest that, although the case-only method generally has maximum power, it has the potential to create substantial false positives in large-scale studies even when a small fraction of markers are associated with the exposure under study in the underlying population. All the hybrid methods perform well in protecting against such false positives and yet can retain substantial power advantages over standard case-control tests. The authors conclude that, for future genome-wide scans for gene-environment interactions, major power gain is possible by using alternatives to standard case-control analysis. Whether a case-only type scan or one of the hybrid methods should be used depends on the strength and direction of gene-environment interaction and association, the level of tolerance for false positives, and the nature of replication strategies.
KW - case-control studies
KW - efficiency
KW - familywise error rate
KW - genome-wide association study
KW - profile likelihood
KW - robustness
KW - shrinkage
UR - http://www.scopus.com/inward/record.url?scp=84856181067&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84856181067&partnerID=8YFLogxK
U2 - 10.1093/aje/kwr367
DO - 10.1093/aje/kwr367
M3 - Article
C2 - 22199027
AN - SCOPUS:84856181067
SN - 0002-9262
VL - 175
SP - 177
EP - 190
JO - American Journal of Epidemiology
JF - American Journal of Epidemiology
IS - 3
ER -