TY - JOUR
T1 - Testing in microbiome-profiling studies with MiRKAT, the microbiome regression-based kernel association test
AU - Zhao, Ni
AU - Chen, Jun
AU - Carroll, Ian M.
AU - Ringel-Kulka, Tamar
AU - Epstein, Michael P.
AU - Zhou, Hua
AU - Zhou, Jin J.
AU - Ringel, Yehuda
AU - Li, Hongzhe
AU - Wu, Michael C.
N1 - Publisher Copyright:
© 2015 by The American Society of Human Genetics. All rights reserved.
PY - 2015/5/7
Y1 - 2015/5/7
N2 - High-throughput sequencing technology has enabled population-based studies of the role of the human microbiome in disease etiology and exposure response. Distance-based analysis is a popular strategy for evaluating the overall association between microbiome diversity and outcome, wherein the phylogenetic distance between individuals' microbiome profiles is computed and tested for association via permutation. Despite their practical popularity, distance-based approaches suffer from important challenges, especially in selecting the best distance and extending the methods to alternative outcomes, such as survival outcomes. We propose the microbiome regression-based kernel association test (MiRKAT), which directly regresses the outcome on the microbiome profiles via the semi-parametric kernel machine regression framework. MiRKAT allows for easy covariate adjustment and extension to alternative outcomes while non-parametrically modeling the microbiome through a kernel that incorporates phylogenetic distance. It uses a variance-component score statistic to test for the association with analytical p value calculation. The model also allows simultaneous examination of multiple distances, alleviating the problem of choosing the best distance. Our simulations demonstrated that MiRKAT provides correctly controlled type I error and adequate power in detecting overall association. "Optimal" MiRKAT, which considers multiple candidate distances, is robust in that it suffers from little power loss in comparison to when the best distance is used and can achieve tremendous power gain in comparison to when a poor distance is chosen. Finally, we applied MiRKAT to real microbiome datasets to show that microbial communities are associated with smoking and with fecal protease levels after confounders are controlled for.
AB - High-throughput sequencing technology has enabled population-based studies of the role of the human microbiome in disease etiology and exposure response. Distance-based analysis is a popular strategy for evaluating the overall association between microbiome diversity and outcome, wherein the phylogenetic distance between individuals' microbiome profiles is computed and tested for association via permutation. Despite their practical popularity, distance-based approaches suffer from important challenges, especially in selecting the best distance and extending the methods to alternative outcomes, such as survival outcomes. We propose the microbiome regression-based kernel association test (MiRKAT), which directly regresses the outcome on the microbiome profiles via the semi-parametric kernel machine regression framework. MiRKAT allows for easy covariate adjustment and extension to alternative outcomes while non-parametrically modeling the microbiome through a kernel that incorporates phylogenetic distance. It uses a variance-component score statistic to test for the association with analytical p value calculation. The model also allows simultaneous examination of multiple distances, alleviating the problem of choosing the best distance. Our simulations demonstrated that MiRKAT provides correctly controlled type I error and adequate power in detecting overall association. "Optimal" MiRKAT, which considers multiple candidate distances, is robust in that it suffers from little power loss in comparison to when the best distance is used and can achieve tremendous power gain in comparison to when a poor distance is chosen. Finally, we applied MiRKAT to real microbiome datasets to show that microbial communities are associated with smoking and with fecal protease levels after confounders are controlled for.
UR - http://www.scopus.com/inward/record.url?scp=84929159912&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84929159912&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2015.04.003
DO - 10.1016/j.ajhg.2015.04.003
M3 - Article
C2 - 25957468
AN - SCOPUS:84929159912
SN - 0002-9297
VL - 96
SP - 797
EP - 807
JO - American journal of human genetics
JF - American journal of human genetics
IS - 5
ER -