TY - JOUR
T1 - Enrichment analyses identify shared associations for 25 quantitative traits in over 600,000 individuals from seven diverse ancestries
AU - Smith, Samuel Pattillo
AU - Shahamatdar, Sahar
AU - Cheng, Wei
AU - Zhang, Selena
AU - Paik, Joseph
AU - Graff, Misa
AU - Haiman, Christopher
AU - Matise, T. C.
AU - North, Kari E.
AU - Peters, Ulrike
AU - Kenny, Eimear
AU - Gignoux, Chris
AU - Wojcik, Genevieve
AU - Crawford, Lorin
AU - Ramachandran, Sohini
N1 - Funding Information:
We thank Kirk Lohmueller and Alicia R. Martin for helpful comments on an earlier version of this manuscript as well as the Crawford and Ramachandran Labs for helpful discussions. This research was conducted in part with computational resources and services at the Center for Computation and Visualization at Brown University as well as with the UK Biobank Resource under application number 22419. The Population Architecture Using Genomics and Epidemiology (PAGE) program is funded by the National Human Genome Research Institute (NHGRI) with co-funding from the National Institute on Minority Health and Health Disparities (NIMHD). The WHI program is funded by the National Heart, Lung, and Blood Institute, National Institutes of Health, U.S. Department of Health and Human Services through contracts 75N92021D00001, 75N92021D00002, 75N92021D00003, 75N92021D00004, and 75N92021D00005. The HCHS/SOL study was carried out as a collaborative study supported by contracts from the National Heart, Lung and Blood Institute (NHLBI) to the University of North Carolina (N01-HC65233), University of Miami (N01-HC65234), Albert Einstein College of Medicine (N01-HC65235), Northwestern University (N01-HC65236), and San Diego State University (N01-HC65237). S.P.S. was a trainee supported under the Brown University Predoctoral Training Program in Biological Data Science (NIH T32 GM128596). L.C. acknowledges the support of an Alfred P. Sloan Research Fellowship and a David & Lucile Packard Fellowship for Science and Engineering. This work was also supported by US National Institutes of Health R01 GM118652 and National Institutes of Health R35 GM139628 to S.R. S.R. acknowledges additional support from National Science Foundation CAREER Award DBI-1452622. C.G. owns stock in 23andMe. E.K. and C.G. are members of the scientific advisory board for Encompass Bioscience. E.K. consults for Illumina.
Funding Information:
We thank Kirk Lohmueller and Alicia R. Martin for helpful comments on an earlier version of this manuscript as well as the Crawford and Ramachandran Labs for helpful discussions. This research was conducted in part with computational resources and services at the Center for Computation and Visualization at Brown University as well as with the UK Biobank Resource under application number 22419. The Population Architecture Using Genomics and Epidemiology (PAGE) program is funded by the National Human Genome Research Institute (NHGRI) with co-funding from the National Institute on Minority Health and Health Disparities (NIMHD). The WHI program is funded by the National Heart, Lung, and Blood Institute , National Institutes of Health , U.S. Department of Health and Human Services through contracts 75N92021D00001 , 75N92021D00002 , 75N92021D00003 , 75N92021D00004 , and 75N92021D00005 . The HCHS/SOL study was carried out as a collaborative study supported by contracts from the National Heart, Lung and Blood Institute (NHLBI) to the University of North Carolina ( N01-HC65233 ), University of Miami ( N01-HC65234 ), Albert Einstein College of Medicine ( N01-HC65235 ), Northwestern University ( N01-HC65236 ), and San Diego State University ( N01-HC65237 ). S.P.S. was a trainee supported under the Brown University Predoctoral Training Program in Biological Data Science ( NIH T32 GM128596 ). L.C. acknowledges the support of an Alfred P. Sloan Research Fellowship and a David & Lucile Packard Fellowship for Science and Engineering. This work was also supported by US National Institutes of Health R01 GM118652 and National Institutes of Health R35 GM139628 to S.R. S.R. acknowledges additional support from National Science Foundation CAREER Award DBI-1452622 .
Publisher Copyright:
© 2022 The Author(s)
PY - 2022/5/5
Y1 - 2022/5/5
N2 - Since 2005, genome-wide association (GWA) datasets have been largely biased toward sampling European ancestry individuals, and recent studies have shown that GWA results estimated from self-identified European individuals are not transferable to non-European individuals because of various confounding challenges. Here, we demonstrate that enrichment analyses that aggregate SNP-level association statistics at multiple genomic scales—from genes to genomic regions and pathways—have been underutilized in the GWA era and can generate biologically interpretable hypotheses regarding the genetic basis of complex trait architecture. We illustrate examples of the robust associations generated by enrichment analyses while studying 25 continuous traits assayed in 566,786 individuals from seven diverse self-identified human ancestries in the UK Biobank and the Biobank Japan as well as 44,348 admixed individuals from the PAGE consortium including cohorts of African American, Hispanic and Latin American, Native Hawaiian, and American Indian/Alaska Native individuals. We identify 1,000 gene-level associations that are genome-wide significant in at least two ancestry cohorts across these 25 traits as well as highly conserved pathway associations with triglyceride levels in European, East Asian, and Native Hawaiian cohorts.
AB - Since 2005, genome-wide association (GWA) datasets have been largely biased toward sampling European ancestry individuals, and recent studies have shown that GWA results estimated from self-identified European individuals are not transferable to non-European individuals because of various confounding challenges. Here, we demonstrate that enrichment analyses that aggregate SNP-level association statistics at multiple genomic scales—from genes to genomic regions and pathways—have been underutilized in the GWA era and can generate biologically interpretable hypotheses regarding the genetic basis of complex trait architecture. We illustrate examples of the robust associations generated by enrichment analyses while studying 25 continuous traits assayed in 566,786 individuals from seven diverse self-identified human ancestries in the UK Biobank and the Biobank Japan as well as 44,348 admixed individuals from the PAGE consortium including cohorts of African American, Hispanic and Latin American, Native Hawaiian, and American Indian/Alaska Native individuals. We identify 1,000 gene-level associations that are genome-wide significant in at least two ancestry cohorts across these 25 traits as well as highly conserved pathway associations with triglyceride levels in European, East Asian, and Native Hawaiian cohorts.
KW - GWAS, multi-ancestry, enrichment analyses
UR - http://www.scopus.com/inward/record.url?scp=85129411168&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85129411168&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2022.03.005
DO - 10.1016/j.ajhg.2022.03.005
M3 - Article
C2 - 35349783
AN - SCOPUS:85129411168
SN - 0002-9297
VL - 109
SP - 871
EP - 884
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 5
ER -