TY - JOUR
T1 - Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits
AU - Zhang, Yan
AU - Qi, Guanghao
AU - Park, Ju Hyun
AU - Chatterjee, Nilanjan
N1 - Funding Information:
The authors thank H. Zhang for assistance with computing, P. Kundu for data management for the UK Biobank data, and M. Chatterjee for editing of manuscript. The research was supported by Bloomberg Distinguished Professorship endowment. Some of the simulation studies were conducted using genotype data from the UK Biobank Resource accessed under Application Number 17712.
Publisher Copyright:
© 2018, The Author(s).
Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.
PY - 2018/9/1
Y1 - 2018/9/1
N2 - We developed a likelihood-based approach for analyzing summary-level statistics and external linkage disequilibrium information to estimate effect-size distributions of common variants, characterized by the proportion of underlying susceptibility SNPs and a flexible normal-mixture model for their effects. Analysis of results available across 32 genome-wide association studies showed that, while all traits are highly polygenic, there is wide diversity in the degree and nature of polygenicity. Psychiatric diseases and traits related to mental health and ability appear to be most polygenic, involving a continuum of small effects. Most other traits, including major chronic diseases, involve clusters of SNPs that have distinct magnitudes of effects. We predict that the sample sizes needed to identify SNPs that explain most heritability found in genome-wide association studies will range from a few hundred thousand to multiple millions, depending on the underlying effect-size distributions of the traits. Accordingly, we project the risk-prediction ability of polygenic risk scores across a wide variety of diseases.
AB - We developed a likelihood-based approach for analyzing summary-level statistics and external linkage disequilibrium information to estimate effect-size distributions of common variants, characterized by the proportion of underlying susceptibility SNPs and a flexible normal-mixture model for their effects. Analysis of results available across 32 genome-wide association studies showed that, while all traits are highly polygenic, there is wide diversity in the degree and nature of polygenicity. Psychiatric diseases and traits related to mental health and ability appear to be most polygenic, involving a continuum of small effects. Most other traits, including major chronic diseases, involve clusters of SNPs that have distinct magnitudes of effects. We predict that the sample sizes needed to identify SNPs that explain most heritability found in genome-wide association studies will range from a few hundred thousand to multiple millions, depending on the underlying effect-size distributions of the traits. Accordingly, we project the risk-prediction ability of polygenic risk scores across a wide variety of diseases.
UR - http://www.scopus.com/inward/record.url?scp=85052492980&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85052492980&partnerID=8YFLogxK
U2 - 10.1038/s41588-018-0193-x
DO - 10.1038/s41588-018-0193-x
M3 - Article
C2 - 30104760
AN - SCOPUS:85052492980
SN - 1061-4036
VL - 50
SP - 1318
EP - 1326
JO - Nature genetics
JF - Nature genetics
IS - 9
ER -