TY - JOUR
T1 - Polygenic Risk Scores for Prediction of Breast Cancer and Breast Cancer Subtypes
AU - NBCS Collaborators
AU - ABCTB Investigators
AU - kConFab/AOCS Investigators
AU - Mavaddat, Nasim
AU - Michailidou, Kyriaki
AU - Dennis, Joe
AU - Lush, Michael
AU - Fachal, Laura
AU - Lee, Andrew
AU - Tyrer, Jonathan P.
AU - Chen, Ting Huei
AU - Wang, Qin
AU - Bolla, Manjeet K.
AU - Yang, Xin
AU - Adank, Muriel A.
AU - Ahearn, Thomas
AU - Aittomäki, Kristiina
AU - Allen, Jamie
AU - Andrulis, Irene L.
AU - Anton-Culver, Hoda
AU - Antonenkova, Natalia N.
AU - Arndt, Volker
AU - Aronson, Kristan J.
AU - Auer, Paul L.
AU - Auvinen, Päivi
AU - Barrdahl, Myrto
AU - Beane Freeman, Laura E.
AU - Beckmann, Matthias W.
AU - Behrens, Sabine
AU - Benitez, Javier
AU - Bermisheva, Marina
AU - Bernstein, Leslie
AU - Blomqvist, Carl
AU - Bogdanova, Natalia V.
AU - Bojesen, Stig E.
AU - Bonanni, Bernardo
AU - Børresen-Dale, Anne Lise
AU - Brauch, Hiltrud
AU - Bremer, Michael
AU - Brenner, Hermann
AU - Brentnall, Adam
AU - Brock, Ian W.
AU - Brooks-Wilson, Angela
AU - Brucker, Sara Y.
AU - Brüning, Thomas
AU - Burwinkel, Barbara
AU - Campa, Daniele
AU - Carter, Brian D.
AU - Castelao, Jose E.
AU - Chanock, Stephen J.
AU - Chlebowski, Rowan
AU - Christiansen, Hans
AU - Chatterjee, Nilanjan
N1 - Publisher Copyright:
© 2018 The Authors
PY - 2019/1/3
Y1 - 2019/1/3
N2 - Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57–1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628–0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs.
AB - Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57–1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628–0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs.
KW - breast
KW - cancer
KW - epidemiology
KW - genetic
KW - polygenic
KW - prediction
KW - risk
KW - score
KW - screening
KW - stratification
UR - http://www.scopus.com/inward/record.url?scp=85059498503&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85059498503&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2018.11.002
DO - 10.1016/j.ajhg.2018.11.002
M3 - Article
C2 - 30554720
AN - SCOPUS:85059498503
SN - 0002-9297
VL - 104
SP - 21
EP - 34
JO - American journal of human genetics
JF - American journal of human genetics
IS - 1
ER -