TY - JOUR
T1 - Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program
AU - NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium
AU - Taliun, Daniel
AU - Harris, Daniel N.
AU - Kessler, Michael D.
AU - Carlson, Jedidiah
AU - Szpiech, Zachary A.
AU - Torres, Raul
AU - Taliun, Sarah A.Gagliano
AU - Corvelo, André
AU - Gogarten, Stephanie M.
AU - Kang, Hyun Min
AU - Pitsillides, Achilleas N.
AU - LeFaive, Jonathon
AU - Lee, Seung been
AU - Tian, Xiaowen
AU - Browning, Brian L.
AU - Das, Sayantan
AU - Emde, Anne Katrin
AU - Clarke, Wayne E.
AU - Loesch, Douglas P.
AU - Shetty, Amol C.
AU - Blackwell, Thomas W.
AU - Smith, Albert V.
AU - Wong, Quenna
AU - Liu, Xiaoming
AU - Conomos, Matthew P.
AU - Bobo, Dean M.
AU - Aguet, François
AU - Albert, Christine
AU - Alonso, Alvaro
AU - Ardlie, Kristin G.
AU - Arking, Dan E.
AU - Aslibekyan, Stella
AU - Auer, Paul L.
AU - Barnard, John
AU - Barr, R. Graham
AU - Becker, Lewis
AU - Casella, James F.
AU - Mathias, RASIKA
AU - Post, Wendy S.
AU - Ruczinski, Ingo
AU - Avramopoulos, DIMITRI
AU - Barron-Casella, Emily
AU - Beaty, Terri
AU - Becker, Diane M
AU - Naik, Rakhi
AU - Becker, Julia Powers
AU - Salzberg, Steven
AU - Taub, Margaret
AU - Vaidya, Dhananjay
AU - Yanek, Lisa
N1 - Publisher Copyright:
© 2021, The Author(s).
PY - 2021/2/11
Y1 - 2021/2/11
N2 - The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.
AB - The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.
UR - http://www.scopus.com/inward/record.url?scp=85101000110&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85101000110&partnerID=8YFLogxK
U2 - 10.1038/s41586-021-03205-y
DO - 10.1038/s41586-021-03205-y
M3 - Article
C2 - 33568819
AN - SCOPUS:85101000110
SN - 0028-0836
VL - 590
SP - 290
EP - 299
JO - Nature
JF - Nature
IS - 7845
ER -