TY - JOUR
T1 - Sequence and expression analysis of gaps in human chromosome 20
AU - Minocherhomji, Sheroy
AU - Seemann, Stefan
AU - Mang, Yuan
AU - El-Schich, Zahra
AU - Bak, Mads
AU - Hansen, Claus
AU - Papadopoulos, Nickolas
AU - Josefsen, Knud
AU - Nielsen, Henrik
AU - Gorodkin, Jan
AU - Tommerup, Niels
AU - Silahtaroglu, Asli
N1 - Funding Information:
EU FP6 Marie Curie Research Trainning Network ‘‘Chromatin Plasticity’’ (to A.S.); University of Copenhagen, Faculty of Health Sciences partial PhD Scholarship (to S.M.); Danish Ministry of Science, Technology and Innovation and the Danish National Research Council (to A.S. and N.T.); Danish Council for Independent Research (Technology and Production Sciences); Danish Council for Strategic Research (Program Commission on Strategic Growth Technologies); Danish Centre for Scientific Computation (to J.G.); Lundbeck Foundation (to J.G., N.T. and A.S., in part). Wilhelm Johannsen Centre for Functional Genome Research is established by the Danish National Research Foundation. Funding for open access charge: University of Copenhagen.
PY - 2012/8
Y1 - 2012/8
N2 - The finished human genome-assemblies comprise several hundred un-sequenced euchromatic gaps, which may be rich in long polypurine/polypyrimidine stretches. Human chromosome 20 (chr 20) currently has three unfinished gaps remaining on its q-arm. All three gaps are within gene-dense regions and/or overlap disease-associated loci, including the DLGAP4 locus. In this study, we sequenced ∼99 of all three unfinished gaps on human chr 20, determined their complete genomic sizes and assessed epigenetic profiles using a combination of Sanger sequencing, mate pair paired-end high-throughput sequencing and chromatin, methylation and expression analyses. We found histone 3 trimethylated at Lysine 27 to be distributed across all three gaps in immortalized B-lymphocytes. In one gap, five novel CpG islands were predominantly hypermethylated in genomic DNA from peripheral blood lymphocytes and human cerebellum. One of these CpG islands was differentially methylated and paternally hypermethylated. We found all chr 20 gaps to comprise structured non-coding RNAs (ncRNAs) and to be conserved in primates. We verified expression for 13 candidate ncRNAs, some of which showed tissue specificity. Four ncRNAs expressed within the gap at DLGAP4 show elevated expression in the human brain. Our data suggest that unfinished human genome gaps are likely to comprise numerous functional elements.
AB - The finished human genome-assemblies comprise several hundred un-sequenced euchromatic gaps, which may be rich in long polypurine/polypyrimidine stretches. Human chromosome 20 (chr 20) currently has three unfinished gaps remaining on its q-arm. All three gaps are within gene-dense regions and/or overlap disease-associated loci, including the DLGAP4 locus. In this study, we sequenced ∼99 of all three unfinished gaps on human chr 20, determined their complete genomic sizes and assessed epigenetic profiles using a combination of Sanger sequencing, mate pair paired-end high-throughput sequencing and chromatin, methylation and expression analyses. We found histone 3 trimethylated at Lysine 27 to be distributed across all three gaps in immortalized B-lymphocytes. In one gap, five novel CpG islands were predominantly hypermethylated in genomic DNA from peripheral blood lymphocytes and human cerebellum. One of these CpG islands was differentially methylated and paternally hypermethylated. We found all chr 20 gaps to comprise structured non-coding RNAs (ncRNAs) and to be conserved in primates. We verified expression for 13 candidate ncRNAs, some of which showed tissue specificity. Four ncRNAs expressed within the gap at DLGAP4 show elevated expression in the human brain. Our data suggest that unfinished human genome gaps are likely to comprise numerous functional elements.
UR - http://www.scopus.com/inward/record.url?scp=84864953110&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84864953110&partnerID=8YFLogxK
U2 - 10.1093/nar/gks302
DO - 10.1093/nar/gks302
M3 - Article
C2 - 22510267
AN - SCOPUS:84864953110
SN - 1362-4962
VL - 40
SP - 6660
EP - 6672
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 14
ER -