TY - JOUR
T1 - Whole-genome assembly of a hybrid Trypanosoma cruzi strain assembled with Nanopore sequencing alone
AU - Hakim, Jill M.C.
AU - Gutierrez Guarnizo, Sneider A.
AU - Machaca, Edith Málaga
AU - Gilman, Robert H.
AU - Mugnier, Monica R.
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/6
Y1 - 2024/6
N2 - Trypanosoma cruzi is the causative agent of Chagas disease, which causes 10,000 deaths per year. Despite the high mortality associated with Chagas, relatively few parasite genomes have been assembled to date, with genome assemblies unavailable even for some commonly used laboratory strains. This is at least partially due to T. cruzi’s highly complex and highly repetitive genome, which defies investigation using traditional short-read sequencing methods. In this study, we have generated a high-quality whole-genome assembly of the hybrid Tulahuen strain, a commercially available type VI strain, using long-read Nanopore sequencing without short-read scaffolding. The assembled genome contains 25% repeat regions, 17% variable multigene family members, and 27% transposable elements (TEs) and is of comparable quality with T. cruzi genome assemblies that utilized both long- and short-read data. Notably, we find that regions with TEs are significantly enriched for multicopy surface proteins, and that surface proteins are, on average, closer to TEs than to other coding regions. This finding suggests that mobile genetic elements such as transposons may drive recombination within surface protein gene families. This work demonstrates the feasibility of Nanopore sequencing to resolve complex regions of T. cruzi genomes, and with these resolved regions, provides support for a possible mechanism for genomic diversification.
AB - Trypanosoma cruzi is the causative agent of Chagas disease, which causes 10,000 deaths per year. Despite the high mortality associated with Chagas, relatively few parasite genomes have been assembled to date, with genome assemblies unavailable even for some commonly used laboratory strains. This is at least partially due to T. cruzi’s highly complex and highly repetitive genome, which defies investigation using traditional short-read sequencing methods. In this study, we have generated a high-quality whole-genome assembly of the hybrid Tulahuen strain, a commercially available type VI strain, using long-read Nanopore sequencing without short-read scaffolding. The assembled genome contains 25% repeat regions, 17% variable multigene family members, and 27% transposable elements (TEs) and is of comparable quality with T. cruzi genome assemblies that utilized both long- and short-read data. Notably, we find that regions with TEs are significantly enriched for multicopy surface proteins, and that surface proteins are, on average, closer to TEs than to other coding regions. This finding suggests that mobile genetic elements such as transposons may drive recombination within surface protein gene families. This work demonstrates the feasibility of Nanopore sequencing to resolve complex regions of T. cruzi genomes, and with these resolved regions, provides support for a possible mechanism for genomic diversification.
KW - Chagas
KW - Nanopore
KW - Trypanosoma cruzi
KW - genome assembly
UR - http://www.scopus.com/inward/record.url?scp=85195328007&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85195328007&partnerID=8YFLogxK
U2 - 10.1093/g3journal/jkae076
DO - 10.1093/g3journal/jkae076
M3 - Article
C2 - 38592968
AN - SCOPUS:85195328007
SN - 2160-1836
VL - 14
JO - G3: Genes, Genomes, Genetics
JF - G3: Genes, Genomes, Genetics
IS - 6
M1 - jkae076
ER -