TY - JOUR
T1 - Third-generation sequencing revises the molecular karyotype for Toxoplasma gondii and identifies emerging copy number variants in sexual recombinants
AU - Xia, Jing
AU - Venkat, Aarthi
AU - Bainbridge, Rachel E.
AU - Reese, Michael L.
AU - Le Roch, Karine G.
AU - Ay, Ferhat
AU - Boyle, Jon P.
N1 - Funding Information:
We thank members of the Boyle laboratory for critical reading of the manuscript and Josh Quick for public sharing of protocols for DNA isolation that maximize read length. This work was supported by National Institutes of Health (NIH) grants R01AI116855, R01AI114655, and R21AI154386 to J.P.B., State Scholarship Fund from the China Scholar Council (2017084 40340) to J.X., and grants R21AI142506 (NIH) and NIFA-Hatch-225935 (University of California, Riverside) to K.G.L.R.
Publisher Copyright:
© 2021 Xia et al.
PY - 2021
Y1 - 2021
N2 - Toxoplasma gondii is a useful model for intracellular parasitism given its ease of culture in the laboratory and genomic resources. However, as for many other eukaryotes, the T. gondii genome contains hundreds of sequence gaps owing to repetitive and/or unclonable sequences that disrupt the assembly process. Here, we use the Oxford Nanopore Minion platform to generate near-complete de novo genome assemblies for multiple strains of T. gondii and its near relative, N. caninum. We significantly improved T. gondii genome contiguity (average N50 of ∼6.6 Mb) and added ∼2 Mb of newly assembled sequence. For all of the T. gondii strains that we sequenced (RH, ME49, CTG, II×III progeny clones CL13, S27, S21, S26, and D3X1), the largest contig ranged in size between 11.9 and 12.1 Mb in size, which is larger than any previously reported T. gondii chromosome, and found to be due to a consistent fusion of Chromosomes VIIb and VIII. These data were validated by mapping existing T. gondii ME49 Hi-C data to our assembly, providing parallel lines of evidence that the T. gondii karyotype consists of 13, rather than 14, chromosomes. By using this technology, we also resolved hundreds of tandem repeats of varying lengths, including in well-known host-targeting effector loci like rhoptry protein 5 (ROP5) and ROP38. Finally, when we compared T. gondii with N. caninum, we found that although the 13-chromosome karyotype was conserved, extensive, previously unappreciated chromosome-scale rearrangements had occurred in T. gondii and N. caninum since their most recent common ancestry.
AB - Toxoplasma gondii is a useful model for intracellular parasitism given its ease of culture in the laboratory and genomic resources. However, as for many other eukaryotes, the T. gondii genome contains hundreds of sequence gaps owing to repetitive and/or unclonable sequences that disrupt the assembly process. Here, we use the Oxford Nanopore Minion platform to generate near-complete de novo genome assemblies for multiple strains of T. gondii and its near relative, N. caninum. We significantly improved T. gondii genome contiguity (average N50 of ∼6.6 Mb) and added ∼2 Mb of newly assembled sequence. For all of the T. gondii strains that we sequenced (RH, ME49, CTG, II×III progeny clones CL13, S27, S21, S26, and D3X1), the largest contig ranged in size between 11.9 and 12.1 Mb in size, which is larger than any previously reported T. gondii chromosome, and found to be due to a consistent fusion of Chromosomes VIIb and VIII. These data were validated by mapping existing T. gondii ME49 Hi-C data to our assembly, providing parallel lines of evidence that the T. gondii karyotype consists of 13, rather than 14, chromosomes. By using this technology, we also resolved hundreds of tandem repeats of varying lengths, including in well-known host-targeting effector loci like rhoptry protein 5 (ROP5) and ROP38. Finally, when we compared T. gondii with N. caninum, we found that although the 13-chromosome karyotype was conserved, extensive, previously unappreciated chromosome-scale rearrangements had occurred in T. gondii and N. caninum since their most recent common ancestry.
UR - http://www.scopus.com/inward/record.url?scp=85106069734&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85106069734&partnerID=8YFLogxK
U2 - 10.1101/GR.262816.120
DO - 10.1101/GR.262816.120
M3 - Article
C2 - 33906962
AN - SCOPUS:85106069734
SN - 1088-9051
VL - 31
SP - 834
EP - 851
JO - Genome Research
JF - Genome Research
IS - 5
ER -