Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing
- authored by
- Annett Erkes, René P. Grove, Milena Žarković, Sebastian Krautwurst, Ralf Koebnik, Richard D. Morgan, Geoffrey G. Wilson, Martin Hölzer, Manja Marz, Jens Boch, Jan Grau
- Abstract
Background: Most plant-pathogenic Xanthomonas bacteria harbor transcription activator-like effector (TALE) genes, which function as transcriptional activators of host plant genes and support infection. The entire repertoire of up to 29 TALE genes of a Xanthomonas strain is also referred to as TALome. The DNA-binding domain of TALEs is comprised of highly conserved repeats and TALE genes often occur in gene clusters, which precludes the assembly of TALE-carrying Xanthomonas genomes based on standard sequencing approaches. Results: Here, we report the successful assembly of the 5 Mbp genomes of five Xanthomonas strains from Oxford Nanopore Technologies (ONT) sequencing data. For one of these strains, Xanthomonas oryzae pv. oryzae (Xoo) PXO35, we illustrate why Illumina short reads and longer PacBio reads are insufficient to fully resolve the genome. While ONT reads are perfectly suited to yield highly contiguous genomes, they suffer from a specific error profile within homopolymers. To still yield complete and correct TALomes from ONT assemblies, we present a computational correction pipeline specifically tailored to TALE genes, which yields at least comparable accuracy as Illumina-based polishing. We further systematically assess the ONT-based pipeline for its multiplexing capacity and find that, combined with computational correction, the complete TALome of Xoo PXO35 could have been reconstructed from less than 20,000 ONT reads. Conclusions: Our results indicate that multiplexed ONT sequencing combined with a computational correction of TALE genes constitutes a highly capable tool for characterizing the TALomes of huge collections of Xanthomonas strains in the future.
- Organisation(s)
-
Section Plant Biotechnology
- External Organisation(s)
-
Martin Luther University Halle-Wittenberg
Friedrich Schiller University Jena
Université Montpellier
New England Biolabs
Robert Koch Institute (RKI)
- Type
- Article
- Journal
- BMC GENOMICS
- Volume
- 24
- ISSN
- 1471-2164
- Publication date
- 12.2023
- Publication status
- Published
- Peer reviewed
- Yes
- ASJC Scopus subject areas
- Biotechnology, Genetics
- Electronic version(s)
-
https://doi.org/10.1186/s12864-023-09228-1 (Access:
Open)