De novo assembly using low-coverage short read sequence data from the rice pathogen Pseudomonas syringae pv. oryzae
- 17 November 2008
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 19 (2) , 294-305
- https://doi.org/10.1101/gr.083311.108
Abstract
We developed a novel approach for de novo genome assembly using only sequence data from high-throughput short read sequencing technologies. By combining data generated from 454 Life Sciences (Roche) and Illumina (formerly known as Solexa sequencing) sequencing platforms, we reliably assembled genomes into large scaffolds at a fraction of the traditional cost and without use of a reference sequence. We applied this method to two isolates of the phytopathogenic bacteria Pseudomonas syringae. Sequencing and reassembly of the well-studied tomato and Arabidopsis pathogen, PtoDC3000, facilitated development and testing of our method. Sequencing of a distantly related rice pathogen, Por1_6, demonstrated our method's efficacy for de novo assembly of novel genomes. Our assembly of Por1_6 yielded an N50 scaffold size of 531,821 bp with >75% of the predicted genome covered by scaffolds over 100,000 bp. One of the critical phenotypic differences between strains of P. syringae is the range of plant hosts they infect. This is largely determined by their complement of type III effector proteins. The genome of Por1_6 is the first sequenced for a P. syringae isolate that is a pathogen of monocots, and, as might be predicted, its complement of type III effectors differs substantially from the previously sequenced isolates of this species. The genome of Por1_6 helps to define an expansion of the P. syringae pan-genome, a corresponding contraction of the core genome, and a further diversification of the type III effector complement for this important plant pathogen species.Keywords
This publication has 55 references indexed in Scilit:
- COI1 is a critical component of a receptor for jasmonate and the bacterial virulence factor coronatineProceedings of the National Academy of Sciences, 2008
- Bioinformatics challenges of new sequencing technologyPublished by Elsevier ,2008
- SOAP: short oligonucleotide alignment programBioinformatics, 2008
- Short read fragment assembly of bacterial genomesGenome Research, 2007
- Parallel genomic evolution and metabolic interdependence in an ancient symbiosisProceedings of the National Academy of Sciences, 2007
- SHARCGS, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencingGenome Research, 2007
- Identifying bacterial genes and endosymbiont DNA with GlimmerBioinformatics, 2007
- The plant immune systemNature, 2006
- A Sanger/pyrosequencing hybrid approach for the generation of high-quality draft assemblies of marine microbial genomesProceedings of the National Academy of Sciences, 2006
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002