The Composition and Origins of Genomic Variation among Individuals of the Soybean Reference Cultivar Williams 82
Open Access
- 29 November 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Plant Physiology
- Vol. 155 (2) , 645-655
- https://doi.org/10.1104/pp.110.166736
Abstract
Soybean (Glycine max) is a self-pollinating species that has relatively low nucleotide polymorphism rates compared with other crop species. Despite the low rate of nucleotide polymorphisms, a wide range of heritable phenotypic variation exists. There is even evidence for heritable phenotypic variation among individuals within some cultivars. Williams 82, the soybean cultivar used to produce the reference genome sequence, was derived from backcrossing a Phytophthora root rot resistance locus from the donor parent Kingwa into the recurrent parent Williams. To explore the genetic basis of intracultivar variation, we investigated the nucleotide, structural, and gene content variation of different Williams 82 individuals. Williams 82 individuals exhibited variation in the number and size of introgressed Kingwa loci. In these regions of genomic heterogeneity, the reference Williams 82 genome sequence consists of a mosaic of Williams and Kingwa haplotypes. Genomic structural variation between Williams and Kingwa was maintained between the Williams 82 individuals within the regions of heterogeneity. Additionally, the regions of heterogeneity exhibited gene content differences between Williams 82 individuals. These findings show that genetic heterogeneity in Williams 82 primarily originated from the differential segregation of polymorphic chromosomal regions following the backcross and single-seed descent generations of the breeding process. We conclude that soybean haplotypes can possess a high rate of structural and gene content variation, and the impact of intracultivar genetic heterogeneity may be significant. This detailed characterization will be useful for interpreting soybean genomic data sets and highlights important considerations for research communities that are developing or utilizing a reference genome sequence.This publication has 46 references indexed in Scilit:
- Pervasive gene content variation and copy number variation in maize and its undomesticated progenitorGenome Research, 2010
- An Integrative Approach to Genomic Introgression MappingPlant Physiology, 2010
- HeterosisPlant Cell, 2010
- Copy Number Variation Shapes Genome Diversity in Arabidopsis Over Immediate Family Generational ScalesGenome Biology and Evolution, 2010
- The B73 Maize Genome: Complexity, Diversity, and DynamicsScience, 2009
- SOAP2: an improved ultrafast tool for short read alignmentBioinformatics, 2009
- SNP detection for massively parallel whole-genome resequencingGenome Research, 2009
- The soybean-Phytophthora resistance locus Rps1-k encompasses coiled coil-nucleotide binding-leucine rich repeat-like genes and repetitive sequencesBMC Plant Biology, 2008
- Impacts of genetic bottlenecks on soybean genome diversityProceedings of the National Academy of Sciences, 2006
- The map-based sequence of the rice genomeNature, 2005