Genome sequence of the palaeopolyploid soybean
Top Cited Papers
Open Access
- 14 January 2010
- journal article
- research article
- Published by Springer Nature in Nature
- Vol. 463 (7278) , 178-183
- https://doi.org/10.1038/nature08670
Abstract
Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.Keywords
This publication has 43 references indexed in Scilit:
- Rosid radiation and the rapid rise of angiosperm-dominated forestsProceedings of the National Academy of Sciences, 2009
- The Sorghum bicolor genome and the diversification of grassesNature, 2009
- Sequencing and Analysis of Approximately 40 000 Soybean cDNA Clones from a Full-Length-Enriched cDNA LibraryDNA Research, 2008
- Unraveling ancient hexaploidy through multiply-aligned angiosperm gene mapsGenome Research, 2008
- The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phylaNature, 2007
- Genomewide comparative analysis of alternative splicing in plantsProceedings of the National Academy of Sciences, 2006
- Improving the Arabidopsis genome annotation using maximal transcript alignment assembliesNucleic Acids Research, 2003
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Nuclear DNA content of some important plant speciesPlant Molecular Biology Reporter, 1991