Analysis of the genome sequence of the flowering plant Arabidopsis thaliana
Top Cited Papers
- 14 December 2000
- journal article
- research article
- Published by Springer Nature in Nature
- Vol. 408 (6814) , 796-815
- https://doi.org/10.1038/35048692
Abstract
The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans— the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.Keywords
This publication has 112 references indexed in Scilit:
- Cloning of the Arabidopsis Clock Gene TOC1 , an Autoregulatory Response Regulator HomologScience, 2000
- Comparative genome analysis reveals extensive conservation of genome organisation for Arabidopsis thaliana and Capsella rubellaThe Plant Journal, 2000
- Predicting Subcellular Localization of Proteins Based on their N-terminal Amino Acid SequenceJournal of Molecular Biology, 2000
- Triggering the cell cycle in plantsTrends in Cell Biology, 2000
- The Genome Sequence of Drosophila melanogasterScience, 2000
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997
- tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic SequenceNucleic Acids Research, 1997
- Cereal Genome Evolution: Grasses, line up and form a circlePublished by Elsevier ,1995
- Basic local alignment search toolJournal of Molecular Biology, 1990