Curated genome annotation ofOryza sativassp.japonicaand comparative genome analysis withArabidopsis thaliana
Open Access
- 8 January 2007
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 17 (2) , 175-183
- https://doi.org/10.1101/gr.5509507
Abstract
We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is ∼32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene.Keywords
This publication has 53 references indexed in Scilit:
- The map-based sequence of the rice genomeNature, 2005
- Large-Scale Identification of Expressed Sequence Tags Involved in Rice and Rice Blast Fungus InteractionPlant Physiology, 2005
- The Genomes of Oryza sativa: A History of DuplicationsPLoS Biology, 2005
- Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA ClonesPLoS Biology, 2004
- Anchoring 9,371 Maize Expressed Sequence Tagged Unigenes to the Bacterial Artificial Chromosome Contig Map by Two-Dimensional Overgo HybridizationPlant Physiology, 2004
- A Comprehensive Rice Transcript Map Containing 6591 Expressed Sequence Tag SitesPlant Cell, 2002
- Phylogenetic analyses do not support horizontal gene transfers from bacteria to vertebratesNature, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genesJournal of Molecular Biology, 1981