Evola: Ortholog database of all human genes in H-InvDB with manual curation of phylogenetic trees
Open Access
- 3 November 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (Database) , D787-D792
- https://doi.org/10.1093/nar/gkm878
Abstract
Orthologs are genes in different species that evolved from a common ancestral gene by speciation. Currently, with the rapid growth of transcriptome data of various species, more reliable orthology information is prerequisite for further studies. However, detection of orthologs could be erroneous if pairwise distance-based methods, such as reciprocal BLAST searches, are utilized. Thus, as a sub-database of H-InvDB, an integrated database of annotated human genes ( http://h-invitational.jp/ ), we constructed a fully curated database of evolutionary features of human genes, called ‘Evola’. In the process of the ortholog detection, computational analysis based on conserved genome synteny and transcript sequence similarity was followed by manual curation by researchers examining phylogenetic trees. In total, 18 968 human genes have orthologs among 11 vertebrates (chimpanzee, mouse, cow, chicken, zebrafish, etc.), either computationally detected or manually curated orthologs. Evola provides amino acid sequence alignments and phylogenetic trees of orthologs and homologs. In ‘ d N / d S view’, natural selection on genes can be analyzed between human and other species. In ‘Locus maps’, all transcript variants and their exon/intron structures can be compared among orthologous gene loci. We expect the Evola to serve as a comprehensive and reliable database to be utilized in comparative analyses for obtaining new knowledge about human genes. Evola is available at http://www.h-invitational.jp/evola/ .Keywords
This publication has 29 references indexed in Scilit:
- The H-Invitational Database (H-InvDB), a comprehensive annotation resource for human genes and transcriptsNucleic Acids Research, 2007
- New developments in the InterPro databaseNucleic Acids Research, 2007
- Database resources of the National Center for Biotechnology InformationNucleic Acids Research, 2006
- Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAsNucleic Acids Research, 2006
- Lineage-Specific Gene Duplication and Loss in Human and Great Ape EvolutionPLoS Biology, 2004
- Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA ClonesPLoS Biology, 2004
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Basic local alignment search toolJournal of Molecular Biology, 1990