Identification of genes encoding hypothetical proteins in open-reading frame expressed sequence tags from mammalian stages of Trypanosoma cruzi
- 1 January 2011
- journal article
- research article
- Published by Genetics and Molecular Research in Genetics and Molecular Research
- Vol. 10 (3) , 1589-1630
- https://doi.org/10.4238/vol10-3gmr1140
Abstract
Approximately 50% of the predicted protein-coding genes of the Trypanosoma cruzi CL Brener strain are annotated as hypothetical or conserved hypothetical proteins. To further characterize these genes, we generated 1161 open-reading frame expressed sequence tags (ORESTES) from the mammalian stages of the VL10 human strain. Sequence clustering resulted in 435 clusters, consisting of 339 singletons and 96 contigs. Significant matches to the T. cruzi predicted gene database were found for similar to 94% contigs and similar to 69% singletons. These included genes encoding surface proteins, known to be intensely expressed in the parasite mammalian stages and implicated in host cell invasion and/or immune evasion mechanisms. Among 151 contigs and singletons with similarity to predicted hypothetical protein-coding genes and conserved hypothetical protein-coding genes, 83% showed no match with T. cruzi EST and/or proteome databases. These ORESTES are the first experimental evidence that the corresponding genes are in fact transcribed. Sequences with no significant match were searched against several T. cruzi and National Center for Biotechnology Information non-redundant sequence databases. The ORESTES analysis indicated that 124 predicted conserved hypothetical protein-coding genes and 27 predicted hypothetical protein-coding genes annotated in the CL Brener genome are transcribed in the VL10 mammalian stages. Six ORESTES annotated as hypothetical protein-coding genes showing no match to EST and/or proteome databases were confirmed by Northern blot in VL10. The generation of this set of ORESTES complements the T. cruzi genome annotation and suggests new stage-regulated genes encoding hypothetical proteins.Keywords
This publication has 39 references indexed in Scilit:
- Role of host lysosomal associated membrane protein (LAMP) in Trypanosoma cruzi invasion and intracellular developmentMicrobes and Infection, 2010
- Localization and Developmental Regulation of a Dispersed Gene Family 1 Protein in Trypanosoma cruziInfection and Immunity, 2010
- The steady-state transcriptome of the four major life-cycle stages of Trypanosoma cruziBMC Genomics, 2009
- Chromosome level assembly of the hybrid Trypanosoma cruzi genomeBMC Genomics, 2009
- Trypanosoma cruziGP63 Proteins Undergo Stage-Specific Differential Posttranslational Modification and Are Important for Host Cell InfectionInfection and Immunity, 2009
- Genomic organization and expression profile of the mucin-associated surface protein (masp) family of the human pathogen Trypanosoma cruziNucleic Acids Research, 2009
- Proteomics in Trypanosoma cruzi – localization of novel proteins to various organellesProteomics, 2008
- Database of Trypanosoma cruzi repeated genes: 20 000 additional gene variantsBMC Genomics, 2007
- TcruziDB: an integrated, post-genomics community resource for Trypanosoma cruziNucleic Acids Research, 2006
- The Genome Sequence of Trypanosoma cruzi , Etiologic Agent of Chagas DiseaseScience, 2005