Nature and Structure of Human Genes that Generate Retropseudogenes
Open Access
- 1 May 2000
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 10 (5) , 672-678
- https://doi.org/10.1101/gr.10.5.672
Abstract
The human genome is estimated to contain 23,000 to 33,000 retropseudogenes. To study the properties of genes giving rise to these retroelements, we compared the structure and expression of genes with or without known retropseudogenes. Four main features have emerged from the analysis of 181 genes associated to retropseudogenes: Reverse-transcribed genes are (1) widely expressed, (2) highly conserved, (3) short, and (4) GC-poor. The first two properties probably reflect the fact that genes giving rise to retropseudogenes have to be expressed in the germ-line. The two latter points suggest that reverse-transcription and transposition is more efficient for short GC-poor mRNAs. In addition, this analysis allowed us to reject previous hypotheses that widely expressed genes are GC rich. Rather, globally, genes with a wide tissue distribution are GC poor.Keywords
This publication has 24 references indexed in Scilit:
- The impact of L1 retrotransposons on the human genomeNature Genetics, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- The size distribution of insertions and deletions in human and rodent pseudogenes suggests the logarithmic gap penalty for sequence alignmentJournal of Molecular Evolution, 1995
- How many genes in the human genome?Nature Genetics, 1994
- The isochore organization of the human genome and its evolutionary history — a reviewGene, 1993
- Information enhancement methods for large scale sequence analysisComputers & Chemistry, 1993
- Human genome organization: Alu, LINES, and the molecular structure of metaphase chromosome bandsCell, 1988
- PROCESSED PSEUDOGENES: CHARACTERISTICS AND EVOLUTIONAnnual Review of Genetics, 1985
- ACNUC – a portable retrieval system for nucleic acid sequence databases: logical and physical designs and usageBioinformatics, 1985
- Patterns of nucleotide substitution in pseudogenes and functional genesJournal of Molecular Evolution, 1982