Ψ-Φ: Exploring the outer limits of bacterial pseudogenes
Open Access
- 12 October 2004
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 14 (11) , 2273-2278
- https://doi.org/10.1101/gr.2925604
Abstract
Because bacterial chromosomes are tightly packed with genes and were traditionally viewed as being optimized for size and replication speed, it was not surprising that the early annotations of sequenced bacterial genomes reported few, if any, pseudogenes. But because pseudogenes are generally recognized by comparisons with their functional counterparts, as more genome sequences accumulated, many bacterial pathogens were found to harbor large numbers of truncated, inactivated, and degraded genes. Because the mutational events that inactivate genes occur continuously in all genomes, we investigated whether the rarity of pseudogenes in some bacteria was attributable to properties inherent to the organism or to the failure to recognize pseudogenes. By developing a program suite (called Ψ-Φ, for Ψ-gene Finder) that applies a comparative method to identify pseudogenes (attributable both to misannotation and to nonrecognition), we analyzed the pseudogene inventories in the sequenced members of the Escherichia coli/Shigella clade. This approach recovered hundreds of previously unrecognized pseudogenes and showed that pseudogenes are a regular feature of bacterial genomes, even in those whose original annotations registered no truncated or otherwise inactivated genes. In Shigella flexneri 2a, large proportions of pseudogenes are generated by nonsense mutations and IS element insertions, events that seldom produce the pseudogenes present in the other genomes examined. Almost all (>95%) pseudogenes are restricted to only one of the genomes and are of relatively recent origin, suggesting that these bacteria possess active mechanisms to eliminate nonfunctional genes.Keywords
This publication has 44 references indexed in Scilit:
- Genome sequence of the endocellular obligate symbiont of tsetse flies, Wigglesworthia glossinidiaNature Genetics, 2002
- Where are the pseudogenes in bacterial genomes?Trends in Microbiology, 2001
- Genome sequence of Yersinia pestis, the causative agent of plagueNature, 2001
- Massive gene decay in the leprosy bacillusNature, 2001
- Improved microbial gene identification with GLIMMERNucleic Acids Research, 1999
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Genetic studies of the lac repressorJournal of Molecular Biology, 1990
- Missense mutation in the lacI gene of Escherichia coliJournal of Molecular Biology, 1988