First Report on the Systematic Sequencing of the Small Genome ofEncephalitozoon cuniculi(Protozoa, Microspora): Gene Organization of a 4.3 kbp Region on Chromosome I

Abstract
Belonging to a large group of parasitic amitochondrial protozoans (Microspora), Encephalitozoon cuniculi infects humans and other mammals. Because of its medical importance and small genome size (2.9 Mbp), we are systematically sequencing its smallest (217 kbp) chromosome. The shotgun cloning strategy now has produced the sequence of randomly dispersed contigs representing more than 180 kbp of this chromosome. The present report describes analysis of the 4.3 kbp contig, which includes the complete coding regions of dihydrofolate reductase (DHFR), thymidylate synthase (TS), and serine hydroxymethyl transferase (SHMT) genes and the partial coding region of an aminopeptidase (AP) gene. In contrast to the other reported protozoan genes, DHFR and TS are encoded by two different open reading frames (ORFs). The SHMT gene is the first one identified in a protozoan and corresponds to the cytosolic form of the enzyme. No introns were detected, and the intergenic noncoding regions do not exceed 50 bp. The mean GC content is close to 60%, and there is a G or C third-base codon bias. Transcription and translation initiation signals also are analyzed, and a model for the mRNA-ssu rRNA interactions is proposed.