Comparison of the leader sequences of four group A streptococcal M protein genes

Abstract
The 5'' portions and flanking sequences of genes encoding types 1, 12, 24, and 6 M proteins were compared. Although the DNA sequences encoding the amino-termini of the mature M proteins had no obvious similarity, upstream sequences, and those encoding the signal peptides (leader sequences) of the four M protein genes had considerable similarity. In general, the 5'' ends of all the leader sequences were more conserved than the 3'' ends, although the M6 and M24 leader sequences had identical 3'' ends. Sequence similarity among the deduced amino acid sequences of the four signal peptides was more extensive than the corresponding DNA sequences. We found that strict DNA similarity among all four sequences extended only to the ends of the hydrophilic amino-terminal regions of the signal peptides, but that amino acid sequence conservation continued to the ends of the respective hydrophobic cores. With the exception of the M6 and M24 sequences, the regions adjacent to the signal peptidase cleavage sites were highly variable.