Fine structural analysis of the chicken pro alpha 2 collagen gene.

Abstract
Kilobase pairs [42] of cloned chicken DNA containing 80% of the pro .alpha.2 (type I) collagen gene and 8 kbase pairs of 3'' flanking sequences were isolated. Detailed analysis of these clones indicates that this collagen gene spans .apprx. 40 kbase pairs of DNA and contains on the order of 50 introns. The fine structure of 40% of the pro .alpha.2 gene, including its 3'' end, was determined by Southern blot restriction endonuclease mapping using a 2.6 kbase pair procollagen c[complementary]DNA clone, pCg45, as a probe, and by DNA sequence determination of more than 2 kbase pairs of this part of the genome. Exons in the triple-helical coding region are all multiples of the 9 base pairs coding for the Gly-X-Y triplet and vary in size from 45-108 base pairs. The sequences of all 6 exons in a 3.8 kbase pair EcoRI fragment were determined. One of these, a 249-base pair exon, joins the collagen domains; it codes for the last 15 amino acids of the triple-helical coding region, the telopeptide, and the 1st 53 amino acids of the carboxy-terminal propeptide.