Active Alu Element “A-Tails”: Size Does Matter
Open Access
- 21 August 2002
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 12 (9) , 1333-1344
- https://doi.org/10.1101/gr.384802
Abstract
Long and short interspersed elements (LINEs and SINEs) are retroelements that make up almost half of the human genome. L1 and Alu represent the most prolific human LINE and SINE families, respectively. Only a few Alu elements are able to retropose, and the factors determining their retroposition capacity are poorly understood. The data presented in this paper indicate that the length of Alu “A-tails” is one of the principal factors in determining the retropositional capability of an Alu element. The A stretches of the Alu subfamilies analyzed, both old (Alu S and J) and young (Ya5), had a Poisson distribution of A-tail lengths with a mean size of 21 and 26, respectively. In contrast, the A-tails of very recent Alu insertions (disease causing) were all between 40 and 97 bp in length. The L1 elements analyzed displayed a similar tendency, in which the “disease”-associated elements have much longer A-tails (mean of 77) than do the elements even from the young Ta subfamily (mean of 41). Analysis of the draft sequence of the human genome showed that only about 1000 of the over one million Alu elements have tails of 40 or more adenosine residues in length. The presence of these long A stretches shows a strong bias toward the actively amplifying subfamilies, consistent with their playing a major role in the amplification process. Evaluation of the 19 Alu elements retrieved from the draft sequence of the human genome that are identical to the Alu Ya5a2 insert in the NF1 gene showed that only five have tails with 40 or more adenosine residues. Sequence analysis of the loci with the Alu elements containing the longest A-tails (7 of the 19) from the genomes of the NF1 patient and the father revealed that there are at least two loci with A-tails long enough to serve as source elements within our model. Analysis of the A-tail lengths of 12 Ya5a2 elements in diverse human population groups showed substantial variability in both the Alu A-tail length and sequence homogeneity. On the basis of these observations, a model is presented for the role of A-tail length in determining which Alu elements are active.[The sequence data from this study have been submitted to GenBank under accession nos.AF504933–AF505511.]Keywords
This publication has 79 references indexed in Scilit:
- Alu repeats and human genomic diversityNature Reviews Genetics, 2002
- Molecular Fossils in the Human Genome: Identification and Analysis of the Pseudogenes in Chromosomes 21 and 22Genome Research, 2002
- Molecular dynamics simulations of B′-DNA: sequence effects on A-tract-induced bending and flexibilityJournal of Molecular Biology, 2001
- Genomic Characterization of Recent Human LINE-1 Insertions: Evidence Supporting Random InsertionGenome Research, 2001
- Large-scale analysis of the Alu Ya5 and Yb8 subfamilies and their contribution to human genomic diversityJournal of Molecular Biology, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- cDNAs derived from primary and small cytoplasmic Alu (scAlu) transcriptsJournal of Molecular Biology, 1997
- Characterization of a nondeleterious L1 insertion in an intron of the human factor VIII gene and further evidence of open reading frames in functional L1 elementsGenomics, 1989
- ‘Brain-specific’ transcription and evolution of the identifier sequenceNature, 1986
- Base sequence studies of 300 nucleotide renatured repeated human DNA clonesJournal of Molecular Biology, 1981