L1 family of repetitive DNA sequences in primates may be derived from a sequence encoding a reverse transcriptase-related protein
- 5 June 1986
- journal article
- research article
- Published by Springer Nature in Nature
- Vol. 321 (6070) , 625-628
- https://doi.org/10.1038/321625a0
Abstract
Primate and rodent genomes contain a family of highly repetitive, long interspersed sequences, designated the L1 family or LINE-1 (refs 1–4). Characteristic features of the L1 family sequences such as an A-rich stretch at the 3′ end, a truncated 5′ end, the existence of significantly long open reading frames (ORFs)5–9 and the presence of L1 family transcripts in various types of cells, including pluripotential embryonic cells10–15, suggest that the L1 family is derived from a sequence encoding a protein(s) and dispersed in the genome through an RNA-mediated process. These features of the L1 family are believed to be due to reverse transcription beginning at the 3′ end of the L1 transcript and terminating prematurely and to the site duplication caused by the insertion of the complementary DNA (reviewed in refs 3, 4). It is likely that this type of transcript is converted to cDNA and inserted into the chromosome through a process similar to that of the formation of processed pseudogenes16. The above model, however, does not necessarily explain why the L1 family should produce the extraordinarily large number of copies (more than 104 per haploid genome17) seen during evolution. It seems likely that the progenitor of the L1 family itself carries (or carried) a function which promotes the active dispersion of the L1 family sequence. We reasoned that such a function, if present, must be conserved during evolution and may be shown by comparative analysis of L1 family sequences from evolutionary distant species. We show here that the L1 family sequence contains an ORF possessing significant sequence homology to several RNA-dependent DNA polymerases of viral and transposable element origins. This provides a plausible explanation for the preferential and active dispersion of the L1 family sequence during evolution.Keywords
This publication has 44 references indexed in Scilit:
- Making sense out of LINES: long interspersed repeat sequences in mammalian genomesTrends in Biochemical Sciences, 1985
- Sequence analysis of a Kpnl family member near the 3′ end of human β-globin geneNucleic Acids Research, 1985
- A large interspersed repeat found in mouse DNA contains a long open reading frame that evolves as if it encodes a protein.Proceedings of the National Academy of Sciences, 1984
- Rearranged sequences of a human Kpn I element.Proceedings of the National Academy of Sciences, 1984
- Kpn I family of long-dispersed repeated DNA sequences of man: evidence for entry into genomic DNA of DNA copies of poly(A)-terminated Kpn I RNAs.Proceedings of the National Academy of Sciences, 1983
- Discrete and heterogeneous high molecular weight RNAs complementary to a long dispersed repeat family (a possible transposon) of human DNAJournal of Molecular Biology, 1983
- Nucleotide sequence definition of a major human repeated DNA, the Hind III 1.9 kb familyNucleic Acids Research, 1982
- SINEs and LINEs: Highly repeated short and long interspersed sequences in mammalian genomesCell, 1982
- Organization and evolutionary progress of a dispersed repetitive family of sequences in widely separated rodent genomesJournal of Molecular Biology, 1981
- A family of long reiterated DNA sequences, one copy of which is next to the human beta globin geneNucleic Acids Research, 1980