Non‐random distribution of T‐DNA insertions at various levels of the genome hierarchy as revealed by analyzing 13 804 T‐DNA flanking sequences from an enhancer‐trap mutant library
Open Access
- 2 February 2007
- journal article
- Published by Wiley in The Plant Journal
- Vol. 49 (5) , 947-959
- https://doi.org/10.1111/j.1365-313x.2006.03001.x
Abstract
We isolated 13 804 T‐DNA flanking sequence tags (FSTs) from a T‐DNA insertion library of rice. A comprehensive analysis of the 13 804 FSTs revealed a number of features demonstrating a highly non‐random distribution of the T‐DNA insertions in the rice genome: T‐DNA insertions were biased towards large chromosomes, not only in the absolute number of insertions but also in the relative density; within chromosomes the insertions occurred more densely in the distal ends, and less densely in the centromeric regions; the distribution of the T‐DNA insertions was highly correlated with that of full‐length cDNAs, but the correlations were highly heterogeneous among the chromosomes; T‐DNA insertions strongly disfavored transposable element (TE)‐related sequences, but favored genic sequences with a strong bias toward the 5′ upstream and 3′ downstream regions of the genes; T‐DNA insertions preferentially occurred among the various classes of functional genes, such that the numbers of insertions were in excess in certain functional categories but were deficient in other categories. The analysis of DNA sequence compositions around the T‐DNA insertion sites also revealed several prominent features, including an elevated bendability from −200 to 200 bp relative to the insertion sites, an inverse relationship between the GC and TA skews, and reversed GC and TA skews in sequences upstream and downstream of the insertion sites, with both GC and TA skews equal to zero at the insertion sites. It was estimated that 365 380 insertions are needed to saturate the genome with P = 0.95, and that the 45 441 FSTs that have been isolated so far by various groups tagged 14 287 of the 42 653 non‐TE related genes.Keywords
This publication has 51 references indexed in Scilit:
- Analysis of T-DNA insertion site distribution patterns in Arabidopsis thaliana reveals special features of genes without insertionsGenomics, 2006
- Generation of a flanking sequence‐tag database for activation‐tagging lines in japonica riceThe Plant Journal, 2005
- The map-based sequence of the rice genomeNature, 2005
- Agrobacterium T-DNA integration in Arabidopsis is correlated with DNA sequence compositions that occur frequently in gene promoter regionsFunctional & Integrative Genomics, 2005
- Identification of Arabidopsis thaliana transformants without selection reveals a high occurrence of silenced T‐DNA integrationsThe Plant Journal, 2004
- Genome-Wide Insertional Mutagenesis of Arabidopsis thalianaScience, 2003
- T‐DNA insertional mutagenesis for functional genomics in riceThe Plant Journal, 2000
- A Preferred Target DNA Structure for Retroviral Integrasein VitroJournal of Biological Chemistry, 1998
- The structures of integration sites in transgenic riceThe Plant Journal, 1997
- Sequence periodicities in chicken nucleosome core DNAJournal of Molecular Biology, 1986