The Mosaic Structure of Human Pericentromeric DNA: A Strategy for Characterizing Complex Regions of the Human Genome
Open Access
- 1 June 2000
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 10 (6) , 839-852
- https://doi.org/10.1101/gr.10.6.839
Abstract
The pericentromeric regions of human chromosomes pose particular problems for both mapping and sequencing. These difficulties are due, in large part, to the presence of duplicated genomic segments that are distributed among multiple human chromosomes. To ensure contiguity of genomic sequence in these regions, we designed a sequence-based strategy to characterize different pericentromeric regions using a single (162 kb) 2p11 seed sequence as a point of reference. Molecular and cytogenetic techniques were first used to construct a paralogy map that delineated the interchromosomal distribution of duplicated segments throughout the human genome. Monochromosomal hybrid DNAs were PCR amplified by primer pairs designed to the 2p11 reference sequence. The PCR products were directly sequenced and used to develop a catalog of sequence tags for each duplicon for each chromosome. A total of 685 paralogous sequence variants were generated by sequencing 34.7 kb of paralogous pericentromeric sequence. Using PCR products as hybridization probes, we were able to identify 702 human BAC clones, of which a subset, 107 clones, were analyzed at the sequence level. We used diagnostic paralogous sequence variants to assign 65 of these BACs to at least 9 chromosomal pericentromeric regions: 1q12, 2p11, 9p11/q12, 10p11, 14q11, 15q11, 16p11, 17p11, and 22q11. Comparisons with existing sequence and physical maps for the human genome suggest that many of these BACs map to regions of the genome with sequence gaps. Our analysis indicates that large portions of pericentromeric DNA are virtually devoid of unique sequences. Instead, they consist of a mosaic of different genomic segments that have had different propensities for duplication. These biologic properties may be exploited for the rapid characterization of, not only pericentromeric DNA, but also other complex paralogous regions of the human genome. [The sequence data described in this paper have been submitted to the GenBank data library under accession numbers AC002038,AC002307, AF182004-AF182009, AF183323-AF183331, AF183333-AF183337,AF183339-AF183350, AF183352-AF183356, AF183358-AF183362,AF183366-AF183369, AF183371-AF183375, and AF262624–AF262695.]Keywords
This publication has 43 references indexed in Scilit:
- Molecular structure and evolution of an alpha satellite/non-alpha satellite junction at 16p11Human Molecular Genetics, 2000
- Chromosome Breakage in the Prader-Willi and Angelman Syndromes Involves Recombination between Large, Transcribed Repeats at Proximal and Distal BreakpointsAmerican Journal of Human Genetics, 1999
- Sequences flanking the centromere of human chromosome 10 are a complex patchwork of arm-specific sequences, stable duplications and unstable sequences with homologies to telomeric and other centromeric locationsHuman Molecular Genetics, 1999
- The human COX10 gene is disrupted during homologous recombination between the 24 kb proximal and distal CMT1A-REPsHuman Molecular Genetics, 1997
- Emergence and scattering of multiple neurofibromatosis (NF1)-related sequences during hominoid evolution suggest a process of pericentromeric interchromosomal transpositionHuman Molecular Genetics, 1997
- Base composition and gene distribution: critical patterns in mammalian genome organizationTrends in Genetics, 1996
- Duplication of a gene-rich cluster between 16p11.1 and Xq28: a novel pericentromeric-directed mechanism for paralogous genome evolutionHuman Molecular Genetics, 1996
- A 9.75-Mb Map across the Centromere of Human Chromosome 10Genomics, 1996
- Identification of complex chromosome rearrangements in the gibbon by fluorescent in situ hybridization (FISH) of a human chromosome 2q specific microlibrary, yeast artificial chromosomes, and reciprocal chromosome paintingCytogenetic and Genome Research, 1996
- Miropeats: graphical DNA sequence comparisonsBioinformatics, 1995