Mouse BAC Ends Quality Assessment and Sequence Analyses
Open Access
- 20 September 2001
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 11 (10) , 1736-1745
- https://doi.org/10.1101/gr.179201
Abstract
A large-scale BAC end-sequencing project at The Institute for Genomic Research (TIGR) has generated one of the most extensive sets of sequence markers for the mouse genome to date. With a sequencing success rate of >80%, an average read length of 485 bp, and ABI3700 capillary sequencers, we have generated 449,234 nonredundant mouse BAC end sequences (mBESs) with 218 Mb total from 257,318 clones from libraries RPCI-23 and RPCI-24, representing 15× clone coverage, 7% sequence coverage, and a marker every 7 kb across the genome. A total of 191,916 BACs have sequences from both ends providing 12× genome coverage. The average Q20 length is 406 bp and 84% of the bases havephred quality scores ≥ 20. RPCI-24 mBESs have more Q20 bases and longer reads on average than RPCI-23 sequences. ABI3700 sequencers and the sample tracking system ensure that > 95% of mBESs are associated with the right clone identifiers. We have found that a significant fraction of mBESs contains L1 repeats and ∼48% of the clones have both ends with ≥ 100 bp contiguous unique Q20 bases. About 3% mBESs match ESTs and > 70% of matches were conserved between the mouse and the human or the rat. Approximately 0.1% mBESs contain STSs. About 0.2% mBESs match human finished sequences and > 70% of these sequences have EST hits. The analyses indicate that our high-quality mouse BAC end sequences will be a valuable resource to the community.Keywords
This publication has 38 references indexed in Scilit:
- A High-Resolution Radiation Hybrid Map of the Human Genome Draft SequenceScience, 2001
- Integration of cytogenetic landmarks into the draft sequence of the human genomeNature, 2001
- A physical map of the human genomeNature, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Human BAC Ends Quality Assessment and Sequence AnalysesGenomics, 2000
- Shotgun Sequencing of the Human GenomeScience, 1998
- A new strategy for genome sequencingNature, 1996
- A comprehensive genetic map of the mouse genomeNature, 1996
- Whole-Genome Random Sequencing and Assembly of Haemophilus influenzae RdScience, 1995
- Basic local alignment search toolJournal of Molecular Biology, 1990