The completion of the Mammalian Gene Collection (MGC)
- 18 September 2009
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 19 (12) , 2324-2333
- https://doi.org/10.1101/gr.095976.109
Abstract
Since its start, the Mammalian Gene Collection (MGC) has sought to provide at least one full-protein-coding sequence cDNA clone for every human and mouse gene with a RefSeq transcript, and at least 6200 rat genes. The MGC cloning effort initially relied on random expressed sequence tag screening of cDNA libraries. Here, we summarize our recent progress using directed RT-PCR cloning and DNA synthesis. The MGC now contains clones with the entire protein-coding sequence for 92% of human and 89% of mouse genes with curated RefSeq (NM-accession) transcripts, and for 97% of human and 96% of mouse genes with curated RefSeq transcripts that have one or more PubMed publications, in addition to clones for more than 6300 rat genes. These high-quality MGC clones and their sequences are accessible without restriction to researchers worldwide.Keywords
This publication has 37 references indexed in Scilit:
- Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammalsNature, 2009
- Origins and Mechanisms of miRNAs and siRNAsCell, 2009
- The UCSC Genome Browser Database: 2008 updateNucleic Acids Research, 2007
- The vertebrate genome annotation (Vega) databaseNucleic Acids Research, 2007
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Genome-wide transcription and the implications for genomic organizationNature Reviews Genetics, 2007
- Systematic identification of abundant A-to-I editing sites in the human transcriptomeNature Biotechnology, 2004
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Initial sequencing and analysis of the human genomeNature, 2001