HGVbase: a human sequence variation database emphasizing data quality and a broad spectrum of data sources
- 1 January 2002
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 30 (1) , 387-391
- https://doi.org/10.1093/nar/30.1.387
Abstract
HGVbase (Human Genome Variation database; http://hgvbase.cgb.ki.se, formerly known as HGBASE) is an academic effort to provide a high quality and non-redundant database of available genomic variation data of all types, mostly comprising single nucleotide polymorphisms (SNPs). Records include neutral polymorphisms as well as disease-related mutations. Online search tools facilitate data interrogation by sequence similarity and keyword queries, and searching by genome coordinates is now being implemented. Downloads are freely available in XML, Fasta, SRS, SQL and tagged-text file formats. Each entry is presented in the context of its surrounding sequence and many records are related to neighboring human genes and affected features therein. Population allele frequencies are included wherever available. Thorough semi-automated data checking ensures internal consistency and addresses common errors in the source information. To keep pace with recent growth in the field, we have developed tools for fully automated annotation. All variants have been uniquely mapped to the draft genome sequence and are referenced to positions in EMBL/GenBank files. Data utility is enhanced by provision of genotyping assays and functional predictions. Recent data structure extensions allow the capture of haplotype and genotype information, and a new initiative (along with BiSC and HUGO-MDI) aims to create a central repository for the broad collection of clinical mutations and associated disease phenotypes of interest.Keywords
This publication has 5 references indexed in Scilit:
- Robust and Accurate Single Nucleotide Polymorphism Genotyping by Dynamic Allele-Specific Hybridization (DASH): Design Criteria and Assay ValidationGenome Research, 2001
- dbSNP: the NCBI database of genetic variationNucleic Acids Research, 2001
- Flexible Sequence Similarity Searching with the FASTA3 Program PackagePublished by Springer Nature ,1999
- The HUGO Mutation Database InitiativeScience, 1998
- [8] SRS: Information retrieval system for molecular biology data banksPublished by Elsevier ,1996