The BioSample Database (BioSD) at the European Bioinformatics Institute
Open Access
- 16 November 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 40 (D1) , D64-D70
- https://doi.org/10.1093/nar/gkr937
Abstract
The BioSample Database (http://www.ebi.ac.uk/biosamples) is a new database at EBI that stores information about biological samples used in molecular experiments, such as sequencing, gene expression or proteomics. The goals of the BioSample Database include: (i) recording and linking of sample information consistently within EBI databases such as ENA, ArrayExpress and PRIDE; (ii) minimizing data entry efforts for EBI database submitters by enabling submitting sample descriptions once and referencing them later in data submissions to assay databases and (iii) supporting cross database queries by sample characteristics. Each sample in the database is assigned an accession number. The database includes a growing set of reference samples, such as cell lines, which are repeatedly used in experiments and can be easily referenced from any database by their accession numbers. Accession numbers for the reference samples will be exchanged with a similar database at NCBI. The samples in the database can be queried by their attributes, such as sample types, disease names or sample providers. A simple tab-delimited format facilitates submissions of sample information to the database, initially via email to biosamples@ebi.ac.uk.Keywords
This publication has 12 references indexed in Scilit:
- SAIL—a software system for sample and phenotype availability across biobanks and cohortsBioinformatics, 2010
- ArrayExpress update--an archive of microarray and high-throughput sequencing-based functional genomics experimentsNucleic Acids Research, 2010
- The European Nucleotide ArchiveNucleic Acids Research, 2010
- Public data archives for genomic structural variationNature Genetics, 2010
- ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community levelBioinformatics, 2010
- The Proteomics Identifications database: 2010 updateNucleic Acids Research, 2009
- The First RSBI (ISA-TAB) Workshop: “Can a Simple Format Work for Complex Studies?”OMICS: A Journal of Integrative Biology, 2008
- PASSIM – an open source software system for managing information in biomedical studiesBMC Bioinformatics, 2007
- A simple spreadsheet-based, MIAME-supportive format for microarray data: MAGE-TABBMC Bioinformatics, 2006
- ArrayExpress--a public repository for microarray gene expression data at the EBINucleic Acids Research, 2003