An XML standard for the dissemination of annotated 2D gel electrophoresis data complemented with mass spectrometry results
Open Access
- 29 January 2004
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 5 (1) , 9
- https://doi.org/10.1186/1471-2105-5-9
Abstract
Many proteomics initiatives require a seamless bioinformatics integration of a range of analytical steps between sample collection and systems modeling immediately assessable to the participants involved in the process. Proteomics profiling by 2D gel electrophoresis to the putative identification of differentially expressed proteins by comparison of mass spectrometry results with reference databases, includes many components of sample processing, not just analysis and interpretation, are regularly revisited and updated. In order for such updates and dissemination of data, a suitable data structure is needed. However, there are no such data structures currently available for the storing of data for multiple gels generated through a single proteomic experiments in a single XML file. This paper proposes a data structure based on XML standards to fill the void that exists between data generated by proteomics experiments and storing of data. In order to address the resulting procedural fluidity we have adopted and implemented a data model centered on the concept of annotated gel (AG) as the format for delivery and management of 2D Gel electrophoresis results. An eXtensible Markup Language (XML) schema is proposed to manage, analyze and disseminate annotated 2D Gel electrophoresis results. The structure of AG objects is formally represented using XML, resulting in the definition of the AGML syntax presented here. The proposed schema accommodates data on the electrophoresis results as well as the mass-spectrometry analysis of selected gel spots. A web-based software library is being developed to handle data storage, analysis and graphic representation. Computational tools described will be made available at http://bioinformatics.musc.edu/agml . Our development of AGML provides a simple data structure for storing 2D gel electrophoresis data.Keywords
This publication has 17 references indexed in Scilit:
- A systematic approach to modeling, capturing, and disseminating proteomics experimental dataNature Biotechnology, 2003
- Constellations in a cellular universeNature, 2003
- From genomics to proteomicsNature, 2003
- proteomicsNature, 2003
- Biological data integration: wrapping data and toolsIEEE Transactions on Information Technology in Biomedicine, 2002
- Standards for modelingNature Biotechnology, 2002
- ProML--the protein markup language for specification of protein sequences, structures and families.2002
- Can we integrate bioinformatics data on the Internet?Trends in Biotechnology, 2001
- 2D or not 2DCurrent Opinion in Chemical Biology, 2001
- Simultaneous modelling of metabolic, genetic and product-interaction networksBriefings in Bioinformatics, 2001