RNAML: A standard syntax for exchanging RNA information
- 1 June 2002
- journal article
- syntax proposal
- Published by Cold Spring Harbor Laboratory in RNA
- Vol. 8 (6) , 707-717
- https://doi.org/10.1017/s1355838202028017
Abstract
Analyzing a single data set using multiple RNA informatics programs often requires a file format conversion between each pair of programs, significantly hampering productivity. To facilitate the interoperation of these programs, we propose a syntax to exchange basic RNA molecular information. This RNAML syntax allows for the storage and the exchange of information about RNA sequence and secondary and tertiary structures. The syntax permits the description of higher level information about the data including, but not restricted to, base pairs, base triples, and pseudoknots. A class-oriented approach allows us to represent data common to a given set of RNA molecules, such as a sequence alignment and a consensus secondary structure. Documentation about experiments and computations, as well as references to journals and external databases, are included in the syntax. The chief challenge in creating such a syntax was to determine the appropriate scope of usage and to ensure extensibility as new needs will arise. The syntax complies with the eXtensible Markup Language (XML) recommendations, a widely accepted standard for syntax specifications. In addition to the various generic packages that exist to read and interpret XML formats, an XML processor was developed and put in the open-source MC-Core library for nucleic acid and protein structure computer manipulation.Keywords
This publication has 14 references indexed in Scilit:
- XML, bioinformatics and data integrationBioinformatics, 2001
- [28] Computational modeling of structural experimental dataPublished by Elsevier ,2000
- The Protein Data BankNucleic Acids Research, 2000
- Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structureJournal of Molecular Biology, 1999
- The Biopolymer Markup Language.Bioinformatics, 1999
- The Ribonuclease P DatabaseNucleic Acids Research, 1999
- MANIP: an interactive tool for modelling RNAJournal of Molecular Graphics and Modelling, 1998
- Summary: the modified nucleosides of RNANucleic Acids Research, 1994
- The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acidsBiophysical Journal, 1992
- The Combination of Symbolic and Numerical Computation or Three-Dimensional Modeling of RNAScience, 1991