The Common Data Elements for Cancer Research: Remarks on Functions and Structure
- 1 January 2006
- journal article
- research article
- Published by Georg Thieme Verlag KG in Methods of Information in Medicine
- Vol. 45 (06) , 594-601
- https://doi.org/10.1055/s-0038-1634121
Abstract
Objectives: The National Cancer Institute (NCI) has developed the Common Data Elements (CDE) to serve as a controlled vocabulary of data descriptors for cancer research, to facilitate data interchange and inter-oper-ability between cancer research centers. We evaluated CDE’s structure to see whether it could represent the elements necessary to support its intended purpose, and whether it could prevent errors and inconsistencies from being accidentally introduced. We also performed automated checks for certain types of content errors that provided a rough measure of curation quality. Methods: Evaluation was performed on CDE content downloaded via the NCI’s CDE Browser, and transformed into relational database form. Evaluation was performed under three categories: 1) compatibility with the ISO/IEC 11179 metadata model, on which CDE structure is based, 2) features necessary for controlled vocabulary support, and 3) support for a stated NCI goal, set up of data collection forms for cancer research. Results: Various limitations were identified both with respect to content (inconsistency, insufficient definition of elements, redundancy) as well as structure – particularly the need for term and relationship support, as well as the need for metadata supporting the explicit representation of electronic forms that utilize sets of common data elements. Conclusions: While there are numerous positive aspects to the CDE effort, there is considerable opportunity for improvement. Our recommendations include review of existing content by diverse experts in the cancer community; integration with the NCI thesaurus to take advantage of the latter’s links to nationally used controlled vocabularies, and various schema enhancements required for electronic form support.Keywords
This publication has 20 references indexed in Scilit:
- NCI Thesaurus: using science-based terminology to integrate cancer research results.2004
- Terminology access methods leveraging LDAP resources.2004
- Screening for depression in head and neck cancerPsycho‐Oncology, 2003
- The Open Terminology Services (OTS) project.2003
- The Common Data Element Dictionary - a standard nomenclature for the reporting of Phase 3 cancer clinical trial dataPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Extending the LOINC Conceptual Schema to Support Standardized Assessment InstrumentsJournal of the American Medical Informatics Association, 2002
- Reengineering a Database for Clinical Trials Management: Lessons for System ArchitectsControlled Clinical Trials, 2000
- WebEAV: Automatic Metadata-driven Generation of Web Interfaces to Entity-Attribute-Value DatabasesJournal of the American Medical Informatics Association, 2000
- Managing Attribute-Value Clinical Trials Data Using the ACT/DB Client-Server Database SystemJournal of the American Medical Informatics Association, 1998
- The GRAIL concept modelling language for medical terminologyArtificial Intelligence in Medicine, 1997