Documenting Research Data
Overview of medical research documentation
Medical and health research have distinct requirements for documentation, ranging from standardized metadata for device interoperability to protocols and standard operating procedures for clinical trials. Good documentation facilitates collaboration among a broad range of specialized roles, translational research for applying results to treatments, and compliance with regulations for protecting patients. These resources are for anyone conducting or supporting research in clinical, biomedical, and public health research, emphasizing planning for sharing data with collaborators and biomedical communities. They accompany our online module on documenting medical research data.
Medical metadata standards
Metadata is information associated with datasets and data files that provides context. See our Metadata and Metadata Standards module.
Metadata standards examples and resources:
- A National Library of Medicine community curated set of metadata standards, research protocols, and standardized assessments in a range of biomedical fields and topics.
A global nonprofit developing data collection standards and protocols for clinical research, pharmogenomics, Standards areas include protocol representation, clinical data acquisition, and pharmcogenomics/genetics testing and description. You must create a free account to access the standards.
- A NIAD project developing standardized human pathogen and vector sequencing metadata to support epidemiologic and genotype-phenotype association studies for human infectious diseases.
- Specialized standards and tools for functional MRI (fMRI) supported by NIMH.
- An international non-profit for developing ANSI-accredited standards for Electronic Health Systems
- Focuses on international common standards for medical terms, including human and machine-readable concepts, descriptions, and relationships
- Healthcare and industry standards for better integrating healthcare computing standards among other standard like DICOM and HCL
- Brain Imaging Data Structure (BIDS) presents a standardized way of organizing neuroimaging and behavioral data that can help with overall documentation. BIDS is compatible with the OpenNeuro data repositor
- CEDAR provides metadata templates, which define the data elements needed to describe particular types of biomedical experiments. The templates include controlled terms and synonyms for specific data elements. CEDAR uses a library of such templates to help scientists submit annotated datasets to appropriate online data repositories.
Types of medical research documentation
Controlled vocabularies: lists of predefined terms, Some authorized and maintained by a community (see metadata standards examples) or developed internally by a research group. Medical research and biomedical professional communities may employ controlled vocabulary standards such as:
- CMS coding and other coding and billing standards
- NLM’s Unified Medical Language System that consolidates several health and biomedical vocabularies and standards to enable interoperability between computer systems.
Ontologies: a variety of controlled vocabulary that defines components and describes relationships among components. Most are used for interoperability among databases, some using (Web Ontology Language (OWL) or Resource Description Frameworks (RDF)). Here are some example of ontologies used in biomedical research:
- Bioportal: the portal for the U.S. National Center for Biomedical Ontology, hosted at Stanford.
- Gene Ontology: a bioinformatics initiative that aims to standardize the representation of gene and gene product attributes across species and databases.
- Medical Subject Headings (MeSH): a controlled vocabulary used for indexing articles for PubMed.
- Chemical Entities of Biological Interest (ChEBI): an ontology of small chemical compounds.
- Microarray Gene Expression Society Ontology (MGED): an ontology designed to describe microarray experiments.