Quality evaluation of cancer study Common Data Elements using the UMLS Semantic Network

Guoqian Jiang, Harold R. Solbrig, Christopher G. Chute

Research output: Contribution to journalArticlepeer-review

6 Scopus citations


The binding of controlled terminology has been regarded as important for standardization of Common Data Elements (CDEs) in cancer research. However, the potential of such binding has not yet been fully explored, especially its quality assurance aspect. The objective of this study is to explore whether there is a relationship between terminological annotations and the UMLS Semantic Network (SN) that can be exploited to improve those annotations. We profiled the terminological concepts associated with the standard structure of the CDEs of the NCI Cancer Data Standards Repository (caDSR) using the UMLS SN. We processed 17798 data elements and extracted 17526 primary object class/property concept pairs. We identified dominant semantic types for the categories "object class" and "property" and determined that the preponderance of the instances were disjoint (i.e. the intersection of semantic types between the two categories is empty). We then performed a preliminary evaluation on the data elements whose asserted primary object class/property concept pairs conflict with this observation - where the semantic type of the object class fell into a SN category typically used by property or visa-versa. In conclusion, the UMLS SN based profiling approach is feasible for the quality assurance and accessibility of the cancer study CDEs. This approach could provide useful insight about how to build mechanisms of quality assurance in a meta-data repository.

Original languageEnglish (US)
Pages (from-to)S78-S85
JournalJournal of Biomedical Informatics
Issue numberSUPPL. 1
StatePublished - Dec 2011
Externally publishedYes


  • Cancer study
  • Common Data Elements
  • Quality assurance
  • Semantic Network

ASJC Scopus subject areas

  • Health Informatics
  • Computer Science Applications


Dive into the research topics of 'Quality evaluation of cancer study Common Data Elements using the UMLS Semantic Network'. Together they form a unique fingerprint.

Cite this