UMLS

Unified Medical Language System

Dataset Information
Modalities
Texts, Graphs
Languages
English
Introduced
2013
License
Unknown
Homepage

Overview

The Unified Medical Language System (UMLS) is a comprehensive resource that integrates and disseminates essential terminology, classification standards, and coding systems. Its purpose is to foster the creation of more effective and interoperable biomedical information systems and services, including electronic health records. Here are the key aspects of the UMLS:

  1. Integration of Vocabularies: The UMLS brings together various health and biomedical vocabularies and standards. It acts as a bridge, enabling interoperability between different computer systems by harmonizing terminology.

  2. Components:

    • Metathesaurus: This component contains terms and codes from multiple vocabularies, including CPT, ICD-10-CM, LOINC, MeSH, RxNorm, and SNOMED CT. It provides hierarchies, definitions, relationships, and other attributes.
    • Semantic Network: The Semantic Network defines broad categories (semantic types) and their relationships (semantic relations).
    • SPECIALIST Lexicon and Lexical Tools: These tools include a large syntactic lexicon for biomedical and general English. They assist in normalizing strings, generating lexical variants, and creating indexes.
  3. Use Cases:

    • Clinical Practice: Link terms and codes across healthcare entities (doctors, pharmacies, insurance companies).
    • Patient Care Coordination: Facilitate communication among hospital departments.
    • Text Processing: Extract concepts, relationships, or knowledge from medical texts.
    • Terminology Mapping: Map between different terminologies.
    • Local Terminology Development: Create and maintain local terminologies.
    • Research: Investigate terminologies or ontologies.
  4. Accessing the UMLS:

    • Web Browsers: You can search and explore UMLS data using UTS applications like the Metathesaurus Browser (retrieve concept information) and the Semantic Network Browser (view semantic types and relations).
    • Local Installation: Download UMLS files and use the MetamorphoSys tool to customize the UMLS for your specific needs. Load the customized data into your own database system or browse it using the MetamorphoSys RRF browser.

Source: Conversation with Bing, 3/18/2024
(1) Unified Medical Language System (UMLS) - National Library of Medicine. https://www.nlm.nih.gov/research/umls/index.html.
(2) UMLS Metathesaurus Browser. https://uts.nlm.nih.gov/uts/umls/home.
(3) GitHub - dongwookim-ml/kg-data: knowledge-graph datasets. https://github.com/dongwookim-ml/kg-data.
(4) UMLS Dataset | Papers With Code. https://paperswithcode.com/dataset/umls.

Variants: UMLS

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Link Prediction KGLM KGLM: Integrating Knowledge Graph Structure … 2022-11-04
Link Prediction PALT PALT: Parameter-Lite Transfer of Language … 2022-10-25
Link Prediction LASS Joint Language Semantic and Structure … 2022-09-19
Link Prediction LP-BERT Multi-task Pre-training Language Model for … 2022-01-13
Link Prediction StAR Structure-Augmented Text Representation Learning for … 2020-04-30
Link Prediction KG-BERT KG-BERT: BERT for Knowledge Graph … 2019-09-07
Link Prediction ConvE Convolutional 2D Knowledge Graph Embeddings 2017-07-05
Link Prediction ComplEx Complex Embeddings for Simple Link … 2016-06-20
Link Prediction DistMult Embedding Entities and Relations for … 2014-12-20

Research Papers

Recent papers with results on this dataset: