CAMELYON16

Cancer Metastases in Lymph Nodes Challenge 2016

Dataset Information
Modalities
Images, Medical
Introduced
2017
License
Unknown
Homepage

Overview

The dataset consists of 400 whole-slide images (WSIs) of lymph node sections stained with hematoxylin and eosin (H&E), collected from two medical centers in the Netherlands. The WSIs are stored in a multi-resolution pyramid format, allowing for efficient retrieval of image subregions at different magnification levels. The training set includes two subsets:

  • 170 WSIs (100 normal, 70 with metastases) from Radboud University Medical Center
  • 100 WSIs (60 normal, 40 with metastases) from University Medical Center Utrecht

The test set consists of 130 WSIs from both institutions. Ground truth data for metastases is provided as XML files with annotated contours and WSI binary masks.

The Camelyon16 dataset aims to reduce the workload and subjectivity in cancer diagnosis by pathologists. It serves as a benchmark for evaluating algorithms that can automatically detect metastases in histopathological images, focusing on breast cancer in sentinel lymph nodes.

Researchers can develop and refine machine learning models for automated detection of metastases. The dataset allows for performance comparisons of different detection algorithms. Automated systems can be integrated into clinical workflows to enhance diagnostic accuracy and efficiency. The dataset is valuable for training medical professionals in digital pathology and AI applications in diagnostics.

Variants: CAMELYON16

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Multiple Instance Learning Snuffy (MAE Adapter) Snuffy: Efficient Whole Slide Image … 2024-08-15
Multiple Instance Learning Snuffy (SimCLR Exhaustive) Snuffy: Efficient Whole Slide Image … 2024-08-15
Multiple Instance Learning Snuffy (DINO Exhaustive) Snuffy: Efficient Whole Slide Image … 2024-08-15
Multiple Instance Learning CAMIL CAMIL: Context-Aware Multiple Instance Learning … 2023-05-09
Multiple Instance Learning CAMIL (CAMIL-L) CAMIL: Context-Aware Multiple Instance Learning … 2023-05-09
Multiple Instance Learning CAMIL (CAMIL-G) CAMIL: Context-Aware Multiple Instance Learning … 2023-05-09
Multiple Instance Learning DGMIL DGMIL: Distribution Guided Multiple Instance … 2022-06-17
Multiple Instance Learning DTFD-MIL (MAS) DTFD-MIL: Double-Tier Feature Distillation Multiple … 2022-03-22
Multiple Instance Learning DTFD-MIL (MaxMinS) DTFD-MIL: Double-Tier Feature Distillation Multiple … 2022-03-22
Multiple Instance Learning DTFD-MIL (MaxS) DTFD-MIL: Double-Tier Feature Distillation Multiple … 2022-03-22
Multiple Instance Learning DTFD-MIL (AFS) DTFD-MIL: Double-Tier Feature Distillation Multiple … 2022-03-22
Multiple Instance Learning TransMIL TransMIL: Transformer based Correlated Multiple … 2021-06-02
Multiple Instance Learning DSMIL-LC Dual-stream Multiple Instance Learning Network … 2020-11-17
Multiple Instance Learning DSMIL Dual-stream Multiple Instance Learning Network … 2020-11-17

Research Papers

Recent papers with results on this dataset: