PubMedQA

Dataset Information
Modalities
Texts
Introduced
2019
License
Homepage

Overview

The task of PubMedQA is to answer research questions with yes/no/maybe (e.g.: Do preoperative statins reduce atrial fibrillation after coronary artery bypass grafting?) using the corresponding abstracts.

PubMedQA has 1k expert labeled, 61.2k unlabeled and 211.3k artificially generated QA instances.

Source: PubMedQA
Image Source: https://arxiv.org/pdf/1909.06146v1.pdf

Variants: PubMedQA

Associated Benchmarks

This dataset is used in 3 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Few-Shot Learning MetaGen Blended RAG (zero-shot) MetaGen Blended RAG: Higher Accuracy … 2025-05-23
Question Answering MetaGen Blended RAG (zero-shot) MetaGen Blended RAG: Higher Accuracy … 2025-05-23
Retrieval MetaGen Blended RAG MetaGen Blended RAG: Higher Accuracy … 2025-05-23
Question Answering RankRAG-llama3-70B (Zero-Shot) RankRAG: Unifying Context Ranking with … 2024-07-02
Question Answering MediSwift-XL MediSwift: Efficient Sparse Pre-trained Biomedical … 2024-03-01
Question Answering Meditron-70B (CoT + SC) MEDITRON-70B: Scaling Medical Pretraining for … 2023-11-27
Question Answering BioMedGPT-10B BioMedGPT: Open Multimodal Generative Pre-trained … 2023-08-18
Question Answering CoT-T5-11B (1024 Shot) The CoT Collection: Improving Zero-shot … 2023-05-23
Few-Shot Learning CoT-T5-11B (1024 Shot) The CoT Collection: Improving Zero-shot … 2023-05-23
Question Answering Med-PaLM 2 (ER) Towards Expert-Level Medical Question Answering … 2023-05-16
Question Answering Med-PaLM 2 (5-shot) Towards Expert-Level Medical Question Answering … 2023-05-16
Question Answering Med-PaLM 2 (CoT + SC) Towards Expert-Level Medical Question Answering … 2023-05-16
Question Answering PaLM (540B, Few-shot) Large Language Models Encode Clinical … 2022-12-26
Question Answering Flan-PaLM (540B, Few-shot) Large Language Models Encode Clinical … 2022-12-26
Question Answering Flan-PaLM (62B, Few-shot) Large Language Models Encode Clinical … 2022-12-26
Question Answering Flan-PaLM (540B, SC) Large Language Models Encode Clinical … 2022-12-26
Question Answering Flan-PaLM (8B, Few-shot) Large Language Models Encode Clinical … 2022-12-26
Question Answering PaLM (62B, Few-shot) Large Language Models Encode Clinical … 2022-12-26
Question Answering PaLM (8B, Few-shot) Large Language Models Encode Clinical … 2022-12-26
Question Answering OPT (zero-shot) Galactica: A Large Language Model … 2022-11-16

Research Papers

Recent papers with results on this dataset: