ML Research Wiki / Benchmarks / Few-Shot Learning / MedConceptsQA

MedConceptsQA

Few-Shot Learning Benchmark

Performance Over Time

📊 Showing 12 results | 📏 Metric: Accuracy

Top Performing Models

Rank Model Paper Accuracy Date Code
1 gpt-4-0125-preview GPT-4 Technical Report 61.91 2023-03-15 📦 openai/evals 📦 shmsw25/factscore 📦 unispac/visual-adversarial-examples-jailbreak-large-language-models
2 gpt-3.5-turbo Language Models are Few-Shot Learners 41.48 2020-05-28 📦 ggml-org/llama.cpp 📦 ggerganov/llama.cpp 📦 karpathy/llm.c
3 meta-llama/Meta-Llama-3-8B-Instruct LLaMA: Open and Efficient Foundation Language Models 25.65 2023-02-27 📦 huggingface/transformers 📦 ggml-org/llama.cpp 📦 ggerganov/llama.cpp
4 johnsnowlabs/JSL-MedMNX-7B MedConceptsQA: Open Source Medical Concepts QA Benchmark 25.63 2024-05-12 📦 nadavlab/MedConceptsQA
5 yikuan8/Clinical-Longformer Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequences 25.55 2022-01-27 📦 luoyuanlab/clinical-longformer
6 dmis-lab/biobert-v1.1 BioBERT: a pre-trained biomedical language representation model for biomedical text mining 25.46 2019-01-25 📦 dmis-lab/biobert 📦 EmilyAlsentzer/clinicalBERT 📦 naver/biobert-pretrained
7 epfl-llm/meditron-70b MEDITRON-70B: Scaling Medical Pretraining for Large Language Models 25.26 2023-11-27 📦 epfllm/meditron
8 BioMistral/BioMistral-7B-DARE BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains 25.06 2024-02-15 📦 biomistral/biomistral
9 HuggingFaceH4/zephyr-7b-beta Zephyr: Direct Distillation of LM Alignment 25.06 2023-10-25 📦 huggingface/alignment-handbook 📦 Savannah120/alignment-handbook-PoFT
10 dmis-lab/meerkat-7b-v1.0 Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks 24.94 2024-03-30 -

All Papers (12)