📊 Showing 4 results | 📏 Metric: Accuracy
Rank | Model | Paper | Accuracy | Date | Code |
---|---|---|---|---|---|
1 | MedVInT | PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering | 42.30 | 2023-05-17 | 📦 xiaoman-zhang/PMC-VQA 📦 zihanzhaosjtu/librisqa |
2 | Open-Flamingo | Flamingo: a Visual Language Model for Few-Shot Learning | 26.40 | 2022-04-29 | 📦 mlfoundations/open_flamingo 📦 lucidrains/flamingo-pytorch 📦 unispac/visual-adversarial-examples-jailbreak-large-language-models 📦 doc-doc/NExT-OE 📦 happen2me/cross-gnn |
3 | PMC-CLIP | PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents | 24.70 | 2023-03-13 | 📦 WeixiongLin/PMC-CLIP 📦 mbzuai-oryx/unimed-clip |
4 | BLIP-2 | BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models | 24.30 | 2023-01-30 | 📦 huggingface/transformers 📦 salesforce/lavis 📦 thudm/visualglm-6b |