📊 Showing 9 results | 📏 Metric: In-domain
Click "Edit" next to any result to modify it, or add a new result at the bottom. All changes will be reviewed before going live.
Model | Paper | In-domain | Date | Actions |
---|---|---|---|---|
GPT-3 175B (few-shot, k=32) | Language Models are Few-Shot Learners | 85.00 | 2020-05-28 | |
BERT Large Augmented (single model) | BERT: Pre-training of Deep Bidirectional Transfor… | 81.10 | 2018-10-11 | |
SDNet (ensemble) | SDNet: Contextualized Attention-based Deep Networ… | 79.30 | 2018-12-10 | |
BERT-base finetune (single model) | BERT: Pre-training of Deep Bidirectional Transfor… | 78.10 | 2018-10-11 | |
SDNet (single model) | SDNet: Contextualized Attention-based Deep Networ… | 76.60 | 2018-12-10 | |
FlowQA (single model) | FlowQA: Grasping Flow in History for Conversation… | 75.00 | 2018-10-06 | |
BiDAF++ (single model) | A Qualitative Comparison of CoQA, SQuAD 2.0 and Q… | 67.80 | 2018-09-27 | |
DrQA + seq2seq with copy attention (single model) | CoQA: A Conversational Question Answering Challen… | 65.10 | 2018-08-21 | |
Vanilla DrQA (single model) | CoQA: A Conversational Question Answering Challen… | 52.60 | 2018-08-21 |