📊 Showing 2 results | 📏 Metric: Accuracy
Rank | Model | Paper | Accuracy | Date | Code |
---|---|---|---|---|---|
1 | Vicuna13B v1.1 | This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models | 95.70 | 2023-10-24 | 📦 hitz-zentroa/this-is-not-a-dataset |
2 | Flan-T5-xxl | This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models | 94.10 | 2023-10-24 | 📦 hitz-zentroa/this-is-not-a-dataset |