AlpacaEval

Dataset Information
Modalities
Texts
Languages
English
Introduced
2023
License
Unknown
Homepage

Overview

The AlpacaEval set contains 805 instructions form self-instruct, open-assistant, vicuna, koala, hh-rlhf. Those were selected so that the AlpacaEval ranking of models on the AlpacaEval set would be similar to the ranking on the Alpaca demo data.

Variants: AlpacaEval

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Chatbot Yi 34B Chat Yi: Open Foundation Models by … 2024-03-07

Research Papers

Recent papers with results on this dataset: