The AlpacaEval set contains 805 instructions form self-instruct, open-assistant, vicuna, koala, hh-rlhf. Those were selected so that the AlpacaEval ranking of models on the AlpacaEval set would be similar to the ranking on the Alpaca demo data.
Variants: AlpacaEval
This dataset is used in 2 benchmarks:
Task | Model | Paper | Date |
---|---|---|---|
Chatbot | Yi 34B Chat | Yi: Open Foundation Models by … | 2024-03-07 |
Recent papers with results on this dataset: