Andreas Köpf [email protected] Christoph Schuhmann Huu Nguyen, Dimitri Von Rütte Christoph Schuhmann Huu Nguyen, Sotiris Anagnostidis Christoph Schuhmann Huu Nguyen, Zhi-Rui Tam Christoph Schuhmann Huu Nguyen, Keith Stevens Christoph Schuhmann Huu Nguyen, Abdullah Barhoum Nguyen Christoph Schuhmann Huu Nguyen, Minh Duc Christoph Schuhmann Huu Nguyen, Oliver Stanley Christoph Schuhmann Huu Nguyen, Richárd Nagyfi Christoph Schuhmann Huu Nguyen, Shahul Es Christoph Schuhmann Huu Nguyen, Sameer Suri Christoph Schuhmann Huu Nguyen, David Glushkov Christoph Schuhmann Huu Nguyen, Arnav Dantuluri Christoph Schuhmann Huu Nguyen, Andrew Maguire Christoph Schuhmann Huu Nguyen, Alexander Mattick [email protected] Christoph Schuhmann Huu Nguyen (2023)
This paper introduces OpenAssistant Conversations, a dataset aimed at democratizing the alignment of large language models (LLMs) with human preferences. It consists of 161,443 messages across 66,497 conversation trees in 35 languages, generated through a crowd-sourcing effort involving over 13,500 volunteers. The dataset is annotated with 461,292 quality ratings, enabling improved usability and effectiveness of LLMs such as Pythia and LLaMA through techniques like supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF). The paper highlights the importance of high-quality human feedback data for training LLMs and discusses methodology, quality control, safety measures, and the dataset's potential impact on AI alignment research and deployment. Moreover, results show that models trained on this dataset outperform baseline models on various benchmarks, promoting accessibility and inclusiveness in AI research.
This paper employs the following methods:
The following datasets were used in this research:
The authors identified the following limitations: