Yunfan Gao Shanghai Research Institute for Intelligent Autonomous Systems Tongji University, Yun Xiong School of Computer Science Shanghai Key Laboratory of Data Science Fudan University, Xinyu Gao School of Computer Science Shanghai Key Laboratory of Data Science Fudan University, Kangxiang Jia School of Computer Science Shanghai Key Laboratory of Data Science Fudan University, Jinliu Pan School of Computer Science Shanghai Key Laboratory of Data Science Fudan University, Yuxi Bi College of Design and Innovation Tongji University, Yi Dai Shanghai Research Institute for Intelligent Autonomous Systems Tongji University, Jiawei Sun Shanghai Research Institute for Intelligent Autonomous Systems Tongji University, Meng Wang College of Design and Innovation Tongji University, Haofen Wang Shanghai Research Institute for Intelligent Autonomous Systems Tongji University College of Design and Innovation Tongji University (2023)
This paper provides a comprehensive survey of Retrieval-Augmented Generation (RAG) for Large Language Models (LLMs). It outlines the significant advantages of RAG in enhancing LLMs by integrating external knowledge sources, addressing challenges such as hallucination, outdated information, and opaque reasoning. The paper categorizes RAG into three paradigms: Naive RAG, Advanced RAG, and Modular RAG, detailing their structures, methodologies, and the evolution influenced by the emergence of advanced LLMs like ChatGPT. The authors explore the technical intricacies of the RAG process, covering retrieval, generation, and augmentation techniques. They compile key datasets and metrics, propose evaluation frameworks, and discuss current challenges and future research directions in RAG. The survey aims to equip readers with a thorough understanding of RAG's implementation, its foundational components, and its application across various domains. Overall, it emphasizes the need for continuous innovation in RAG technologies amidst the growth of LLM capabilities.
This paper employs the following methods:
The following datasets were used in this research:
The authors identified the following limitations: