当前位置: 首页 > article >正文

Retrieval-Augmented Generation for LargeLanguage Models: A Survey

标题:Retrieval-Augmented Generation for Large Language Models: A Survey

作者:Yunfan Gaoa , Yun Xiongb , Xinyu Gaob , Kangxiang Jiab , Jinliu Panb , Yuxi Bic , Yi Daia , Jiawei Suna , Meng Wangc , and Haofen Wang

1. By referencing external knowledge, RAG effectively reduces the problem of generating factually incorrect content. Its integration into LLMs has resulted in widespread adoption, establishing RAG as a key technology in advancing chatbots and enhancing the suitability of LLMs for real-world applications

2. The RAG research paradigm is continuously evolving, and we categorize it into three stages: Naive RAG, Advanced RAG, and Modular RAG

3. The Naive RAG:

Indexing starts with the cleaning and extraction of raw data

Retrieval. Upon receipt of a user query, the RAG system employs the same encoding model utilized during the indexing phase to transform the query into a vector representation.

Generation. The posed query and selected documents are synthesized into a coherent prompt to which a large language model is tasked with formulating a response.

4. 

Advanced RAG introduces specific improvements to overcome the limitations of Naive RAG. Focusing on enhancing retrieval quality, it employs pre-retrieval and post-retrieval strategies.

5. 

Pre-retrieval process. In this stage, the primary focus is on optimizing the indexing structure and the original query. The goal of optimizing indexing is to enhance the quality of the content being indexed.

Post-Retrieval Process. Once relevant context is retrieved, it’s crucial to integrate it effectively with the query

6. Innovations such as the Rewrite-Retrieve-Read [7]model leverage the LLM’s capabilities to refine retrieval queries through a rewriting module and a LM-feedback mechanism to update rewriting model

7. RAG is often compared with Fine-tuning (FT) and prompt engineering. Each method has distinct characteristics as illustrated in Figure 4.

8. In the context of RAG, it is crucial to efficiently retrieve relevant documents from the data source. There are several key issues involved, such as the retrieval source, retrieval granularity, pre-processing of the retrieval, and selection of the corresponding embedding model.


http://www.kler.cn/a/548572.html

相关文章:

  • [算法]——链表(二)
  • springCloud-2021.0.9 之 GateWay 示例
  • 数学建模与MATLAB实现:数据拟合全解析
  • 华为IPD简介
  • 【AIDevops】Deepseek驱动无界面自动化运维与分布式脚本系统,初探运维革命之路
  • C语言中的强制类型转换:原理、用法及注意事项
  • 从源代码编译构建vLLM并解决常见编译问题
  • LVS 负载均衡集群(NAT模式)
  • Oracle RHEL AS 4 安装 JAVA 1.4.2
  • 双轴伺服电机驱动控制器AGV、AMR专用双伺服电机驱动控制器解决方案
  • elasticsearch8 linux版以服务的方式启动
  • 开发板适配之I2C-RTC
  • 大疆无人机指令飞行JWT认证
  • HTTP 参数污染(HPP)详解
  • c++中什么时候应该使用final关键字?
  • 【在idea中配置两个不同端口,同时运行两个相同的主程序springboot】
  • 5G与物联网的协同发展:打造智能城市的未来
  • nginx的十一个阶段详解
  • unity学习40:导入模型的 Animations文件夹内容,动画属性和修改动画文件
  • Mongodb数据管理