all-in-rag

🔍大模型应用开发实战一：RAG技术全栈指南，在线阅读地址：https://datawhalechina.github.io/all-in-rag/

Stars: 710

Visit

All-in-RAG is a comprehensive repository for all things related to Randomized Algorithms and Graphs. It provides a wide range of resources, including implementations of various randomized algorithms, graph data structures, and visualization tools. The repository aims to serve as a one-stop solution for researchers, students, and enthusiasts interested in exploring the intersection of randomized algorithms and graph theory. Whether you are looking to study theoretical concepts, implement algorithms in practice, or visualize graph structures, All-in-RAG has got you covered.

README:

All-in-RAG | 大模型应用开发实战一：RAG技术全栈指南

🔍 检索增强生成 (RAG) 技术全栈指南

从理论到实践，从基础到进阶，构建你的RAG技术体系

🎯 系统化学习
完整的RAG技术体系

🛠️ 动手实践
丰富的项目案例

🚀 生产就绪
工程化最佳实践

📊 多模态支持
文本+图像检索

项目简介（中文 | English）

本项目是一个面向大模型应用开发者的RAG（检索增强生成）技术全栈教程，旨在通过体系化的学习路径和动手实践项目，帮助开发者掌握基于大语言模型的RAG应用开发技能，构建生产级的智能问答和知识检索系统。

主要内容包括：

RAG技术基础：深入浅出地介绍RAG的核心概念、技术原理和应用场景
数据处理全流程：从数据加载、清洗到文本分块的完整数据准备流程
索引构建与优化：向量嵌入、多模态嵌入、向量数据库构建及索引优化技术
检索技术进阶：混合检索、查询构建、Text2SQL等高级检索技术
生成集成与评估：格式化生成、系统评估与优化方法
项目实战：从基础到进阶的完整RAG应用开发实践

项目意义

随着大语言模型的快速发展，RAG技术已成为构建智能问答系统、知识检索应用的核心技术。然而，现有的RAG教程往往零散且缺乏系统性，初学者难以形成完整的技术体系认知。

本项目从实践出发，结合最新的RAG技术发展趋势，构建了一套完整的RAG学习体系，帮助开发者：

系统掌握RAG技术的理论基础和实践技能
理解RAG系统的完整架构和各组件的作用
具备独立开发RAG应用的能力
掌握RAG系统的评估和优化方法

项目受众

本项目适合以下人群学习：

具备Python编程基础，对RAG技术感兴趣的开发者
希望系统学习RAG技术的AI工程师
想要构建智能问答系统的产品开发者
对检索增强生成技术有学习需求的研究人员

前置要求：

掌握Python基础语法和常用库的使用
能够简单使用docker
了解基本的LLM概念（推荐但非必需）
具备基础的Linux命令行操作能力

项目亮点

体系化学习路径：从基础概念到高级应用，构建完整的RAG技术学习体系
理论与实践并重：每个章节都包含理论讲解和代码实践，确保学以致用
多模态支持：不仅涵盖文本RAG，还包括多模态嵌入和检索技术
工程化导向：注重实际应用中的工程化问题，包括性能优化、系统评估等
丰富的实战项目：提供从基础到进阶的多个实战项目，帮助巩固学习成果

内容大纲

第一部分：RAG基础入门

第一章解锁RAG 📖 查看章节

[x] RAG简介 - RAG技术概述与应用场景
[x] 准备工作 - 环境配置与准备
[x] 四步构建RAG - 快速上手RAG开发
[x] 附：环境部署 - Python虚拟环境部署方案补充 (贡献者: @anarchysaiko)

第二章数据准备 📖 查看章节

[x] 数据加载 - 多格式文档处理与加载
[x] 文本分块 - 文本切分策略与优化

第二部分：索引构建与优化

第三章索引构建 📖 查看章节

[x] 向量嵌入 - 文本向量化技术详解
[x] 多模态嵌入 - 图文多模态向量化
[x] 向量数据库 - 向量存储与检索系统
[x] Milvus实践 - Milvus多模态检索实战
[x] 索引优化 - 索引性能调优技巧

第三部分：检索技术进阶

第四章检索优化 📖 查看章节

[x] 混合检索 - 稠密+稀疏检索融合
[x] 查询构建 - 智能查询理解与构建
[x] Text2SQL - 自然语言转SQL查询
[x] 查询重构与分发 - 查询优化策略
[x] 检索进阶技术 - 高级检索算法

第四部分：生成与评估

第五章生成集成 📖 查看章节

[x] 格式化生成 - 结构化输出与格式控制

第六章 RAG系统评估 📖 查看章节

[x] 评估介绍 - RAG系统评估方法论
[x] 评估工具 - 常用评估工具与指标

第五部分：高级应用与实战

第七章高级RAG架构（拓展部分） 📖 查看章节

[x] 基于知识图谱的RAG

第八章项目实战一 📖 查看章节

第九章项目实战一优化（选修篇） 📖 查看章节

🍽️ 项目展示

第十章项目实战二（选修篇） 📖 查看章节 规划中

目录结构说明

all-in-rag/
├── docs/           # 教程文档
├── code/           # 代码示例
├── data/           # 示例数据
├── models/         # 预训练模型
└── README.md       # 项目说明

实战项目展示

第八章项目一：

第九章项目一（Graph RAG优化）：

第十章项目二：

致谢

核心贡献者

尹大吕-项目负责人（项目发起人与主要贡献者）

额外章节贡献者

孙超-内容创作者（Datawhale成员-上海工程技术大学）

特别感谢

感谢 @Sm1les 对本项目的帮助与支持
感谢所有为本项目做出贡献的开发者们
感谢开源社区提供的优秀工具和框架支持
特别感谢以下为教程做出贡献的开发者！

Made with contrib.rocks.

参与贡献

我们欢迎所有形式的贡献，包括但不限于：

🚨 Bug报告：发现问题请提交 Issue
💭 教程建议：有好的想法欢迎在 Discussions 中讨论
📚 文档改进：帮助完善文档内容和示例代码（当前仅支持第七章优质内容pr）

Star History

如果这个项目对你有帮助，请给我们一个 ⭐️

让更多人发现这个项目（护食？发来！）

关于 Datawhale

扫描二维码关注 Datawhale 公众号，获取更多优质开源内容

许可证

本作品采用知识共享署名-非商业性使用-相同方式共享 4.0 国际许可协议进行许可。

For Tasks:

Click tags to check more tools for each tasks

analyze graphs implement algorithms visualize data research new methods study theoretical concepts

For Jobs:

research assistant data scientist software engineer algorithm developer academic researcher

Alternative AI tools for all-in-rag

Similar Open Source Tools

all-in-rag

github

: 710

SwanLab

SwanLab is an open-source, lightweight AI experiment tracking tool that provides a platform for tracking, comparing, and collaborating on experiments, aiming to accelerate the research and development efficiency of AI teams by 100 times. It offers a friendly API and a beautiful interface, combining hyperparameter tracking, metric recording, online collaboration, experiment link sharing, real-time message notifications, and more. With SwanLab, researchers can document their training experiences, seamlessly communicate and collaborate with collaborators, and machine learning engineers can develop models for production faster.

github

: 1.3k

qiaoqiaoyun

Qiaoqiaoyun is a new generation zero-code product that combines an AI application development platform, AI knowledge base, and zero-code platform, helping enterprises quickly build personalized business applications in an AI way. Users can build personalized applications that meet business needs without any code. Qiaoqiaoyun has comprehensive application building capabilities, form engine, workflow engine, and dashboard engine, meeting enterprise's normal requirements. It is also an AI application development platform based on LLM large language model and RAG open-source knowledge base question-answering system.

github

: 63

AIMedia

AIMedia is a fully automated AI media software that automatically fetches hot news, generates news, and publishes on various platforms. It supports hot news fetching from platforms like Douyin, NetEase News, Weibo, The Paper, China Daily, and Sohu News. Additionally, it enables AI-generated images for text-only news to enhance originality and reading experience. The tool is currently commercialized with plans to support video auto-generation for platform publishing in the future. It requires a minimum CPU of 4 cores or above, 8GB RAM, and supports Windows 10 or above. Users can deploy the tool by cloning the repository, modifying the configuration file, creating a virtual environment using Conda, and starting the web interface. Feedback and suggestions can be submitted through issues or pull requests.

github

: 127

MoneyPrinterPlus

MoneyPrinterPlus is a project designed to help users easily make money in the era of short videos. It leverages AI big model technology to batch generate various short videos, perform video editing, and automatically publish videos to popular platforms like Douyin, Kuaishou, Xiaohongshu, and Video Number. The tool covers a wide range of functionalities including integrating with major AI big model tools, supporting various voice types, offering video transition effects, enabling customization of subtitles, and more. It aims to simplify the process of creating and sharing videos to monetize traffic.

github

: 1.8k

LogChat

LogChat is an open-source and free AI chat client that supports various chat models and technologies such as ChatGPT, 讯飞星火, DeepSeek, LLM, TTS, STT, and Live2D. The tool provides a user-friendly interface designed using Qt Creator and can be used on Windows systems without any additional environment requirements. Users can interact with different AI models, perform voice synthesis and recognition, and customize Live2D character models. LogChat also offers features like language translation, AI platform integration, and menu items like screenshot editing, clock, and application launcher.

github

: 53

AI-Compass

github

: 288

Operit

Operit AI is a fully functional AI assistant application for mobile devices, running independently on Android devices with powerful tool invocation capabilities. It offers over 40 built-in tools for file system operations, HTTP requests, system operations, UI automation, and media processing. The app combines these tools with rich plugins to enable a wide range of tasks, from simple to complex, providing a comprehensive experience of a smartphone AI assistant.

github

: 1.7k

KubeDoor

KubeDoor is a microservice resource management platform developed using Python and Vue, based on K8S admission control mechanism. It supports unified remote storage, monitoring, alerting, notification, and display for multiple K8S clusters. The platform focuses on resource analysis and control during daily peak hours of microservices, ensuring consistency between resource request rate and actual usage rate.

github

: 272

WeChatMsg

WeChatMsg is a tool designed to help users manage and analyze their WeChat data. It aims to provide users with the ability to preserve their precious memories and create a personalized AI companion. The tool allows users to extract and export various types of data from WeChat, such as text, images, contacts, and more. Additionally, it offers features like analyzing chat data and generating visual annual reports. WeChatMsg is built on the idea of empowering users to take control of their data and foster emotional connections through technology.

github

: 38.4k

douyin-chatgpt-bot

Douyin ChatGPT Bot is an AI-driven system for automatic replies on Douyin, including comment and private message replies. It offers features such as comment filtering, customizable robot responses, and automated account management. The system aims to enhance user engagement and brand image on the Douyin platform, providing a seamless experience for managing interactions with followers and potential customers.

github

: 166

aituber-kit

AITuber-Kit is a tool that enables users to interact with AI characters, conduct AITuber live streams, and engage in external integration modes. Users can easily converse with AI characters using various LLM APIs, stream on YouTube with AI character reactions, and send messages to server apps via WebSocket. The tool provides settings for API keys, character configurations, voice synthesis engines, and more. It supports multiple languages and allows customization of VRM models and background images. AITuber-Kit follows the MIT license and offers guidelines for adding new languages to the project.

github

: 421

Desktop-Pet-Godot

Godog is an AI desktop pet powered by a large language model and created with Godot. It aims to provide a versatile and rich desktop AI pet that users can customize to create unique pet images and behaviors. The tool is lightweight, easy to develop with Godot, compatible with various large language models, offers pre-made character functions and multiple appearances, supports multimodal capabilities, and allows users to easily build their own AI desktop pets on top of the existing features.

github

: 155

aice_ps

Aice PS is a powerful web-based AI photo editor that utilizes Google aistudio's advanced capabilities to make professional image editing and creation simple and intuitive. Users can enhance images, apply creative filters, make professional adjustments, and even generate new images from scratch using simple text prompts. The tool combines various cutting-edge AI capabilities to provide a one-stop creative image and video solution, including AI image generation, intelligent editing, creative filters, professional adjustments, AI inspiration suggestions, intelligent synthesis, texture overlay, one-click cutout, time travel effects, BeatSync for music and image synchronization, NB prompt word library, basic editing toolkit, and more.

github

: 200

claude-init

Claude Code Chinese development suite is a localized version based on the Claude Code Development Kit, offering a seamless Chinese AI programming experience. It features complete Chinese AI commands, documentation system, error messages, and installation experience. The suite includes intelligent context management with a three-tier document structure, automatic context injection, smart document routing, and cross-session state management. It integrates development tools like Hook system, MCP server support, security scans, and notification system. Additionally, it provides a comprehensive template library with project templates, document templates, and configuration examples.

github

: 519

airda

airda(Air Data Agent) is a multi-agent system for data analysis, which can understand data development and data analysis requirements, understand data, and generate SQL and Python code for data query, data visualization, machine learning and other tasks.

github

: 2.1k

For similar tasks

Awesome-LLM4Graph-Papers

A collection of papers and resources about Large Language Models (LLM) for Graph Learning (Graph). Integrating LLMs with graph learning techniques to enhance performance in graph learning tasks. Categorizes approaches based on four primary paradigms and nine secondary-level categories. Valuable for research or practice in self-supervised learning for recommendation systems.

github

: 290

Graph-CoT

This repository contains the source code and datasets for Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs accepted to ACL 2024. It proposes a framework called Graph Chain-of-thought (Graph-CoT) to enable Language Models to traverse graphs step-by-step for reasoning, interaction, and execution. The motivation is to alleviate hallucination issues in Language Models by augmenting them with structured knowledge sources represented as graphs.

github

: 174

Awesome-Graph-LLM

Awesome-Graph-LLM is a curated collection of research papers exploring the intersection of graph-based techniques with Large Language Models (LLMs). The repository aims to bridge the gap between LLMs and graph structures prevalent in real-world applications by providing a comprehensive list of papers covering various aspects of graph reasoning, node classification, graph classification/regression, knowledge graphs, multimodal models, applications, and tools. It serves as a valuable resource for researchers and practitioners interested in leveraging LLMs for graph-related tasks.

github

: 2.0k

all-in-rag

github

: 710

Interview-for-Algorithm-Engineer

This repository provides a collection of interview questions and answers for algorithm engineers. The questions are organized by topic, and each question includes a detailed explanation of the answer. This repository is a valuable resource for anyone preparing for an algorithm engineering interview.

github

: 1.4k

lmql

LMQL is a programming language designed for large language models (LLMs) that offers a unique way of integrating traditional programming with LLM interaction. It allows users to write programs that combine algorithmic logic with LLM calls, enabling model reasoning capabilities within the context of the program. LMQL provides features such as Python syntax integration, rich control-flow options, advanced decoding techniques, powerful constraints via logit masking, runtime optimization, sync and async API support, multi-model compatibility, and extensive applications like JSON decoding and interactive chat interfaces. The tool also offers library integration, flexible tooling, and output streaming options for easy model output handling.

github

: 3.4k

learnopencv

LearnOpenCV is a repository containing code for Computer Vision, Deep learning, and AI research articles shared on the blog LearnOpenCV.com. It serves as a resource for individuals looking to enhance their expertise in AI through various courses offered by OpenCV. The repository includes a wide range of topics such as image inpainting, instance segmentation, robotics, deep learning models, and more, providing practical implementations and code examples for readers to explore and learn from.

github

: 22.3k

Java-AI-Book-Code

The Java-AI-Book-Code repository contains code examples for the 2020 edition of 'Practical Artificial Intelligence With Java'. It is a comprehensive update of the previous 2013 edition, featuring new content on deep learning, knowledge graphs, anomaly detection, linked data, genetic algorithms, search algorithms, and more. The repository serves as a valuable resource for Java developers interested in AI applications and provides practical implementations of various AI techniques and algorithms.

github

: 244

For similar jobs

Perplexica

Perplexica is an open-source AI-powered search engine that utilizes advanced machine learning algorithms to provide clear answers with sources cited. It offers various modes like Copilot Mode, Normal Mode, and Focus Modes for specific types of questions. Perplexica ensures up-to-date information by using SearxNG metasearch engine. It also features image and video search capabilities and upcoming features include finalizing Copilot Mode and adding Discover and History Saving features.

github

: 26.0k

KULLM

KULLM (구름) is a Korean Large Language Model developed by Korea University NLP & AI Lab and HIAI Research Institute. It is based on the upstage/SOLAR-10.7B-v1.0 model and has been fine-tuned for instruction. The model has been trained on 8×A100 GPUs and is capable of generating responses in Korean language. KULLM exhibits hallucination and repetition phenomena due to its decoding strategy. Users should be cautious as the model may produce inaccurate or harmful results. Performance may vary in benchmarks without a fixed system prompt.

github

: 527

MMMU

MMMU is a benchmark designed to evaluate multimodal models on college-level subject knowledge tasks, covering 30 subjects and 183 subfields with 11.5K questions. It focuses on advanced perception and reasoning with domain-specific knowledge, challenging models to perform tasks akin to those faced by experts. The evaluation of various models highlights substantial challenges, with room for improvement to stimulate the community towards expert artificial general intelligence (AGI).

github

: 374

1filellm

1filellm is a command-line data aggregation tool designed for LLM ingestion. It aggregates and preprocesses data from various sources into a single text file, facilitating the creation of information-dense prompts for large language models. The tool supports automatic source type detection, handling of multiple file formats, web crawling functionality, integration with Sci-Hub for research paper downloads, text preprocessing, and token count reporting. Users can input local files, directories, GitHub repositories, pull requests, issues, ArXiv papers, YouTube transcripts, web pages, Sci-Hub papers via DOI or PMID. The tool provides uncompressed and compressed text outputs, with the uncompressed text automatically copied to the clipboard for easy pasting into LLMs.

github

: 292

gpt-researcher

GPT Researcher is an autonomous agent designed for comprehensive online research on a variety of tasks. It can produce detailed, factual, and unbiased research reports with customization options. The tool addresses issues of speed, determinism, and reliability by leveraging parallelized agent work. The main idea involves running 'planner' and 'execution' agents to generate research questions, seek related information, and create research reports. GPT Researcher optimizes costs and completes tasks in around 3 minutes. Features include generating long research reports, aggregating web sources, an easy-to-use web interface, scraping web sources, and exporting reports to various formats.

github

: 23.6k

ChatTTS

ChatTTS is a generative speech model optimized for dialogue scenarios, providing natural and expressive speech synthesis with fine-grained control over prosodic features. It supports multiple speakers and surpasses most open-source TTS models in terms of prosody. The model is trained with 100,000+ hours of Chinese and English audio data, and the open-source version on HuggingFace is a 40,000-hour pre-trained model without SFT. The roadmap includes open-sourcing additional features like VQ encoder, multi-emotion control, and streaming audio generation. The tool is intended for academic and research use only, with precautions taken to limit potential misuse.

github

: 33.9k

HebTTS

HebTTS is a language modeling approach to diacritic-free Hebrew text-to-speech (TTS) system. It addresses the challenge of accurately mapping text to speech in Hebrew by proposing a language model that operates on discrete speech representations and is conditioned on a word-piece tokenizer. The system is optimized using weakly supervised recordings and outperforms diacritic-based Hebrew TTS systems in terms of content preservation and naturalness of generated speech.

github

: 52

do-research-in-AI

This repository is a collection of research lectures and experience sharing posts from frontline researchers in the field of AI. It aims to help individuals upgrade their research skills and knowledge through insightful talks and experiences shared by experts. The content covers various topics such as evaluating research papers, choosing research directions, research methodologies, and tips for writing high-quality scientific papers. The repository also includes discussions on academic career paths, research ethics, and the emotional aspects of research work. Overall, it serves as a valuable resource for individuals interested in advancing their research capabilities in the field of AI.

github

: 61

all-in-rag

README:

All-in-RAG | 大模型应用开发实战一：RAG技术全栈指南

🔍 检索增强生成 (RAG) 技术全栈指南

项目简介（中文 | English）

项目意义

项目受众

项目亮点

内容大纲

第一部分：RAG基础入门

第二部分：索引构建与优化

第三部分：检索技术进阶

第四部分：生成与评估

第五部分：高级应用与实战

目录结构说明

实战项目展示

第八章 项目一：

第九章 项目一（Graph RAG优化）：

第十章 项目二：

致谢

特别感谢

参与贡献

Star History

关于 Datawhale

许可证

For Tasks:

For Jobs:

Alternative AI tools for all-in-rag

Similar Open Source Tools

all-in-rag

SwanLab

qiaoqiaoyun

AIMedia

MoneyPrinterPlus

LogChat

AI-Compass

Operit

KubeDoor

WeChatMsg

douyin-chatgpt-bot

aituber-kit

Desktop-Pet-Godot

aice_ps

claude-init

airda

For similar tasks

Awesome-LLM4Graph-Papers

Graph-CoT

Awesome-Graph-LLM

all-in-rag

Interview-for-Algorithm-Engineer

lmql

learnopencv

Java-AI-Book-Code

For similar jobs

Perplexica

KULLM

MMMU

1filellm

gpt-researcher

ChatTTS

HebTTS

do-research-in-AI

第八章项目一：

第九章项目一（Graph RAG优化）：

第十章项目二：