RAGHub
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
Stars: 171
RAGHub is a community-driven project focused on cataloging new and emerging frameworks, projects, and resources in the Retrieval-Augmented Generation (RAG) ecosystem. It aims to help users stay ahead of changes in the field by providing a platform for the latest innovations in RAG. The repository includes information on RAG frameworks, evaluation frameworks, optimization frameworks, citation frameworks, engines, search reranker frameworks, projects, resources, and real-world use cases across industries and professions.
README:
Welcome to RAGHub, a living collection of new and emerging frameworks, projects, and resources in the Retrieval-Augmented Generation (RAG) ecosystem. This is a community-driven project for r/RAG, where we aim to catalog the rapid growth of RAG tools and projects that are pushing the boundaries of the field.
Each day, it feels like a new tool or framework emerges, and choosing the right one is becoming more of an art than a science. Is the framework from three months ago still relevant? Or was it just hype, rehashing old concepts with a fresh look? RAGHub exists to help you stay ahead of these changes, providing a platform for the latest innovations in RAG.
This is a community project, and we welcome contributions from everyone! If you’d like to add a new framework, project, or resource, please check out our Contribution Guidelines for details on how to get started.
Name | Description | Website | Github | Stars | Activity |
---|---|---|---|---|---|
LangChain | A framework for building applications with LLMs | Website | Github | 93.2k | 9h ago |
Haystack | A framework for building search engines using neural networks | Website | Github | 17k | Last week |
LlamaIndex | A framework for building data-driven LLM applications | Website | Github | 35.9k | 7h ago |
BentoML | Build Inference APIs, LLM apps, Multi-model chains, RAG service | Website | Github | 7k | 1h ago |
LightRAG | Simple and fast Retrieval-Augmented Generation | Website | Github | 268 | 1d ago |
Swarm by OpenAI | Educational framework for lightweight multi-agent orchestration | - | Github | 7k | 1d ago |
Langroid | Python framework to easily build LLM-powered applications | Website | Github | 2.4k | 10h ago |
NeMo-Guardrails | Toolkit for adding programmable guardrails to LLM-based applications | Website | Github | 4k | Last week |
Swiftide | A Rust library for building fast, streaming applications with LLMs | Website | Github | 222 | 1h ago |
Korvus | The entire RAG pipeline in a single database query | Website | Github | 1.3k | Last month |
semantic-router | A framework for routing LLM requests using semantic vectors. | Website | Github | 2k | 4h ago |
AWS Bedrock Knowledge Bases | Service to build, scale, and deploy RAG-powered applications | Website | - | - | 1h ago |
langflow | A framework to build, scale, and deploy RAG and multi-agent AI apps. | Website | Github | 31.4k | 1h ago |
dspy | A framework to build language model apps with modular programming. | Website | Github | 17.8k | 13h ago |
Name | Description | Website | GitHub | Stars | Activity |
---|---|---|---|---|---|
Trulens | Measures and enhancse LLM app quality with feedback functions for scalable evaluation | Website | Github | 2.1k | 11h ago |
Phoenix | AI observability platform designed for experimentation, evaluation, and troubleshooting | Website | Github | 3.6k | 1d ago |
ragas | Evaluates and quantifies the performance of RAG pipelines that enhance LLM context with external data | Website | Github | 6.8k | 3h ago |
Deepchecks | Continuous validation of AI & ML models, detecting data drift and model issues | Website | Github | 3.6k | 8m ago |
AutoRAG | End-to-end RAG optimization: parsing, chunking, evaluation dataset creation, and pipeline deployment | Website | Github | 1.6k | 1h ago |
evalmy.ai | Fine tuned lightweight RAG evaluation service + python client lib | Website | Github | -- | -- |
Name | Description | Website | GitHub | Stars | Activity |
---|---|---|---|---|---|
R2R | The Elasticsearch for RAG, helps you quickly build and launch scalable RAG solutions | Website | Github | 3.4k | 6h ago |
RAGFlow | Open-source RAG engine based on deep document understanding | Website | Github | 18.7k | 1h ago |
Vertex AI Knowledge Engine | A data framework for context-augmented LLM applications | Website | - | - | 1d ago |
Embedchain | Open Source Framework for personalizing LLM responses under 10 lines of code | Website | Github | 22.2k | Last week |
txtai | All-in-one embeddings database for semantic search, LLM orchestration, and RAG workflows | Website | Github | 8.8k | Last week |
dsRAG | High-performance retrieval engine for unstructured data | - | Github | 815 | Last week |
Flash-Rank | Use Pairwise or Listwise rerankers to improve search accuracy before passing to LLMs. | Github | 606 | 2w ago | |
Graphlit | API-first platform for building knowledge-driven AI applications and agents | Website | Github | 16 | 8h ago |
rag-citation | Combines RAG with automatic citation generation to enhance content credibility | Website | Github | 6 | Last week |
PostgresML | Postgres + GPUs with functions for chunking, embedding, transforming and ranking | Website | Github | 6k | Yesterday |
Name | Description | Website | GitHub | Stars | Activity |
---|---|---|---|---|---|
LlamaParse | GenAI-native document parsing platform | Website | Github | 2.8k | 2d ago |
Langchain-extract | Web server to extract information from text and files using LLMs | Website | Github | 1k | 4m ago |
Unstructured.io | build custom preprocessing pipelines for labeling, training, or production ml | Website | Github | 8.7k | 3d ago |
Verba | RAG chatbot powered by Weaviate | Website | Github | 6.1k | 2w ago |
Unstract | No-code Platform to launch APIs and ETL Pipelines to structure unstructured documents | Website | Github | 2.3k | 4h ago |
Humata.ai | Ask questions across all of your document files | Website | 4h ago | ||
Ragie.ai | Fully managed RAG-as-a-Service for developers. | Website | Github | 12 | 12h ago |
Reducto | Parses complex documents and creates LLM-ready inputs | Website | Github | 16 | 2w ago |
Midship | Extract document data straight into your spreadsheet/erp/crm | Website | Github | - | - |
DocuPanda | Convert documents into a structured, standard set of fields and values | Website | - | - | - |
contextual-doc-retrieval-opneai-reranker | Using GPT-4 and Cohere for query expansion and re-ranking with BM25 | GitHub | 20 | Last week | |
Raggenie | Low-code platform to build custom RAG-based AI applications | Website | Github | 60 | 10h ago |
Chunkr | Vision model based PDF chunking and OCR, optimized for fast processing of large datasets | Website | Github | 651 | 11h ago |
Site/Article | Description | Link |
---|---|---|
Contextual Retrieval | Anthropic introducing Contextual Retrieval | Website |
Open-RAG | Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models | Website |
ColPali | Efficient Document Retrieval with Vision Language Models | Website |
RAG_Techniques | Showcases various advanced techniques for RAG systems | Website |
GenAI_Agents | Tutorials and implementations for various AI Agent techniques | Website |
Name | Description | Link |
---|---|---|
Artificial Analysis | LLM Comparison | Website |
HuggingFace/mteb | Embedding models leaderboard | Website |
If you're looking for mainstream RAG frameworks and techniques**, check out the excellent repository by Nir Diamant: RAG Techniques. This repository focuses on more established tools and methods that have already gained traction in the community.
This project is licensed under the MIT License. See the LICENSE file for details.
This project is part of the r/RAG community. Have feedback or suggestions? Feel free to open an issue, start a discussion, or join the conversation on our Discord server! We want to make this repository a valuable resource for everyone exploring the RAG ecosystem, and your input is crucial.
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for RAGHub
Similar Open Source Tools
RAGHub
RAGHub is a community-driven project focused on cataloging new and emerging frameworks, projects, and resources in the Retrieval-Augmented Generation (RAG) ecosystem. It aims to help users stay ahead of changes in the field by providing a platform for the latest innovations in RAG. The repository includes information on RAG frameworks, evaluation frameworks, optimization frameworks, citation frameworks, engines, search reranker frameworks, projects, resources, and real-world use cases across industries and professions.
Model-References
The 'Model-References' repository contains examples for training and inference using Intel Gaudi AI Accelerator. It includes models for computer vision, natural language processing, audio, generative models, MLPerf™ training, and MLPerf™ inference. The repository provides performance data and model validation information for various frameworks like PyTorch. Users can find examples of popular models like ResNet, BERT, and Stable Diffusion optimized for Intel Gaudi AI accelerator.
models
The Intel® AI Reference Models repository contains links to pre-trained models, sample scripts, best practices, and tutorials for popular open-source machine learning models optimized by Intel to run on Intel® Xeon® Scalable processors and Intel® Data Center GPUs. It aims to replicate the best-known performance of target model/dataset combinations in optimally-configured hardware environments. The repository will be deprecated upon the publication of v3.2.0 and will no longer be maintained or published.
ai-reference-models
The Intel® AI Reference Models repository contains links to pre-trained models, sample scripts, best practices, and tutorials for popular open-source machine learning models optimized by Intel to run on Intel® Xeon® Scalable processors and Intel® Data Center GPUs. The purpose is to quickly replicate complete software environments showcasing the AI capabilities of Intel platforms. It includes optimizations for popular deep learning frameworks like TensorFlow and PyTorch, with additional plugins/extensions for improved performance. The repository is licensed under Apache License Version 2.0.
Top-AI-Tools
Top AI Tools is a comprehensive, community-curated directory that aims to catalog and showcase the most outstanding AI-powered products. This index is not exhaustive, but rather a compilation of our research and contributions from the community.
ai-game-development-tools
Here we will keep track of the AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥 * Tool (AI LLM) * Game (Agent) * Code * Framework * Writer * Image * Texture * Shader * 3D Model * Avatar * Animation * Video * Audio * Music * Singing Voice * Speech * Analytics * Video Tool
CogVLM2
CogVLM2 is a new generation of open source models that offer significant improvements in benchmarks such as TextVQA and DocVQA. It supports 8K content length, image resolution up to 1344 * 1344, and both Chinese and English languages. The project provides basic calling methods, fine-tuning examples, and OpenAI API format calling examples to help developers quickly get started with the model.
are-copilots-local-yet
Current trends and state of the art for using open & local LLM models as copilots to complete code, generate projects, act as shell assistants, automatically fix bugs, and more. This document is a curated list of local Copilots, shell assistants, and related projects, intended to be a resource for those interested in a survey of the existing tools and to help developers discover the state of the art for projects like these.
cool-ai-stuff
This repository contains an uncensored list of free to use APIs and sites for several AI models. > _This list is mainly managed by @zukixa, the queen of zukijourney, so any decisions may have bias!~_ > > **Scroll down for the sites, APIs come first!** * * * > [!WARNING] > We are not endorsing _any_ of the listed services! Some of them might be considered controversial. We are not responsible for any legal, technical or any other damage caused by using the listed services. Data is provided without warranty of any kind. **Use these at your own risk!** * * * # APIs Table of Contents #### Overview of Existing APIs #### Overview of Existing APIs -- Top LLM Models Available #### Overview of Existing APIs -- Top Image Models Available #### Overview of Existing APIs -- Top Other Features & Models Available #### Overview of Existing APIs -- Available Donator Perks * * * ## API List:* *: This list solely covers all providers I (@zukixa) was able to collect metrics in. Any mistakes are not my responsibility, as I am either banned, or not aware of x API. \ 1: Last Updated 4/14/24 ### Overview of APIs: | Service | # of Users1 | Link | Stablity | NSFW Ok? | Open Source? | Owner(s) | Other Notes | | ----------- | ---------- | ------------------------------------------ | ------------------------------------------ | --------------------------- | ------------------------------------------------------ | -------------------------- | ----------------------------------------------------------------------------------------------------------- | | zukijourney| 4441 | D | High | On /unf/, not /v1/ | ✅, Here | @zukixa | Largest & Oldest GPT-4 API still continuously around. Offers other popular AI-related Bots too. | | Hyzenberg| 1234 | D | High | Forbidden | ❌ | @thatlukinhasguy & @voidiii | Experimental sister API to Zukijourney. Successor to HentAI | | NagaAI | 2883 | D | High | Forbidden | ❌ | @zentixua | Honorary successor to ChimeraGPT, the largest API in history (15k users). | | WebRaftAI | 993 | D | High | Forbidden | ❌ | @ds_gamer | Largest API by model count. Provides a lot of service/hosting related stuff too. | | KrakenAI | 388 | D | High | Discouraged | ❌ | @paninico | It is an API of all time. | | ShuttleAI | 3585 | D | Medium | Generally Permitted | ❌ | @xtristan | Faked GPT-4 Before 1, 2 | | Mandrill | 931 | D | Medium | Enterprise-Tier-Only | ❌ | @fredipy | DALL-E-3 access pioneering API. Has some issues with speed & stability nowadays. | oxygen | 742 | D | Medium | Donator-Only | ❌ | @thesketchubuser | Bri'ish 🤮 & Fren'sh 🤮 | | Skailar | 399 | D | Medium | Forbidden | ❌ | @aquadraws | Service is the personification of the word 'feature creep'. Lots of things announced, not much operational. |
awesome-llm-planning-reasoning
The 'Awesome LLMs Planning Reasoning' repository is a curated collection focusing on exploring the capabilities of Large Language Models (LLMs) in planning and reasoning tasks. It includes research papers, code repositories, and benchmarks that delve into innovative techniques, reasoning limitations, and standardized evaluations related to LLMs' performance in complex cognitive tasks. The repository serves as a comprehensive resource for researchers, developers, and enthusiasts interested in understanding the advancements and challenges in leveraging LLMs for planning and reasoning in real-world scenarios.
OpenGPT-4o
OpenGPT 4o is a free alternative to OpenAI GPT 4o. It offers various features such as free pricing, image and video generation, image QnA, voice and video chat, multilingual support, high customization, and continuous learning capability. The tool aims to provide an alternative to OpenAI GPT 4o with enhanced capabilities and features for users.
Github-Ranking-AI
This repository provides a list of the most starred and forked repositories on GitHub. It is updated automatically and includes information such as the project name, number of stars, number of forks, language, number of open issues, description, and last commit date. The repository is divided into two sections: LLM and chatGPT. The LLM section includes repositories related to large language models, while the chatGPT section includes repositories related to the chatGPT chatbot.
EAGLE
Eagle is a family of Vision-Centric High-Resolution Multimodal LLMs that enhance multimodal LLM perception using a mix of vision encoders and various input resolutions. The model features a channel-concatenation-based fusion for vision experts with different architectures and knowledge, supporting up to over 1K input resolution. It excels in resolution-sensitive tasks like optical character recognition and document understanding.
oss-fuzz-gen
This framework generates fuzz targets for real-world `C`/`C++` projects with various Large Language Models (LLM) and benchmarks them via the `OSS-Fuzz` platform. It manages to successfully leverage LLMs to generate valid fuzz targets (which generate non-zero coverage increase) for 160 C/C++ projects. The maximum line coverage increase is 29% from the existing human-written targets.
open-llms
Open LLMs is a repository containing various Large Language Models licensed for commercial use. It includes models like T5, GPT-NeoX, UL2, Bloom, Cerebras-GPT, Pythia, Dolly, and more. These models are designed for tasks such as transfer learning, language understanding, chatbot development, code generation, and more. The repository provides information on release dates, checkpoints, papers/blogs, parameters, context length, and licenses for each model. Contributions to the repository are welcome, and it serves as a resource for exploring the capabilities of different language models.
For similar tasks
lance
Lance is a modern columnar data format optimized for ML workflows and datasets. It offers high-performance random access, vector search, zero-copy automatic versioning, and ecosystem integrations with Apache Arrow, Pandas, Polars, and DuckDB. Lance is designed to address the challenges of the ML development cycle, providing a unified data format for collection, exploration, analytics, feature engineering, training, evaluation, deployment, and monitoring. It aims to reduce data silos and streamline the ML development process.
ai-powered-search
AI-Powered Search provides code examples for the book 'AI-Powered Search' by Trey Grainger, Doug Turnbull, and Max Irwin. The book teaches modern machine learning techniques for building search engines that continuously learn from users and content to deliver more intelligent and domain-aware search experiences. It covers semantic search, retrieval augmented generation, question answering, summarization, fine-tuning transformer-based models, personalized search, machine-learned ranking, click models, and more. The code examples are in Python, leveraging PySpark for data processing and Apache Solr as the default search engine. The repository is open source under the Apache License, Version 2.0.
RAGHub
RAGHub is a community-driven project focused on cataloging new and emerging frameworks, projects, and resources in the Retrieval-Augmented Generation (RAG) ecosystem. It aims to help users stay ahead of changes in the field by providing a platform for the latest innovations in RAG. The repository includes information on RAG frameworks, evaluation frameworks, optimization frameworks, citation frameworks, engines, search reranker frameworks, projects, resources, and real-world use cases across industries and professions.
Co-LLM-Agents
This repository contains code for building cooperative embodied agents modularly with large language models. The agents are trained to perform tasks in two different environments: ThreeDWorld Multi-Agent Transport (TDW-MAT) and Communicative Watch-And-Help (C-WAH). TDW-MAT is a multi-agent environment where agents must transport objects to a goal position using containers. C-WAH is an extension of the Watch-And-Help challenge, which enables agents to send messages to each other. The code in this repository can be used to train agents to perform tasks in both of these environments.
GPT4Point
GPT4Point is a unified framework for point-language understanding and generation. It aligns 3D point clouds with language, providing a comprehensive solution for tasks such as 3D captioning and controlled 3D generation. The project includes an automated point-language dataset annotation engine, a novel object-level point cloud benchmark, and a 3D multi-modality model. Users can train and evaluate models using the provided code and datasets, with a focus on improving models' understanding capabilities and facilitating the generation of 3D objects.
asreview
The ASReview project implements active learning for systematic reviews, utilizing AI-aided pipelines to assist in finding relevant texts for search tasks. It accelerates the screening of textual data with minimal human input, saving time and increasing output quality. The software offers three modes: Oracle for interactive screening, Exploration for teaching purposes, and Simulation for evaluating active learning models. ASReview LAB is designed to support decision-making in any discipline or industry by improving efficiency and transparency in screening large amounts of textual data.
Groma
Groma is a grounded multimodal assistant that excels in region understanding and visual grounding. It can process user-defined region inputs and generate contextually grounded long-form responses. The tool presents a unique paradigm for multimodal large language models, focusing on visual tokenization for localization. Groma achieves state-of-the-art performance in referring expression comprehension benchmarks. The tool provides pretrained model weights and instructions for data preparation, training, inference, and evaluation. Users can customize training by starting from intermediate checkpoints. Groma is designed to handle tasks related to detection pretraining, alignment pretraining, instruction finetuning, instruction following, and more.
amber-train
Amber is the first model in the LLM360 family, an initiative for comprehensive and fully open-sourced LLMs. It is a 7B English language model with the LLaMA architecture. The model type is a language model with the same architecture as LLaMA-7B. It is licensed under Apache 2.0. The resources available include training code, data preparation, metrics, and fully processed Amber pretraining data. The model has been trained on various datasets like Arxiv, Book, C4, Refined-Web, StarCoder, StackExchange, and Wikipedia. The hyperparameters include a total of 6.7B parameters, hidden size of 4096, intermediate size of 11008, 32 attention heads, 32 hidden layers, RMSNorm ε of 1e^-6, max sequence length of 2048, and a vocabulary size of 32000.
For similar jobs
weave
Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.
LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.
VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.
kaito
Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.
PyRIT
PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.
tabby
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.
spear
SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.
Magick
Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.