vectordb-recipes

High quality resources & applications for LLMs, multi-modal models and VectorDBs

Stars: 845

Visit

This repository contains examples, applications, starter code, & tutorials to help you kickstart your GenAI projects. * These are built using LanceDB, a free, open-source, serverless vectorDB that **requires no setup**. * It **integrates into python data ecosystem** so you can simply start using these in your existing data pipelines in pandas, arrow, pydantic etc. * LanceDB has **native Typescript SDK** using which you can **run vector search** in serverless functions! This repository is divided into 3 sections: - Examples - Get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes! - Applications - Ready to use Python and web apps using applied LLMs, VectorDB and GenAI tools - Tutorials - A curated list of tutorials, blogs, Colabs and courses to get you started with GenAI in greater depth.

README:

VectorDB-recipes

Dive into building GenAI applications! This repository contains examples, applications, starter code, & tutorials to help you kickstart your GenAI projects.

These are built using LanceDB, a free, open-source, serverless vectorDB that requires no setup.
It integrates into Python data ecosystem so you can simply start using these in your existing data pipelines in pandas, arrow, pydantic etc.
LanceDB has native Typescript SDK using which you can run vector search in serverless functions!

Join our community for support - Discord • Twitter

This repository is divided into 2 sections:

Examples - Get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes!
Applications - Ready to use Python and web apps using applied LLMs, VectorDB and GenAI tools

The following examples are organized into different tables to make similar types of examples easily accessible.

Sections

Build from Scratch - Step-by-step guides to create AI applications from scratch.
Multimodal - Build apps that process and search across both text and images.
RAG - Combine document retrieval with LLM-powered responses.
Vector Search - Learn to efficiently find relevant documents using vector-based search.
Chatbot - Create AI chatbots that fetch information and generate intelligent replies.
Evalution - Measure the quality and accuracy of AI-generated answers.
AI Agents - Build LLM-driven applications where multiple agents collaborate and interact.
Recommender Systems - Develop AI-powered recommendation systems for personalized suggestions.
Concepts - Tutorials and explanations of key techniques used in AI applications.

🌟 New 🌟

Stay up to date with the latest projects, tools, and improvements added to the repository.

V-JEPA Video Search -

Build from Scratch

Start with the basics! These examples guide you through creating AI applications from the ground up using LanceDB for efficient document retrieval and search.

Build from Scratch	Interactive Notebook & Scripts

Build RAG from Scratch
Local RAG from Scratch with Llama3
Multi-Head RAG from Scratch
Fintech AI Agent from Scratch

MultiModal

Search across different types of data (text, images, and more). Build powerful search applications that work with diverse inputs.

Multimodal	Interactive Notebook & Scripts	Blog

V-JEPA Video Search
Multimodal CLIP: DiffusionDB
Multimodal CLIP: Youtube videos
Cambrian-1: Vision centric exploration of images
Multimodal Jina CLIP-V2 : Food Search
Multimodal vector search: Voyage AI X LanceDB

RAG

Generated Responses by retrieving relevant documents before answering. This section covers different approaches to implementing RAG in your projects.

RAG	Interactive Notebook & Scripts	Blog

RAG using Deepseek R1 vs OpenAI o1
RAG On PDF
RAG with Contextual Retrieval and Hybrid search
RAG with Matryoshka Embeddings and LlamaIndex
RAG with IBM Watsonx
Cognee RAG
Improve RAG with Re-ranking
Improve RAG with HyDE
Improve RAG with LOTR
Advanced RAG: Context Enrichment Window
Advanced RAG: Late Chunking
Corrective RAG with Langgraph
Contextual-Compression-with-RAG
Improve RAG with FLARE
Agentic RAG
GraphRAG
GraphRAG with CSV File
GraphRAG with cognee - Multimedia

Vector Search

Find relevant documents quickly! These projects show how to use vector-based search techniques to make AI-powered searches faster and smarter.

Vector Search	Interactive Notebook & Scripts	Blog

Inbuilt Hybrid Search
Hybrid search BM25 & lancedb
NER powered Semantic Search
Vector Arithmetic with LanceDB
Summarize and Search Reddit Posts
Imagebind demo app
Search Within Images
Zero Shot Object Detection with CLIP
Vector Search with TransformersJS
Geospatial Recommendation System
Accelerate Vector Search Applications Using OpenVINO

Chatbot

Create chatbots that understand user queries and fetch relevant responses using LanceDB’s vector search capabilities.

Chatbot	Interactive Notebook & Scripts	Blog

Databricks DBRX Website Bot
CLI-based SDK Manual Chatbot with Phidata
Youtube transcript search bot
Langchain: Code Docs QA bot
Chatbot with any website using Crawl4AI
Context-Aware Chatbot using Llama 2 & LanceDB

Evaluation

These projects provide tools to compare AI-generated responses against reference data and fine-tune accuracy.

Evaluation	Interactive Notebook & Scripts	Blog

Monitoring and Tracing RAG using HoneyHive
Evaluating RAG with RAGAs

AI Agents

Build applications where multiple AI agents interact to complete tasks efficiently. These projects show how agents can collaborate, exchange data, and automate workflows.

AI Agents	Interactive Notebook & Scripts	Blog

Trip Planner Swarm style Agent
Multi Source Agent
AI email assistant with Composio
Assitant Bot with OpenAI Swarm
AI Trends Searcher with CrewAI
SuperAgent Autogen
Build autonomous Customer support agent using Langgraph
AI Agents: Reducing Hallucination
Multi Document Agentic RAG
RASA: Customer Support Bot

Recommender Systems

Personalized AI recommendations! These projects help you build recommendation engines that suggest content based on user preferences.

Recommender Systems	Interactive Notebook & Scripts	Blog

Movie Recommender
Product Recommender
Arxiv paper recommender
Music Recommender

Concepts

Learn the core ideas behind AI applications—including text chunking, retrieval strategies, and optimization techniques—to improve your understanding of vector search and AI pipelines.

Concepts	Interactive Notebook	Blog

A Primer on Text Chunking and its Types
Langchain LlamaIndex Chunking
Create structured dataset using Instructor
Comparing Cohere Rerankers with LanceDB
Product Quantization: Compress High Dimensional Vectors
LLMs, RAG, & the missing storage layer for AI
Fine-Tuning LLM using PEFT & QLoRA
Extracting Complex tables-text from PDFs using LlamaParse
Convert any Image dataset to lance Format

Projects & Applications

Ready-to-use AI applications built with LanceDB! Use these projects as-is, customize them, or integrate them into your own applications.

Node applications powered by LanceDB

Project Name	Description	Screenshot
Writing assistant	Writing assistant app using lanchain.js with LanceDB, allows you to get real time relevant suggestions and facts based on you written text to help you with your writing.
Sentence Auto-Complete	Sentance auto complete app using lanchain.js with LanceDB, allows you to get real time relevant auto complete suggestions and facts based on you written text to help you with your writing.You can also upload your data source in the form of a pdf file.You can switch between gpt models to get faster results.
Article Recommendation	Article Recommender: Explore vast data set of articles with Instant, Context-Aware Suggestions. Leveraging Advanced NLP, Vector Search, and Customizable Datasets, Our App Delivers Real-Time, Precise Article Recommendations. Perfect for Research, Content Curation, and Staying Informed. Unlock Smarter Insights with State-of-the-Art Technology in Content Retrieval and Discovery!".
AI Powered Job Search	Transform your job search experience with this AI-driven application. Powered by LangChain.js, LanceDB, and advanced semantic search, it provides real-time, highly accurate job listings tailored to your preferences. Featuring customizable datasets and advanced filtering options (e.g., skills, location, job type, and salary range), this app ensures you find the right opportunities quickly and effortlessly. Best suited for job seekers, recruiters, career platforms, custom job boards.
AI Powered Multimodal meme search	An advanced AI-powered meme search engine that allows users to find memes using both text and image queries. By leveraging LanceDB as a high-performance vector database and Roboflow's CLIP model for embedding generation, the platform delivers fast and accurate meme retrieval.
AI Powered Feedback search and analysis	An AI-powered employee feedback analysis platform designed to collect, store, analyze, and retrieve insightful employee feedback. This system leverages LanceDB for high-speed vector-based semantic search, React.js for an interactive UI, Node.js for backend processing, and LangChain.js with an Ambient Agent for intelligent analysis and actionable insights.
Hierarchical Multi Agent	The AI-Powered Law Assistant is a Hierarchical Multi-Agent System leveraging LangGraph, LangChain, and LanceDB for efficient legal query processing. It features a Supervisor Agent that delegates tasks to specialized agents for IPC and NDPS laws, each with sub-agents for case retrieval and legal summarization. Using LanceDB, it stores and retrieves vectorized legal documents, enabling fast, structured, and context-aware responses for legal professionals, researchers, and law students.

Project Name	Description	Screenshot
YOLOExplorer	Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds
Website Chatbot (Deployable Vercel Template)	Create a chatbot from the sitemap of any website/docs of your choice. Built using vectorDB serverless native javascript package.
Advanced Chatbot with Parler TTS	This Chatbot app uses Lancedb Hybrid search, FTS & reranker method with Parlers TTS library.
Multi-Modal Search Engine	Create a Multi-modal search engine app, to search images using both images or text
Evaluate RAG	A working Streamlit RAG App designed to demonstrate end to to end production grade evaluation using 50+ scores and metrics which include guards, software metrics, traditional metrics and LLM as judge metrics. It uses mixture of specialised deep learning models and LLM as Judge models to do the evaluations
Multi-Agent Collaboration Chatbot	Multi-Agent collabration chatbot using langgraph for share-market use case using Lancedb & tools such as Polygon ,Tavily
Multimodal Myntra Fashion Search Engine	This app uses OpenAI's CLIP to make a search engine that can understand and deal with both written words and pictures.
Multilingual-RAG	Multilingual RAG with cohere embedding & support 100+ languages
Music Recommender	Music Recommendation system using audio feature extraction and vector similarity search. By utilizing LanceDB, PANNs for audio tagging, and Librosa for audio feature extraction, the system finds and recommends tracks with similar audio characteristics based on a query song.
NoOCR	End-to-end solution for complex PDFs, powered by ColPali and LanceDB.

🌟 New! 🌟 Applied GenAI and VectorDB course on Udacity Learn about GenAI and vectorDBs using LanceDB in the recently launched Udacity Course

Contributing Examples

If you're working on some cool applications that you'd like to add to this repo, please open a PR!

For Tasks:

Click tags to check more tools for each tasks

search recommendation classification clustering summarization

For Jobs:

data scientist machine learning engineer ai researcher software engineer product manager

Alternative AI tools for vectordb-recipes

Similar Open Source Tools

vectordb-recipes

github

: 845

miniLLMFlow

Mini LLM Flow is a 100-line minimalist LLM framework designed for agents, task decomposition, RAG, etc. It aims to be the framework used by LLMs, focusing on high-level programming paradigms while stripping away low-level implementation details. It serves as a learning resource and allows LLMs to design, build, and maintain projects themselves.

github

: 52

obsidian-textgenerator-plugin

Text Generator is an open-source AI Assistant Tool that leverages Generative Artificial Intelligence to enhance knowledge creation and organization in Obsidian. It allows users to generate ideas, titles, summaries, outlines, and paragraphs based on their knowledge database, offering endless possibilities. The plugin is free and open source, compatible with Obsidian for a powerful Personal Knowledge Management system. It provides flexible prompts, template engine for repetitive tasks, community templates for shared use cases, and highly flexible configuration with services like Google Generative AI, OpenAI, and HuggingFace.

github

: 1.6k

ServerlessLLM

ServerlessLLM is a fast, affordable, and easy-to-use library designed for multi-LLM serving, optimized for environments with limited GPU resources. It supports loading various leading LLM inference libraries, achieving fast load times, and reducing model switching overhead. The library facilitates easy deployment via Ray Cluster and Kubernetes, integrates with the OpenAI Query API, and is actively maintained by contributors.

github

: 447

meilisearch

Meilisearch is a lightning-fast search engine that seamlessly integrates into apps, websites, and workflows. It offers features like hybrid search, search-as-you-type, typo tolerance, filtering, sorting, synonym support, geosearch, extensive language support, security management, multi-tenancy, RESTful API, AI-readiness, easy installation, deployment, and maintenance.

github

: 53.4k

languine

Languine is a CLI tool powered by AI that helps developers streamline the localization process by providing AI-powered translations, automation features, consistent localization, developer-centric design, and time-saving workflows. It automates the identification of translation keys, supports multiple file formats, delivers accurate translations in over 100 languages, aligns translations with the original text's tone and intent, extracts translation keys from codebase, and supports hooks for content formatting with Biome or Prettier. Languine is designed to simplify and enhance the localization experience for developers.

github

: 1.7k

moon

Moon is a monitoring and alerting platform suitable for multiple domains, supporting various application scenarios such as cloud-native, Internet of Things (IoT), and Artificial Intelligence (AI). It simplifies operational work of cloud-native monitoring, boasts strong IoT and AI support capabilities, and meets diverse monitoring needs across industries. Capable of real-time data monitoring, intelligent alerts, and fault response for various fields.

github

: 246

ChainForge

ChainForge is a visual programming environment for battle-testing prompts to LLMs. It is geared towards early-stage, quick-and-dirty exploration of prompts, chat responses, and response quality that goes beyond ad-hoc chatting with individual LLMs. With ChainForge, you can: * Query multiple LLMs at once to test prompt ideas and variations quickly and effectively. * Compare response quality across prompt permutations, across models, and across model settings to choose the best prompt and model for your use case. * Setup evaluation metrics (scoring function) and immediately visualize results across prompts, prompt parameters, models, and model settings. * Hold multiple conversations at once across template parameters and chat models. Template not just prompts, but follow-up chat messages, and inspect and evaluate outputs at each turn of a chat conversation. ChainForge comes with a number of example evaluation flows to give you a sense of what's possible, including 188 example flows generated from benchmarks in OpenAI evals. This is an open beta of Chainforge. We support model providers OpenAI, HuggingFace, Anthropic, Google PaLM2, Azure OpenAI endpoints, and Dalai-hosted models Alpaca and Llama. You can change the exact model and individual model settings. Visualization nodes support numeric and boolean evaluation metrics. ChainForge is built on ReactFlow and Flask.

github

: 2.8k

llama-github

Llama-github is a powerful tool that helps retrieve relevant code snippets, issues, and repository information from GitHub based on queries. It empowers AI agents and developers to solve coding tasks efficiently. With features like intelligent GitHub retrieval, repository pool caching, LLM-powered question analysis, and comprehensive context generation, llama-github excels at providing valuable knowledge context for development needs. It supports asynchronous processing, flexible LLM integration, robust authentication options, and logging/error handling for smooth operations and troubleshooting. The vision is to seamlessly integrate with GitHub for AI-driven development solutions, while the roadmap focuses on empowering LLMs to automatically resolve complex coding tasks.

github

: 285

nextpy

Nextpy is a cutting-edge software development framework optimized for AI-based code generation. It provides guardrails for defining AI system boundaries, structured outputs for prompt engineering, a powerful prompt engine for efficient processing, better AI generations with precise output control, modularity for multiplatform and extensible usage, developer-first approach for transferable knowledge, and containerized & scalable deployment options. It offers 4-10x faster performance compared to Streamlit apps, with a focus on cooperation within the open-source community and integration of key components from various projects.

github

: 2.1k

tiledesk-chatbot

Tiledesk Chatbot Engine is a Node.js-based framework for creating and managing interactive chatbots. It is designed to work seamlessly with the Tiledesk Design Studio, allowing easy design and customization of chatbot behavior. The engine is scalable, performant, and encourages collaboration and innovation through its open-source nature under the MIT license.

github

: 53

k8sgateway

K8sGateway is a feature-rich, fast, and flexible Kubernetes-native API gateway built on Envoy proxy and Kubernetes Gateway API. It excels in function-level routing, supports legacy apps, microservices, and serverless. It offers robust discovery capabilities, seamless integration with open-source projects, and supports hybrid applications with various technologies, architectures, protocols, and clouds.

github

: 4.2k

hopsworks

Hopsworks is a data platform for ML with a Python-centric Feature Store and MLOps capabilities. It provides collaboration for ML teams, offering a secure, governed platform for developing, managing, and sharing ML assets. Hopsworks supports project-based multi-tenancy, team collaboration, development tools for Data Science, and is available on any platform including managed cloud services and on-premise installations. The platform enables end-to-end responsibility from raw data to managed features and models, supports versioning, lineage, and provenance, and facilitates the complete MLOps life cycle.

github

: 1.1k

mindsdb

MindsDB is a platform for customizing AI from enterprise data. You can create, serve, and fine-tune models in real-time from your database, vector store, and application data. MindsDB "enhances" SQL syntax with AI capabilities to make it accessible for developers worldwide. With MindsDB’s nearly 200 integrations, any developer can create AI customized for their purpose, faster and more securely. Their AI systems will constantly improve themselves — using companies’ own data, in real-time.

github

: 36.1k

lsp-ai

LSP-AI is an open source language server designed to enhance software engineers' productivity by integrating AI-powered functionality into various text editors. It serves as a backend for completion with large language models and offers features like unified AI capabilities, simplified plugin development, enhanced collaboration, broad compatibility with editors supporting Language Server Protocol, flexible LLM backend support, and commitment to staying updated with the latest advancements in LLM-driven software development. The tool aims to centralize open-source development work, provide a collaborative platform for developers, and offer a future-ready solution for AI-powered assistants in text editors.

github

: 2.1k

coze-studio

Coze Studio is an all-in-one AI agent development tool that offers the most convenient AI agent development environment, from development to deployment. It provides core technologies for AI agent development, complete app templates, and build frameworks. Coze Studio aims to simplify creating, debugging, and deploying AI agents through visual design and build tools, enabling powerful AI app development and customized business logic. The tool is developed using Golang for the backend, React + TypeScript for the frontend, and follows microservices architecture based on domain-driven design principles.

github

: 17.0k

For similar tasks

imodels

Python package for concise, transparent, and accurate predictive modeling. All sklearn-compatible and easy to use. _For interpretability in NLP, check out our new package:imodelsX _

github

: 1.4k

vectordb-recipes

github

: 845

raft

RAFT (Reusable Accelerated Functions and Tools) is a C++ header-only template library with an optional shared library that contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

github

: 933

superduperdb

SuperDuperDB is a Python framework for integrating AI models, APIs, and vector search engines directly with your existing databases, including hosting of your own models, streaming inference and scalable model training/fine-tuning. Build, deploy and manage any AI application without the need for complex pipelines, infrastructure as well as specialized vector databases, and moving our data there, by integrating AI at your data's source: - Generative AI, LLMs, RAG, vector search - Standard machine learning use-cases (classification, segmentation, regression, forecasting recommendation etc.) - Custom AI use-cases involving specialized models - Even the most complex applications/workflows in which different models work together SuperDuperDB is **not** a database. Think `db = superduper(db)`: SuperDuperDB transforms your databases into an intelligent platform that allows you to leverage the full AI and Python ecosystem. A single development and deployment environment for all your AI applications in one place, fully scalable and easy to manage.

github

: 4.5k

AI-YinMei

AI-YinMei is an AI virtual anchor Vtuber development tool (N card version). It supports fastgpt knowledge base chat dialogue, a complete set of solutions for LLM large language models: [fastgpt] + [one-api] + [Xinference], supports docking bilibili live broadcast barrage reply and entering live broadcast welcome speech, supports Microsoft edge-tts speech synthesis, supports Bert-VITS2 speech synthesis, supports GPT-SoVITS speech synthesis, supports expression control Vtuber Studio, supports painting stable-diffusion-webui output OBS live broadcast room, supports painting picture pornography public-NSFW-y-distinguish, supports search and image search service duckduckgo (requires magic Internet access), supports image search service Baidu image search (no magic Internet access), supports AI reply chat box [html plug-in], supports AI singing Auto-Convert-Music, supports playlist [html plug-in], supports dancing function, supports expression video playback, supports head touching action, supports gift smashing action, supports singing automatic start dancing function, chat and singing automatic cycle swing action, supports multi scene switching, background music switching, day and night automatic switching scene, supports open singing and painting, let AI automatically judge the content.

github

: 529

RWKV-LM

RWKV is an RNN with Transformer-level LLM performance, which can also be directly trained like a GPT transformer (parallelizable). And it's 100% attention-free. You only need the hidden state at position t to compute the state at position t+1. You can use the "GPT" mode to quickly compute the hidden state for the "RNN" mode. So it's combining the best of RNN and transformer - **great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding** (using the final hidden state).

github

: 13.0k

LLMs-from-scratch

This repository contains the code for coding, pretraining, and finetuning a GPT-like LLM and is the official code repository for the book Build a Large Language Model (From Scratch). In _Build a Large Language Model (From Scratch)_, you'll discover how LLMs work from the inside out. In this book, I'll guide you step by step through creating your own LLM, explaining each stage with clear text, diagrams, and examples. The method described in this book for training and developing your own small-but-functional model for educational purposes mirrors the approach used in creating large-scale foundational models such as those behind ChatGPT.

github

: 73.0k

Tutorial

The Bookworm·Puyu large model training camp aims to promote the implementation of large models in more industries and provide developers with a more efficient platform for learning the development and application of large models. Within two weeks, you will learn the entire process of fine-tuning, deploying, and evaluating large models.

github

: 1.6k

For similar jobs

weave

Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

github

: 980

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

VisionCraft

The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

github

: 94

kaito

Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

github

: 405

PyRIT

PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

github

: 2.9k

tabby

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

github

: 32.1k

spear

SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

github

: 224

Magick

Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.

github

: 675