orbit

An adaptable, open-source context-aware inference engine designed for privacy, control, and independence from proprietary models.

Stars: 144

Visit

ORBIT (Open Retrieval-Based Inference Toolkit) is a middleware platform that provides a unified API for AI inference. It acts as a central gateway, allowing you to connect various local and remote AI models with your private data sources like SQL databases, vector stores, and local files. ORBIT uses a flexible adapter architecture to connect your data to AI models, creating specialized 'agents' for specific tasks. It supports scenarios like Knowledge Base Q&A and Chat with Your SQL Database, enabling users to interact with AI models seamlessly. The tool offers a RESTful API for programmatic access and includes features like authentication, API key management, system prompts, health monitoring, and file management. ORBIT is designed to streamline AI inference tasks and facilitate interactions between users and AI models.

README:

ORBIT – Unified, self‑hosted AI inference with your data

ORBIT gives you a single, consistent API to run LLMs (local or cloud) against your private data sources with portability, performance, high-availability, and security at the core.

Quick Start • Why ORBIT • Architecture • Examples • Docs • Issues

🚀 Quick Start

1. Deploy with Docker

Refer to the Docker Setup Guide or run from the docker/ directory:

cd docker
chmod +x docker-init.sh orbit-docker.sh
./docker-init.sh --build --profile minimal

2. Deploy Locally

# Download the latest release
curl -L https://github.com/schmitech/orbit/releases/download/v1.5.2/orbit-1.5.2.tar.gz -o orbit-1.5.2.tar.gz
tar -xzf orbit-1.5.2.tar.gz
cd orbit-1.5.2

# Run the quick setup script (downloads a small model)
cp .env.example .env
./install/setup.sh --profile minimal --download-gguf gemma3-270m

# Start the ORBIT server
source venv/bin/activate
./bin/orbit.sh start # Logs are at /logs/orbit.log

Your ORBIT instance is now running at http://localhost:3000.

3. Chat with ORBIT using the CLI tool

Use the orbit-chat CLI tool to interact with your instance.

pip install schmitech-orbit-client

# Start chatting!
orbit-chat

Your browser does not support the video tag.
Using 'orbit-chat' tool. Add -h for usage.

4. Chat with ORBIT using the Web Widget

cd clients/chat-app
npm install
npm run dev

Your browser does not support the video tag.
Chatting with ORBIT using the React client.

🏗️ Architecture Overview

Click to learn more about the Core Components

Core Components

ORBIT Server (/server/): FastAPI-based inference middleware

Inference Layer: Supports multiple LLM providers (OpenAI, Anthropic, Cohere, Ollama, etc.) via unified interface
RAG System: Retrieval-Augmented Generation with SQL and Vector DB adapters (file-based / multimodal retrieval underway, it will be available in release 2.0.0)
Authentication: PBKDF2-SHA256 with bearer tokens, MongoDB-backed sessions
Fault Tolerance: Circuit breaker pattern with exponential backoff for provider failures
Content Moderation: Multi-layered safety with LLM Guard and configurable moderators

Configuration (/config/): YAML-based modular configuration

Main config in config.yaml with environment variable support
Separate configs for adapters, datasources, embeddings, inference, moderators, and rerankers
Dynamic loading with validation and resolver system

Client Libraries:

React-based chat application with Zustand state management
Embeddable chat widget with theming support
Node.js and Python API client libraries

Client Libraries

New language clients are available under clients/ with examples and integration tests:

Elixir: clients/elixir — make deps, make example, ORBIT_INTEGRATION=1 make test
Swift: clients/swift — make example, ORBIT_INTEGRATION=1 make test
Kotlin: clients/kotlin — make example, ORBIT_INTEGRATION=1 make test
Scala: clients/scala — make example, ORBIT_INTEGRATION=1 make test
PHP: clients/php — make example, ORBIT_INTEGRATION=1 make test
Haskell: clients/haskell — make example
Clojure: clients/clojure — make example, ORBIT_INTEGRATION=1 make test
Perl: clients/perl — make example, ORBIT_INTEGRATION=1 make test
R: clients/r — make example, ORBIT_INTEGRATION=1 make test

Set ORBIT_INTEGRATION=1 to enable integration tests. Optionally set ORBIT_URL (defaults to http://localhost:3000).

Dependencies

MongoDB (Required): Authentication, RAG storage, conversation history
Redis (Optional): Caching layer
Vector DBs (Optional): Chroma, Qdrant, Pinecone, Milvus for semantic search
SQL DBs (Optional): PostgreSQL, MySQL, SQLite for structured data retrieval

✨ What Can You Build with ORBIT?

ORBIT uses a flexible adapter architecture to connect your data to AI models. An API key is tied to a specific adapter, effectively creating a specialized "agent" for a certain task. Here are a few examples:

Scenario 1: Knowledge Base Q&A

Provide instant, semantically-aware answers from a knowledge base. Perfect for customer support or internal documentation.

Sample Questions:

"What are the summer camp programs for kids?"
"How do I register for the contemporary dance class?"

NOTE: You need an instance of MongoDB to enable adapters

Setup the sample SQLite Database with Q/A records about a municipality.

Here's the Sample Q/A datasets for this example. The knowledge base corresponds to a municipal services assistant.

#Login as admin first. Default password is admin123. You should change after installing ORBIT.
./bin/orbit.sh login

# Set up SQLite database with Q&A data
./examples/sample-db-setup.sh sqlite

Your browser does not support the video tag.
Setting up the sample SQLite Q/A dataset

Testing with the node client:

# Test using node client
cd clients/node-api
npm install
npm run build
npm run test-query-from-pairs ../../examples/city-qa-pairs.json "http://localhost:3000" "your-api-key" 5 134444

Your browser does not support the video tag.
Testing the Q/A Adapter using the node API client

Scenario 2: Chat with Your SQL Database

Ask questions about your data in natural language and get answers without writing SQL.

Sample Questions:

"Show me all orders from John Smith"
"What are the top 10 customers by order value?"

# Set up PostgreSQL with a sample schema and data
cd examples/postgres

# Update with  your connection parameters
cp env.example .env

# Test connection
python test_connection.py

# Create the DB
python setup_schema.py

# Install faker to generate synthetic data
pip install faker

# Add sample data
python customer-order.py --action insert --clean --customers 100 --orders 1000

# Create an API key for the SQL intent adapter.
# Make sure you are logged in as admin if auth is enabled in `/config/config.yaml`.
python bin/orbit.py key create \
  --adapter intent-sql-postgres \
  --name "Order Management Assistant" \
  --prompt-file examples/postgres/prompts/customer-assistant-enhanced-prompt.txt

#make sure the sample SQL intent adapter is enabled in `/config/adapters.yaml`
- name: "intent-sql-postgres"
    enabled: false
    type: "retriever"
    datasource: "postgres"
    adapter: "intent"
    implementation: "retrievers.implementations.intent.IntentPostgreSQLRetriever"
    inference_provider: "ollama"

# Start or restart the server
./bin/orbit.sh start --delete-logs

# Start chatting with your new key
orbit-chat --url http://localhost:3000 --api-key YOUR_API_KEY

Your browser does not support the video tag.
Testing the SQL Intent Adapter using the ORBIT CLI tool

⭐ Like this project? Give it a star!

If you find ORBIT useful, please consider giving it a star on GitHub. It helps more people discover the project.

📖 Documentation

For more detailed information, please refer to the official documentation.

Full API Reference

ORBIT provides a RESTful API for programmatic access. The full API reference with examples is available at /docs (Swagger UI) when the server is running.

Core Chat & Inference

POST /v1/chat - MCP protocol chat endpoint (JSON-RPC 2.0 format)
GET /health - Overall system health

Authentication

POST /auth/login - User authentication
POST /auth/logout - End session
GET /auth/me - Get current user info
POST /auth/register - Register new user
POST /auth/change-password - Change user password

API Key Management (Admin)

GET /admin/api-keys - List API keys
POST /admin/api-keys - Create new API key
DELETE /admin/api-keys/{api_key} - Delete API key
POST /admin/api-keys/deactivate - Deactivate API key
GET /admin/api-keys/{api_key}/status - Get API key status

System Prompts (Admin)

GET /admin/prompts - List system prompts
POST /admin/prompts - Create system prompt
PUT /admin/prompts/{prompt_id} - Update system prompt
DELETE /admin/prompts/{prompt_id} - Delete system prompt

Health & Monitoring

GET /health - System health overview
GET /health/adapters - Adapter health status
GET /health/embedding-services - Embedding service status
GET /health/mongodb-services - MongoDB connection status
GET /health/ready - Readiness check
GET /health/system - System resource usage

File Management (Experimental)

POST /upload - Single file upload
POST /upload/batch - Batch file upload
GET /info/{file_id} - File information
DELETE /{file_id} - Delete file
GET /status - File system status

🤝 Community & Support

Questions? Open an issue
Updates: Check the changelog
Commercial Support: Contact schmitech.ai
Maintained by: Remsy Schmilinsky

📄 License

Apache 2.0 - See LICENSE for details.

For Tasks:

Click tags to check more tools for each tasks

connect data to ai models provide instant answers chat with sql database setup q&a datasets test ai adapters

For Jobs:

ai engineer data scientist machine learning engineer software developer research scientist

Alternative AI tools for orbit

Similar Open Source Tools

orbit

github

: 144

backend.ai

Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator support including CUDA GPU, ROCm GPU, TPU, IPU and other NPUs. It allocates and isolates the underlying computing resources for multi-tenant computation sessions on-demand or in batches with customizable job schedulers with its own orchestrator. All its functions are exposed as REST/GraphQL/WebSocket APIs.

github

: 579

LEANN

LEANN is an innovative vector database that democratizes personal AI, transforming your laptop into a powerful RAG system that can index and search through millions of documents using 97% less storage than traditional solutions without accuracy loss. It achieves this through graph-based selective recomputation and high-degree preserving pruning, computing embeddings on-demand instead of storing them all. LEANN allows semantic search of file system, emails, browser history, chat history, codebase, or external knowledge bases on your laptop with zero cloud costs and complete privacy. It is a drop-in semantic search MCP service fully compatible with Claude Code, enabling intelligent retrieval without changing your workflow.

github

: 2.6k

pentagi

PentAGI is an innovative tool for automated security testing that leverages cutting-edge artificial intelligence technologies. It is designed for information security professionals, researchers, and enthusiasts who need a powerful and flexible solution for conducting penetration tests. The tool provides secure and isolated operations in a sandboxed Docker environment, fully autonomous AI-powered agent for penetration testing steps, a suite of 20+ professional security tools, smart memory system for storing research results, web intelligence for gathering information, integration with external search systems, team delegation system, comprehensive monitoring and reporting, modern interface, API integration, persistent storage, scalable architecture, self-hosted solution, flexible authentication, and quick deployment through Docker Compose.

github

: 170

openai-edge-tts

This project provides a local, OpenAI-compatible text-to-speech (TTS) API using `edge-tts`. It emulates the OpenAI TTS endpoint (`/v1/audio/speech`), enabling users to generate speech from text with various voice options and playback speeds, just like the OpenAI API. `edge-tts` uses Microsoft Edge's online text-to-speech service, making it completely free. The project supports multiple audio formats, adjustable playback speed, and voice selection options, providing a flexible and customizable TTS solution for users.

github

: 412

well-architected-iac-analyzer

Well-Architected Infrastructure as Code (IaC) Analyzer is a project demonstrating how generative AI can evaluate infrastructure code for alignment with best practices. It features a modern web application allowing users to upload IaC documents, complete IaC projects, or architecture diagrams for assessment. The tool provides insights into infrastructure code alignment with AWS best practices, offers suggestions for improving cloud architecture designs, and can generate IaC templates from architecture diagrams. Users can analyze CloudFormation, Terraform, or AWS CDK templates, architecture diagrams in PNG or JPEG format, and complete IaC projects with supporting documents. Real-time analysis against Well-Architected best practices, integration with AWS Well-Architected Tool, and export of analysis results and recommendations are included.

github

: 196

comp

Comp AI is an open-source compliance automation platform designed to assist companies in achieving compliance with standards like SOC 2, ISO 27001, and GDPR. It transforms compliance into an engineering problem solved through code, automating evidence collection, policy management, and control implementation while maintaining data and infrastructure control.

github

: 1.1k

mcpd

mcpd is a tool developed by Mozilla AI to declaratively manage Model Context Protocol (MCP) servers, enabling consistent interface for defining and running tools across different environments. It bridges the gap between local development and enterprise deployment by providing secure secrets management, declarative configuration, and seamless environment promotion. mcpd simplifies the developer experience by offering zero-config tool setup, language-agnostic tooling, version-controlled configuration files, enterprise-ready secrets management, and smooth transition from local to production environments.

github

: 65

dexto

Dexto is a lightweight runtime for creating and running AI agents that turn natural language into real-world actions. It serves as the missing intelligence layer for building AI applications, standalone chatbots, or as the reasoning engine inside larger products. Dexto features a powerful CLI and Web UI for running AI agents, supports multiple interfaces, allows hot-swapping of LLMs from various providers, connects to remote tool servers via the Model Context Protocol, is config-driven with version-controlled YAML, offers production-ready core features, extensibility for custom services, and enables multi-agent collaboration via MCP and A2A.

github

: 225

aira-dojo

aira-dojo is a scalable and customizable framework for AI research agents, designed to accelerate hill-climbing on research capabilities toward a fully automated AI research scientist. The framework provides a general abstraction for tasks and agents, implements the MLE-bench task, and includes state-of-the-art agents. It features an isolated code execution environment that integrates smoothly with job schedulers like Slurm, enabling large-scale experiments and rapid iteration across a portfolio of tasks and solvers.

github

: 94

forge

Forge is a powerful open-source tool for building modern web applications. It provides a simple and intuitive interface for developers to quickly scaffold and deploy projects. With Forge, you can easily create custom components, manage dependencies, and streamline your development workflow. Whether you are a beginner or an experienced developer, Forge offers a flexible and efficient solution for your web development needs.

github

: 4.5k

hound

Hound is a security audit automation pipeline for AI-assisted code review that mirrors how expert auditors think, learn, and collaborate. It features graph-driven analysis, sessionized audits, provider-agnostic models, belief system and hypotheses, precise code grounding, and adaptive planning. The system employs a senior/junior auditor pattern where the Scout actively navigates the codebase and annotates knowledge graphs while the Strategist handles high-level planning and vulnerability analysis. Hound is optimized for small-to-medium sized projects like smart contract applications and is language-agnostic.

github

: 325

TalkWithGemini

Talk With Gemini is a web application that allows users to deploy their private Gemini application for free with one click. It supports Gemini Pro and Gemini Pro Vision models. The application features talk mode for direct communication with Gemini, visual recognition for understanding picture content, full Markdown support, automatic compression of chat records, privacy and security with local data storage, well-designed UI with responsive design, fast loading speed, and multi-language support. The tool is designed to be user-friendly and versatile for various deployment options and language preferences.

github

: 616

Flowise

Flowise is a tool that allows users to build customized LLM flows with a drag-and-drop UI. It is open-source and self-hostable, and it supports various deployments, including AWS, Azure, Digital Ocean, GCP, Railway, Render, HuggingFace Spaces, Elestio, Sealos, and RepoCloud. Flowise has three different modules in a single mono repository: server, ui, and components. The server module is a Node backend that serves API logics, the ui module is a React frontend, and the components module contains third-party node integrations. Flowise supports different environment variables to configure your instance, and you can specify these variables in the .env file inside the packages/server folder.

github

: 44.0k

AirCasting

AirCasting is a platform for gathering, visualizing, and sharing environmental data. It aims to provide a central hub for environmental data, making it easier for people to access and use this information to make informed decisions about their environment.

github

: 63

ai-commits-intellij-plugin

AI Commits is a plugin for IntelliJ-based IDEs and Android Studio that generates commit messages using git diff and OpenAI. It offers features such as generating commit messages from diff using OpenAI API, computing diff only from selected files and lines in the commit dialog, creating custom prompts for commit message generation, using predefined variables and hints to customize prompts, choosing any of the models available in OpenAI API, setting OpenAI network proxy, and setting custom OpenAI compatible API endpoint.

github

: 664

For similar tasks

orbit

github

: 144

For similar jobs

sweep

Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.

github

: 7.1k

teams-ai

The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.

github

: 502

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

github

: 668

chatbot-ui

Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.

github

: 27.7k

BricksLLM

BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students

github

: 953

uAgents

uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.

github

: 1.3k

griptape

Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.

github

: 2.2k