OrionChat

Orion is a web-based chat interface that simplifies interactions with multiple AI model providers.

Stars: 86

Visit

Orion is a web-based chat interface that simplifies interactions with multiple AI model providers. It provides a unified platform for chatting and exploring various large language models (LLMs) such as Ollama, OpenAI (GPT model), Cohere (Command-r models), Google (Gemini models), Anthropic (Claude models), Groq Inc., Cerebras, and SambaNova. Users can easily navigate and assess different AI models through an intuitive, user-friendly interface. Orion offers features like browser-based access, code execution with Google Gemini, text-to-speech (TTS), speech-to-text (STT), seamless integration with multiple AI models, customizable system prompts, language translation tasks, document uploads for analysis, and more. API keys are stored locally, and requests are sent directly to official providers' APIs without external proxies.

README:

Orion

Orion is a web-based chat interface that simplifies interactions with multiple AI model providers. It provides a unified platform for chatting and exploring multiple large language models (LLMs), including:

💚 Ollama – An open-source tool for running LLMs locally 🏡
🐳 DeepSeek (DeepSeek-R1 rivaling OpenAI's o1 model)
🤖 OpenAI (GPTs and o's models)
🎯 Cohere (Command-r models)
🌌 Google (Gemini models)
🟡 Anthropic (Claude models)
🚀 Groq Inc. – Optimized for fast inference (open source models) ⚡️
⚡️ Cerebras – Also optimized for fast inference 🚀
🟣 SambaNova - Fast inference and support for Meta-Llama-3.1-405B-Instruct 🦙🦙🦙.🦙
♾️ OpenRouter - A unified interface for LLMs
☁️ Together AI - The AI Acceleration Cloud
📡 Deep Infra

Screenshot

Light theme:

Dark Theme:

With Orion, you can easily navigate and assess the strengths and limitations of different AI models through an intuitive, user-friendly interface.

Key Features

🖥️ Browser - No need to download anything ⚡️
✅ Code Execution (Execute code with Google Gemini)
🗣️ TTS - Realistic text-to-speech using ElevenLabs
🎙️ STT - Speech-to-Text using Groq/Whisper ️
🔄 Seamless integration with multiple AI models
✨ Clean and responsive web interface 🌐
🌈 Syntax highlighting for code snippets 🖌️
⬇️ One-click download for AI-generated code outputs
🎛️ Customizable system prompts to tailor responses 🛠️
🌐 Special command for quick and easy language translation tasks
📁 Upload a variety of documents (text, PDF, images, video) to Google Gemini for analysis and processing
🧠 Awesome Prompts 150+ awesome prompts most of them from Awesome ChatGPT Prompts to select with one click.

Show your support for this project

API Key Management

Your API keys are stored locally using localStorage, and requests are sent directly to the official provider's API (OpenAI, Anthropic, Google, Groq, Cerebras) without routing through any external proxy.

API Keys

Some companies offer free API access. Check their terms and conditions before you get started.

Google Gemini: Get your Gemini API key - 🟢 free
Cerebras: Get your Cerebras API key - 🟢 free
Cohere: Get your Cohere API key - 🟢 free
Groq: Get your Groq API key - 🟢 free
SambaNova: Get your SambaNova API key - 🟢 free
Together AI: 🟢 Free for deepseek-ai/DeepSeek-R1-Distill-Llama-70B-free Get your Together AI API key
OpenAI: OpenAI key
Anthropic: Anthropic API key
DeepSeek: DeepSeek API Key

Special Commands

Use special commands to perform an action quickly and easily.

Grounding with Google Search

g: what's the main news today

The Grounding with Google Search feature in the Gemini API can be used to improve the accuracy and recency of the model's responses, essentially an internal RAG within the Gemini API itself.

You can now use this feature when using any Gemini 2.0 model, whether flash or pro, simply by typing g: + your prompt.

Deep Thinking

dt: Explain quantum entanglement

Claude 3.7 Sonnet is a hybrid model capable of "standard thinking" and "extended thinking".

When you want to use this model's deep thinking, simply type dt: + your prompt.

This will give you easy control in switching between the two modes.

Translation

t:spanish Hello everyone!

Translate: Translate text with ease using special command.

To translate "Hello everyone!" into Spanish, use: translate:spanish Hello everyone! or its short form t:spanish Hello everyone!.
AI will automatically detect the source language, requiring only the target language specification.

YouTube Summary

Ask AI to summarize a YouTube video or answer any question based on the video caption. See an example of use below.

Summarize this video https://www.youtube.com/watch?v=r7pEdhnS3tI
What is this video about? https://www.youtube.com/watch?v=qQviI1d_hFA It is recommended to use a larger context window model when using this functionality, such as Google Gemini.

Keyboard shortcut.

Press Ctrl + Q to close the current chat and start a new one.
Press Shift + T on your keyboard to switch between the dark and light theme modes.

Retrieval-Augmented Generation (RAG)

Search: Perform quick searches and retrieve relevant information with ease from Google.

Example: search: What is the latest news? or s: What is the latest news?

Execution of JavaScript code in the user's browser.

Please perform this functionality with caution and always check code before accepting execution.

Example: javascript: How Many R's in 'Strawberry'? or js: How Many R's in 'Strawberry'?
This will allow the AI to generate Javascript code that will run in your browser.

Executing Python code in a remote environment

When using Google Gemini you can ask it to execute codes directly in Google's own remote environment. For now only Python codes are executed. The code and output will be returned.
Command example: py: Run a python code to write "tseb eht sI noirO" in the inverse order
Another example: py: The United States has a population of 342,524,968, of which 480,000 die annually due to tobacco use. What percentage is that?

How to Run

To run Orion first download this repository.

You can download it by running the following command in your terminal:

git clone https://github.com/EliasPereirah/OrionChat.git

Or download the zip file from the repository by clicking on the green button "Code" and then Download ZIP.

After downloading, just open the folder and click on index.html if you don't have a server. If you do, just access the directory where the project was saved. It's that simple.

Google CSE API Key

*This will be useful when you are not using Google's Gemini models, as the g: command will not work.

Sometimes you might want AI to search the web and respond based on that information.

To allow AI to search using Google, you will need Google CSE (Custom Search Engine) API Key and CX.

First, create a custom search here Google CSE Panel
Copy your CX ID
Go to Google Developers and click on Get a Key to get your API Key
Now just enter CX and API key in Orion. for that go to Options -> More Options and make the configuration.
To use the functionality in chat, you need to put an "s:" at the beginning of your prompt, e.g. "s: what's today's news"
*Note: Google Search will only return snippets of search results, which may not have enough context for AI to respond. It is recommended to implement the solution below for best results.

Rag Endpoint

For better search results, you can configure a "RAG endpoint".

Just follow the instructions at https://github.com/EliasPereirah/SearchAugmentedLLM/

After that you can enter the search endpoint you just created in the Orion Chat interface. Click on "Options" -> "Advanced" enter the "RAG endpoint" and click "Activate".

Now whenever you want the AI to do a search to answer your question, write at the beginning of your prompt "s:" + your question, e.g. "s: what's the news today?", The AI will search the web and respond based on the information found.

Cors

To get around CORS errors when working with SambaNova the API request will pass through cors-proxy.php which will forward the request to the desired platform. This will not hide your IP address, just forward the request.

This is necessary because direct requests via JavaScript in the browser to these platform are not possible.

YouTube Caption

You can simply share the URL of a YouTube video in the chat to be able to ask something about the video content, or ask the AI to make a summary.

To do this, you will need to use this other project: https://github.com/EliasPereirah/YoutubeSubtitlesDownloader/

You can run it on localhost or on a remote server.

After activating the mentioned project, go to "Options" -> "YouTube Captions" and point to the URL where you deployed the project, for example: http://localhost/YoutubeSubtitlesDownloader/

For Tasks:

Click tags to check more tools for each tasks

summarize video translate text execute python code search information generate code

For Jobs:

data scientist ai engineer software developer machine learning engineer research scientist

Alternative AI tools for OrionChat

Similar Open Source Tools

OrionChat

github

: 86

classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

github

: 620

tribe

Tribe AI is a low code tool designed to rapidly build and coordinate multi-agent teams. It leverages the langgraph framework to customize and coordinate teams of agents, allowing tasks to be split among agents with different strengths for faster and better problem-solving. The tool supports persistent conversations, observability, tool calling, human-in-the-loop functionality, easy deployment with Docker, and multi-tenancy for managing multiple users and teams.

github

: 919

litlytics

LitLytics is an affordable analytics platform leveraging LLMs for automated data analysis. It simplifies analytics for teams without data scientists, generates custom pipelines, and allows customization. Cost-efficient with low data processing costs. Scalable and flexible, works with CSV, PDF, and plain text data formats.

github

: 83

devika

Devika is an advanced AI software engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika utilizes large language models, planning and reasoning algorithms, and web browsing abilities to intelligently develop software. Devika aims to revolutionize the way we build software by providing an AI pair programmer who can take on complex coding tasks with minimal human guidance. Whether you need to create a new feature, fix a bug, or develop an entire project from scratch, Devika is here to assist you.

github

: 18.1k

dream-team

Build your dream team with Autogen is a repository that leverages Microsoft Autogen 0.4, Azure OpenAI, and Streamlit to create an end-to-end multi-agent application. It provides an advanced multi-agent framework based on Magentic One, with features such as a friendly UI, single-line deployment, secure code execution, managed identities, and observability & debugging tools. Users can deploy Azure resources and the app with simple commands, work locally with virtual environments, install dependencies, update configurations, and run the application. The repository also offers resources for learning more about building applications with Autogen.

github

: 168

Sentient

Sentient is a personal, private, and interactive AI companion developed by Existence. The project aims to build a completely private AI companion that is deeply personalized and context-aware of the user. It utilizes automation and privacy to create a true companion for humans. The tool is designed to remember information about the user and use it to respond to queries and perform various actions. Sentient features a local and private environment, MBTI personality test, integrations with LinkedIn, Reddit, and more, self-managed graph memory, web search capabilities, multi-chat functionality, and auto-updates for the app. The project is built using technologies like ElectronJS, Next.js, TailwindCSS, FastAPI, Neo4j, and various APIs.

github

: 52

blinkid-react-native

BlinkID SDK wrapper for React Native provides best-in-class ID scanning software for cross-platform apps built with React Native. It offers complete guidance on installing and linking BlinkID library with iOS and Android apps. The SDK requires a valid license key for scanning, with offline data extraction. It supports React Native v0.71.2 and includes installation and linking instructions for iOS and Android. The repository also contains a script to create a sample React Native project and dependencies. Video tutorials demonstrate using documentVerificationOverlay and CombinedRecognizer for scanning various document types.

github

: 176

Open_Data_QnA

Open Data QnA is a Python library that allows users to interact with their PostgreSQL or BigQuery databases in a conversational manner, without needing to write SQL queries. The library leverages Large Language Models (LLMs) to bridge the gap between human language and database queries, enabling users to ask questions in natural language and receive informative responses. It offers features such as conversational querying with multiturn support, table grouping, multi schema/dataset support, SQL generation, query refinement, natural language responses, visualizations, and extensibility. The library is built on a modular design and supports various components like Database Connectors, Vector Stores, and Agents for SQL generation, validation, debugging, descriptions, embeddings, responses, and visualizations.

github

: 127

actions

Sema4.ai Action Server is a tool that allows users to build semantic actions in Python to connect AI agents with real-world applications. It enables users to create custom actions, skills, loaders, and plugins that securely connect any AI Assistant platform to data and applications. The tool automatically creates and exposes an API based on function declaration, type hints, and docstrings by adding '@action' to Python scripts. It provides an end-to-end stack supporting various connections between AI and user's apps and data, offering ease of use, security, and scalability.

github

: 109

MiniSearch

MiniSearch is a minimalist search engine with integrated browser-based AI. It is privacy-focused, easy to use, cross-platform, integrated, time-saving, efficient, optimized, and open-source. MiniSearch can be used for a variety of tasks, including searching the web, finding files on your computer, and getting answers to questions. It is a great tool for anyone who wants a fast, private, and easy-to-use search engine.

github

: 384

genai-for-marketing

This repository provides a deployment guide for utilizing Google Cloud's Generative AI tools in marketing scenarios. It includes step-by-step instructions, examples of crafting marketing materials, and supplementary Jupyter notebooks. The demos cover marketing insights, audience analysis, trendspotting, content search, content generation, and workspace integration. Users can access and visualize marketing data, analyze trends, improve search experience, and generate compelling content. The repository structure includes backend APIs, frontend code, sample notebooks, templates, and installation scripts.

github

: 220

langdrive

LangDrive is an open-source AI library that simplifies training, deploying, and querying open-source large language models (LLMs) using private data. It supports data ingestion, fine-tuning, and deployment via a command-line interface, YAML file, or API, with a quick, easy setup. Users can build AI applications such as question/answering systems, chatbots, AI agents, and content generators. The library provides features like data connectors for ingestion, fine-tuning of LLMs, deployment to Hugging Face hub, inference querying, data utilities for CRUD operations, and APIs for model access. LangDrive is designed to streamline the process of working with LLMs and making AI development more accessible.

github

: 59

conversational-agent-langchain

This repository contains a Rest-Backend for a Conversational Agent that allows embedding documents, semantic search, QA based on documents, and document processing with Large Language Models. It uses Aleph Alpha and OpenAI Large Language Models to generate responses to user queries, includes a vector database, and provides a REST API built with FastAPI. The project also features semantic search, secret management for API keys, installation instructions, and development guidelines for both backend and frontend components.

github

: 148

OpenCopilot

OpenCopilot allows you to have your own product's AI copilot. It integrates with your underlying APIs and can execute API calls whenever needed. It uses LLMs to determine if the user's request requires calling an API endpoint. Then, it decides which endpoint to call and passes the appropriate payload based on the given API definition.

github

: 5.0k

STMP

SillyTavern MultiPlayer (STMP) is an LLM chat interface that enables multiple users to chat with an AI. It features a sidebar chat for users, tools for the Host to manage the AI's behavior and moderate users. Users can change display names, chat in different windows, and the Host can control AI settings. STMP supports Text Completions, Chat Completions, and HordeAI. Users can add/edit APIs, manage past chats, view user lists, and control delays. Hosts have access to various controls, including AI configuration, adding presets, and managing characters. Planned features include smarter retry logic, host controls enhancements, and quality of life improvements like user list fading and highlighting exact usernames in AI responses.

github

: 69

For similar tasks

codebox-api

CodeBox is a cloud infrastructure tool designed for running Python code in an isolated environment. It also offers simple file input/output capabilities and will soon support vector database operations. Users can install CodeBox using pip and utilize it by setting up an API key. The tool allows users to execute Python code snippets and interact with the isolated environment. CodeBox is currently in early development stages and requires manual handling for certain operations like refunds and cancellations. The tool is open for contributions through issue reporting and pull requests. It is licensed under MIT and can be contacted via email at [email protected].

github

: 270

OrionChat

github

: 86

GraphLLM

GraphLLM is a graph-based framework designed to process data using LLMs. It offers a set of tools including a web scraper, PDF parser, YouTube subtitles downloader, Python sandbox, and TTS engine. The framework provides a GUI for building and debugging graphs with advanced features like loops, conditionals, parallel execution, streaming of results, hierarchical graphs, external tool integration, and dynamic scheduling. GraphLLM is a low-level framework that gives users full control over the raw prompt and output of models, with a steeper learning curve. It is tested with llama70b and qwen 32b, under heavy development with breaking changes expected.

github

: 195

lyraios

LYRAIOS (LLM-based Your Reliable AI Operating System) is an advanced AI assistant platform built with FastAPI and Streamlit, designed to serve as an operating system for AI applications. It offers core features such as AI process management, memory system, and I/O system. The platform includes built-in tools like Calculator, Web Search, Financial Analysis, File Management, and Research Tools. It also provides specialized assistant teams for Python and research tasks. LYRAIOS is built on a technical architecture comprising FastAPI backend, Streamlit frontend, Vector Database, PostgreSQL storage, and Docker support. It offers features like knowledge management, process control, and security & access control. The roadmap includes enhancements in core platform, AI process management, memory system, tools & integrations, security & access control, open protocol architecture, multi-agent collaboration, and cross-platform support.

github

: 202

langmanus

LangManus is a community-driven AI automation framework that combines language models with specialized tools for tasks like web search, crawling, and Python code execution. It implements a hierarchical multi-agent system with agents like Coordinator, Planner, Supervisor, Researcher, Coder, Browser, and Reporter. The framework supports LLM integration, search and retrieval tools, Python integration, workflow management, and visualization. LangManus aims to give back to the open-source community and welcomes contributions in various forms.

github

: 4.9k

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

onnxruntime-genai

ONNX Runtime Generative AI is a library that provides the generative AI loop for ONNX models, including inference with ONNX Runtime, logits processing, search and sampling, and KV cache management. Users can call a high level `generate()` method, or run each iteration of the model in a loop. It supports greedy/beam search and TopP, TopK sampling to generate token sequences, has built in logits processing like repetition penalties, and allows for easy custom scoring.

github

: 442

mistral.rs

Mistral.rs is a fast LLM inference platform written in Rust. We support inference on a variety of devices, quantization, and easy-to-use application with an Open-AI API compatible HTTP server and Python bindings.

github

: 5.4k

For similar jobs

sweep

Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.

github

: 7.1k

teams-ai

The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.

github

: 502

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

classifai

github

: 620

chatbot-ui

Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.

github

: 27.7k

BricksLLM

BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students

github

: 953

uAgents

uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.

github

: 1.3k

griptape

Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.

github

: 2.2k