Awesome-AITools
Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具,欢迎提交issues 或者pull requests
Stars: 4328
This repo collects AI-related utilities. ## All Categories * All Categories * ChatGPT and other closed-source LLMs * AI Search engine * Open Source LLMs * GPT/LLMs Applications * LLM training platform * Applications that integrate multiple LLMs * AI Agent * Writing * Programming Development * Translation * AI Conversation or AI Voice Conversation * Image Creation * Speech Recognition * Text To Speech * Voice Processing * AI generated music or sound effects * Speech translation * Video Creation * Video Content Summary * OCR(Optical Character Recognition)
README:
This repo collects AI-related utilities.
-
All Categories
- ChatGPT and other closed-source LLMs
- AI Search engine
- Open Source LLMs
- GPT/LLMs Applications
- AI Image Creation
- LLM Prompts
- LLM Leaderboard
- LLM training platform
- Applications that integrate multiple LLMs
- AI Agent
- Writing
- Programming Development
- Translation
- AI Conversation or AI Voice Conversation
- Speech Recognition
- Text To Speech
- Voice Processing
- AI generated music or sound effects
- Speech translation
- Video Creation
- Video Content Summary
- OCR(Optical Character Recognition)
Name | Description | Links | Fees |
---|---|---|---|
ChatGPT | OpenAI's chatgpt | URL | Free, with paid upgrades |
Claude | Anthropic's AI assistant | URL | Free |
Gemini | Google's conversational, AI chat service. Google's latest LLM, including Gemini Nono, Gemini Pro and Gemini Ultra. Gemini Pro is open for api and sdk use. Gemini is built from the ground up for multimodality — reasoning seamlessly across text, images, video, audio, and code |
URL dev: URL |
Free |
Microsoft Copilot | Microsoft's AI assistant. | URL | Free |
Le Chat | Mistral.ai's conversational, AI chat service | URL | Free |
Name | Description | Links | Fees |
---|---|---|---|
Perplexity.ai | AI-driven conversational search engine. | URL | Free |
You.com | A search engine in conversation mode | URL | Free |
Name | Description | Links | Fees |
---|---|---|---|
Llama 3 | Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model. Online test address: huggingface.co/Meta-Llama-3-70B-Instruct |
GitHub | Free |
Mixtral | Mixtral 8x7B, a high-quality sparse mixture of experts model (SMoE) with open weights. Mixtral outperforms Llama 2 70B on most benchmarks with 6x faster inference. It matches or outperforms GPT3.5 on most standard benchmarks. paper:https://arxiv.org/pdf/2401.04088.pdf news:https://mistral.ai/news/mixtral-of-experts/ |
mistral-inference mistral-finetune |
Free |
grok-1 | A large language model open sourced by xAI | Github | Free |
Phi-3 | Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks. | Github | Free |
Name | Description | Links | Fees |
---|---|---|---|
Poe | AI product built by Quora. Can use ChatGPT, Sage, Dragonfly, Claude bots for free. All you need is an email address to register. GPT-4 can be used once a day for free | URL | Free, with paid upgrades |
monica | AI assistant that provides help with a variety of tasks such as searching, reading, writing, translating, drawing, and more. Standalone apps and browser plug-ins available |
URL chrome extension |
Free, with paid upgrades |
ollama | Get up and running with Llama 2, Mistral, Gemma, and other large language models. | Github | Free |
openai/openai-python | The official Python library for the OpenAI API, It is generated from OpenAPI specification with Stainless | Github | Free, need OpenAPI apikey |
sashabaranov/go-openai | This library provides unofficial Go clients for OpenAI API. support: ChatGPT, GPT-3, GPT-4, DALL·E 2 | Github | Free |
langchain | LangChain is a framework for developing applications powered by language models. | Github | Free |
Helicone AI | Helicone is the open-source LLM observability platform for logging, monitoring, and debugging AI applications. | Github | Free |
ChatGPT-Next-Web | One-Click to get a well-designed cross-platform ChatGPT web UI, with GPT3, GPT4 & Gemini Pro support. | Github | Free |
screenshot-to-code | This simple app converts a screenshot to HTML/Tailwind CSS. It uses GPT-4 Vision to generate the code and DALL-E 3 to generate similar-looking images. You can now also enter a URL to clone a live website! | GitHub | Free, need access to GPT-4 Vision |
Chatbox | Desktop application that uses ChatGPT API (OpenAI API) to store all chat messages and prompts locally, thus reducing the risk of data loss. A bit more stable to use than the web version | GitHub | Free, requires apikey with OpenAPI |
gpt-crawler | Crawl a site to generate knowledge files to create your own custom GPT from a URL | Github | Free |
ChatGPT-Shortcut | Open source, ChatGPT shortcut commands that double productivity, partitioned by domain and function, can filter prompt words by tag, keyword search and one-click copy. | GitHub | Free |
ChatGPT Sidebar | ChatGPT Sidebar is an artificial intelligence assistant you can use while browsing any website. | URL | Free |
WebChatGPT | Open source, expand the ability of networking to chatgpt | GitHub | Free |
AIPRM for ChatGPT | Browser plug-in, providing a series of selected ChatGPT instruction templates, and even creating your own, and adjusting AI tone and writing style | URL | Free |
GPTCache | ⚡ GPTCache is a library for creating semantic cache to store responses from LLM queries. It can be used to speed up and lower the cost of chat applications that rely on the LLM service. And it's similar to redis in an aigc scenario. | Github | Free |
MindMac | Feature-rich & privacy-first native ChatGPT app for macOS to use OpenAI, Azure OpenAI, Anthropic Claude, OpenRouter all in one place, designed for maximum productivity. Currently available in 15 languages. | URL | Free, with paid upgrades |
MemFree | Open Source Hybrid AI Search Engine, Instantly Get Accurate Answers from the Internet, Bookmarks, Notes, and Docs. Support One-Click Deployment. | Github | Free & Suport one-click self-host |
Name | Description | Links | Fees |
---|---|---|---|
Midjourney | Enter text or pictures to create pictures | URL | Free account has a certain usage minutes limit, and there is a paid upgrade version |
Photoshop AI | Adobe Photoshop generative-fill | URL | Paid |
Stable diffusion webui | Open source project, input text or pictures to create pictures, Stable diffusion webui is the GUI of Stable diffusion, and it is an image user interface that visualizes stable diffusion. It also integrates many other useful extension scripts. | GitHub | Free |
civitai | civitai.com is a website platform for sharing AI image creation model resources, with a large number of models, has become the main model exchange place in the SD open source community | URL | Free |
clipdrop | clipdrop by stability.ai. Has many AI image processing tools, such as stable diffusion XL, uncrop, reimage XL, stable doodle. | URL | Free/Paid |
firefly | Adobe's AI image processing web site | URL | Free/Paid |
ideogram.ai | Enter text to create pictures. A product developed by a company founded by many ex-Googlers | URL | Free/Paid |
Skybox AI | Generate 360-degree panoramic images using text prompts | URL | Free/Paid |
DragGAN | Interactive Point-based Manipulation on the Generative Image Manifold | GitHub | Free |
visual-chatgpt | Create images with ChatGPT | GitHub | Free |
Microsoft Bing Image Creator | Image Creator is a tool for creating pictures using DALL-E technology. Tried Generating portrait pictures is unsightly | URL | Free |
remove.bg | Remove Image Background | URL | Free/Paid |
ControlNet | ControlNet is a neural network structure to control diffusion models by adding extra conditions. | Github | Free |
StreamDiffusion | A Pipeline-Level Solution for Real-Time Interactive Generation | Github | Free |
Name | Description | Links | Fees |
---|---|---|---|
f/awesome-chatgpt-prompts | This repo includes ChatGPT prompt curation to use ChatGPT better. | Github | Free |
Name | Description | Links | Fees |
---|---|---|---|
LMSYS Chatbot Arena Leaderboard | LMSYS Chatbot Arena is a crowdsourced open platform for LLM evals. Collected over 1,000,000 human pairwise comparisons to rank LLMs with the Bradley-Terry model and display the model ratings in Elo-scale. | URL | Free |
Artificial Analysis | Artificial Analysis is a platform that provides AI model and service provider comparisons and benchmarks to help users make informed decisions when choosing AI models and service providers. The platform provides comparative data on a wide range of popular AI models, including OpenAI's GPT-4, Meta's Llama 3, and Anthropic's Claude series, covering performance metrics such as response time, latency, and cost. | URL | Free |
Name | Description | Links | Fees |
---|---|---|---|
lm-sys/FastChat | An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. | Github | Free |
Name | Description | Links | Fees |
---|---|---|---|
chathub | Use different chatbots in one app, currently supporting ChatGPT, new Bing Chat, Google Bard, Claude, and 10+ open-source models including Alpaca, Vicuna, ChatGLM etc. | GitHub | Free/Paid |
ChatALL | Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, and more, discover the best answers | GitHub | Free |
Name | Description | Links | Fees |
---|---|---|---|
Auto-GPT | Open source, An experimental open-source attempt to make GPT-4 fully autonomous. | GitHub | Free |
OthersideAI/self-operating-computer | A framework to enable multimodal models to operate a computer. | Github | Free,GPT-4v required |
AppAgent | Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps. | Github | Free |
microsoft/autogen | AutoGen is an open-source programming framework for building AI agents and facilitating cooperation among multiple agents to solve tasks. | Github | Free |
Name | Description | Links | Fees |
---|---|---|---|
Notion AI | AI-assisted note-taking software | URL | with certain free AI trials, AI features $10/month |
Deep L Write | English and German writing tools to fix writing errors and rewrite sentences promptly. | URL | Free version to use with text word limit / paid upgrade available |
grammarly | Edit and correct your grammar, spelling, punctuation, and more with your personal writing assistant, grammar checker, and editor. | URL | Free/Paid |
Name | Description | Links | Fees |
---|---|---|---|
GitHub Copilot | A code writing assistant developed by GitHub and OpenAI | URL | Paid |
Cursor | A collaborative code editor using GPT | URL | Paid/Free Trial |
ai-code-translator | Open source project. Translates code from one language to another using chatgpt. | GitHub | Free, requires OpenAI API key |
Amazon CodeWhisperer | A code writing assistant developed by Amazon | URL | Free for Individual Use |
gpt-engineer | GPT Engineer is made to be easy to adapt, extend, and make your agent learn how you want your code to look. It generates an entire codebase based on a prompt. | GitHub | Free |
Codeium | Powerful in-IDE AI coding assistant | URL | Free/Paid |
scalene | Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals | Github | Free |
Fitten Code | Fitten Code is an AI programming assistant driven by Fitten LLM models, which can automatically generate code, improve development efficiency, help you debug, and save your time. It can also chat with you and solve your programming problems.freeand supports over 80 languages: Python, C++,JavaScript, TypeScript, Java, etc. Fitten Code supports Visual Studio Code and JetBrains series IDEs, including IntelliJ IDEA, PyCharm, WebStorm, etc. | URL | Free |
flappy | Production-Ready LLM Agent SDK for Every Developer | GitHub | Free |
Plandex | Open source, terminal-based AI programming engine for complex tasks | GitHub | Free |
Mistral/Codestral | Empowering developers and democratising coding with Mistral AI., models:https://huggingface.co/mistralai/Codestral-22B-v0.1 | URL | Free |
Name | Description | Links | Fees |
---|---|---|---|
immersive-translate | Open source project. Immersive bilingual web translation extension | GitHub | Free |
Deep L | Accurate and instant translation tool, currently supporting 31 languages | URL | Free/Paid |
openai-translator | Open source project. Crossword translation browser plugin and cross-platform desktop application based on ChatGPT API | GitHub | Free, requires OpenAI API key |
Name | Description | Links | Fees |
---|---|---|---|
pi.ai | An AI that's been shown to be very good at chatting, so you don't have to worry about talking all day. It supports both text and speech. Voice input is required with Apple's input system. Good for practicing English conversation and listening. | URL | Free |
Voice Control for ChatGPT | This Chrome extension allows you to have voice conversations with ChatGPT. | URL | Free, requires chatgpt account |
SpeechGPT | SpeechGPT is a web application that enables you to converse with ChatGPT. | GitHub | Free,requires OpenAI API key |
Name | Description | Links | Fees |
---|---|---|---|
whisper | OpenAPI open source robust speech recognition model through large-scale weak supervision | GitHub | Free |
buzz | An open source desktop software based on OpenAI's Whisper to recognize speech and generate subtitles | GitHub | Free |
WhisperDesktop | Open source, OpenAI-based Whisper, a desktop application for Windows, uses the GPU for processing, which will be faster than on the CPU with good GPU performance. | GitHub | Free |
whisperX | WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) | whisperX | Free |
whisper-web | ML-powered speech recognition directly in your browser. Built with Transformers.js. Demo | GitHub | Free |
Name | Description | Links | Fees |
---|---|---|---|
Azure Text to speech | The best and most realistic voice tools currently available | URL | Paid / 500,000 characters per month free |
coqui-ai/tts | A deep learning toolkit for Text-to-Speech, battle-tested in research and production Online Demo: https://huggingface.co/spaces/coqui/xtts |
Github | Free |
elevenlabs | Intelligent AI Text to Speech | URL | Free/Paid |
netease-youdao/EmotiVoice | A Multi-Voice and Prompt-Controlled TTS Engine. EmotiVoice speaks both English and Chinese, and with over 2000 different voices. The most prominent feature is emotional synthesis, allowing you to create speech with a wide range of emotions, including happy, excited, sad, angry and others. | Github | Free |
tetos | A unified interface for multiple Text-to-Speech (TTS) providers. Supported TTS providers: Edge TTS, OpenAI TTS, Azure TTS, Google TTS, Volcengine TTS, Baidu TTS | Github | Free |
ChatTTS | ChatTTS is a text-to-speech model designed specifically for dialogue scenario such as LLM assistant. It supports both English and Chinese languages. Our model is trained with 100,000+ hours composed of chinese and english. Website:https://chattts.com/ | Github | Free |
Name | Description | Links | Fees |
---|---|---|---|
so-vits-svc | SoftVC VITS Singing Voice Conversion. | GitHub | Free |
vocalremover | Extract vocal and music | URL | Free |
lala.ai | Extract vocal, accompaniment and various instruments from any audio and video | URL | Free/Paid |
Name | Description | Link | Fees |
---|---|---|---|
suno.ai | The AI music creation tool Suno can generate custom songs based on text prompts in mere second You can create your own AI songs with this new Copilot extension | URL | |
udio | Create music from simple text prompts by specifying topics, genres, and other descriptors which are then transformed into professional quality tracks. | URL | |
elevenlabs/sound-effects | Imagine a sound and bring it to life, or explore a selection of the best sound effects generated by the community. | URL | Free |
suno-ai/bark | Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. | Github | Free |
audiocraft | Open source library for audio/music generation by Meta, which mainly includes two models, MusicGen: text-to-music model, AudioGen: text-generated sound model. MusicGen Online Demo | GitHub | Free |
Stable Audio | AI music and sound effect generation application by stability.ai | URL | Free/Paid |
OptimizerAI | Sound effect generation Official Introduction |
URL | Free/Paid |
Name | Description | Links | Fees |
---|---|---|---|
Seamless | Seamless is a family of AI models that enable more natural and authentic communication across languages.Online Demo | Github | Free |
Name | Description | Links | Fees |
---|---|---|---|
KLING AI | AI Video Creation Tool by kuaishou. | URL | Free/Paid |
Dream Machine | By Luma AI. Dream Machine is an AI model that makes high quality, realistic videos fast from text and images.Official introductory video | URL | Free/Paid |
Sora | Sora is an AI model published by OpenAI that can create realistic and imaginative scenes from text instructions. Sora access not fully open, some visual artists, designers and filmmakers given access | URL | - |
capcut | Subtitle-generated speech, speech recognition, and very convenient and powerful video editing | URL | Free/Paid |
Runway | Gen-2: Text/Image to video Gen-1: Video to video. Featured video: https://runwayml.com/staff-picks |
URL | Paid/Free trial |
Pika | Text/Image to video | URL | Paid/Free trial |
Fliki | A website that converts text into audio and video | URL | Free/Paid |
d-id | Generate digital human dubbing video based on text | URL | Paid/Free trial |
HeyGen | Generate digital human dubbing video based on text | URL | Paid/Free trial |
AnimateDiff | AnimateDiff is a plug-and-play module turning most community models into animation generators, without the need of additional training. | Github | Free |
vivago.ai/video | Text to Video; Image to Video; 4K enhance | URL | Free |
Name | Description | Links | Fees |
---|---|---|---|
ChatGPT for YouTube | Chrome plugin, quickly summarize Youtube video content, need to log in chatgpt account or apikey | URL | Free |
Chat Youtube | Give a Youtube link, it will give a summary, and you can ask it questions about the content of the video | URL | Free |
Name | Description | Links | Fees |
---|---|---|---|
Umi-OCR | Comes with a highly efficient offline OCR engine. As long as the computer performance is sufficient, it can be faster than online OCR services. | Github | Free |
Awesome-AITools Discord Link: https://discord.gg/7hAvJQME
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for Awesome-AITools
Similar Open Source Tools
Awesome-AITools
This repo collects AI-related utilities. ## All Categories * All Categories * ChatGPT and other closed-source LLMs * AI Search engine * Open Source LLMs * GPT/LLMs Applications * LLM training platform * Applications that integrate multiple LLMs * AI Agent * Writing * Programming Development * Translation * AI Conversation or AI Voice Conversation * Image Creation * Speech Recognition * Text To Speech * Voice Processing * AI generated music or sound effects * Speech translation * Video Creation * Video Content Summary * OCR(Optical Character Recognition)
GenAI-Showcase
The Generative AI Use Cases Repository showcases a wide range of applications in generative AI, including Retrieval-Augmented Generation (RAG), AI Agents, and industry-specific use cases. It provides practical notebooks and guidance on utilizing frameworks such as LlamaIndex and LangChain, and demonstrates how to integrate models from leading AI research companies like Anthropic and OpenAI.
unstract
Unstract is a no-code platform that enables users to launch APIs and ETL pipelines to structure unstructured documents. With Unstract, users can go beyond co-pilots by enabling machine-to-machine automation. Unstract's Prompt Studio provides a simple, no-code approach to creating prompts for LLMs, vector databases, embedding models, and text extractors. Users can then configure Prompt Studio projects as API deployments or ETL pipelines to automate critical business processes that involve complex documents. Unstract supports a wide range of LLM providers, vector databases, embeddings, text extractors, ETL sources, and ETL destinations, providing users with the flexibility to choose the best tools for their needs.
openinference
OpenInference is a set of conventions and plugins that complement OpenTelemetry to enable tracing of AI applications. It provides a way to capture and analyze the performance and behavior of AI models, including their interactions with other components of the application. OpenInference is designed to be language-agnostic and can be used with any OpenTelemetry-compatible backend. It includes a set of instrumentations for popular machine learning SDKs and frameworks, making it easy to add tracing to your AI applications.
pr-agent
PR-Agent is a tool that helps to efficiently review and handle pull requests by providing AI feedbacks and suggestions. It supports various commands such as generating PR descriptions, providing code suggestions, answering questions about the PR, and updating the CHANGELOG.md file. PR-Agent can be used via CLI, GitHub Action, GitHub App, Docker, and supports multiple git providers and models. It emphasizes real-life practical usage, with each tool having a single GPT-4 call for quick and affordable responses. The PR Compression strategy enables effective handling of both short and long PRs, while the JSON prompting strategy allows for modular and customizable tools. PR-Agent Pro, the hosted version by CodiumAI, provides additional benefits such as full management, improved privacy, priority support, and extra features.
openkore
OpenKore is a custom client and intelligent automated assistant for Ragnarok Online. It is a free, open source, and cross-platform program (Linux, Windows, and MacOS are supported). To run OpenKore, you need to download and extract it or clone the repository using Git. Configure OpenKore according to the documentation and run openkore.pl to start. The tool provides a FAQ section for troubleshooting, guidelines for reporting issues, and information about botting status on official servers. OpenKore is developed by a global team, and contributions are welcome through pull requests. Various community resources are available for support and communication. Users are advised to comply with the GNU General Public License when using and distributing the software.
llava-docker
This Docker image for LLaVA (Large Language and Vision Assistant) provides a convenient way to run LLaVA locally or on RunPod. LLaVA is a powerful AI tool that combines natural language processing and computer vision capabilities. With this Docker image, you can easily access LLaVA's functionalities for various tasks, including image captioning, visual question answering, text summarization, and more. The image comes pre-installed with LLaVA v1.2.0, Torch 2.1.2, xformers 0.0.23.post1, and other necessary dependencies. You can customize the model used by setting the MODEL environment variable. The image also includes a Jupyter Lab environment for interactive development and exploration. Overall, this Docker image offers a comprehensive and user-friendly platform for leveraging LLaVA's capabilities.
recommenders
Recommenders is a project under the Linux Foundation of AI and Data that assists researchers, developers, and enthusiasts in prototyping, experimenting with, and bringing to production a range of classic and state-of-the-art recommendation systems. The repository contains examples and best practices for building recommendation systems, provided as Jupyter notebooks. It covers tasks such as preparing data, building models using various recommendation algorithms, evaluating algorithms, tuning hyperparameters, and operationalizing models in a production environment on Azure. The project provides utilities to support common tasks like loading datasets, evaluating model outputs, and splitting training/test data. It includes implementations of state-of-the-art algorithms for self-study and customization in applications.
dl_model_infer
This project is a c++ version of the AI reasoning library that supports the reasoning of tensorrt models. It provides accelerated deployment cases of deep learning CV popular models and supports dynamic-batch image processing, inference, decode, and NMS. The project has been updated with various models and provides tutorials for model exports. It also includes a producer-consumer inference model for specific tasks. The project directory includes implementations for model inference applications, backend reasoning classes, post-processing, pre-processing, and target detection and tracking. Speed tests have been conducted on various models, and onnx downloads are available for different models.
chat-your-doc
Chat Your Doc is an experimental project exploring various applications based on LLM technology. It goes beyond being just a chatbot project, focusing on researching LLM applications using tools like LangChain and LlamaIndex. The project delves into UX, computer vision, and offers a range of examples in the 'Lab Apps' section. It includes links to different apps, descriptions, launch commands, and demos, aiming to showcase the versatility and potential of LLM applications.
langchain_dart
LangChain.dart is a Dart port of the popular LangChain Python framework created by Harrison Chase. LangChain provides a set of ready-to-use components for working with language models and a standard interface for chaining them together to formulate more advanced use cases (e.g. chatbots, Q&A with RAG, agents, summarization, extraction, etc.). The components can be grouped into a few core modules: * **Model I/O:** LangChain offers a unified API for interacting with various LLM providers (e.g. OpenAI, Google, Mistral, Ollama, etc.), allowing developers to switch between them with ease. Additionally, it provides tools for managing model inputs (prompt templates and example selectors) and parsing the resulting model outputs (output parsers). * **Retrieval:** assists in loading user data (via document loaders), transforming it (with text splitters), extracting its meaning (using embedding models), storing (in vector stores) and retrieving it (through retrievers) so that it can be used to ground the model's responses (i.e. Retrieval-Augmented Generation or RAG). * **Agents:** "bots" that leverage LLMs to make informed decisions about which available tools (such as web search, calculators, database lookup, etc.) to use to accomplish the designated task. The different components can be composed together using the LangChain Expression Language (LCEL).
LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse is a comprehensive and curated guide designed to empower developers, researchers, and enthusiasts to harness the true capabilities of Large Language Models (LLMs) and build intelligent applications that push the boundaries of natural language understanding. This GitHub repository provides in-depth articles, codebase mastery, LLM PlayLab, and resources for cost analysis and network visualization. It covers various aspects of LLMs, including NLP, models, training, evaluation metrics, open LLMs, and more. The repository also includes a collection of code examples and tutorials to help users build and deploy LLM-based applications.
txtai
Txtai is an all-in-one embeddings database for semantic search, LLM orchestration, and language model workflows. It combines vector indexes, graph networks, and relational databases to enable vector search with SQL, topic modeling, retrieval augmented generation, and more. Txtai can stand alone or serve as a knowledge source for large language models (LLMs). Key features include vector search with SQL, object storage, topic modeling, graph analysis, multimodal indexing, embedding creation for various data types, pipelines powered by language models, workflows to connect pipelines, and support for Python, JavaScript, Java, Rust, and Go. Txtai is open-source under the Apache 2.0 license.
llm-awq
AWQ (Activation-aware Weight Quantization) is a tool designed for efficient and accurate low-bit weight quantization (INT3/4) for Large Language Models (LLMs). It supports instruction-tuned models and multi-modal LMs, providing features such as AWQ search for accurate quantization, pre-computed AWQ model zoo for various LLMs, memory-efficient 4-bit linear in PyTorch, and efficient CUDA kernel implementation for fast inference. The tool enables users to run large models on resource-constrained edge platforms, delivering more efficient responses with LLM/VLM chatbots through 4-bit inference.
refact
This repository contains Refact WebUI for fine-tuning and self-hosting of code models, which can be used inside Refact plugins for code completion and chat. Users can fine-tune open-source code models, self-host them, download and upload Lloras, use models for code completion and chat inside Refact plugins, shard models, host multiple small models on one GPU, and connect GPT-models for chat using OpenAI and Anthropic keys. The repository provides a Docker container for running the self-hosted server and supports various models for completion, chat, and fine-tuning. Refact is free for individuals and small teams under the BSD-3-Clause license, with custom installation options available for GPU support. The community and support include contributing guidelines, GitHub issues for bugs, a community forum, Discord for chatting, and Twitter for product news and updates.
For similar tasks
LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.
ai-guide
This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.
onnxruntime-genai
ONNX Runtime Generative AI is a library that provides the generative AI loop for ONNX models, including inference with ONNX Runtime, logits processing, search and sampling, and KV cache management. Users can call a high level `generate()` method, or run each iteration of the model in a loop. It supports greedy/beam search and TopP, TopK sampling to generate token sequences, has built in logits processing like repetition penalties, and allows for easy custom scoring.
jupyter-ai
Jupyter AI connects generative AI with Jupyter notebooks. It provides a user-friendly and powerful way to explore generative AI models in notebooks and improve your productivity in JupyterLab and the Jupyter Notebook. Specifically, Jupyter AI offers: * An `%%ai` magic that turns the Jupyter notebook into a reproducible generative AI playground. This works anywhere the IPython kernel runs (JupyterLab, Jupyter Notebook, Google Colab, Kaggle, VSCode, etc.). * A native chat UI in JupyterLab that enables you to work with generative AI as a conversational assistant. * Support for a wide range of generative model providers, including AI21, Anthropic, AWS, Cohere, Gemini, Hugging Face, NVIDIA, and OpenAI. * Local model support through GPT4All, enabling use of generative AI models on consumer grade machines with ease and privacy.
khoj
Khoj is an open-source, personal AI assistant that extends your capabilities by creating always-available AI agents. You can share your notes and documents to extend your digital brain, and your AI agents have access to the internet, allowing you to incorporate real-time information. Khoj is accessible on Desktop, Emacs, Obsidian, Web, and Whatsapp, and you can share PDF, markdown, org-mode, notion files, and GitHub repositories. You'll get fast, accurate semantic search on top of your docs, and your agents can create deeply personal images and understand your speech. Khoj is self-hostable and always will be.
langchain_dart
LangChain.dart is a Dart port of the popular LangChain Python framework created by Harrison Chase. LangChain provides a set of ready-to-use components for working with language models and a standard interface for chaining them together to formulate more advanced use cases (e.g. chatbots, Q&A with RAG, agents, summarization, extraction, etc.). The components can be grouped into a few core modules: * **Model I/O:** LangChain offers a unified API for interacting with various LLM providers (e.g. OpenAI, Google, Mistral, Ollama, etc.), allowing developers to switch between them with ease. Additionally, it provides tools for managing model inputs (prompt templates and example selectors) and parsing the resulting model outputs (output parsers). * **Retrieval:** assists in loading user data (via document loaders), transforming it (with text splitters), extracting its meaning (using embedding models), storing (in vector stores) and retrieving it (through retrievers) so that it can be used to ground the model's responses (i.e. Retrieval-Augmented Generation or RAG). * **Agents:** "bots" that leverage LLMs to make informed decisions about which available tools (such as web search, calculators, database lookup, etc.) to use to accomplish the designated task. The different components can be composed together using the LangChain Expression Language (LCEL).
danswer
Danswer is an open-source Gen-AI Chat and Unified Search tool that connects to your company's docs, apps, and people. It provides a Chat interface and plugs into any LLM of your choice. Danswer can be deployed anywhere and for any scale - on a laptop, on-premise, or to cloud. Since you own the deployment, your user data and chats are fully in your own control. Danswer is MIT licensed and designed to be modular and easily extensible. The system also comes fully ready for production usage with user authentication, role management (admin/basic users), chat persistence, and a UI for configuring Personas (AI Assistants) and their Prompts. Danswer also serves as a Unified Search across all common workplace tools such as Slack, Google Drive, Confluence, etc. By combining LLMs and team specific knowledge, Danswer becomes a subject matter expert for the team. Imagine ChatGPT if it had access to your team's unique knowledge! It enables questions such as "A customer wants feature X, is this already supported?" or "Where's the pull request for feature Y?"
infinity
Infinity is an AI-native database designed for LLM applications, providing incredibly fast full-text and vector search capabilities. It supports a wide range of data types, including vectors, full-text, and structured data, and offers a fused search feature that combines multiple embeddings and full text. Infinity is easy to use, with an intuitive Python API and a single-binary architecture that simplifies deployment. It achieves high performance, with 0.1 milliseconds query latency on million-scale vector datasets and up to 15K QPS.
For similar jobs
weave
Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.
LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.
VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.
kaito
Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.
PyRIT
PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.
tabby
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.
spear
SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.
Magick
Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.