Awesome-AITools
Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具,欢迎提交issues 或者pull requests
Stars: 5618
This repo collects AI-related utilities. ## All Categories * All Categories * ChatGPT and other closed-source LLMs * AI Search engine * Open Source LLMs * GPT/LLMs Applications * LLM training platform * Applications that integrate multiple LLMs * AI Agent * Writing * Programming Development * Translation * AI Conversation or AI Voice Conversation * Image Creation * Speech Recognition * Text To Speech * Voice Processing * AI generated music or sound effects * Speech translation * Video Creation * Video Content Summary * OCR(Optical Character Recognition)
README:
English | 中文
This repo collects awesome AI tools. Welcome everyone to recommend more awesome AI tools together! Please use the following template as a reference for your recommendations. issue
-
All Categories
- ChatGPT and other AI chat assistant
- AI Agent
- AI Search engine
- Open Source LLMs
- LLM Leaderboard
- GPT LLMs Applications
- Programming Development
- AI Image Creation
- Video Creation
- AI Cloud Platform
- LLM Prompts
- LLM training platform
- Writing
- Translation
- Speech Recognition
- Text To Speech
- Music Recognition
- Voice Processing
- AI generated music or sound effects
- Speech translation
- Video Content Summary
- Academic research
- OCR
- AI Detection
| Name | Description | Links | Fees |
|---|---|---|---|
| Gemini | Google's LLMs, including Gemini-3 pro. ai.google.dev |
URL |
Free/Paid |
| ChatGPT | OpenAI's LLMs, including GPT-5.2 | URL | Free/Paid |
| Claude | Anthropic's LLMs, including Claude Opus 4.6 | URL | Free/Paid |
| DeepSeek | DeepSeek's LLMs. API | URL | Free/Paid |
| Grok | xAI's LLMs, including grok-4.1-thinking. x.com/grok | URL | Free |
| Microsoft Copilot | Microsoft's AI assistant. | URL | Free |
| Le Chat | Mistral.ai's conversational, AI chat service | URL | Free |
| qwen | Alibaba's LLMs. Includes Qwen3, Qwen3-Code and other Qwen LLMs | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Manus | Manus is the action engine that goes beyond answers to execute tasks, automate workflows, and extend your human reach | URL | Free Trial/Paid |
| AnyGen | AnyGen is the AI assistant that truly "gets work done" for you. From writing and analysis to planning and reporting, it transforms your ideas into ready-to-use professional deliverables in minutes. The AI Assistant Built for Work | URL | Free Trial/Paid |
| Gemini CLI | An open-source AI agent that brings the power of Gemini directly into your terminal. |
Github |
Free |
| agentscope | Agent-Oriented Programming for Building LLM Applications, Open-sourced by Alibaba |
Github |
Free |
| Auto-GPT | Open source, An experimental open-source attempt to make GPT-4 fully autonomous. |
GitHub |
Free |
| OthersideAI/self-operating-computer | A framework to enable multimodal models to operate a computer. |
Github |
Free,GPT-4v required |
| microsoft/autogen | AutoGen is an open-source programming framework for building AI agents and facilitating cooperation among multiple agents to solve tasks. |
Github |
Free |
| potpie-ai/potpie | Open Source AI Agents for your codebase in minutes. Use pre-built agents for Q&A, Testing, Debugging and System Design or create your own purpose-built agents. |
URL , Github |
Free Trial |
| saplings | A framework for building agents that use search algorithms to complete tasks. |
Github |
Free |
| MastraAI | Mastra is an opinionated TypeScript framework that helps you build AI applications and features quickly. It gives you the set of primitives you need: workflows, agents, RAG, integrations and evals |
Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Perplexity.ai | AI-driven conversational search engine. | URL | Free |
| You.com | A search engine in conversation mode | URL | Free |
| Morphik.ai | Open source AI-driven search engine for private documents |
URL Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| DeepSeek-R1 | DeepSeek's first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning. |
Github |
Free |
| DeepSeek-V3 | A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. |
Github |
Free |
| Qwen3 | Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud. |
Github |
Free |
| Llama 3 | Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model. Online test address: huggingface.co/Meta-Llama-3-70B-Instruct |
GitHub |
Free |
| Mixtral | Mixtral 8x7B, a high-quality sparse mixture of experts model (SMoE) with open weights. Mixtral outperforms Llama 2 70B on most benchmarks with 6x faster inference. It matches or outperforms GPT3.5 on most standard benchmarks. paper:https://arxiv.org/pdf/2401.04088.pdf news:https://mistral.ai/news/mixtral-of-experts/ |
mistral-inference mistral-finetune |
Free |
| grok-1 | A large language model open sourced by xAI |
Github |
Free |
| Phi-3 | Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks. |
Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| LMSYS Chatbot Arena Leaderboard | LMSYS Chatbot Arena is a crowdsourced open platform for LLM evals. Collected over 1,000,000 human pairwise comparisons to rank LLMs with the Bradley-Terry model and display the model ratings in Elo-scale. | URL | Free |
| Artificial Analysis | Artificial Analysis is a platform that provides AI model and service provider comparisons and benchmarks to help users make informed decisions when choosing AI models and service providers. The platform provides comparative data on a wide range of popular AI models, including OpenAI's GPT-4, Meta's Llama 3, and Anthropic's Claude series, covering performance metrics such as response time, latency, and cost. | URL | Free |
| LiveCodeBench | LiveCodeBench is a holistic and contamination-free evaluation benchmark of LLMs for code that continuously collects new problems over time. Particularly, LiveCodeBench also focuses on broader code-related capabilities, such as self-repair, code execution, and test output prediction, beyond mere code generation. | URL | Free |
| LLM Stats | LLM Stats, the most comprehensive LLM leaderboard, benchmarks and compares API models using daily‑updated, open‑source community data on capability, price, speed, and context length. | URL | Free |
| Price Per Token | Compare LLM API pricing across 200+ models from OpenAI, Anthropic, Google, and more. Includes token counters, cost calculators, and benchmark comparisons. | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| NotebookLM | AI Research Assistant developed by Google. Upload PDFs, websites, YouTube videos, audio files, Google Docs, or Google Slides, and NotebookLM will summarize them and make interesting connections between topics. Audio Overview feature can turn your sources into engaging “Deep Dive” discussions with one click. | URL | Free |
| Google AI Studio | Google AI Studio is a free, web-based developer tool that enables you to quickly develop prompts and then get an API key to use in your app development. Available regions | URL | Free |
| Poe | AI product built by Quora. Can use ChatGPT, Sage, Dragonfly, Claude bots for free. All you need is an email address to register. GPT-4 can be used once a day for free | URL | Free, with paid upgrades |
| Cherry Studio | Cherry Studio is a desktop client that supports for multiple LLM providers, available on Windows, Mac and Linux. Support major LLM Cloud Services: OpenAI, Gemini, Anthropic, and more AI Web Service Integration: Claude, Peplexity, Poe, and others Local Model Support with Ollama, LM Studio |
Github |
Free |
| HuggingChat | Open source codebase powering the HuggingChat app. URL |
Github |
Free |
| Learn about | AI learning Assistant developed by Google.Grasp new topics and deepen your understanding with a conversational learning companion that adapts to your unique curiosity and learning goals. | URL | Free |
| monica | AI assistant that provides help with a variety of tasks such as searching, reading, writing, translating, drawing, and more. Standalone apps and browser plug-ins available |
URL chrome extension |
Free, with paid upgrades |
| ollama | Get up and running with Llama 2, Mistral, Gemma, and other large language models. |
Github |
Free |
| openai/openai-python | The official Python library for the OpenAI API, It is generated from OpenAPI specification with Stainless |
Github |
Free, need OpenAPI apikey |
| sashabaranov/go-openai | This library provides unofficial Go clients for OpenAI API. support: ChatGPT, GPT-3, GPT-4, DALL·E 2 |
Github |
Free |
| langchain | LangChain is a framework for developing applications powered by language models. |
Github |
Free |
| Helicone AI | Helicone is the open-source LLM observability platform for logging, monitoring, and debugging AI applications. |
Github |
Free |
| ChatGPT-Next-Web | One-Click to get a well-designed cross-platform ChatGPT web UI, with GPT3, GPT4 & Gemini Pro support. |
Github |
Free |
| screenshot-to-code | This simple app converts a screenshot to HTML/Tailwind CSS. It uses GPT-4 Vision to generate the code and DALL-E 3 to generate similar-looking images. You can now also enter a URL to clone a live website! |
GitHub |
Free, need access to GPT-4 Vision |
| Chatbox | Desktop application that uses ChatGPT API (OpenAI API) to store all chat messages and prompts locally, thus reducing the risk of data loss. A bit more stable to use than the web version |
GitHub |
Free, requires apikey with OpenAPI |
| together.ai chat | Similar to HuggingChat, with the option of different open source models, support for DeepSeek R1, LLaMA, QWen, Flux Schnell. 60 free messages per day. | URL | Free/Paid |
| gpt-crawler | Crawl a site to generate knowledge files to create your own custom GPT from a URL |
Github |
Free |
| ChatGPT-Shortcut | Open source, ChatGPT shortcut commands that double productivity, partitioned by domain and function, can filter prompt words by tag, keyword search and one-click copy. |
GitHub |
Free |
| ChatGPT Sidebar | ChatGPT Sidebar is an artificial intelligence assistant you can use while browsing any website. | URL | Free |
| WebChatGPT | Open source, expand the ability of networking to chatgpt |
GitHub |
Free |
| AIPRM for ChatGPT | Browser plug-in, providing a series of selected ChatGPT instruction templates, and even creating your own, and adjusting AI tone and writing style | URL | Free |
| MindMac | Feature-rich & privacy-first native ChatGPT app for macOS to use OpenAI, Azure OpenAI, Anthropic Claude, OpenRouter all in one place, designed for maximum productivity. Currently available in 15 languages. | URL | Free, with paid upgrades |
| chathub | Use different chatbots in one app, currently supporting ChatGPT, new Bing Chat, Google Bard, Claude, and 10+ open-source models including Alpaca, Vicuna, ChatGLM etc. |
GitHub |
Free/Paid |
| Harbor | Effortlessly run LLM backends, APIs, frontends, and services with one command. |
GitHub |
Free |
| gemini-fullstack-langgraph-quickstart | Get started with building Fullstack Agents using Gemini 2.5 and LangGraph |
Github |
Free |
| NoteGPT | NoteGPT is a smart note-taking tool that can record, transcribe, and summarize various content, such as meetings, lectures, podcasts, YouTube videos, news briefings, and articles. | URL | Free/Paid |
| Name | Description | Links | Fees |
|---|---|---|---|
| Cursor | A collaborative code editor using GPT | URL | Paid/Free Trial |
| GitHub Copilot | A code writing assistant developed by GitHub and OpenAI | URL | Paid |
| Trae | ByteDance's AI coding IDE. Trae is your helpful coding partner. It offers features like AI Q&A, code auto-completion, and agent-based AI programming capabilities. | URL | Free |
| Amazon CodeWhisperer | A code writing assistant developed by Amazon | URL | Free for Individual Use |
| Codeium | Powerful in-IDE AI coding assistant | URL | Free/Paid |
| scalene | Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals |
Github |
Free |
| Fitten Code | Fitten Code is an AI programming assistant driven by Fitten LLM models, which can automatically generate code, improve development efficiency, help you debug, and save your time. It can also chat with you and solve your programming problems.freeand supports over 80 languages: Python, C++,JavaScript, TypeScript, Java, etc. Fitten Code supports Visual Studio Code and JetBrains series IDEs, including IntelliJ IDEA, PyCharm, WebStorm, etc. | URL | Free |
| Plandex | Open source, terminal-based AI programming engine for complex tasks |
GitHub |
Free |
| Roundtable | Zero-configuration MCP server that unifies multiple AI coding assistants for enhanced development workflows. Intelligent client management platform enabling seamless coordination between Claude Code, Cursor, GPT-4, and other AI development tools. |
GitHub |
Free |
| Mistral/Codestral | Empowering developers and democratising coding with Mistral AI., models:https://huggingface.co/mistralai/Codestral-22B-v0.1 | URL | Free |
| Kodus | Open Source Code Review Agent |
GitHub |
Free/Paid |
| Kagan | AI-powered Kanban TUI for autonomous development workflows. Integrates with Claude Code and OpenCode for ticket-driven AI coding with git worktree isolation and MCP server support. |
GitHub |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Nano Banana/Nano Banana Pro | Google's advanced AI model for image generation and editing. No. 1 in the LMArea Text to Image and Image Edit leadboard. Online website: 1. gemini 2.aistudio 3. lmarea.ai |
URL | Free/Paid |
| Z-Image | Z-Image is a high-performance image generation model recently open-sourced by Alibaba's Tongyi Lab. It strikes a balance between "extreme speed" and "high quality," making it highly suitable for scenarios requiring rapid image generation. Z-Image-Turbo Online Demo: https://huggingface.co/spaces/mrfakename/Z-Image-Turbo |
Github |
Free |
| Midjourney | Enter text or pictures to create pictures | URL | Paid |
| ChatGPT Images | GPT Image 1.5 | URL | Free/Paid |
| Photoshop AI | Adobe Photoshop generative-fill | URL | Paid |
| Stable diffusion webui | Open source project, input text or pictures to create pictures, Stable diffusion webui is the GUI of Stable diffusion, and it is an image user interface that visualizes stable diffusion. It also integrates many other useful extension scripts. |
GitHub |
Free |
| civitai | civitai.com is a website platform for sharing AI image creation model resources, with a large number of models, has become the main model exchange place in the SD open source community | URL | Free |
| clipdrop | clipdrop by stability.ai. Has many AI image processing tools, such as stable diffusion XL, uncrop, reimage XL, stable doodle. | URL | Free/Paid |
| firefly | Adobe's AI image processing web site | URL | Free/Paid |
| ideogram.ai | Enter text to create pictures. A product developed by a company founded by many ex-Googlers | URL | Free/Paid |
| Nero AI | AI picture upscale, AI repair scratches, AI picture coloring, AI picture noise removal, AI one-click to change the background, AI magical erasing pen, AI portrait. API doc:https://ai.nero.com/ai-api/docs/ | URL | Paid/Trial |
| Skybox AI | Generate 360-degree panoramic images using text prompts | URL | Free/Paid |
| remove.bg | Remove Image Background | URL | Free/Paid |
| ControlNet | ControlNet is a neural network structure to control diffusion models by adding extra conditions. |
Github |
Free |
| PixelPanda | AI-powered platform that creates professional product photos, marketing images, UGC-style videos, and AI avatars — no camera or studio needed. | URL | Free/Paid |
| Name | Description | Links | Fees |
|---|---|---|---|
| Wan2.6 | AI Video Creation Tool by Alibaba | URL | Paid/Free trial |
| Sora | Sora is an AI model published by OpenAI that can create realistic and imaginative scenes from text instructions. | URL | Paid |
| KLING AI | AI Video Creation Tool by kuaishou. Support text to video, image to video, start-end frame and motion control | URL | Free/Paid |
| hailuoai | AI Video Creation Tool by Minimax | URL | Free/Paid |
| Dream Machine | By Luma AI. Dream Machine is an AI model that makes high quality, realistic videos fast from text and images.Official introductory video | URL | Free/Paid |
| capcut | Subtitle-generated speech, speech recognition, and very convenient and powerful video editing | URL | Free/Paid |
| Runway | Gen-2: Text/Image to video Gen-1: Video to video. Featured video: https://runwayml.com/staff-picks |
URL | Paid/Free trial |
| pixverse | Create Amazing AI Videos from Text & Photos | URL | Paid/Free trial |
| Pika | Text/Image to video | URL | Paid/Free trial |
| Fliki | A website that converts text into audio and video | URL | Free/Paid |
| d-id | Generate digital human dubbing video based on text | URL | Paid/Free trial |
| HeyGen | Generate digital human dubbing video based on text | URL | Paid/Free trial |
| AnimateDiff | AnimateDiff is a plug-and-play module turning most community models into animation generators, without the need of additional training. |
Github |
Free |
| vivago.ai/video | Text to Video; Image to Video; 4K enhance | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| together.ai | The AI Acceleration Cloud. Train, fine-tune-and run inference on AI models blazing fast, at low cost, and at production scale. | URL | Free/Paid |
| Name | Description | Links | Fees |
|---|---|---|---|
| f/awesome-chatgpt-prompts | This repo includes ChatGPT prompt curation to use ChatGPT better. |
Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| lm-sys/FastChat | An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. |
Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Notion AI | AI-assisted note-taking software | URL | with certain free AI trials, AI features $10/month |
| Deep L Write | English and German writing tools to fix writing errors and rewrite sentences promptly. | URL | Free version to use with text word limit / paid upgrade available |
| grammarly | Edit and correct your grammar, spelling, punctuation, and more with your personal writing assistant, grammar checker, and editor. | URL | Free/Paid |
| TextCraft | Add-in for Microsoft Word that seamlessly integrates essential AI tools, including text generation, proofreading, and more, directly into the user interface. | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Google Translate | Support text, picture, document and URL | URL | Free |
| Deep L | Accurate and instant translation tool, currently supporting 31 languages | URL | Free/Paid |
| immersive-translate | Open source project. Immersive bilingual web translation extension |
GitHub |
Free |
| openai-translator | Open source project. Crossword translation browser plugin and cross-platform desktop application based on ChatGPT API |
GitHub |
Free, requires OpenAI API key |
| RTranslator | RTranslator is an open-source, free, and offline real-time translation app for Android. |
Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| whisper | OpenAPI open source robust speech recognition model through large-scale weak supervision |
GitHub |
Free |
| whisper.cpp | Port of OpenAI's Whisper model in C/C++ |
Github |
Free |
| buzz | An open source desktop software based on OpenAI's Whisper to recognize speech and generate subtitles |
GitHub |
Free |
| WhisperDesktop | Open source, OpenAI-based Whisper, a desktop application for Windows, uses the GPU for processing, which will be faster than on the CPU with good GPU performance. |
GitHub |
Free |
| whisperX | WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) |
whisperX |
Free |
| whisper-web | ML-powered speech recognition directly in your browser. Built with Transformers.js. Demo |
GitHub |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| index-tts2 | Bilibili's Open-Source Industrial-Grade Controllable High-Efficiency Zero-Sample Text-to-Speech System. Online Demo: https://huggingface.co/spaces/IndexTeam/IndexTTS-2-Demo Paper: https://arxiv.org/abs/2506.21619 |
Github |
Free |
| Azure Text to speech | The best and most realistic voice tools currently available | URL | Paid / 500,000 characters per month free |
| Hailuo AI Text to Speech | Offer over 300 voices in 17 languages and multiple accents, covering a wide range of styles and age groups to provide the voice effects you need. | URL | Limited-time Free |
| coqui-ai/tts | A deep learning toolkit for Text-to-Speech, battle-tested in research and production Online Demo: https://huggingface.co/spaces/coqui/xtts |
Github |
Free |
| elevenlabs | Intelligent AI Text to Speech | URL | Free/Paid |
| netease-youdao/EmotiVoice | A Multi-Voice and Prompt-Controlled TTS Engine. EmotiVoice speaks both English and Chinese, and with over 2000 different voices. The most prominent feature is emotional synthesis, allowing you to create speech with a wide range of emotions, including happy, excited, sad, angry and others. |
Github |
Free |
| tetos | A unified interface for multiple Text-to-Speech (TTS) providers. Supported TTS providers: Edge TTS, OpenAI TTS, Azure TTS, Google TTS, Volcengine TTS, Baidu TTS |
Github |
Free |
| ChatTTS | ChatTTS is a text-to-speech model designed specifically for dialogue scenario such as LLM assistant. It supports both English and Chinese languages. Our model is trained with 100,000+ hours composed of chinese and english. Website:https://chattts.com/ |
Github |
Free |
| Name | Description | Links | Fee |
|---|---|---|---|
| shazam | Download the shazaom app for music recognition, which is pretty fast | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| so-vits-svc | SoftVC VITS Singing Voice Conversion. |
GitHub |
Free |
| vocalremover | Extract vocal and music | URL | Free |
| lala.ai | Extract vocal, accompaniment and various instruments from any audio and video | URL | Free/Paid |
| Name | Description | Link | Fees |
|---|---|---|---|
| suno.ai | The AI music creation tool Suno can generate custom songs based on text prompts in mere second | URL | |
| udio | Create music from simple text prompts by specifying topics, genres, and other descriptors which are then transformed into professional quality tracks. | URL | |
| mureka.ai | Text to music | URL | Free/Paid |
| elevenlabs/sound-effects | Imagine a sound and bring it to life, or explore a selection of the best sound effects generated by the community. | URL | Free |
| suno-ai/bark | Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. |
Github |
Free |
| audiocraft | Open source library for audio/music generation by Meta, which mainly includes two models, MusicGen: text-to-music model, AudioGen: text-generated sound model. MusicGen Online Demo |
GitHub |
Free |
| Stable Audio | AI music and sound effect generation application by stability.ai | URL | Free/Paid |
| OptimizerAI | Sound effect generation Official Introduction |
URL | Free/Paid |
| SFX Engine | AI Sound effect generation | URL | Free/Paid |
| Name | Description | Links | Fees |
|---|---|---|---|
| Seamless | Seamless is a family of AI models that enable more natural and authentic communication across languages.Online Demo |
Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| ChatGPT for YouTube | Chrome plugin, quickly summarize Youtube video content, need to log in chatgpt account or apikey | URL | Free |
| Chat Youtube | Give a Youtube link, it will give a summary, and you can ask it questions about the content of the video | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| alphaxiv | An open academic discussion community based on the arXiv platform that allows users to comment line-by-line, ask questions, and interact in real-time by replacing the paper's linking domain (arxiv.org for alphaxiv.org) directly on the paper's page. And provides AI features such as Ask AI and AI-generated article blogs | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Umi-OCR | Comes with a highly efficient offline OCR engine. As long as the computer performance is sufficient, it can be faster than online OCR services. |
Github |
Free |
| allenai/olmocr | A toolkit for training language models to work with PDF documents in the wild. Online demo: https://olmocr.allenai.org/ |
Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| AI Virtual Staging | Stage empty rooms instantly with realistic furniture using AI. MLS compliant, fast, and affordable virtual staging for real estate listings. With furniture removal, day to dusk, 2D to 3D floor plan features support. | URL |
| Name | Description | Links | Fees |
|---|---|---|---|
| AI Detect Lab | Professional AI image and deepfake detector specifically optimized for Midjourney v7 and Flux. | URL | Free |
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for Awesome-AITools
Similar Open Source Tools
Awesome-AITools
This repo collects AI-related utilities. ## All Categories * All Categories * ChatGPT and other closed-source LLMs * AI Search engine * Open Source LLMs * GPT/LLMs Applications * LLM training platform * Applications that integrate multiple LLMs * AI Agent * Writing * Programming Development * Translation * AI Conversation or AI Voice Conversation * Image Creation * Speech Recognition * Text To Speech * Voice Processing * AI generated music or sound effects * Speech translation * Video Creation * Video Content Summary * OCR(Optical Character Recognition)
MiniCPM-V-CookBook
MiniCPM-V & o Cookbook is a comprehensive repository for building multimodal AI applications effortlessly. It provides easy-to-use documentation, supports a wide range of users, and offers versatile deployment scenarios. The repository includes live demonstrations, inference recipes for vision and audio capabilities, fine-tuning recipes, serving recipes, quantization recipes, and a framework support matrix. Users can customize models, deploy them efficiently, and compress models to improve efficiency. The repository also showcases awesome works using MiniCPM-V & o and encourages community contributions.
openkore
OpenKore is a custom client and intelligent automated assistant for Ragnarok Online. It is a free, open source, and cross-platform program (Linux, Windows, and MacOS are supported). To run OpenKore, you need to download and extract it or clone the repository using Git. Configure OpenKore according to the documentation and run openkore.pl to start. The tool provides a FAQ section for troubleshooting, guidelines for reporting issues, and information about botting status on official servers. OpenKore is developed by a global team, and contributions are welcome through pull requests. Various community resources are available for support and communication. Users are advised to comply with the GNU General Public License when using and distributing the software.
generative-ai-with-javascript
The 'Generative AI with JavaScript' repository is a comprehensive resource hub for JavaScript developers interested in delving into the world of Generative AI. It provides code samples, tutorials, and resources from a video series, offering best practices and tips to enhance AI skills. The repository covers the basics of generative AI, guides on building AI applications using JavaScript, from local development to deployment on Azure, and scaling AI models. It is a living repository with continuous updates, making it a valuable resource for both beginners and experienced developers looking to explore AI with JavaScript.
RustySEO
RustySEO is a free, modern SEO/GEO toolkit designed to help users crawl and analyze websites and server logs without crawl limits. It is an all-in-one, cross-platform marketing toolkit for comprehensive SEO & GEO analysis, providing actionable insights into marketing and SEO strategies. The tool offers features such as shallow & deep crawl, technical diagnostics, on-page SEO analysis, dashboards, reporting, topic and keyword generators, AI chatbot, crawl history, image conversion and optimization, and more. RustySEO aims to be a robust, free alternative to costly commercial SEO tools, with integrations like Google PageSpeed Insights, Google Gemini, and more.
together-cookbook
The Together Cookbook is a collection of code and guides designed to help developers build with open source models using Together AI. The recipes provide examples on how to chain multiple LLM calls, create agents that route tasks to specialized models, run multiple LLMs in parallel, break down tasks into parallel subtasks, build agents that iteratively improve responses, perform LoRA fine-tuning and inference, fine-tune LLMs for repetition, improve summarization capabilities, fine-tune LLMs on multi-step conversations, implement retrieval-augmented generation, conduct multimodal search and conditional image generation, visualize vector embeddings, improve search results with rerankers, implement vector search with embedding models, extract structured text from images, summarize and evaluate outputs with LLMs, generate podcasts from PDF content, and get LLMs to generate knowledge graphs.
ai-agents-for-beginners
AI Agents for Beginners is a course that covers the fundamentals of building AI Agents. It consists of 10 lessons with code examples using Azure AI Foundry and GitHub Model Catalogs. The course utilizes AI Agent frameworks and services from Microsoft, such as Azure AI Agent Service, Semantic Kernel, and AutoGen. Learners can access written lessons, Python code samples, and additional learning resources for each lesson. The course encourages contributions and suggestions from the community and provides multi-language support for learners worldwide.
txtai
Txtai is an all-in-one embeddings database for semantic search, LLM orchestration, and language model workflows. It combines vector indexes, graph networks, and relational databases to enable vector search with SQL, topic modeling, retrieval augmented generation, and more. Txtai can stand alone or serve as a knowledge source for large language models (LLMs). Key features include vector search with SQL, object storage, topic modeling, graph analysis, multimodal indexing, embedding creation for various data types, pipelines powered by language models, workflows to connect pipelines, and support for Python, JavaScript, Java, Rust, and Go. Txtai is open-source under the Apache 2.0 license.
generative-ai-for-beginners
This course has 18 lessons. Each lesson covers its own topic so start wherever you like! Lessons are labeled either "Learn" lessons explaining a Generative AI concept or "Build" lessons that explain a concept and code examples in both **Python** and **TypeScript** when possible. Each lesson also includes a "Keep Learning" section with additional learning tools. **What You Need** * Access to the Azure OpenAI Service **OR** OpenAI API - _Only required to complete coding lessons_ * Basic knowledge of Python or Typescript is helpful - *For absolute beginners check out these Python and TypeScript courses. * A Github account to fork this entire repo to your own GitHub account We have created a **Course Setup** lesson to help you with setting up your development environment. Don't forget to star (🌟) this repo to find it easier later. ## 🧠 Ready to Deploy? If you are looking for more advanced code samples, check out our collection of Generative AI Code Samples in both **Python** and **TypeScript**. ## 🗣️ Meet Other Learners, Get Support Join our official AI Discord server to meet and network with other learners taking this course and get support. ## 🚀 Building a Startup? Sign up for Microsoft for Startups Founders Hub to receive **free OpenAI credits** and up to **$150k towards Azure credits to access OpenAI models through Azure OpenAI Services**. ## 🙏 Want to help? Do you have suggestions or found spelling or code errors? Raise an issue or Create a pull request ## 📂 Each lesson includes: * A short video introduction to the topic * A written lesson located in the README * Python and TypeScript code samples supporting Azure OpenAI and OpenAI API * Links to extra resources to continue your learning ## 🗃️ Lessons | | Lesson Link | Description | Additional Learning | | :-: | :------------------------------------------------------------------------------------------------------------------------------------------: | :---------------------------------------------------------------------------------------------: | ------------------------------------------------------------------------------ | | 00 | Course Setup | **Learn:** How to Setup Your Development Environment | Learn More | | 01 | Introduction to Generative AI and LLMs | **Learn:** Understanding what Generative AI is and how Large Language Models (LLMs) work. | Learn More | | 02 | Exploring and comparing different LLMs | **Learn:** How to select the right model for your use case | Learn More | | 03 | Using Generative AI Responsibly | **Learn:** How to build Generative AI Applications responsibly | Learn More | | 04 | Understanding Prompt Engineering Fundamentals | **Learn:** Hands-on Prompt Engineering Best Practices | Learn More | | 05 | Creating Advanced Prompts | **Learn:** How to apply prompt engineering techniques that improve the outcome of your prompts. | Learn More | | 06 | Building Text Generation Applications | **Build:** A text generation app using Azure OpenAI | Learn More | | 07 | Building Chat Applications | **Build:** Techniques for efficiently building and integrating chat applications. | Learn More | | 08 | Building Search Apps Vector Databases | **Build:** A search application that uses Embeddings to search for data. | Learn More | | 09 | Building Image Generation Applications | **Build:** A image generation application | Learn More | | 10 | Building Low Code AI Applications | **Build:** A Generative AI application using Low Code tools | Learn More | | 11 | Integrating External Applications with Function Calling | **Build:** What is function calling and its use cases for applications | Learn More | | 12 | Designing UX for AI Applications | **Learn:** How to apply UX design principles when developing Generative AI Applications | Learn More | | 13 | Securing Your Generative AI Applications | **Learn:** The threats and risks to AI systems and methods to secure these systems. | Learn More | | 14 | The Generative AI Application Lifecycle | **Learn:** The tools and metrics to manage the LLM Lifecycle and LLMOps | Learn More | | 15 | Retrieval Augmented Generation (RAG) and Vector Databases | **Build:** An application using a RAG Framework to retrieve embeddings from a Vector Databases | Learn More | | 16 | Open Source Models and Hugging Face | **Build:** An application using open source models available on Hugging Face | Learn More | | 17 | AI Agents | **Build:** An application using an AI Agent Framework | Learn More | | 18 | Fine-Tuning LLMs | **Learn:** The what, why and how of fine-tuning LLMs | Learn More |
SemanticFinder
SemanticFinder is a frontend-only live semantic search tool that calculates embeddings and cosine similarity client-side using transformers.js and SOTA embedding models from Huggingface. It allows users to search through large texts like books with pre-indexed examples, customize search parameters, and offers data privacy by keeping input text in the browser. The tool can be used for basic search tasks, analyzing texts for recurring themes, and has potential integrations with various applications like wikis, chat apps, and personal history search. It also provides options for building browser extensions and future ideas for further enhancements and integrations.
AI-For-Beginners
AI-For-Beginners is a comprehensive 12-week, 24-lesson curriculum designed by experts at Microsoft to introduce beginners to the world of Artificial Intelligence (AI). The curriculum covers various topics such as Symbolic AI, Neural Networks, Computer Vision, Natural Language Processing, Genetic Algorithms, and Multi-Agent Systems. It includes hands-on lessons, quizzes, and labs using popular frameworks like TensorFlow and PyTorch. The focus is on providing a foundational understanding of AI concepts and principles, making it an ideal starting point for individuals interested in AI.
chat-your-doc
Chat Your Doc is an experimental project exploring various applications based on LLM technology. It goes beyond being just a chatbot project, focusing on researching LLM applications using tools like LangChain and LlamaIndex. The project delves into UX, computer vision, and offers a range of examples in the 'Lab Apps' section. It includes links to different apps, descriptions, launch commands, and demos, aiming to showcase the versatility and potential of LLM applications.
FFAIVideo
FFAIVideo is a lightweight node.js project that utilizes popular AI LLM to intelligently generate short videos. It supports multiple AI LLM models such as OpenAI, Moonshot, Azure, g4f, Google Gemini, etc. Users can input text to automatically synthesize exciting video content with subtitles, background music, and customizable settings. The project integrates Microsoft Edge's online text-to-speech service for voice options and uses Pexels website for video resources. Installation of FFmpeg is essential for smooth operation. Inspired by MoneyPrinterTurbo, MoneyPrinter, and MsEdgeTTS, FFAIVideo is designed for front-end developers with minimal dependencies and simple usage.
llava-docker
This Docker image for LLaVA (Large Language and Vision Assistant) provides a convenient way to run LLaVA locally or on RunPod. LLaVA is a powerful AI tool that combines natural language processing and computer vision capabilities. With this Docker image, you can easily access LLaVA's functionalities for various tasks, including image captioning, visual question answering, text summarization, and more. The image comes pre-installed with LLaVA v1.2.0, Torch 2.1.2, xformers 0.0.23.post1, and other necessary dependencies. You can customize the model used by setting the MODEL environment variable. The image also includes a Jupyter Lab environment for interactive development and exploration. Overall, this Docker image offers a comprehensive and user-friendly platform for leveraging LLaVA's capabilities.
dataforce.studio
DataForce Studio is an open-source MLOps platform designed to help build, manage, and deploy AI/ML models with ease. It supports the entire model lifecycle, from creation to deployment and monitoring, within a user-friendly interface. The platform is in active early development, aiming to provide features like post-deployment monitoring, model deployment, data science agent, experiment snapshots, model cards, Python SDK, model registry, notebooks, in-browser runtime, and express tasks for prompt optimization and tabular data.
awesome-generative-ai-data-scientist
A curated list of 50+ resources to help you become a Generative AI Data Scientist. This repository includes resources on building GenAI applications with Large Language Models (LLMs), and deploying LLMs and GenAI with Cloud-based solutions.
For similar tasks
LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.
ai-guide
This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.
onnxruntime-genai
ONNX Runtime Generative AI is a library that provides the generative AI loop for ONNX models, including inference with ONNX Runtime, logits processing, search and sampling, and KV cache management. Users can call a high level `generate()` method, or run each iteration of the model in a loop. It supports greedy/beam search and TopP, TopK sampling to generate token sequences, has built in logits processing like repetition penalties, and allows for easy custom scoring.
jupyter-ai
Jupyter AI connects generative AI with Jupyter notebooks. It provides a user-friendly and powerful way to explore generative AI models in notebooks and improve your productivity in JupyterLab and the Jupyter Notebook. Specifically, Jupyter AI offers: * An `%%ai` magic that turns the Jupyter notebook into a reproducible generative AI playground. This works anywhere the IPython kernel runs (JupyterLab, Jupyter Notebook, Google Colab, Kaggle, VSCode, etc.). * A native chat UI in JupyterLab that enables you to work with generative AI as a conversational assistant. * Support for a wide range of generative model providers, including AI21, Anthropic, AWS, Cohere, Gemini, Hugging Face, NVIDIA, and OpenAI. * Local model support through GPT4All, enabling use of generative AI models on consumer grade machines with ease and privacy.
khoj
Khoj is an open-source, personal AI assistant that extends your capabilities by creating always-available AI agents. You can share your notes and documents to extend your digital brain, and your AI agents have access to the internet, allowing you to incorporate real-time information. Khoj is accessible on Desktop, Emacs, Obsidian, Web, and Whatsapp, and you can share PDF, markdown, org-mode, notion files, and GitHub repositories. You'll get fast, accurate semantic search on top of your docs, and your agents can create deeply personal images and understand your speech. Khoj is self-hostable and always will be.
langchain_dart
LangChain.dart is a Dart port of the popular LangChain Python framework created by Harrison Chase. LangChain provides a set of ready-to-use components for working with language models and a standard interface for chaining them together to formulate more advanced use cases (e.g. chatbots, Q&A with RAG, agents, summarization, extraction, etc.). The components can be grouped into a few core modules: * **Model I/O:** LangChain offers a unified API for interacting with various LLM providers (e.g. OpenAI, Google, Mistral, Ollama, etc.), allowing developers to switch between them with ease. Additionally, it provides tools for managing model inputs (prompt templates and example selectors) and parsing the resulting model outputs (output parsers). * **Retrieval:** assists in loading user data (via document loaders), transforming it (with text splitters), extracting its meaning (using embedding models), storing (in vector stores) and retrieving it (through retrievers) so that it can be used to ground the model's responses (i.e. Retrieval-Augmented Generation or RAG). * **Agents:** "bots" that leverage LLMs to make informed decisions about which available tools (such as web search, calculators, database lookup, etc.) to use to accomplish the designated task. The different components can be composed together using the LangChain Expression Language (LCEL).
danswer
Danswer is an open-source Gen-AI Chat and Unified Search tool that connects to your company's docs, apps, and people. It provides a Chat interface and plugs into any LLM of your choice. Danswer can be deployed anywhere and for any scale - on a laptop, on-premise, or to cloud. Since you own the deployment, your user data and chats are fully in your own control. Danswer is MIT licensed and designed to be modular and easily extensible. The system also comes fully ready for production usage with user authentication, role management (admin/basic users), chat persistence, and a UI for configuring Personas (AI Assistants) and their Prompts. Danswer also serves as a Unified Search across all common workplace tools such as Slack, Google Drive, Confluence, etc. By combining LLMs and team specific knowledge, Danswer becomes a subject matter expert for the team. Imagine ChatGPT if it had access to your team's unique knowledge! It enables questions such as "A customer wants feature X, is this already supported?" or "Where's the pull request for feature Y?"
infinity
Infinity is an AI-native database designed for LLM applications, providing incredibly fast full-text and vector search capabilities. It supports a wide range of data types, including vectors, full-text, and structured data, and offers a fused search feature that combines multiple embeddings and full text. Infinity is easy to use, with an intuitive Python API and a single-binary architecture that simplifies deployment. It achieves high performance, with 0.1 milliseconds query latency on million-scale vector datasets and up to 15K QPS.
For similar jobs
weave
Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.
LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.
VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.
kaito
Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.
PyRIT
PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.
tabby
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.
spear
SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.
Magick
Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.