Awesome-AITools

Awesome-AITools

Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具,欢迎提交issues 或者pull requests

Stars: 4328

Visit
 screenshot

This repo collects AI-related utilities. ## All Categories * All Categories * ChatGPT and other closed-source LLMs * AI Search engine * Open Source LLMs * GPT/LLMs Applications * LLM training platform * Applications that integrate multiple LLMs * AI Agent * Writing * Programming Development * Translation * AI Conversation or AI Voice Conversation * Image Creation * Speech Recognition * Text To Speech * Voice Processing * AI generated music or sound effects * Speech translation * Video Creation * Video Content Summary * OCR(Optical Character Recognition)

README:

Awesome AI Tools

English | 中文 | हिन्दी

This repo collects AI-related utilities.

Buy Me A Coffee

All Categories

ChatGPT and other closed-source LLMs

Name Description Links Fees
ChatGPT OpenAI's chatgpt URL Free, with paid upgrades
Claude Anthropic's AI assistant URL Free
Gemini Google's conversational, AI chat service. Google's latest LLM, including Gemini Nono, Gemini Pro and Gemini Ultra. Gemini Pro is open for api and sdk use. Gemini is built from the ground up for multimodality — reasoning seamlessly across text, images, video, audio, and code URL
dev: URL
Free
Microsoft Copilot Microsoft's AI assistant. URL Free
Le Chat Mistral.ai's conversational, AI chat service URL Free

AI Search engine

Name Description Links Fees
Perplexity.ai AI-driven conversational search engine. URL Free
You.com A search engine in conversation mode URL Free

Open Source LLMs

Name Description Links Fees
Llama 3 Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model.
Online test address:
huggingface.co/Meta-Llama-3-70B-Instruct
GitHub GitHub Repo stars Free
Mixtral Mixtral 8x7B, a high-quality sparse mixture of experts model (SMoE) with open weights. Mixtral outperforms Llama 2 70B on most benchmarks with 6x faster inference. It matches or outperforms GPT3.5 on most standard benchmarks.
paper:https://arxiv.org/pdf/2401.04088.pdf
news:https://mistral.ai/news/mixtral-of-experts/
mistral-inference GitHub Repo stars
mistral-finetune GitHub Repo stars
Free
grok-1 A large language model open sourced by xAI Github GitHub Repo stars Free
Phi-3 Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks. Github GitHub Repo stars Free

GPT LLMs Applications

Name Description Links Fees
Poe AI product built by Quora. Can use ChatGPT, Sage, Dragonfly, Claude bots for free. All you need is an email address to register. GPT-4 can be used once a day for free URL Free, with paid upgrades
monica AI assistant that provides help with a variety of tasks such as searching, reading, writing, translating, drawing, and more. Standalone apps and browser plug-ins available URL
chrome extension
Free, with paid upgrades
ollama Get up and running with Llama 2, Mistral, Gemma, and other large language models. Github GitHub Repo stars Free
openai/openai-python The official Python library for the OpenAI API, It is generated from OpenAPI specification with Stainless GithubGitHub Repo stars Free, need OpenAPI apikey
sashabaranov/go-openai This library provides unofficial Go clients for OpenAI API. support: ChatGPT, GPT-3, GPT-4, DALL·E 2 GithubGitHub Repo stars Free
langchain LangChain is a framework for developing applications powered by language models. Github GitHub Repo stars Free
Helicone AI Helicone is the open-source LLM observability platform for logging, monitoring, and debugging AI applications. Github GitHub Repo stars Free
ChatGPT-Next-Web One-Click to get a well-designed cross-platform ChatGPT web UI, with GPT3, GPT4 & Gemini Pro support. Github GitHub Repo stars Free
screenshot-to-code This simple app converts a screenshot to HTML/Tailwind CSS. It uses GPT-4 Vision to generate the code and DALL-E 3 to generate similar-looking images. You can now also enter a URL to clone a live website! GitHub GitHub Repo stars Free, need access to GPT-4 Vision
Chatbox Desktop application that uses ChatGPT API (OpenAI API) to store all chat messages and prompts locally, thus reducing the risk of data loss. A bit more stable to use than the web version GitHub GitHub Repo stars Free, requires apikey with OpenAPI
gpt-crawler Crawl a site to generate knowledge files to create your own custom GPT from a URL GithubGitHub Repo stars Free
ChatGPT-Shortcut Open source, ChatGPT shortcut commands that double productivity, partitioned by domain and function, can filter prompt words by tag, keyword search and one-click copy. GitHub GitHub Repo stars Free
ChatGPT Sidebar ChatGPT Sidebar is an artificial intelligence assistant you can use while browsing any website. URL Free
WebChatGPT Open source, expand the ability of networking to chatgpt GitHub GitHub Repo stars Free
AIPRM for ChatGPT Browser plug-in, providing a series of selected ChatGPT instruction templates, and even creating your own, and adjusting AI tone and writing style URL Free
GPTCache ⚡ GPTCache is a library for creating semantic cache to store responses from LLM queries. It can be used to speed up and lower the cost of chat applications that rely on the LLM service. And it's similar to redis in an aigc scenario. Github GitHub Repo stars Free
MindMac Feature-rich & privacy-first native ChatGPT app for macOS to use OpenAI, Azure OpenAI, Anthropic Claude, OpenRouter all in one place, designed for maximum productivity. Currently available in 15 languages. URL Free, with paid upgrades
MemFree Open Source Hybrid AI Search Engine, Instantly Get Accurate Answers from the Internet, Bookmarks, Notes, and Docs. Support One-Click Deployment. Github GitHub Repo stars Free & Suport one-click self-host

AI Image Creation

Name Description Links Fees
Midjourney Enter text or pictures to create pictures URL Free account has a certain usage minutes limit, and there is a paid upgrade version
Photoshop AI Adobe Photoshop generative-fill URL Paid
Stable diffusion webui Open source project, input text or pictures to create pictures, Stable diffusion webui is the GUI of Stable diffusion, and it is an image user interface that visualizes stable diffusion. It also integrates many other useful extension scripts. GitHub GitHub Repo stars Free
civitai civitai.com is a website platform for sharing AI image creation model resources, with a large number of models, has become the main model exchange place in the SD open source community URL Free
clipdrop clipdrop by stability.ai. Has many AI image processing tools, such as stable diffusion XL, uncrop, reimage XL, stable doodle. URL Free/Paid
firefly Adobe's AI image processing web site URL Free/Paid
ideogram.ai Enter text to create pictures. A product developed by a company founded by many ex-Googlers URL Free/Paid
Skybox AI Generate 360-degree panoramic images using text prompts URL Free/Paid
DragGAN Interactive Point-based Manipulation on the Generative Image Manifold GitHub GitHub Repo stars Free
visual-chatgpt Create images with ChatGPT GitHub GitHub Repo stars Free
Microsoft Bing Image Creator Image Creator is a tool for creating pictures using DALL-E technology. Tried Generating portrait pictures is unsightly URL Free
remove.bg Remove Image Background URL Free/Paid
ControlNet ControlNet is a neural network structure to control diffusion models by adding extra conditions. Github GitHub Repo stars Free
StreamDiffusion A Pipeline-Level Solution for Real-Time Interactive Generation Github GitHub Repo stars Free

LLM Prompts

Name Description Links Fees
f/awesome-chatgpt-prompts This repo includes ChatGPT prompt curation to use ChatGPT better. Github GitHub Repo stars Free

LLM Leaderboard

Name Description Links Fees
LMSYS Chatbot Arena Leaderboard LMSYS Chatbot Arena is a crowdsourced open platform for LLM evals. Collected over 1,000,000 human pairwise comparisons to rank LLMs with the Bradley-Terry model and display the model ratings in Elo-scale. URL Free
Artificial Analysis Artificial Analysis is a platform that provides AI model and service provider comparisons and benchmarks to help users make informed decisions when choosing AI models and service providers. The platform provides comparative data on a wide range of popular AI models, including OpenAI's GPT-4, Meta's Llama 3, and Anthropic's Claude series, covering performance metrics such as response time, latency, and cost. URL Free

LLM training platform

Name Description Links Fees
lm-sys/FastChat An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. Github GitHub Repo stars Free

Applications that integrate multiple LLMs

Name Description Links Fees
chathub Use different chatbots in one app, currently supporting ChatGPT, new Bing Chat, Google Bard, Claude, and 10+ open-source models including Alpaca, Vicuna, ChatGLM etc. GitHub GitHub Repo stars Free/Paid
ChatALL Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, and more, discover the best answers GitHub GitHub Repo stars Free

AI Agent

Name Description Links Fees
Auto-GPT Open source, An experimental open-source attempt to make GPT-4 fully autonomous. GitHub GitHub Repo stars Free
OthersideAI/self-operating-computer A framework to enable multimodal models to operate a computer. Github GitHub Repo stars Free,GPT-4v required
AppAgent Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps. Github GitHub Repo stars Free
microsoft/autogen AutoGen is an open-source programming framework for building AI agents and facilitating cooperation among multiple agents to solve tasks. Github GitHub Repo stars Free

Writing

Name Description Links Fees
Notion AI AI-assisted note-taking software URL with certain free AI trials, AI features $10/month
Deep L Write English and German writing tools to fix writing errors and rewrite sentences promptly. URL Free version to use with text word limit / paid upgrade available
grammarly Edit and correct your grammar, spelling, punctuation, and more with your personal writing assistant, grammar checker, and editor. URL Free/Paid

Programming Development

Name Description Links Fees
GitHub Copilot A code writing assistant developed by GitHub and OpenAI URL Paid
Cursor A collaborative code editor using GPT URL Paid/Free Trial
ai-code-translator Open source project. Translates code from one language to another using chatgpt. GitHub GitHub Repo stars Free, requires OpenAI API key
Amazon CodeWhisperer A code writing assistant developed by Amazon URL Free for Individual Use
gpt-engineer GPT Engineer is made to be easy to adapt, extend, and make your agent learn how you want your code to look. It generates an entire codebase based on a prompt. GitHub GitHub Repo stars Free
Codeium Powerful in-IDE AI coding assistant URL Free/Paid
scalene Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals Github GitHub Repo stars Free
Fitten Code Fitten Code is an AI programming assistant driven by Fitten LLM models, which can automatically generate code, improve development efficiency, help you debug, and save your time. It can also chat with you and solve your programming problems.freeand supports over 80 languages: Python, C++,JavaScript, TypeScript, Java, etc. Fitten Code supports Visual Studio Code and JetBrains series IDEs, including IntelliJ IDEA, PyCharm, WebStorm, etc. URL Free
flappy Production-Ready LLM Agent SDK for Every Developer GitHub GitHub Repo stars Free
Plandex Open source, terminal-based AI programming engine for complex tasks GitHub GitHub Repo stars Free
Mistral/Codestral Empowering developers and democratising coding with Mistral AI., models:https://huggingface.co/mistralai/Codestral-22B-v0.1 URL Free

Translation

Name Description Links Fees
immersive-translate Open source project. Immersive bilingual web translation extension GitHub GitHub Repo stars Free
Deep L Accurate and instant translation tool, currently supporting 31 languages URL Free/Paid
openai-translator Open source project. Crossword translation browser plugin and cross-platform desktop application based on ChatGPT API GitHub GitHub Repo stars Free, requires OpenAI API key

AI Conversation or AI Voice Conversation

Name Description Links Fees
pi.ai An AI that's been shown to be very good at chatting, so you don't have to worry about talking all day. It supports both text and speech. Voice input is required with Apple's input system. Good for practicing English conversation and listening. URL Free
Voice Control for ChatGPT This Chrome extension allows you to have voice conversations with ChatGPT. URL Free, requires chatgpt account
SpeechGPT SpeechGPT is a web application that enables you to converse with ChatGPT. GitHub GitHub Repo stars Free,requires OpenAI API key

Speech Recognition

Name Description Links Fees
whisper OpenAPI open source robust speech recognition model through large-scale weak supervision GitHub GitHub Repo stars Free
buzz An open source desktop software based on OpenAI's Whisper to recognize speech and generate subtitles GitHub GitHub Repo stars Free
WhisperDesktop Open source, OpenAI-based Whisper, a desktop application for Windows, uses the GPU for processing, which will be faster than on the CPU with good GPU performance. GitHub GitHub Repo stars Free
whisperX WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) whisperX GitHub Repo stars Free
whisper-web ML-powered speech recognition directly in your browser. Built with Transformers.js. Demo GitHub GitHub Repo stars Free

Text To Speech

Name Description Links Fees
Azure Text to speech The best and most realistic voice tools currently available URL Paid / 500,000 characters per month free
coqui-ai/tts A deep learning toolkit for Text-to-Speech, battle-tested in research and production
Online Demo: https://huggingface.co/spaces/coqui/xtts
Github GitHub Repo stars Free
elevenlabs Intelligent AI Text to Speech URL Free/Paid
netease-youdao/EmotiVoice A Multi-Voice and Prompt-Controlled TTS Engine. EmotiVoice speaks both English and Chinese, and with over 2000 different voices. The most prominent feature is emotional synthesis, allowing you to create speech with a wide range of emotions, including happy, excited, sad, angry and others. Github GitHub Repo stars Free
tetos A unified interface for multiple Text-to-Speech (TTS) providers. Supported TTS providers: Edge TTS, OpenAI TTS, Azure TTS, Google TTS, Volcengine TTS, Baidu TTS Github GitHub Repo stars Free
ChatTTS ChatTTS is a text-to-speech model designed specifically for dialogue scenario such as LLM assistant. It supports both English and Chinese languages. Our model is trained with 100,000+ hours composed of chinese and english. Website:https://chattts.com/ GithubGitHub Repo stars Free

Voice Processing

Name Description Links Fees
so-vits-svc SoftVC VITS Singing Voice Conversion. GitHub GitHub Repo stars Free
vocalremover Extract vocal and music URL Free
lala.ai Extract vocal, accompaniment and various instruments from any audio and video URL Free/Paid

AI generated music or sound effects

Name Description Link Fees
suno.ai The AI music creation tool Suno can generate custom songs based on text prompts in mere second You can create your own AI songs with this new Copilot extension URL
udio Create music from simple text prompts by specifying topics, genres, and other descriptors which are then transformed into professional quality tracks. URL
elevenlabs/sound-effects Imagine a sound and bring it to life, or explore a selection of the best sound effects generated by the community. URL Free
suno-ai/bark Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. Github GitHub Repo stars Free
audiocraft Open source library for audio/music generation by Meta, which mainly includes two models, MusicGen: text-to-music model, AudioGen: text-generated sound model. MusicGen Online Demo GitHub GitHub Repo stars Free
Stable Audio AI music and sound effect generation application by stability.ai URL Free/Paid
OptimizerAI Sound effect generation
Official Introduction
URL Free/Paid

Speech translation

Name Description Links Fees
Seamless Seamless is a family of AI models that enable more natural and authentic communication across languages.Online Demo Github GitHub Repo stars Free

Video Creation

Name Description Links Fees
KLING AI AI Video Creation Tool by kuaishou. URL Free/Paid
Dream Machine By Luma AI. Dream Machine is an AI model that makes high quality, realistic videos fast from text and images.Official introductory video URL Free/Paid
Sora Sora is an AI model published by OpenAI that can create realistic and imaginative scenes from text instructions. Sora access not fully open, some visual artists, designers and filmmakers given access URL -
capcut Subtitle-generated speech, speech recognition, and very convenient and powerful video editing URL Free/Paid
Runway Gen-2: Text/Image to video
Gen-1: Video to video. Featured video: https://runwayml.com/staff-picks
URL Paid/Free trial
Pika Text/Image to video URL Paid/Free trial
Fliki A website that converts text into audio and video URL Free/Paid
d-id Generate digital human dubbing video based on text URL Paid/Free trial
HeyGen Generate digital human dubbing video based on text URL Paid/Free trial
AnimateDiff AnimateDiff is a plug-and-play module turning most community models into animation generators, without the need of additional training. Github GitHub Repo stars Free
vivago.ai/video Text to Video; Image to Video; 4K enhance URL Free

Video Content Summary

Name Description Links Fees
ChatGPT for YouTube Chrome plugin, quickly summarize Youtube video content, need to log in chatgpt account or apikey URL Free
Chat Youtube Give a Youtube link, it will give a summary, and you can ask it questions about the content of the video URL Free

OCR

Name Description Links Fees
Umi-OCR Comes with a highly efficient offline OCR engine. As long as the computer performance is sufficient, it can be faster than online OCR services. Github GitHub Repo stars Free

Star History

Awesome-AITools Discord Link: https://discord.gg/7hAvJQME

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for Awesome-AITools

Similar Open Source Tools

For similar tasks

For similar jobs