Awesome-AITools

Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具，欢迎提交issues 或者pull requests

Stars: 5618

Visit

This repo collects AI-related utilities. ## All Categories * All Categories * ChatGPT and other closed-source LLMs * AI Search engine * Open Source LLMs * GPT/LLMs Applications * LLM training platform * Applications that integrate multiple LLMs * AI Agent * Writing * Programming Development * Translation * AI Conversation or AI Voice Conversation * Image Creation * Speech Recognition * Text To Speech * Voice Processing * AI generated music or sound effects * Speech translation * Video Creation * Video Content Summary * OCR(Optical Character Recognition)

README:

Awesome AI Tools

English | 中文

This repo collects awesome AI tools. Welcome everyone to recommend more awesome AI tools together! Please use the following template as a reference for your recommendations. issue

All Categories

All Categories

ChatGPT and other AI chat assistant

Name	Description	Links	Fees
Gemini	Google's LLMs, including Gemini-3 pro. ai.google.dev	URL	Free/Paid
ChatGPT	OpenAI's LLMs, including GPT-5.2	URL	Free/Paid
Claude	Anthropic's LLMs, including Claude Opus 4.6	URL	Free/Paid
DeepSeek	DeepSeek's LLMs. API	URL	Free/Paid
Grok	xAI's LLMs, including grok-4.1-thinking. x.com/grok	URL	Free
Microsoft Copilot	Microsoft's AI assistant.	URL	Free
Le Chat	Mistral.ai's conversational, AI chat service	URL	Free
qwen	Alibaba's LLMs. Includes Qwen3, Qwen3-Code and other Qwen LLMs	URL	Free

AI Agent

Name	Description	Links	Fees
Manus	Manus is the action engine that goes beyond answers to execute tasks, automate workflows, and extend your human reach	URL	Free Trial/Paid
AnyGen	AnyGen is the AI assistant that truly "gets work done" for you. From writing and analysis to planning and reporting, it transforms your ideas into ready-to-use professional deliverables in minutes. The AI Assistant Built for Work	URL	Free Trial/Paid
Gemini CLI	An open-source AI agent that brings the power of Gemini directly into your terminal.	Github	Free
agentscope	Agent-Oriented Programming for Building LLM Applications, Open-sourced by Alibaba	Github	Free
Auto-GPT	Open source, An experimental open-source attempt to make GPT-4 fully autonomous.	GitHub	Free
OthersideAI/self-operating-computer	A framework to enable multimodal models to operate a computer.	Github	Free，GPT-4v required
microsoft/autogen	AutoGen is an open-source programming framework for building AI agents and facilitating cooperation among multiple agents to solve tasks.	Github	Free
potpie-ai/potpie	Open Source AI Agents for your codebase in minutes. Use pre-built agents for Q&A, Testing, Debugging and System Design or create your own purpose-built agents.	URL , Github	Free Trial
saplings	A framework for building agents that use search algorithms to complete tasks.	Github	Free
MastraAI	Mastra is an opinionated TypeScript framework that helps you build AI applications and features quickly. It gives you the set of primitives you need: workflows, agents, RAG, integrations and evals	Github	Free

AI Search engine

Name	Description	Links	Fees
Perplexity.ai	AI-driven conversational search engine.	URL	Free
You.com	A search engine in conversation mode	URL	Free
Morphik.ai	Open source AI-driven search engine for private documents	URL Github	Free

Open Source LLMs

Name	Description	Links	Fees
DeepSeek-R1	DeepSeek's first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning.	Github	Free
DeepSeek-V3	A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.	Github	Free
Qwen3	Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.	Github	Free
Llama 3	Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model. Online test address: huggingface.co/Meta-Llama-3-70B-Instruct	GitHub	Free
Mixtral	Mixtral 8x7B, a high-quality sparse mixture of experts model (SMoE) with open weights. Mixtral outperforms Llama 2 70B on most benchmarks with 6x faster inference. It matches or outperforms GPT3.5 on most standard benchmarks. paper：https://arxiv.org/pdf/2401.04088.pdf news：https://mistral.ai/news/mixtral-of-experts/	mistral-inference mistral-finetune	Free
grok-1	A large language model open sourced by xAI	Github	Free
Phi-3	Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.	Github	Free

LLM Leaderboard

Name	Description	Links	Fees
LMSYS Chatbot Arena Leaderboard	LMSYS Chatbot Arena is a crowdsourced open platform for LLM evals. Collected over 1,000,000 human pairwise comparisons to rank LLMs with the Bradley-Terry model and display the model ratings in Elo-scale.	URL	Free
Artificial Analysis	Artificial Analysis is a platform that provides AI model and service provider comparisons and benchmarks to help users make informed decisions when choosing AI models and service providers. The platform provides comparative data on a wide range of popular AI models, including OpenAI's GPT-4, Meta's Llama 3, and Anthropic's Claude series, covering performance metrics such as response time, latency, and cost.	URL	Free
LiveCodeBench	LiveCodeBench is a holistic and contamination-free evaluation benchmark of LLMs for code that continuously collects new problems over time. Particularly, LiveCodeBench also focuses on broader code-related capabilities, such as self-repair, code execution, and test output prediction, beyond mere code generation.	URL	Free
LLM Stats	LLM Stats, the most comprehensive LLM leaderboard, benchmarks and compares API models using daily‑updated, open‑source community data on capability, price, speed, and context length.	URL	Free
Price Per Token	Compare LLM API pricing across 200+ models from OpenAI, Anthropic, Google, and more. Includes token counters, cost calculators, and benchmark comparisons.	URL	Free

GPT LLMs Applications

Name	Description	Links	Fees
NotebookLM	AI Research Assistant developed by Google. Upload PDFs, websites, YouTube videos, audio files, Google Docs, or Google Slides, and NotebookLM will summarize them and make interesting connections between topics. Audio Overview feature can turn your sources into engaging “Deep Dive” discussions with one click.	URL	Free
Google AI Studio	Google AI Studio is a free, web-based developer tool that enables you to quickly develop prompts and then get an API key to use in your app development. Available regions	URL	Free
Poe	AI product built by Quora. Can use ChatGPT, Sage, Dragonfly, Claude bots for free. All you need is an email address to register. GPT-4 can be used once a day for free	URL	Free, with paid upgrades
Cherry Studio	Cherry Studio is a desktop client that supports for multiple LLM providers, available on Windows, Mac and Linux. Support major LLM Cloud Services: OpenAI, Gemini, Anthropic, and more AI Web Service Integration: Claude, Peplexity, Poe, and others Local Model Support with Ollama, LM Studio	Github	Free
HuggingChat	Open source codebase powering the HuggingChat app. URL	Github	Free
Learn about	AI learning Assistant developed by Google.Grasp new topics and deepen your understanding with a conversational learning companion that adapts to your unique curiosity and learning goals.	URL	Free
monica	AI assistant that provides help with a variety of tasks such as searching, reading, writing, translating, drawing, and more. Standalone apps and browser plug-ins available	URL chrome extension	Free, with paid upgrades
ollama	Get up and running with Llama 2, Mistral, Gemma, and other large language models.	Github	Free
openai/openai-python	The official Python library for the OpenAI API, It is generated from OpenAPI specification with Stainless	Github	Free, need OpenAPI apikey
sashabaranov/go-openai	This library provides unofficial Go clients for OpenAI API. support: ChatGPT, GPT-3, GPT-4, DALL·E 2	Github	Free
langchain	LangChain is a framework for developing applications powered by language models.	Github	Free
Helicone AI	Helicone is the open-source LLM observability platform for logging, monitoring, and debugging AI applications.	Github	Free
ChatGPT-Next-Web	One-Click to get a well-designed cross-platform ChatGPT web UI, with GPT3, GPT4 & Gemini Pro support.	Github	Free
screenshot-to-code	This simple app converts a screenshot to HTML/Tailwind CSS. It uses GPT-4 Vision to generate the code and DALL-E 3 to generate similar-looking images. You can now also enter a URL to clone a live website!	GitHub	Free, need access to GPT-4 Vision
Chatbox	Desktop application that uses ChatGPT API (OpenAI API) to store all chat messages and prompts locally, thus reducing the risk of data loss. A bit more stable to use than the web version	GitHub	Free, requires apikey with OpenAPI
together.ai chat	Similar to HuggingChat, with the option of different open source models, support for DeepSeek R1, LLaMA, QWen, Flux Schnell. 60 free messages per day.	URL	Free/Paid
gpt-crawler	Crawl a site to generate knowledge files to create your own custom GPT from a URL	Github	Free
ChatGPT-Shortcut	Open source, ChatGPT shortcut commands that double productivity, partitioned by domain and function, can filter prompt words by tag, keyword search and one-click copy.	GitHub	Free
ChatGPT Sidebar	ChatGPT Sidebar is an artificial intelligence assistant you can use while browsing any website.	URL	Free
WebChatGPT	Open source, expand the ability of networking to chatgpt	GitHub	Free
AIPRM for ChatGPT	Browser plug-in, providing a series of selected ChatGPT instruction templates, and even creating your own, and adjusting AI tone and writing style	URL	Free
MindMac	Feature-rich & privacy-first native ChatGPT app for macOS to use OpenAI, Azure OpenAI, Anthropic Claude, OpenRouter all in one place, designed for maximum productivity. Currently available in 15 languages.	URL	Free, with paid upgrades
chathub	Use different chatbots in one app, currently supporting ChatGPT, new Bing Chat, Google Bard, Claude, and 10+ open-source models including Alpaca, Vicuna, ChatGLM etc.	GitHub	Free/Paid
Harbor	Effortlessly run LLM backends, APIs, frontends, and services with one command.	GitHub	Free
gemini-fullstack-langgraph-quickstart	Get started with building Fullstack Agents using Gemini 2.5 and LangGraph	Github	Free
NoteGPT	NoteGPT is a smart note-taking tool that can record, transcribe, and summarize various content, such as meetings, lectures, podcasts, YouTube videos, news briefings, and articles.	URL	Free/Paid

Programming Development

Name	Description	Links	Fees
Cursor	A collaborative code editor using GPT	URL	Paid/Free Trial
GitHub Copilot	A code writing assistant developed by GitHub and OpenAI	URL	Paid
Trae	ByteDance's AI coding IDE. Trae is your helpful coding partner. It offers features like AI Q&A, code auto-completion, and agent-based AI programming capabilities.	URL	Free
Amazon CodeWhisperer	A code writing assistant developed by Amazon	URL	Free for Individual Use
Codeium	Powerful in-IDE AI coding assistant	URL	Free/Paid
scalene	Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals	Github	Free
Fitten Code	Fitten Code is an AI programming assistant driven by Fitten LLM models, which can automatically generate code, improve development efficiency, help you debug, and save your time. It can also chat with you and solve your programming problems.freeand supports over 80 languages: Python, C++,JavaScript, TypeScript, Java, etc. Fitten Code supports Visual Studio Code and JetBrains series IDEs, including IntelliJ IDEA, PyCharm, WebStorm, etc.	URL	Free
Plandex	Open source, terminal-based AI programming engine for complex tasks	GitHub	Free
Roundtable	Zero-configuration MCP server that unifies multiple AI coding assistants for enhanced development workflows. Intelligent client management platform enabling seamless coordination between Claude Code, Cursor, GPT-4, and other AI development tools.	GitHub Website	Free
Mistral/Codestral	Empowering developers and democratising coding with Mistral AI., models:https://huggingface.co/mistralai/Codestral-22B-v0.1	URL	Free
Kodus	Open Source Code Review Agent	GitHub	Free/Paid
Kagan	AI-powered Kanban TUI for autonomous development workflows. Integrates with Claude Code and OpenCode for ticket-driven AI coding with git worktree isolation and MCP server support.	GitHub	Free

AI Image Creation

Name	Description	Links	Fees
Nano Banana/Nano Banana Pro	Google's advanced AI model for image generation and editing. No. 1 in the LMArea Text to Image and Image Edit leadboard. Online website: 1. gemini 2.aistudio 3. lmarea.ai	URL	Free/Paid
Z-Image	Z-Image is a high-performance image generation model recently open-sourced by Alibaba's Tongyi Lab. It strikes a balance between "extreme speed" and "high quality," making it highly suitable for scenarios requiring rapid image generation. Z-Image-Turbo Online Demo: https://huggingface.co/spaces/mrfakename/Z-Image-Turbo	Github	Free
Midjourney	Enter text or pictures to create pictures	URL	Paid
ChatGPT Images	GPT Image 1.5	URL	Free/Paid
Photoshop AI	Adobe Photoshop generative-fill	URL	Paid
Stable diffusion webui	Open source project, input text or pictures to create pictures, Stable diffusion webui is the GUI of Stable diffusion, and it is an image user interface that visualizes stable diffusion. It also integrates many other useful extension scripts.	GitHub	Free
civitai	civitai.com is a website platform for sharing AI image creation model resources, with a large number of models, has become the main model exchange place in the SD open source community	URL	Free
clipdrop	clipdrop by stability.ai. Has many AI image processing tools, such as stable diffusion XL, uncrop, reimage XL, stable doodle.	URL	Free/Paid
firefly	Adobe's AI image processing web site	URL	Free/Paid
ideogram.ai	Enter text to create pictures. A product developed by a company founded by many ex-Googlers	URL	Free/Paid
Nero AI	AI picture upscale, AI repair scratches, AI picture coloring, AI picture noise removal, AI one-click to change the background, AI magical erasing pen, AI portrait. API doc：https://ai.nero.com/ai-api/docs/	URL	Paid/Trial
Skybox AI	Generate 360-degree panoramic images using text prompts	URL	Free/Paid
remove.bg	Remove Image Background	URL	Free/Paid
ControlNet	ControlNet is a neural network structure to control diffusion models by adding extra conditions.	Github	Free
PixelPanda	AI-powered platform that creates professional product photos, marketing images, UGC-style videos, and AI avatars — no camera or studio needed.	URL	Free/Paid

Video Creation

Name	Description	Links	Fees
Wan2.6	AI Video Creation Tool by Alibaba	URL	Paid/Free trial
Sora	Sora is an AI model published by OpenAI that can create realistic and imaginative scenes from text instructions.	URL	Paid
KLING AI	AI Video Creation Tool by kuaishou. Support text to video, image to video, start-end frame and motion control	URL	Free/Paid
hailuoai	AI Video Creation Tool by Minimax	URL	Free/Paid
Dream Machine	By Luma AI. Dream Machine is an AI model that makes high quality, realistic videos fast from text and images.Official introductory video	URL	Free/Paid
capcut	Subtitle-generated speech, speech recognition, and very convenient and powerful video editing	URL	Free/Paid
Runway	Gen-2: Text/Image to video Gen-1: Video to video. Featured video: https://runwayml.com/staff-picks	URL	Paid/Free trial
pixverse	Create Amazing AI Videos from Text & Photos	URL	Paid/Free trial
Pika	Text/Image to video	URL	Paid/Free trial
Fliki	A website that converts text into audio and video	URL	Free/Paid
d-id	Generate digital human dubbing video based on text	URL	Paid/Free trial
HeyGen	Generate digital human dubbing video based on text	URL	Paid/Free trial
AnimateDiff	AnimateDiff is a plug-and-play module turning most community models into animation generators, without the need of additional training.	Github	Free
vivago.ai/video	Text to Video; Image to Video; 4K enhance	URL	Free

AI Cloud Platform

Name	Description	Links	Fees
together.ai	The AI Acceleration Cloud. Train, fine-tune-and run inference on AI models blazing fast, at low cost, and at production scale.	URL	Free/Paid

LLM Prompts

Name	Description	Links	Fees
f/awesome-chatgpt-prompts	This repo includes ChatGPT prompt curation to use ChatGPT better.	Github	Free

LLM training platform

Name	Description	Links	Fees
lm-sys/FastChat	An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.	Github	Free

Writing

Name	Description	Links	Fees
Notion AI	AI-assisted note-taking software	URL	with certain free AI trials, AI features $10/month
Deep L Write	English and German writing tools to fix writing errors and rewrite sentences promptly.	URL	Free version to use with text word limit / paid upgrade available
grammarly	Edit and correct your grammar, spelling, punctuation, and more with your personal writing assistant, grammar checker, and editor.	URL	Free/Paid
TextCraft	Add-in for Microsoft Word that seamlessly integrates essential AI tools, including text generation, proofreading, and more, directly into the user interface.	URL	Free

Translation

Name	Description	Links	Fees
Google Translate	Support text, picture, document and URL	URL	Free
Deep L	Accurate and instant translation tool, currently supporting 31 languages	URL	Free/Paid
immersive-translate	Open source project. Immersive bilingual web translation extension	GitHub	Free
openai-translator	Open source project. Crossword translation browser plugin and cross-platform desktop application based on ChatGPT API	GitHub	Free, requires OpenAI API key
RTranslator	RTranslator is an open-source, free, and offline real-time translation app for Android.	Github	Free

Speech Recognition

Name	Description	Links	Fees
whisper	OpenAPI open source robust speech recognition model through large-scale weak supervision	GitHub	Free
whisper.cpp	Port of OpenAI's Whisper model in C/C++	Github	Free
buzz	An open source desktop software based on OpenAI's Whisper to recognize speech and generate subtitles	GitHub	Free
WhisperDesktop	Open source, OpenAI-based Whisper, a desktop application for Windows, uses the GPU for processing, which will be faster than on the CPU with good GPU performance.	GitHub	Free
whisperX	WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)	whisperX	Free
whisper-web	ML-powered speech recognition directly in your browser. Built with Transformers.js. Demo	GitHub	Free

Text To Speech

Name	Description	Links	Fees
index-tts2	Bilibili's Open-Source Industrial-Grade Controllable High-Efficiency Zero-Sample Text-to-Speech System. Online Demo: https://huggingface.co/spaces/IndexTeam/IndexTTS-2-Demo Paper: https://arxiv.org/abs/2506.21619	Github	Free
Azure Text to speech	The best and most realistic voice tools currently available	URL	Paid / 500,000 characters per month free
Hailuo AI Text to Speech	Offer over 300 voices in 17 languages and multiple accents, covering a wide range of styles and age groups to provide the voice effects you need.	URL	Limited-time Free
coqui-ai/tts	A deep learning toolkit for Text-to-Speech, battle-tested in research and production Online Demo: https://huggingface.co/spaces/coqui/xtts	Github	Free
elevenlabs	Intelligent AI Text to Speech	URL	Free/Paid
netease-youdao/EmotiVoice	A Multi-Voice and Prompt-Controlled TTS Engine. EmotiVoice speaks both English and Chinese, and with over 2000 different voices. The most prominent feature is emotional synthesis, allowing you to create speech with a wide range of emotions, including happy, excited, sad, angry and others.	Github	Free
tetos	A unified interface for multiple Text-to-Speech (TTS) providers. Supported TTS providers: Edge TTS, OpenAI TTS, Azure TTS, Google TTS, Volcengine TTS, Baidu TTS	Github	Free
ChatTTS	ChatTTS is a text-to-speech model designed specifically for dialogue scenario such as LLM assistant. It supports both English and Chinese languages. Our model is trained with 100,000+ hours composed of chinese and english. Website：https://chattts.com/	Github	Free

Music Recognition

Name	Description	Links	Fee
shazam	Download the shazaom app for music recognition, which is pretty fast	URL	Free

Voice Processing

Name	Description	Links	Fees
so-vits-svc	SoftVC VITS Singing Voice Conversion.	GitHub	Free
vocalremover	Extract vocal and music	URL	Free
lala.ai	Extract vocal, accompaniment and various instruments from any audio and video	URL	Free/Paid

AI generated music or sound effects

Name	Description	Link	Fees
suno.ai	The AI music creation tool Suno can generate custom songs based on text prompts in mere second	URL
udio	Create music from simple text prompts by specifying topics, genres, and other descriptors which are then transformed into professional quality tracks.	URL
mureka.ai	Text to music	URL	Free/Paid
elevenlabs/sound-effects	Imagine a sound and bring it to life, or explore a selection of the best sound effects generated by the community.	URL	Free
suno-ai/bark	Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects.	Github	Free
audiocraft	Open source library for audio/music generation by Meta, which mainly includes two models, MusicGen: text-to-music model, AudioGen: text-generated sound model. MusicGen Online Demo	GitHub	Free
Stable Audio	AI music and sound effect generation application by stability.ai	URL	Free/Paid
OptimizerAI	Sound effect generation Official Introduction	URL	Free/Paid
SFX Engine	AI Sound effect generation	URL	Free/Paid

Speech translation

Name	Description	Links	Fees
Seamless	Seamless is a family of AI models that enable more natural and authentic communication across languages.Online Demo	Github	Free

Video Content Summary

Name	Description	Links	Fees
ChatGPT for YouTube	Chrome plugin, quickly summarize Youtube video content, need to log in chatgpt account or apikey	URL	Free
Chat Youtube	Give a Youtube link, it will give a summary, and you can ask it questions about the content of the video	URL	Free

Academic research

Name	Description	Links	Fees
alphaxiv	An open academic discussion community based on the arXiv platform that allows users to comment line-by-line, ask questions, and interact in real-time by replacing the paper's linking domain (arxiv.org for alphaxiv.org) directly on the paper's page. And provides AI features such as Ask AI and AI-generated article blogs	URL	Free

OCR

Name	Description	Links	Fees
Umi-OCR	Comes with a highly efficient offline OCR engine. As long as the computer performance is sufficient, it can be faster than online OCR services.	Github	Free
allenai/olmocr	A toolkit for training language models to work with PDF documents in the wild. Online demo: https://olmocr.allenai.org/	Github	Free

Real Estate

Name	Description	Links	Fees
AI Virtual Staging	Stage empty rooms instantly with realistic furniture using AI. MLS compliant, fast, and affordable virtual staging for real estate listings. With furniture removal, day to dusk, 2D to 3D floor plan features support.	URL

AI Detection

Name	Description	Links	Fees
AI Detect Lab	Professional AI image and deepfake detector specifically optimized for Midjourney v7 and Flux.	URL	Free

For Tasks:

Click tags to check more tools for each tasks

write code translate text answer questions generate images create music

For Jobs:

content writer software engineer translator ai researcher data scientist

Alternative AI tools for Awesome-AITools

Similar Open Source Tools

Awesome-AITools

github

: 5.6k

MiniCPM-V-CookBook

MiniCPM-V & o Cookbook is a comprehensive repository for building multimodal AI applications effortlessly. It provides easy-to-use documentation, supports a wide range of users, and offers versatile deployment scenarios. The repository includes live demonstrations, inference recipes for vision and audio capabilities, fine-tuning recipes, serving recipes, quantization recipes, and a framework support matrix. Users can customize models, deploy them efficiently, and compress models to improve efficiency. The repository also showcases awesome works using MiniCPM-V & o and encourages community contributions.

github

: 192

openkore

OpenKore is a custom client and intelligent automated assistant for Ragnarok Online. It is a free, open source, and cross-platform program (Linux, Windows, and MacOS are supported). To run OpenKore, you need to download and extract it or clone the repository using Git. Configure OpenKore according to the documentation and run openkore.pl to start. The tool provides a FAQ section for troubleshooting, guidelines for reporting issues, and information about botting status on official servers. OpenKore is developed by a global team, and contributions are welcome through pull requests. Various community resources are available for support and communication. Users are advised to comply with the GNU General Public License when using and distributing the software.

github

: 1.3k

generative-ai-with-javascript

The 'Generative AI with JavaScript' repository is a comprehensive resource hub for JavaScript developers interested in delving into the world of Generative AI. It provides code samples, tutorials, and resources from a video series, offering best practices and tips to enhance AI skills. The repository covers the basics of generative AI, guides on building AI applications using JavaScript, from local development to deployment on Azure, and scaling AI models. It is a living repository with continuous updates, making it a valuable resource for both beginners and experienced developers looking to explore AI with JavaScript.

github

: 1.1k

RustySEO

RustySEO is a free, modern SEO/GEO toolkit designed to help users crawl and analyze websites and server logs without crawl limits. It is an all-in-one, cross-platform marketing toolkit for comprehensive SEO & GEO analysis, providing actionable insights into marketing and SEO strategies. The tool offers features such as shallow & deep crawl, technical diagnostics, on-page SEO analysis, dashboards, reporting, topic and keyword generators, AI chatbot, crawl history, image conversion and optimization, and more. RustySEO aims to be a robust, free alternative to costly commercial SEO tools, with integrations like Google PageSpeed Insights, Google Gemini, and more.

github

: 151

together-cookbook

The Together Cookbook is a collection of code and guides designed to help developers build with open source models using Together AI. The recipes provide examples on how to chain multiple LLM calls, create agents that route tasks to specialized models, run multiple LLMs in parallel, break down tasks into parallel subtasks, build agents that iteratively improve responses, perform LoRA fine-tuning and inference, fine-tune LLMs for repetition, improve summarization capabilities, fine-tune LLMs on multi-step conversations, implement retrieval-augmented generation, conduct multimodal search and conditional image generation, visualize vector embeddings, improve search results with rerankers, implement vector search with embedding models, extract structured text from images, summarize and evaluate outputs with LLMs, generate podcasts from PDF content, and get LLMs to generate knowledge graphs.

github

: 769

ai-agents-for-beginners

AI Agents for Beginners is a course that covers the fundamentals of building AI Agents. It consists of 10 lessons with code examples using Azure AI Foundry and GitHub Model Catalogs. The course utilizes AI Agent frameworks and services from Microsoft, such as Azure AI Agent Service, Semantic Kernel, and AutoGen. Learners can access written lessons, Python code samples, and additional learning resources for each lesson. The course encourages contributions and suggestions from the community and provides multi-language support for learners worldwide.

github

: 38.7k

txtai

Txtai is an all-in-one embeddings database for semantic search, LLM orchestration, and language model workflows. It combines vector indexes, graph networks, and relational databases to enable vector search with SQL, topic modeling, retrieval augmented generation, and more. Txtai can stand alone or serve as a knowledge source for large language models (LLMs). Key features include vector search with SQL, object storage, topic modeling, graph analysis, multimodal indexing, embedding creation for various data types, pipelines powered by language models, workflows to connect pipelines, and support for Python, JavaScript, Java, Rust, and Go. Txtai is open-source under the Apache 2.0 license.

github

: 12.2k

generative-ai-for-beginners

This course has 18 lessons. Each lesson covers its own topic so start wherever you like! Lessons are labeled either "Learn" lessons explaining a Generative AI concept or "Build" lessons that explain a concept and code examples in both **Python** and **TypeScript** when possible. Each lesson also includes a "Keep Learning" section with additional learning tools. **What You Need** * Access to the Azure OpenAI Service **OR** OpenAI API - _Only required to complete coding lessons_ * Basic knowledge of Python or Typescript is helpful - *For absolute beginners check out these Python and TypeScript courses. * A Github account to fork this entire repo to your own GitHub account We have created a **Course Setup** lesson to help you with setting up your development environment. Don't forget to star (🌟) this repo to find it easier later. ## 🧠 Ready to Deploy? If you are looking for more advanced code samples, check out our collection of Generative AI Code Samples in both **Python** and **TypeScript**. ## 🗣️ Meet Other Learners, Get Support Join our official AI Discord server to meet and network with other learners taking this course and get support. ## 🚀 Building a Startup? Sign up for Microsoft for Startups Founders Hub to receive **free OpenAI credits** and up to **$150k towards Azure credits to access OpenAI models through Azure OpenAI Services**. ## 🙏 Want to help? Do you have suggestions or found spelling or code errors? Raise an issue or Create a pull request ## 📂 Each lesson includes: * A short video introduction to the topic * A written lesson located in the README * Python and TypeScript code samples supporting Azure OpenAI and OpenAI API * Links to extra resources to continue your learning ## 🗃️ Lessons | | Lesson Link | Description | Additional Learning | | :-: | :------------------------------------------------------------------------------------------------------------------------------------------: | :---------------------------------------------------------------------------------------------: | ------------------------------------------------------------------------------ | | 00 | Course Setup | **Learn:** How to Setup Your Development Environment | Learn More | | 01 | Introduction to Generative AI and LLMs | **Learn:** Understanding what Generative AI is and how Large Language Models (LLMs) work. | Learn More | | 02 | Exploring and comparing different LLMs | **Learn:** How to select the right model for your use case | Learn More | | 03 | Using Generative AI Responsibly | **Learn:** How to build Generative AI Applications responsibly | Learn More | | 04 | Understanding Prompt Engineering Fundamentals | **Learn:** Hands-on Prompt Engineering Best Practices | Learn More | | 05 | Creating Advanced Prompts | **Learn:** How to apply prompt engineering techniques that improve the outcome of your prompts. | Learn More | | 06 | Building Text Generation Applications | **Build:** A text generation app using Azure OpenAI | Learn More | | 07 | Building Chat Applications | **Build:** Techniques for efficiently building and integrating chat applications. | Learn More | | 08 | Building Search Apps Vector Databases | **Build:** A search application that uses Embeddings to search for data. | Learn More | | 09 | Building Image Generation Applications | **Build:** A image generation application | Learn More | | 10 | Building Low Code AI Applications | **Build:** A Generative AI application using Low Code tools | Learn More | | 11 | Integrating External Applications with Function Calling | **Build:** What is function calling and its use cases for applications | Learn More | | 12 | Designing UX for AI Applications | **Learn:** How to apply UX design principles when developing Generative AI Applications | Learn More | | 13 | Securing Your Generative AI Applications | **Learn:** The threats and risks to AI systems and methods to secure these systems. | Learn More | | 14 | The Generative AI Application Lifecycle | **Learn:** The tools and metrics to manage the LLM Lifecycle and LLMOps | Learn More | | 15 | Retrieval Augmented Generation (RAG) and Vector Databases | **Build:** An application using a RAG Framework to retrieve embeddings from a Vector Databases | Learn More | | 16 | Open Source Models and Hugging Face | **Build:** An application using open source models available on Hugging Face | Learn More | | 17 | AI Agents | **Build:** An application using an AI Agent Framework | Learn More | | 18 | Fine-Tuning LLMs | **Learn:** The what, why and how of fine-tuning LLMs | Learn More |

github

: 106.2k

SemanticFinder

SemanticFinder is a frontend-only live semantic search tool that calculates embeddings and cosine similarity client-side using transformers.js and SOTA embedding models from Huggingface. It allows users to search through large texts like books with pre-indexed examples, customize search parameters, and offers data privacy by keeping input text in the browser. The tool can be used for basic search tasks, analyzing texts for recurring themes, and has potential integrations with various applications like wikis, chat apps, and personal history search. It also provides options for building browser extensions and future ideas for further enhancements and integrations.

github

: 204

AI-For-Beginners

AI-For-Beginners is a comprehensive 12-week, 24-lesson curriculum designed by experts at Microsoft to introduce beginners to the world of Artificial Intelligence (AI). The curriculum covers various topics such as Symbolic AI, Neural Networks, Computer Vision, Natural Language Processing, Genetic Algorithms, and Multi-Agent Systems. It includes hands-on lessons, quizzes, and labs using popular frameworks like TensorFlow and PyTorch. The focus is on providing a foundational understanding of AI concepts and principles, making it an ideal starting point for individuals interested in AI.

github

: 42.1k

chat-your-doc

Chat Your Doc is an experimental project exploring various applications based on LLM technology. It goes beyond being just a chatbot project, focusing on researching LLM applications using tools like LangChain and LlamaIndex. The project delves into UX, computer vision, and offers a range of examples in the 'Lab Apps' section. It includes links to different apps, descriptions, launch commands, and demos, aiming to showcase the versatility and potential of LLM applications.

github

: 67

FFAIVideo

FFAIVideo is a lightweight node.js project that utilizes popular AI LLM to intelligently generate short videos. It supports multiple AI LLM models such as OpenAI, Moonshot, Azure, g4f, Google Gemini, etc. Users can input text to automatically synthesize exciting video content with subtitles, background music, and customizable settings. The project integrates Microsoft Edge's online text-to-speech service for voice options and uses Pexels website for video resources. Installation of FFmpeg is essential for smooth operation. Inspired by MoneyPrinterTurbo, MoneyPrinter, and MsEdgeTTS, FFAIVideo is designed for front-end developers with minimal dependencies and simple usage.

github

: 55

llava-docker

This Docker image for LLaVA (Large Language and Vision Assistant) provides a convenient way to run LLaVA locally or on RunPod. LLaVA is a powerful AI tool that combines natural language processing and computer vision capabilities. With this Docker image, you can easily access LLaVA's functionalities for various tasks, including image captioning, visual question answering, text summarization, and more. The image comes pre-installed with LLaVA v1.2.0, Torch 2.1.2, xformers 0.0.23.post1, and other necessary dependencies. You can customize the model used by setting the MODEL environment variable. The image also includes a Jupyter Lab environment for interactive development and exploration. Overall, this Docker image offers a comprehensive and user-friendly platform for leveraging LLaVA's capabilities.

github

: 59

dataforce.studio

DataForce Studio is an open-source MLOps platform designed to help build, manage, and deploy AI/ML models with ease. It supports the entire model lifecycle, from creation to deployment and monitoring, within a user-friendly interface. The platform is in active early development, aiming to provide features like post-deployment monitoring, model deployment, data science agent, experiment snapshots, model cards, Python SDK, model registry, notebooks, in-browser runtime, and express tasks for prompt optimization and tabular data.

github

: 90

awesome-generative-ai-data-scientist

A curated list of 50+ resources to help you become a Generative AI Data Scientist. This repository includes resources on building GenAI applications with Large Language Models (LLMs), and deploying LLMs and GenAI with Cloud-based solutions.

github

: 425

For similar tasks

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

onnxruntime-genai

ONNX Runtime Generative AI is a library that provides the generative AI loop for ONNX models, including inference with ONNX Runtime, logits processing, search and sampling, and KV cache management. Users can call a high level `generate()` method, or run each iteration of the model in a loop. It supports greedy/beam search and TopP, TopK sampling to generate token sequences, has built in logits processing like repetition penalties, and allows for easy custom scoring.

github

: 831

jupyter-ai

Jupyter AI connects generative AI with Jupyter notebooks. It provides a user-friendly and powerful way to explore generative AI models in notebooks and improve your productivity in JupyterLab and the Jupyter Notebook. Specifically, Jupyter AI offers: * An `%%ai` magic that turns the Jupyter notebook into a reproducible generative AI playground. This works anywhere the IPython kernel runs (JupyterLab, Jupyter Notebook, Google Colab, Kaggle, VSCode, etc.). * A native chat UI in JupyterLab that enables you to work with generative AI as a conversational assistant. * Support for a wide range of generative model providers, including AI21, Anthropic, AWS, Cohere, Gemini, Hugging Face, NVIDIA, and OpenAI. * Local model support through GPT4All, enabling use of generative AI models on consumer grade machines with ease and privacy.

github

: 3.5k

khoj

Khoj is an open-source, personal AI assistant that extends your capabilities by creating always-available AI agents. You can share your notes and documents to extend your digital brain, and your AI agents have access to the internet, allowing you to incorporate real-time information. Khoj is accessible on Desktop, Emacs, Obsidian, Web, and Whatsapp, and you can share PDF, markdown, org-mode, notion files, and GitHub repositories. You'll get fast, accurate semantic search on top of your docs, and your agents can create deeply personal images and understand your speech. Khoj is self-hostable and always will be.

github

: 28.5k

langchain_dart

LangChain.dart is a Dart port of the popular LangChain Python framework created by Harrison Chase. LangChain provides a set of ready-to-use components for working with language models and a standard interface for chaining them together to formulate more advanced use cases (e.g. chatbots, Q&A with RAG, agents, summarization, extraction, etc.). The components can be grouped into a few core modules: * **Model I/O:** LangChain offers a unified API for interacting with various LLM providers (e.g. OpenAI, Google, Mistral, Ollama, etc.), allowing developers to switch between them with ease. Additionally, it provides tools for managing model inputs (prompt templates and example selectors) and parsing the resulting model outputs (output parsers). * **Retrieval:** assists in loading user data (via document loaders), transforming it (with text splitters), extracting its meaning (using embedding models), storing (in vector stores) and retrieving it (through retrievers) so that it can be used to ground the model's responses (i.e. Retrieval-Augmented Generation or RAG). * **Agents:** "bots" that leverage LLMs to make informed decisions about which available tools (such as web search, calculators, database lookup, etc.) to use to accomplish the designated task. The different components can be composed together using the LangChain Expression Language (LCEL).

github

: 660

danswer

Danswer is an open-source Gen-AI Chat and Unified Search tool that connects to your company's docs, apps, and people. It provides a Chat interface and plugs into any LLM of your choice. Danswer can be deployed anywhere and for any scale - on a laptop, on-premise, or to cloud. Since you own the deployment, your user data and chats are fully in your own control. Danswer is MIT licensed and designed to be modular and easily extensible. The system also comes fully ready for production usage with user authentication, role management (admin/basic users), chat persistence, and a UI for configuring Personas (AI Assistants) and their Prompts. Danswer also serves as a Unified Search across all common workplace tools such as Slack, Google Drive, Confluence, etc. By combining LLMs and team specific knowledge, Danswer becomes a subject matter expert for the team. Imagine ChatGPT if it had access to your team's unique knowledge! It enables questions such as "A customer wants feature X, is this already supported?" or "Where's the pull request for feature Y?"

github

: 10.5k

infinity

Infinity is an AI-native database designed for LLM applications, providing incredibly fast full-text and vector search capabilities. It supports a wide range of data types, including vectors, full-text, and structured data, and offers a fused search feature that combines multiple embeddings and full text. Infinity is easy to use, with an intuitive Python API and a single-binary architecture that simplifies deployment. It achieves high performance, with 0.1 milliseconds query latency on million-scale vector datasets and up to 15K QPS.

github

: 4.4k

For similar jobs

weave

Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

github

: 1.1k

LLMStack

github

: 1.5k

VisionCraft

The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

github

: 94

kaito

Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

github

: 405

PyRIT

PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

github

: 2.9k

tabby

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

github

: 32.9k

spear

SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

github

: 224

Magick

Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.

github

: 675