
AITreasureBox
🤖 Collect practical AI repos, tools, websites, papers and tutorials on AI. 实用的AI百宝箱 💎
Stars: 663

AITreasureBox is a comprehensive collection of AI tools and resources designed to simplify and accelerate the development of AI projects. It provides a wide range of pre-trained models, datasets, and utilities that can be easily integrated into various AI applications. With AITreasureBox, developers can quickly prototype, test, and deploy AI solutions without having to build everything from scratch. Whether you are working on computer vision, natural language processing, or reinforcement learning projects, AITreasureBox has something to offer for everyone. The repository is regularly updated with new tools and resources to keep up with the latest advancements in the field of artificial intelligence.
README:
English | 中文
Collect practical AI repos, tools, websites, papers and tutorials on AI.
Translated from ChatGPT, picture from Midjourney.
updated repos and stars every 2 hours and re-ranking automatically.
No. |
Repos |
Description |
---|---|---|
1 |
codecrafters-io/build-your-own-x |
Master programming by recreating your favorite technologies from scratch. |
2 |
sindresorhus/awesome |
😎 Awesome lists about all kinds of interesting topics |
3 |
public-apis/public-apis |
A collective list of free APIs |
4 |
kamranahmedse/developer-roadmap |
Interactive roadmaps, guides and other educational content to help developers grow in their careers. |
5 |
vinta/awesome-python |
A curated list of awesome Python frameworks, libraries, software and resources |
6 |
practical-tutorials/project-based-learning |
Curated list of project-based tutorials |
7 |
TheAlgorithms/Python |
All Algorithms implemented in Python |
8 |
tensorflow/tensorflow |
An Open Source Machine Learning Framework for Everyone |
9 |
Significant-Gravitas/AutoGPT |
An experimental open-source attempt to make GPT-4 fully autonomous. |
10 |
jackfrued/Python-100-Days |
Python - 100天从新手到大师 |
11 |
AUTOMATIC1111/stable-diffusion-webui |
Stable Diffusion web UI |
12 |
ollama/ollama |
Get up and running with Llama 2, Mistral, Gemma, and other large language models. |
13 |
huggingface/transformers |
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. |
14 |
n8n-io/n8n |
Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services. |
15 |
f/awesome-chatgpt-prompts |
This repo includes ChatGPT prompt curation to use ChatGPT better. |
16 |
yt-dlp/yt-dlp |
A feature-rich command-line audio/video downloader |
⭐ 17 |
microsoft/PowerToys |
Windows system utilities to maximize productivity |
18 |
krahets/hello-algo |
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing |
19 |
justjavac/free-programming-books-zh_CN |
📚 免费的计算机编程类中文书籍,欢迎投稿 |
20 |
langchain-ai/langchain |
⚡ Building applications with LLMs through composability ⚡ |
21 |
langgenius/dify |
One API for plugins and datasets, one interface for prompt engineering and visual operation, all for creating powerful AI applications. |
22 | 🔥langflow-ai/langflow |
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database. |
23 |
open-webui/open-webui |
User-friendly WebUI for LLMs (Formerly Ollama WebUI) |
24 |
goldbergyoni/nodebestpractices |
✅ The Node.js best practices list (July 2024) |
25 |
iptv-org/iptv |
Collection of publicly available IPTV channels from all over the world |
26 |
microsoft/generative-ai-for-beginners |
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/ |
27 |
pytorch/pytorch |
Tensors and Dynamic neural networks in Python with strong GPU acceleration |
28 |
neovim/neovim |
Vim-fork focused on extensibility and usability |
29 |
puppeteer/puppeteer |
JavaScript API for Chrome and Firefox |
30 |
mtdvio/every-programmer-should-know |
A collection of (mostly) technical things every software developer should know about |
31 |
supabase/supabase |
The open source Firebase alternative. |
32 |
openai/whisper |
Robust Speech Recognition via Large-Scale Weak Supervision |
33 |
comfyanonymous/ComfyUI |
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. |
34 |
ggerganov/llama.cpp |
Port of Facebook's LLaMA model in C/C++ |
35 |
ChatGPTNextWeb/NextChat |
✨ Light and Fast AI Assistant. Support: Web |
36 |
home-assistant/core |
🏡 Open source home automation that puts local control and privacy first. |
37 |
x1xhlol/system-prompts-and-models-of-ai-tools |
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models. |
38 |
3b1b/manim |
Animation engine for explanatory math videos |
39 |
bregman-arie/devops-exercises |
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions |
40 |
tensorflow/models |
Models and examples built with TensorFlow |
41 |
nomic-ai/gpt4all |
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use. |
42 |
fighting41love/funNLP |
The Most Powerful NLP-Weapon Arsenal |
43 |
anuraghazra/github-readme-stats |
⚡ Dynamically generated stats for your GitHub readmes |
44 |
immich-app/immich |
High performance self-hosted photo and video management solution. |
45 |
hoppscotch/hoppscotch |
Open source API development ecosystem - https://hoppscotch.io (open-source alternative to Postman, Insomnia) |
|
google-gemini/gemini-cli |
An open-source AI agent that brings the power of Gemini directly into your terminal |
|
elastic/elasticsearch |
Free and Open, Distributed, RESTful Search Engine |
48 |
clash-verge-rev/clash-verge-rev |
Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux) |
49 |
hacksider/Deep-Live-Cam |
real time face swap and one-click video deepfake with only a single image (uncensored) |
50 |
d2l-ai/d2l-zh |
Targeting Chinese readers, functional and open for discussion. The Chinese and English versions are used for teaching in over 400 universities across more than 60 countries |
51 |
abi/screenshot-to-code |
Drop in a screenshot and convert it to clean HTML/Tailwind/JS code |
52 |
grafana/grafana |
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more. |
53 |
josephmisiti/awesome-machine-learning |
A curated list of awesome Machine Learning frameworks, libraries and software. |
54 |
browser-use/browser-use |
Make websites accessible for AI agents |
55 |
binary-husky/gpt_academic |
Academic Optimization of GPT |
56 |
rasbt/LLMs-from-scratch |
Implementing a ChatGPT-like LLM from scratch, step by step |
57 |
python/cpython |
The Python programming language |
58 |
sherlock-project/sherlock |
Hunt down social media accounts by username across social networks |
59 |
apache/superset |
Apache Superset is a Data Visualization and Data Exploration Platform |
60 |
openai/openai-cookbook |
Examples and guides for using the OpenAI API |
⭐ 61 |
ansible/ansible |
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com. |
62 |
Shubhamsaboo/awesome-llm-apps |
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini, and open-source models. |
63 |
awesomedata/awesome-public-datasets |
A topic-centric list of HQ open datasets. |
64 |
AppFlowy-IO/AppFlowy |
Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion. |
65 |
lobehub/lobe-chat |
🤖 Lobe Chat - an open-source, extensible (Function Calling), high-performance chatbot framework. It supports one-click free deployment of your private ChatGPT/LLM web application. |
66 |
binhnguyennus/awesome-scalability |
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems |
67 |
xtekky/gpt4free |
decentralizing the Ai Industry, free gpt-4/3.5 scripts through several reverse engineered API's ( poe.com, phind.com, chat.openai.com etc...) |
68 |
Stirling-Tools/Stirling-PDF |
#1 Locally hosted web application that allows you to perform various operations on PDF files |
69 |
infiniflow/ragflow |
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. |
70 |
twitter/the-algorithm |
Source code for Twitter's Recommendation Algorithm |
71 |
keras-team/keras |
Deep Learning for humans |
72 |
TheAlgorithms/Java |
All Algorithms implemented in Java |
73 |
scikit-learn/scikit-learn |
scikit-learn: machine learning in Python |
74 |
All-Hands-AI/OpenHands |
🙌 OpenHands: Code Less, Make More |
75 |
labmlai/annotated_deep_learning_paper_implementations |
🧑🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠 |
76 |
mlabonne/llm-course |
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks. |
77 |
dair-ai/Prompt-Engineering-Guide |
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering |
78 |
OpenInterpreter/open-interpreter |
A natural language interface for computers |
79 |
meta-llama/llama |
Inference code for Llama models |
80 |
geekan/MetaGPT |
The Multi-Agent Meta Programming Framework: Given one line Requirement, return PRD, Design, Tasks, Repo |
81 |
nuxt/nuxt |
The Intuitive Vue Framework. |
82 |
hiyouga/LLaMA-Factory |
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM) |
83 |
vllm-project/vllm |
A high-throughput and memory-efficient inference and serving engine for LLMs |
84 |
imartinez/privateGPT |
Interact with your documents using the power of GPT, 100% privately, no data leaks |
85 |
PlexPt/awesome-chatgpt-prompts-zh |
ChatGPT Chinese Training Guide. Guidelines for various scenarios. Learn how to make it listen to you |
86 |
commaai/openpilot |
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars. |
87 |
ageitgey/face_recognition |
The world's simplest facial recognition api for Python and the command line |
88 |
firecrawl/firecrawl |
The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data 🔥 |
89 |
CorentinJ/Real-Time-Voice-Cloning |
Clone a voice in 5 seconds to generate arbitrary speech in real-time |
90 |
gpt-engineer-org/gpt-engineer |
Specify what you want it to build, the AI asks for clarification, and then builds it. |
91 |
deepfakes/faceswap |
Deepfakes Software For All |
92 |
lencx/ChatGPT |
🔮 ChatGPT Desktop Application (Mac, Windows and Linux) |
93 |
PaddlePaddle/PaddleOCR |
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices) |
94 |
meilisearch/meilisearch |
A lightning-fast search API that fits effortlessly into your apps, websites, and workflow |
95 |
unclecode/crawl4ai |
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper |
96 |
OpenBB-finance/OpenBB |
Financial data aggregator for humans and AI agents. |
97 |
facebookresearch/segment-anything |
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. |
98 |
RVC-Boss/GPT-SoVITS |
1 min voice data can also be used to train a good TTS model! (few shot voice cloning) |
⭐ 99 |
pocketbase/pocketbase |
Open Source realtime backend in 1 file |
100 |
xai-org/grok-1 |
Grok open release |
101 |
cline/cline |
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, and more with your permission every step of the way. |
102 |
jgthms/bulma |
Modern CSS framework based on Flexbox |
103 |
microsoft/autogen |
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ |
104 |
Asabeneh/30-Days-Of-Python |
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw |
105 |
dani-garcia/vaultwarden |
Unofficial Bitwarden compatible server written in Rust, formerly known as bitwarden_rs |
106 |
Mintplex-Labs/anything-llm |
A full-stack application that turns any documents into an intelligent chatbot with a sleek UI and easier way to manage your workspaces. |
107 |
alist-org/alist |
🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。 |
108 |
Avik-Jain/100-Days-Of-ML-Code |
100 Days of ML Coding |
109 |
type-challenges/type-challenges |
Collection of TypeScript type challenges with online judge |
110 |
lllyasviel/Fooocus |
Focus on prompting and generating |
111 |
v2ray/v2ray-core |
A platform for building proxies to bypass network restrictions. |
112 |
typst/typst |
A new markup-based typesetting system that is powerful and easy to learn. |
113 |
ultralytics/ultralytics |
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite |
114 |
unslothai/unsloth |
5X faster 50% less memory LLM finetuning |
115 |
oobabooga/text-generation-webui |
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, OPT, and GALACTICA. |
116 |
2dust/v2rayNG |
A V2Ray client for Android, support Xray core and v2fly core |
117 |
karpathy/nanoGPT |
The simplest, fastest repository for training/finetuning medium-sized GPTs |
118 |
jeecgboot/JeecgBoot |
🔥「企业级低代码平台」前后端分离架构SpringBoot 2.x/3.x,SpringCloud,Ant Design&Vue3,Mybatis,Shiro,JWT。强大的代码生成器让前后端代码一键生成,无需写任何代码! 引领新的开发模式OnlineCoding->代码生成->手工MERGE,帮助Java项目解决70%重复工作,让开发更关注业务,既能快速提高效率,帮助公司节省成本,同时又不失灵活性。 |
119 |
FlowiseAI/Flowise |
Drag & drop UI to build your customized LLM flow using LangchainJS |
120 |
opendatalab/MinerU |
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。 |
121 |
ggerganov/whisper.cpp |
Port of OpenAI's Whisper model in C/C++ |
122 |
coqui-ai/TTS |
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production |
123 |
NaiboWang/EasySpider |
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。 |
124 |
tw93/Pake |
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 Easily build lightweight cross-platform desktop applications with Rust. |
125 |
getsentry/sentry |
Developer-first error tracking and performance monitoring |
⭐ 126 |
apache/airflow |
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows |
127 |
freqtrade/freqtrade |
Free, open source crypto trading bot |
128 |
bevyengine/bevy |
A refreshingly simple data-driven game engine built in Rust |
129 |
Kong/kong |
🦍 The Cloud-Native API Gateway and AI Gateway. |
130 |
Stability-AI/stablediffusion |
High-Resolution Image Synthesis with Latent Diffusion Models |
131 |
mingrammer/diagrams |
🎨 Diagram as Code for prototyping cloud system architectures |
132 |
streamlit/streamlit |
Streamlit — A faster way to build and share data apps. |
133 |
KRTirtho/spotube |
🎧 Open source Spotify client that doesn't require Premium nor uses Electron! Available for both desktop & mobile! |
134 |
hpcaitech/ColossalAI |
Making large AI models cheaper, faster and more accessible |
135 |
THUDM/ChatGLM-6B |
ChatGLM-6B: An Open Bilingual Dialogue Language Model |
136 |
psf/black |
The uncompromising Python code formatter |
137 |
SimplifyJobs/Summer2025-Internships |
Collection of Summer 2025 tech internships! |
138 |
fastlane/fastlane |
🚀 The easiest way to automate building and releasing your iOS and Android apps |
139 |
penpot/penpot |
Penpot: The open-source design tool for design and code collaboration |
140 |
microsoft/DeepSpeed |
A deep learning optimization library that makes distributed training and inference easy, efficient, and effective |
141 |
LC044/WeChatMsg |
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手 |
142 |
virattt/ai-hedge-fund |
An AI Hedge Fund Team |
143 |
gradio-app/gradio |
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work! |
144 |
harry0703/MoneyPrinterTurbo |
Generate short videos with one click using a large model |
145 |
upscayl/upscayl |
🆙 Upscayl - Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy. |
146 |
mem0ai/mem0 |
The memory layer for Personalized AI |
147 |
lm-sys/FastChat |
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5. |
148 |
zhayujie/chatgpt-on-wechat |
Wechat robot based on ChatGPT, which uses OpenAI api and itchat library |
149 |
ray-project/ray |
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads. |
150 |
openai/codex |
Lightweight coding agent that runs in your terminal |
151 |
suno-ai/bark |
🔊 Text-Prompted Generative Audio Model |
152 |
QuivrHQ/quivr |
Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation. |
153 |
DS4SD/docling |
Get your docs ready for gen AI |
154 |
karanpratapsingh/system-design |
Learn how to design systems at scale and prepare for system design interviews |
155 |
2noise/ChatTTS |
ChatTTS is a generative speech model for daily dialogue. |
⭐ 156 |
crewAIInc/crewAI |
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks. |
157 |
menloresearch/jan |
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. |
158 |
LAION-AI/Open-Assistant |
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so. |
159 |
DataExpert-io/data-engineer-handbook |
This is a repo with links to everything you'd ever want to learn about data engineering |
160 |
freeCodeCamp/devdocs |
API Documentation Browser |
161 |
lapce/lapce |
Lightning-fast and Powerful Code Editor written in Rust |
162 |
paul-gauthier/aider |
aider is GPT powered coding in your terminal |
163 |
TencentARC/GFPGAN |
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration. |
164 |
milvus-io/milvus |
A cloud-native vector database, storage for next generation AI applications |
165 |
babysor/MockingBird |
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time |
166 |
Bin-Huang/chatbox |
A desktop app for GPT-4 / GPT-3.5 (OpenAI API) that supports Windows, Mac & Linux |
167 |
hiroi-sora/Umi-OCR |
OCR图片转文字识别软件,完全离线。截屏/批量导入图片,支持多国语言、合并段落、竖排文字。可排除水印区域,提取干净的文本。基于 PaddleOCR 。 |
168 |
google-research/google-research |
Google Research |
169 |
microsoft/ai-agents-for-beginners |
10 Lessons to Get Started Building AI Agents |
170 |
chatchat-space/Langchain-Chatchat |
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain |
171 |
HeyPuter/puter |
🌐 The Internet OS! Free, Open-Source, and Self-Hostable. |
172 |
XingangPan/DragGAN |
Code for DragGAN (SIGGRAPH 2023) |
173 |
mindsdb/mindsdb |
The platform for customizing AI from enterprise data |
174 |
pola-rs/polars |
Fast multi-threaded, hybrid-out-of-core query engine focussing on DataFrame front-ends |
175 |
mudler/LocalAI |
🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities. |
176 |
roboflow/supervision |
We write your reusable computer vision tools. 💜 |
177 |
microsoft/TaskMatrix |
Talking, Drawing and Editing with Visual Foundation Models |
178 |
myshell-ai/OpenVoice |
Instant voice cloning by MyShell |
179 |
deepseek-ai/awesome-deepseek-integration |
|
180 |
pathwaycom/pathway |
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG. |
181 |
google/jax |
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more |
182 |
Pythagora-io/gpt-pilot |
PoC for a scalable dev tool that writes entire apps from scratch while the developer oversees the implementation |
183 |
danielmiessler/Fabric |
Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere. |
184 |
hashicorp/vault |
A tool for secrets management, encryption as a service, and privileged access management |
185 |
lllyasviel/ControlNet |
Let us control diffusion models! |
186 |
TheAlgorithms/C-Plus-Plus |
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes. |
187 |
agno-agi/agno |
Agno is a lightweight library for building Multimodal Agents. It exposes LLMs as a unified API and gives them superpowers like memory, knowledge, tools and reasoning. |
188 |
facebookresearch/detectron2 |
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks. |
189 |
anthropics/claude-code |
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands. |
190 |
DataTalksClub/data-engineering-zoomcamp |
Free Data Engineering course! |
191 |
drawdb-io/drawdb |
Free, simple, and intuitive online database design tool and SQL generator. |
192 |
xinntao/Real-ESRGAN |
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration. |
193 |
mckaywrigley/chatbot-ui |
The open-source AI chat interface for everyone. |
194 |
chatanywhere/GPT_API_free |
Free ChatGPT API Key, Free ChatGPT API, supports GPT-4 API (free), ChatGPT offers a free domestic forwarding API that allows direct connections without the need for a proxy. It can be used in conjunction with software/plugins like ChatBox, significantly reducing interface usage costs. Enjoy unlimited and unrestricted chatting within China |
195 |
TabbyML/tabby |
Self-hosted AI coding assistant |
196 |
alibaba/nacos |
an easy-to-use dynamic service discovery, configuration and service management platform for building cloud native applications. |
197 |
Chanzhaoyu/chatgpt-web |
A demonstration website built with Express and Vue3 called ChatGPT |
198 |
v2fly/v2ray-core |
A platform for building proxies to bypass network restrictions. |
199 |
mouredev/Hello-Python |
Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA... |
200 |
RVC-Project/Retrieval-based-Voice-Conversion-WebUI |
Voice data <= 10 mins can also be used to train a good VC model! |
201 |
facebookresearch/fairseq |
Facebook AI Research Sequence-to-Sequence Toolkit written in Python. |
202 |
shardeum/shardeum |
Shardeum is an EVM based autoscaling blockchain |
203 |
yunjey/pytorch-tutorial |
PyTorch Tutorial for Deep Learning Researchers |
204 |
paperless-ngx/paperless-ngx |
A community-supported supercharged document management system: scan, index and archive all your documents |
205 |
google/comprehensive-rust |
This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust. |
206 |
pathwaycom/llm-app |
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more. |
207 |
google-ai-edge/mediapipe |
Cross-platform, customizable ML solutions for live and streaming media. |
208 |
getcursor/cursor |
An editor made for programming with AI |
209 |
mli/paper-reading |
Classic Deep Learning and In-Depth Reading of New Papers Paragraph by Paragraph |
210 |
ocrmypdf/OCRmyPDF |
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched |
211 |
XTLS/Xray-core |
Xray, Penetrates Everything. Also the best v2ray-core, with XTLS support. Fully compatible configuration. |
212 |
khoj-ai/khoj |
Your AI second brain. A copilot to get answers to your questions, whether they be from your own notes or from the internet. Use powerful, online (e.g gpt4) or private, local (e.g mistral) LLMs. Self-host locally or use our web app. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp. |
213 |
exo-explore/exo |
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚ |
214 |
huggingface/diffusers |
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX. |
215 |
openai/CLIP |
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image |
216 |
linexjlin/GPTs |
leaked prompts of GPTs |
217 |
firecracker-microvm/firecracker |
Secure and fast microVMs for serverless computing. |
218 |
s0md3v/roop |
one-click deepfake (face swap) |
219 |
tatsu-lab/stanford_alpaca |
Code and documentation to train Stanford's Alpaca models, and generate the data. |
220 |
Lightning-AI/pytorch-lightning |
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes. |
221 |
tinygrad/tinygrad |
You like pytorch? You like micrograd? You love tinygrad! ❤️ |
222 |
microsoft/qlib |
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL. |
223 |
danny-avila/LibreChat |
Enhanced ChatGPT Clone: Features OpenAI, GPT-4 Vision, Bing, Anthropic, OpenRouter, Google Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development |
224 |
facebook/folly |
An open-source C++ library developed and used at Facebook. |
225 |
iperov/DeepFaceLive |
Real-time face swap for PC streaming or video calls |
226 |
JushBJJ/Mr.-Ranedeer-AI-Tutor |
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences. |
227 |
tokio-rs/tokio |
A runtime for writing reliable asynchronous applications with Rust. Provides I/O, networking, scheduling, timers, ... |
228 |
google-research/tuning_playbook |
A playbook for systematically maximizing the performance of deep learning models. |
229 |
meta-llama/llama3 |
The official Meta Llama 3 GitHub site |
230 |
PostHog/posthog |
🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host. |
|
continuedev/continue |
⏩ the open-source copilot chat for software development—bring the power of ChatGPT to VS Code |
|
feder-cr/Jobs_Applier_AI_Agent |
Auto_Jobs_Applier_AI_Agent aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way. |
233 |
openai/openai-python |
The OpenAI Python library provides convenient access to the OpenAI API from applications written in the Python language. |
⭐ 234 |
aquasecurity/trivy |
Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more |
235 |
BerriAI/litellm |
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs) |
236 |
eugeneyan/applied-ml |
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production. |
⭐ 237 |
eriklindernoren/ML-From-Scratch |
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning. |
238 |
frappe/erpnext |
Free and Open Source Enterprise Resource Planning (ERP) |
239 |
acheong08/ChatGPT |
Reverse engineered ChatGPT API |
240 |
stanfordnlp/dspy |
DSPy: The framework for programming—not prompting—language models |
241 |
microsoft/graphrag |
A modular graph-based Retrieval-Augmented Generation (RAG) system |
242 |
svc-develop-team/so-vits-svc |
SoftVC VITS Singing Voice Conversion |
243 |
karpathy/llm.c |
LLM training in simple, raw C/CUDA |
244 |
taichi-dev/taichi |
Productive, portable, and performant GPU programming in Python. |
245 |
OpenBMB/ChatDev |
Create Customized Software using Natural Language Idea (through Multi-Agent Collaboration) |
246 |
academic/awesome-datascience |
📝 An awesome Data Science repository to learn and apply for real world problems. |
247 |
stanford-oval/storm |
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations. |
248 |
gabime/spdlog |
Fast C++ logging library. |
249 |
Genesis-Embodied-AI/Genesis |
A generative world for general-purpose robotics & embodied AI learning. |
250 |
songquanpeng/one-api |
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI. |
251 |
ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code |
500 AI Machine learning Deep learning Computer vision NLP Projects with code |
252 |
Byaidu/PDFMathTranslate |
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker |
253 |
nrwl/nx |
Smart Monorepos · Fast CI |
254 |
Budibase/budibase |
Create business apps and automate workflows in minutes. Supports PostgreSQL, MySQL, MariaDB, MSSQL, MongoDB, Rest API, Docker, K8s, and more 🚀 No code / Low code platform.. |
255 |
opentofu/opentofu |
OpenTofu lets you declaratively manage your cloud infrastructure. |
256 |
airbnb/lottie-ios |
An iOS library to natively render After Effects vector animations |
257 |
deepinsight/insightface |
State-of-the-art 2D and 3D Face Analysis Project |
258 |
Stability-AI/generative-models |
Generative Models by Stability AI |
259 |
microsoft/semantic-kernel |
Integrate cutting-edge LLM technology quickly and easily into your apps |
260 |
Raphire/Win11Debloat |
A simple, easy to use PowerShell script to remove pre-installed apps from windows, disable telemetry, remove Bing from windows search as well as perform various other changes to declutter and improve your windows experience. This script works for both windows 10 and windows 11. |
261 |
invoke-ai/InvokeAI |
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products. |
262 |
ashishps1/awesome-system-design-resources |
This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems |
263 |
blakeblackshear/frigate |
NVR with realtime local object detection for IP cameras |
264 |
Vision-CAIR/MiniGPT-4 |
Enhancing Vision-language Understanding with Advanced Large Language Models |
265 |
qdrant/qdrant |
Qdrant - Vector Database for the next generation of AI applications. Also available in the cloud https://cloud.qdrant.io/ |
266 |
labring/FastGPT |
A platform that uses the OpenAI API to quickly build an AI knowledge base, supporting many-to-many relationships. |
267 |
ComposioHQ/composio |
Composio equips agents with well-crafted tools empowering them to tackle complex tasks |
268 |
jingyaogong/minimind |
「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练! |
269 |
apache/flink |
Apache Flink |
270 |
stretchr/testify |
A toolkit with common assertions and mocks that plays nicely with the standard library |
271 |
pydantic/pydantic |
Data validation using Python type hints |
272 |
facefusion/facefusion |
Next generation face swapper and enhancer |
273 |
Hannibal046/Awesome-LLM |
Awesome-LLM: a curated list of Large Language Model |
274 |
modularml/mojo |
The Mojo Programming Language |
275 |
google/flatbuffers |
FlatBuffers: Memory Efficient Serialization Library |
276 |
rancher/rancher |
Complete container management platform |
277 |
TheAlgorithms/Rust |
All Algorithms implemented in Rust |
278 |
warpdotdev/Warp |
Warp is a modern, Rust-based terminal with AI built in so you and your team can build great software, faster. |
279 |
HumanSignal/label-studio |
Label Studio is a multi-type data labeling and annotation tool with standardized output format |
280 |
yetone/openai-translator |
Browser extension and cross-platform desktop application for translation based on ChatGPT API |
281 |
bazelbuild/bazel |
a fast, scalable, multi-language and extensible build system |
282 |
QwenLM/Qwen3 |
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud. |
283 |
microsoft/JARVIS |
a system to connect LLMs with ML community |
284 |
yuliskov/SmartTube |
SmartTube - an advanced player for set-top boxes and tvs running Android OS |
285 |
black-forest-labs/flux |
Official inference repo for FLUX.1 models |
286 |
CodePhiliaX/Chat2DB |
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more. |
287 |
openai/gpt-2 |
Code for the paper "Language Models are Unsupervised Multitask Learners" |
288 |
datawhalechina/self-llm |
“Open-Source Large Model Guide”: A concise tutorial for Chinese users on quick fine‑tuning (full‑parameter/LoRA) in Linux and deploying open-source LLMs and multimodal large models (MLLM). |
289 |
ItzCrazyKns/Perplexica |
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI |
290 |
haotian-liu/LLaVA |
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities. |
291 |
microsoft/OmniParser |
A simple screen parsing tool towards pure vision based GUI agent |
292 |
zulip/zulip |
Zulip server and web application. Open-source team chat that helps teams stay productive and focused. |
293 |
assafelovic/gpt-researcher |
GPT based autonomous agent that does online comprehensive research on any given topic |
294 |
gitroomhq/postiz-app |
📨 Schedule social posts, measure them, exchange with other members and get a lot of help from AI 🚀 |
295 |
PaddlePaddle/Paddle |
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署) |
296 |
Mozilla-Ocho/llamafile |
Distribute and run LLMs with a single file. |
297 |
chroma-core/chroma |
the AI-native open-source embedding database |
298 |
srbhr/Resume-Matcher |
Open Source Free ATS Tool to compare Resumes with Job Descriptions and create a score to rank them. |
299 |
Cinnamon/kotaemon |
An open-source RAG-based tool for chatting with your documents. |
300 |
fishaudio/fish-speech |
Brand new TTS solution |
301 |
valkey-io/valkey |
A flexible distributed key-value datastore that supports both caching and beyond caching workloads. |
302 |
CopilotKit/CopilotKit |
Build in-app AI chatbots 🤖, and AI-powered Textareas ✨, into react web apps. |
303 |
karpathy/minGPT |
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training |
304 |
facebookresearch/audiocraft |
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning. |
305 |
jbhuang0604/awesome-computer-vision |
A curated list of awesome computer vision resources |
306 |
onlook-dev/onlook |
The Cursor for Designers • An Open-Source Visual Vibecoding Editor • Visually build, style, and edit your React App with AI |
307 |
LazyVim/LazyVim |
Neovim config for the lazy |
308 |
huggingface/agents-course |
This repository contains the Hugging Face Agents Course. |
309 |
grpc/grpc-go |
The Go language implementation of gRPC. HTTP/2 based RPC |
310 |
deepset-ai/haystack |
🔍 Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex question answering, semantic search, text generation applications, and more. |
311 |
deepseek-ai/DeepSeek-Coder |
DeepSeek Coder: Let the Code Write Itself |
312 |
ml-explore/mlx |
MLX: An array framework for Apple silicon |
313 |
sst/opencode |
AI coding agent, built for the terminal. |
314 |
Sanster/lama-cleaner |
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures. |
315 |
sinaptik-ai/pandas-ai |
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG. |
316 |
e2b-dev/awesome-ai-agents |
A list of AI autonomous agents |
317 |
mlflow/mlflow |
Open source platform for the machine learning lifecycle |
318 |
tree-sitter/tree-sitter |
An incremental parsing system for programming tools |
319 |
spotDL/spotify-downloader |
Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found). |
320 |
BuilderIO/gpt-crawler |
Crawl a site to generate knowledge files to create your own custom GPT from a URL |
321 |
ml-tooling/best-of-ml-python |
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly. |
322 |
PromtEngineer/localGPT |
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private. |
323 |
yoheinakajima/babyagi |
uses OpenAI and Pinecone APIs to create, prioritize, and execute tasks, This is a pared-down version of the original Task-Driven Autonomous Agent |
324 |
Anjok07/ultimatevocalremovergui |
GUI for a Vocal Remover that uses Deep Neural Networks. |
325 |
microsoft/unilm |
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities |
326 |
wez/wezterm |
A GPU-accelerated cross-platform terminal emulator and multiplexer written by @wez and implemented in Rust |
327 |
eyaltoledano/claude-task-master |
An AI-powered task-management system you can drop into Cursor, Lovable, Windsurf, Roo, and others. |
328 |
ente-io/ente |
Fully open source, End to End Encrypted alternative to Google Photos and Apple Photos |
329 |
daytonaio/daytona |
Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code |
⭐ 330 |
microsoft/BitNet |
Official inference framework for 1-bit LLMs |
331 |
OpenBMB/MiniCPM-o |
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone |
332 |
mlc-ai/mlc-llm |
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices. |
333 |
openai/chatgpt-retrieval-plugin |
Plugins are chat extensions designed specifically for language models like ChatGPT, enabling them to access up-to-date information, run computations, or interact with third-party services in response to a user's request. |
334 |
VinciGit00/Scrapegraph-ai |
Python scraper based on AI |
335 |
HqWu-HITCS/Awesome-Chinese-LLM |
Organizing smaller, cost-effective, privately deployable open-source Chinese language models, including related datasets and tutorials |
336 |
datawhalechina/llm-cookbook |
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版 |
⭐ 337 |
kestra-io/kestra |
⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow... |
|
trufflesecurity/trufflehog |
Find, verify, and analyze leaked credentials |
|
elastic/kibana |
Your window into the Elastic Stack |
|
guidance-ai/guidance |
A guidance language for controlling large language models. |
341 |
NirDiamant/RAG_Techniques |
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses. |
342 |
HKUDS/LightRAG |
"LightRAG: Simple and Fast Retrieval-Augmented Generation" |
343 |
LiLittleCat/awesome-free-chatgpt |
🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated. |
344 |
serengil/deepface |
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python |
345 |
openai/swarm |
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team. |
346 |
kenjihiranabe/The-Art-of-Linear-Algebra |
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone" |
347 |
PrefectHQ/prefect |
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines |
348 |
vanna-ai/vanna |
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄. |
349 |
samber/lo |
💥 A Lodash-style Go library based on Go 1.18+ Generics (map, filter, contains, find...) |
350 |
brave/brave-browser |
Brave browser for Android, iOS, Linux, macOS, Windows. |
351 |
DarkFlippers/unleashed-firmware |
Flipper Zero Unleashed Firmware |
352 |
anthropics/anthropic-cookbook |
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude. |
353 |
thingsboard/thingsboard |
Open-source IoT Platform - Device management, data collection, processing and visualization. |
354 |
renovatebot/renovate |
Universal dependency update tool that fits into your workflows. |
355 |
neondatabase/neon |
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero. |
356 |
dipakkr/A-to-Z-Resources-for-Students |
✅ Curated list of resources for college students |
357 |
joonspk-research/generative_agents |
Generative Agents: Interactive Simulacra of Human Behavior |
358 |
stitionai/devika |
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI. |
359 |
teableio/teable |
✨ The Next Gen Airtable Alternative: No-Code Postgres |
360 |
Dao-AILab/flash-attention |
Fast and memory-efficient exact attention |
361 |
apify/crawlee |
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation. |
362 |
block/goose |
an open-source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM |
363 |
QwenLM/Qwen-7B |
The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud. |
364 |
Zeyi-Lin/HivisionIDPhotos |
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。 |
|
asgeirtj/system_prompts_leaks |
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini |
|
yamadashy/repomix |
📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, and Gemini. |
367 |
tloen/alpaca-lora |
Instruct-tune LLaMA on consumer hardware |
368 |
ymcui/Chinese-LLaMA-Alpaca |
Chinese LLaMA & Alpaca LLMs |
369 |
jasontaylordev/CleanArchitecture |
Clean Architecture Solution Template for ASP.NET Core |
370 |
w-okada/voice-changer |
リアルタイムボイスチェンジャー Realtime Voice Changer |
371 |
Avaiga/taipy |
Turns Data and AI algorithms into production-ready web applications in no time. |
372 |
VikParuchuri/surya |
Accurate line-level text detection and recognition (OCR) in any language |
373 |
graphdeco-inria/gaussian-splatting |
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering" |
374 |
bytedance/UI-TARS-desktop |
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language. |
375 |
78/xiaozhi-esp32 |
Build your own AI friend |
376 |
langchain-ai/langgraph |
|
377 |
letta-ai/letta |
Letta (formerly MemGPT) is a framework for creating LLM services with memory. |
378 |
rlabbe/Kalman-and-Bayesian-Filters-in-Python |
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions. |
379 |
SillyTavern/SillyTavern |
LLM Frontend for Power Users. |
380 |
nats-io/nats-server |
High-Performance server for NATS.io, the cloud and edge native messaging system. |
381 |
anthropics/prompt-eng-interactive-tutorial |
Anthropic's Interactive Prompt Engineering Tutorial |
382 |
guillaumekln/faster-whisper |
Faster Whisper transcription with CTranslate2 |
383 |
transitive-bullshit/chatgpt-api |
Node.js client for the official ChatGPT API. |
384 |
1Panel-dev/MaxKB |
? Based on LLM large language model knowledge base Q&A system. Ready to use out of the box, supports quick integration into third-party business systems. Officially produced by 1Panel |
385 |
patchy631/ai-engineering-hub |
In-depth tutorials on LLMs, RAGs and real-world AI agent applications. |
386 |
kortix-ai/suna |
Suna - Open Source Generalist AI Agent |
387 |
vercel/ai-chatbot |
A full-featured, hackable Next.js AI chatbot built by Vercel |
388 |
meta-llama/llama-cookbook |
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services |
389 |
Mikubill/sd-webui-controlnet |
WebUI extension for ControlNet |
390 |
microsoft/onnxruntime |
ONNX Runtime: cross-platform, high-performance ML inferencing and training accelerator |
391 |
getzep/graphiti |
Build Real-Time Knowledge Graphs for AI Agents |
392 |
sgl-project/sglang |
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with LLMs faster and more controllable. |
393 |
m-bain/whisperX |
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) |
394 |
activepieces/activepieces |
Your friendliest open source all-in-one automation tool ✨ Workflow automation tool 100+ integration / Enterprise automation tool / Zapier Alternative |
395 |
apple/ml-stable-diffusion |
Stable Diffusion with Core ML on Apple Silicon |
396 |
microsoft/LightGBM |
A fast, distributed, high-performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. |
397 |
soxoj/maigret |
🕵️♂️ Collect a dossier on a person by username from thousands of sites |
398 |
deepseek-ai/Janus |
Janus-Series: Unified Multimodal Understanding and Generation Models |
399 |
vercel-labs/ai |
Build AI-powered applications with React, Svelte, and Vue |
400 |
ardalis/CleanArchitecture |
Clean Architecture Solution Template: A starting point for Clean Architecture with ASP.NET Core |
401 |
pgvector/pgvector |
Open-source vector similarity search for Postgres |
402 |
RPCS3/rpcs3 |
PS3 emulator/debugger |
403 |
karpathy/nn-zero-to-hero |
Neural Networks: Zero to Hero |
404 |
eosphoros-ai/DB-GPT |
Revolutionizing Database Interactions with Private LLM Technology |
405 |
princeton-nlp/SWE-agent |
SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models |
|
AI4Finance-Foundation/FinGPT |
Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We'll soon release the trained model. |
|
pybind/pybind11 |
Seamless operability between C++11 and Python |
408 |
xx025/carrot |
Free ChatGPT Site List |
409 |
arc53/DocsGPT |
GPT-powered chat for documentation, chat with your documents |
410 |
huggingface/lerobot |
🤗 LeRobot: State-of-the-art Machine Learning for Real-World Robotics in Pytorch |
411 |
anthropics/courses |
Anthropic's educational courses |
412 |
openai/evals |
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks. |
413 |
IDEA-Research/Grounded-Segment-Anything |
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect, Segment and Generate Anything with Image and Text Inputs |
414 |
dair-ai/ML-YouTube-Courses |
📺 Discover the latest machine learning / AI courses on YouTube. |
415 |
browserbase/stagehand |
An AI web browsing framework focused on simplicity and extensibility. |
416 |
ai16z/eliza |
Conversational Agent for Twitter and Discord |
417 |
triton-lang/triton |
Development repository for the Triton language and compiler |
418 |
musistudio/claude-code-router |
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic. |
419 |
TransformerOptimus/SuperAGI |
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably. |
420 |
aishwaryanr/awesome-generative-ai-guide |
A one stop repository for generative AI research updates, interview resources, notebooks and much more! |
421 |
nocobase/nocobase |
NocoBase is a scalability-first, open-source no-code/low-code platform for building business applications and enterprise solutions. |
422 |
emilwallner/Screenshot-to-code |
A neural network that transforms a design mock-up into a static website. |
423 |
overleaf/overleaf |
A web-based collaborative LaTeX editor |
424 |
datawhalechina/happy-llm |
📚 A Beginner's Guide to the Principles and Practice of Large Language Models |
425 |
mlc-ai/web-llm |
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support. |
426 |
meta-llama/codellama |
Inference code for CodeLlama models |
|
mastra-ai/mastra |
the TypeScript AI agent framework |
|
abseil/abseil-cpp |
Abseil Common Libraries (C++) |
429 |
NirDiamant/GenAI_Agents |
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems. |
430 |
BradyFU/Awesome-Multimodal-Large-Language-Models |
Latest Papers and Datasets on Multimodal Large Language Models |
431 |
espressif/esp-idf |
Espressif IoT Development Framework. Official development framework for Espressif SoCs. |
432 |
influxdata/telegraf |
Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data. |
433 |
NVIDIA/open-gpu-kernel-modules |
NVIDIA Linux open GPU kernel module source |
434 |
ddbourgin/numpy-ml |
Machine learning, in numpy |
435 |
sunner/ChatALL |
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vincuna, Claude, ChatGLM, MOSS, iFlytek Spark, ERNIE and more, discover the best answers |
436 |
mayooear/gpt4-pdf-chatbot-langchain |
GPT4 & LangChain Chatbot for large PDF docs |
437 |
typescript-eslint/typescript-eslint |
✨ Monorepo for all the tooling which enables ESLint to support TypeScript |
438 |
apache/arrow |
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing |
439 |
umijs/umi |
A framework in react community ✨ |
440 |
KindXiaoming/pykan |
Kolmogorov Arnold Networks |
441 |
langfuse/langfuse |
🪢 Open source LLM engineering platform. Observability, metrics, evals, prompt management, testing, prompt playground, datasets, LLM evaluations -- 🍊YC W23 🤖 integrate via Typescript, Python / Decorators, OpenAI, Langchain, LlamaIndex, Litellm, Instructor, Mistral, Perplexity, Claude, Gemini, Vertex |
|
amplication/amplication |
🔥🔥🔥 The Only Production-Ready AI-Powered Backend Code Generation |
|
Stability-AI/StableLM |
Stability AI Language Models |
|
openai/tiktoken |
tiktoken is a fast BPE tokeniser for use with OpenAI's models. |
445 |
state-spaces/mamba |
Mamba: Linear-Time Sequence Modeling with Selective State Spaces |
446 |
marimo-team/marimo |
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git. |
447 |
THUDM/ChatGLM2-6B |
ChatGLM2-6B: An Open Bilingual Chat LLM |
448 |
alibaba/lowcode-engine |
An enterprise-class low-code technology stack with scale-out design / 一套面向扩展设计的企业级低代码技术体系 |
449 |
datawhalechina/leedl-tutorial |
《李宏毅深度学习教程》(李宏毅老师推荐👍),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases |
450 |
NVIDIA/NeMo |
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech) |
451 |
microsoft/Bringing-Old-Photos-Back-to-Life |
Bringing Old Photo Back to Life (CVPR 2020 oral) |
452 |
stefan-jansen/machine-learning-for-trading |
Code for Machine Learning for Algorithmic Trading, 2nd edition. |
453 |
mediar-ai/screenpipe |
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust. |
454 |
mrdbourke/pytorch-deep-learning |
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course. |
455 |
xming521/WeClone |
🚀从聊天记录创造数字分身的一站式解决方案💡 使用微信聊天记录微调大语言模型,让大模型有“那味儿”,并绑定到聊天机器人,实现自己的数字分身。 数字克隆/数字分身/数字永生/声音克隆/LLM/大语言模型/微信聊天机器人/LoRA |
456 |
GaiZhenbiao/ChuanhuChatGPT |
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. |
457 |
huggingface/trl |
Train transformer language models with reinforcement learning. |
458 |
steven-tey/novel |
Notion-style WYSIWYG editor with AI-powered autocompletions |
459 |
HandsOnLLM/Hands-On-Large-Language-Models |
Official code repo for the O'Reilly Book - "Hands-On Large Language Models" |
460 |
lukas-blecher/LaTeX-OCR |
pix2tex: Using a ViT to convert images of equations into LaTeX code. |
461 |
chidiwilliams/buzz |
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper. |
462 |
naklecha/llama3-from-scratch |
llama3 implementation one matrix multiplication at a time |
463 |
nautechsystems/nautilus_trader |
A high-performance algorithmic trading platform and event-driven backtester |
464 |
stas00/ml-engineering |
Machine Learning Engineering Guides and Tools |
465 |
udecode/plate |
A rich-text editor powered by AI |
466 |
bleedline/aimoneyhunter |
AI Side Hustle Money Mega Collection: Teaching You How to Utilize AI for Various Side Projects to Earn Extra Income. |
467 |
Huanshere/VideoLingo |
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team |
468 |
xcanwin/KeepChatGPT |
Using ChatGPT is more efficient and smoother, perfectly solving ChatGPT network errors. No longer do you need to frequently refresh the webpage, saving over 10 unnecessary steps |
469 |
browser-use/web-ui |
Run AI Agent in your browser. |
470 |
HumanAIGC/AnimateAnyone |
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation |
471 |
fauxpilot/fauxpilot |
An open-source GitHub Copilot server |
472 |
LlamaFamily/Llama-Chinese |
Llama Chinese Community, the best Chinese Llama large model, fully open source and commercially available |
473 |
neonbjb/tortoise-tts |
A multi-voice TTS system trained with an emphasis on quality |
474 |
xenova/transformers.js |
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server! |
475 |
dyad-sh/dyad |
Free, local, open-source AI app builder ✨ v0 / lovable / Bolt alternative 🌟 Star if you like it! |
476 |
plandex-ai/plandex |
An AI coding engine for complex tasks |
477 |
llmware-ai/llmware |
Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models. |
478 |
adobe/react-spectrum |
A collection of libraries and tools that help you build adaptive, accessible, and robust user experiences. |
479 |
linshenkx/prompt-optimizer |
A prompt word optimizer, designed to assist in crafting high-quality prompts. |
480 |
unifyai/ivy |
Unified AI |
481 |
laramies/theHarvester |
E-mails, subdomains and names Harvester - OSINT |
482 |
Skyvern-AI/skyvern |
Automate browser-based workflows with LLMs and Computer Vision |
483 |
simstudioai/sim |
Sim is an open-source AI agent workflow builder. Sim's interface is a lightweight, intuitive way to rapidly build and deploy LLMs that connect with your favorite tools. |
484 |
lightaime/camel |
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society |
485 |
allenai/olmocr |
Toolkit for linearizing PDFs for LLM datasets/training |
486 |
BlinkDL/RWKV-LM |
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it combines the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding. |
487 |
Koenkk/zigbee2mqtt |
Zigbee 🐝 to MQTT bridge 🌉, get rid of your proprietary Zigbee bridges 🔨 |
488 |
THUDM/ChatGLM3 |
ChatGLM3 series: Open Bilingual Chat LLMs |
489 |
humanlayer/12-factor-agents |
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers? |
490 |
official-stockfish/Stockfish |
UCI chess engine |
491 |
microsoft/data-formulator |
🪄 Create rich visualizations with AI |
492 |
netease-youdao/QAnything |
Question and Answer based on Anything. |
493 |
getmaxun/maxun |
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta] |
494 |
NVIDIA/Megatron-LM |
Ongoing research training transformer models at scale |
495 |
comet-ml/opik |
Open-source end-to-end LLM Development Platform |
496 |
danswer-ai/danswer |
Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc. |
497 |
RockChinQ/LangBot |
😎丰富生态、🧩支持扩展、🦄多模态 - 大模型原生即时通信机器人平台 🤖 |
498 |
DataTalksClub/mlops-zoomcamp |
Free MLOps course from DataTalks.Club |
499 |
fuergaosi233/wechat-chatgpt |
Use ChatGPT On Wechat via wechaty |
⭐ 500 |
wazuh/wazuh |
Wazuh - The Open Source Security Platform. Unified XDR and SIEM protection for endpoints and cloud workloads. |
501 |
UFund-Me/Qbot |
Qbot is an AI-oriented quantitative investment platform, which aims to realize the potential, empower AI technologies in quantitative investment |
502 |
wong2/chatgpt-google-extension |
A browser extension that enhances search engines with ChatGPT, this repos will not be updated from 2023-02-20 |
503 |
OpenTalker/SadTalker |
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation |
504 |
QwenLM/Qwen3-Coder |
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud. |
505 |
alphacep/vosk-api |
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node |
506 |
ggerganov/ggml |
Tensor library for machine learning |
507 |
volcengine/verl |
veRL: Volcano Engine Reinforcement Learning for LLM |
508 |
vosen/ZLUDA |
CUDA on AMD GPUs |
509 |
alibaba/MNN |
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:MNN-LLM-Android |
510 |
bytebase/bytebase |
World's most advanced database DevOps and CI/CD for Developer, DBA and Platform Engineering teams. The GitLab/GitHub for database DevOps. |
511 |
plasma-umass/scalene |
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals |
512 |
pytube/pytube |
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos. |
513 |
moeru-ai/airi |
💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minecraft, Factorio playing. Web / macOS / Windows supported. |
514 |
tracel-ai/burn |
Burn is a next generation Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability. |
515 |
electric-sql/pglite |
Lightweight Postgres packaged as WASM into a TypeScript library for the browser, Node.js, Bun and Deno from https://electric-sql.com |
516 |
google/adk-python |
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control. |
517 |
kgrzybek/modular-monolith-with-ddd |
Full Modular Monolith application with Domain-Driven Design approach. |
518 |
Lightning-AI/litgpt |
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more. |
519 |
karpathy/micrograd |
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API |
520 |
facebookresearch/AnimatedDrawings |
Code to accompany "A Method for Animating Children's Drawings of the Human Figure" |
521 |
Unstructured-IO/unstructured |
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines. |
522 |
LibreTranslate/LibreTranslate |
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup. |
523 |
normal-computing/outlines |
Generative Model Programming |
524 |
modelscope/FunASR |
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc. |
525 |
hajimehoshi/ebiten |
Ebitengine - A dead simple 2D game engine for Go |
526 |
ShishirPatil/gorilla |
Gorilla: An API store for LLMs |
527 |
Rudrabha/Wav2Lip |
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. |
528 |
QwenLM/Qwen2.5-VL |
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud. |
529 |
andrewyng/aisuite |
Simple, unified interface to multiple Generative AI providers |
530 |
eugeneyan/open-llms |
A list of open LLMs available for commercial use. |
531 |
wasp-lang/open-saas |
A free, open-source SaaS app starter for React & Node.js with superpowers. Production-ready. Community-driven. |
532 |
owainlewis/awesome-artificial-intelligence |
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers. |
533 |
willwulfken/MidJourney-Styles-and-Keywords-Reference |
A reference containing Styles and Keywords that you can use with MidJourney AI |
534 |
pydantic/pydantic-ai |
Agent Framework / shim to use Pydantic with LLMs |
535 |
illacloud/illa-builder |
Create AI-Driven Apps like Assembling Blocks |
536 |
n8n-io/self-hosted-ai-starter-kit |
The Self-hosted AI Starter Kit is an open-source template that quickly sets up a local AI environment. Curated by n8n, it provides essential tools for creating secure, self-hosted AI workflows. |
537 |
smol-ai/developer |
With 100k context windows on the way, it's now feasible for every dev to have their own smol developer |
538 |
NVIDIA/TensorRT |
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT. |
539 |
OpenLMLab/MOSS |
An open-source tool-augmented conversational language model from Fudan University |
540 |
openai/shap-e |
Generate 3D objects conditioned on text or images |
541 |
PKU-YuanGroup/Open-Sora-Plan |
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project. |
542 |
Soulter/AstrBot |
✨易上手的多平台 LLM 聊天机器人及开发框架✨。支持 QQ、QQ频道、Telegram、微信平台(Gewechat, 企业微信)、内置 Web Chat,OpenAI GPT、DeepSeek、Ollama、Llama、GLM、Gemini、OneAPI、LLMTuner,支持 LLM Agent 插件开发,可视化面板。一键部署。支持 Dify 工作流、代码执行器、Whisper 语音转文字。 |
|
resemble-ai/chatterbox |
SoTA open-source TTS |
|
h2oai/h2ogpt |
Come join the movement to make the world's best open source GPT led by H2O.ai - 100% private chat and document search, no data leaks, Apache 2.0 |
545 |
THUDM/CogVideo |
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023) |
546 |
dair-ai/ML-Papers-of-the-Week |
🔥Highlighting the top ML papers every week. |
547 |
getomni-ai/zerox |
Zero shot pdf OCR with gpt-4o-mini |
548 |
libretro/RetroArch |
Cross-platform, sophisticated frontend for the libretro API. Licensed GPLv3. |
549 |
InstantID/InstantID |
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥 |
550 |
midday-ai/midday |
Invoicing, Time tracking, File reconciliation, Storage, Financial Overview & your own Assistant made for Freelancers |
551 |
RUCAIBox/LLMSurvey |
A collection of papers and resources related to Large Language Models. |
552 |
TheRamU/Fay |
Fay is a complete open source project that includes Fay controller and numeral models, which can be used in different applications such as virtual hosts, live promotion, numeral human interaction and so on |
553 |
bentoml/OpenLLM |
An open platform for operating large language models (LLMs) in production. Fine-tune, serve, deploy, and monitor any LLMs with ease. |
554 |
Canner/WrenAI |
🤖 Open-source AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, and BI. 📈📊📋🧑💻 |
555 |
facebookresearch/seamless_communication |
Foundational Models for State-of-the-Art Speech and Text Translation |
556 |
frdel/agent-zero |
Agent Zero AI framework |
557 |
coleam00/Archon |
Beta release of Archon OS - the knowledge and task management backbone for AI coding assistants. |
558 |
Tencent/Hunyuan3D-2 |
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. |
559 |
lllyasviel/stable-diffusion-webui-forge |
a platform on top of Stable Diffusion WebUI (based on Gradio) to make development easier, optimize resource management, and speed up inference |
560 |
idootop/mi-gpt |
? Transform XiaoAi speaker into a personal voice assistant with ChatGPT and DouBao integration. |
|
oraios/serena |
A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & Agno integration) |
|
facebookresearch/dinov2 |
PyTorch code and models for the DINOv2 self-supervised learning method. |
|
NVIDIA/TensorRT-LLM |
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines. |
564 |
kubeshark/kubeshark |
The API traffic analyzer for Kubernetes providing real-time K8s protocol-level visibility, capturing and monitoring all traffic and payloads going in, out and across containers, pods, nodes and clusters. Inspired by Wireshark, purposely built for Kubernetes |
565 |
GoogleCloudPlatform/generative-ai |
Sample code and notebooks for Generative AI on Google Cloud |
566 |
SakanaAI/AI-Scientist |
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬 |
567 |
srush/GPU-Puzzles |
Solve puzzles. Learn CUDA. |
568 |
PaperMC/Paper |
The most widely used, high performance Minecraft server that aims to fix gameplay and mechanics inconsistencies |
569 |
WordPress/gutenberg |
The Block Editor project for WordPress and beyond. Plugin is available from the official repository. |
570 |
jxnl/instructor |
structured outputs for llms |
571 |
Z3Prover/z3 |
The Z3 Theorem Prover |
572 |
TheR1D/shell_gpt |
A command-line productivity tool powered by ChatGPT, will help you accomplish your tasks faster and more efficiently |
573 |
QwenLM/Qwen-Agent |
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension. |
574 |
lz4/lz4 |
Extremely Fast Compression algorithm |
575 |
manticoresoftware/manticoresearch |
Easy to use open source fast database for search |
576 |
DrewThomasson/ebook2audiobook |
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages! |
577 |
NielsRogge/Transformers-Tutorials |
This repository contains demos I made with the Transformers library by HuggingFace. |
578 |
HigherOrderCO/HVM |
A massively parallel, optimal functional runtime in Rust |
579 |
DataTalksClub/machine-learning-zoomcamp |
The code from the Machine Learning Bookcamp book and a free course based on the book |
580 |
zyronon/douyin |
Vue3 + Pinia + Vite5 仿抖音,Vue 在移动端的最佳实践 . Imitate TikTok ,Vue Best practices on Mobile |
581 |
getumbrel/llama-gpt |
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. |
582 |
THU-MIG/yolov10 |
YOLOv10: Real-Time End-to-End Object Detection |
583 |
dream-num/univer |
Univer is a full-stack framework for creating and editing spreadsheets, documents, and slides on both web and server. |
584 |
salesforce/LAVIS |
LAVIS - A One-stop Library for Language-Vision Intelligence |
585 |
projectdiscovery/nuclei-templates |
Community curated list of templates for the nuclei engine to find security vulnerabilities. |
586 |
magic-research/magic-animate |
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model |
587 |
databrickslabs/dolly |
A large language model trained on the Databricks Machine Learning Platform |
588 |
apache/thrift |
Apache Thrift |
589 |
plait-board/drawnix |
All in one open-source whiteboard tool with mind, flowchart, freehand and etc. |
590 |
microsoft/promptflow |
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring. |
591 |
aws/amazon-sagemaker-examples |
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker. |
592 |
artidoro/qlora |
QLoRA: Efficient Finetuning of Quantized LLMs |
|
zakirullin/cognitive-load |
🧠 Cognitive Load is what matters |
|
keephq/keep |
The open-source alert management and AIOps platform |
|
szimek/sharedrop |
Easy P2P file transfer powered by WebRTC - inspired by Apple AirDrop |
596 |
confident-ai/deepeval |
The LLM Evaluation Framework |
597 |
nerfstudio-project/nerfstudio |
A collaboration friendly studio for NeRFs |
598 |
Chainlit/chainlit |
Build Conversational AI in minutes ⚡️ |
599 |
zaidmukaddam/scira |
Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like Grok 2.0. |
600 |
kedro-org/kedro |
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular. |
601 |
FlagOpen/FlagEmbedding |
Dense Retrieval and Retrieval-augmented LLMs |
602 |
huggingface/text-generation-inference |
Large Language Model Text Generation Inference |
603 |
cupy/cupy |
NumPy & SciPy for GPU |
604 |
mistralai/mistral-src |
Reference implementation of Mistral AI 7B v0.1 model. |
605 |
cumulo-autumn/StreamDiffusion |
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation |
606 |
chathub-dev/chathub |
All-in-one chatbot client |
|
microsoft/mcp-for-beginners |
This open-source curriculum introduces the fundamentals of Model Context Protocol (MCP) through real-world, cross-language examples in .NET, Java, TypeScript, JavaScript, and Python. Designed for developers, it focuses on practical techniques for building modular, scalable, and secure AI workflows from session setup to service orchestration. |
|
autowarefoundation/autoware |
Autoware - the world's leading open-source software project for autonomous driving |
609 |
wandb/wandb |
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API. |
610 |
sashabaranov/go-openai |
OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go |
611 |
googleapis/genai-toolbox |
MCP Toolbox for Databases is an open source MCP server for databases. |
612 |
web-infra-dev/midscene |
An AI-powered automation SDK can control the page, perform assertions, and extract data in JSON format using natural language. |
613 |
ubicloud/ubicloud |
Open source alternative to AWS. Elastic compute, block storage (non replicated), firewall and load balancer, managed Postgres, K8s, AI inference, and IAM services. |
614 |
AIGC-Audio/AudioGPT |
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head |
⭐ 615 |
tensorzero/tensorzero |
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models. |
616 |
steven2358/awesome-generative-ai |
A curated list of modern Generative Artificial Intelligence projects and services |
617 |
tadata-org/fastapi_mcp |
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth! |
618 |
openai/openai-node |
Official JavaScript / TypeScript library for the OpenAI API |
619 |
timlrx/tailwind-nextjs-starter-blog |
This is a Next.js, Tailwind CSS blogging starter template. Comes out of the box configured with the latest technologies to make technical writing a breeze. Easily configurable and customizable. Perfect as a replacement to existing Jekyll and Hugo individual blogs. |
620 |
EleutherAI/lm-evaluation-harness |
A framework for few-shot evaluation of autoregressive language models. |
621 |
0xk1h0/ChatGPT_DAN |
ChatGPT DAN, Jailbreaks prompt |
622 |
ashishpatel26/500-AI-Agents-Projects |
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation, illustrating how AI agents are transforming sectors such as healthcare, finance, education, retail, and more. |
623 |
karpathy/minbpe |
Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization. |
624 |
thuml/Time-Series-Library |
A Library for Advanced Deep Time Series Models. |
625 |
modelscope/DiffSynth-Studio |
Enjoy the magic of Diffusion models! |
626 |
OthersideAI/self-operating-computer |
A framework to enable multimodal models to operate a computer. |
627 |
popcorn-official/popcorn-desktop |
Popcorn Time is a multi-platform, free software BitTorrent client that includes an integrated media player ( Windows / Mac / Linux ) A Butter-Project Fork |
628 |
lucidrains/denoising-diffusion-pytorch |
Implementation of Denoising Diffusion Probabilistic Model in Pytorch |
629 |
bigscience-workshop/petals |
🌸 Run large language models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading |
630 |
colmap/colmap |
COLMAP - Structure-from-Motion and Multi-View Stereo |
|
🔥🔥modelscope/agentscope |
Start building LLM-empowered multi-agent applications in an easier way. |
|
RayTracing/raytracing.github.io |
Main Web Site (Online Books) |
|
google/benchmark |
A microbenchmark support library |
|
modelscope/swift |
ms-swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs |
|
Const-me/Whisper |
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model |
|
LouisShark/chatgpt_system_prompt |
store all agent's system prompt |
|
lightpanda-io/browser |
Lightpanda: the headless browser designed for AI and automation |
638 |
facebookresearch/nougat |
Implementation of Nougat Neural Optical Understanding for Academic Documents |
639 |
mshumer/gpt-prompt-engineer |
|
640 |
dataelement/bisheng |
Bisheng is an open LLM devops platform for next generation AI applications. |
641 |
Visualize-ML/Book4_Power-of-Matrix |
Book_4 'Power of Matrix' |
642 |
BlinkDL/ChatRWKV |
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. |
643 |
BloopAI/bloop |
A fast code search engine written in Rust |
644 |
modelscope/facechain |
FaceChain is a deep-learning toolchain for generating your Digital-Twin. |
645 |
e2b-dev/e2b |
Vercel for AI agents. We help developers to build, deploy, and monitor AI agents. Focusing on specialized AI agents that build software for you - your personal software developers. |
646 |
espnet/espnet |
End-to-End Speech Processing Toolkit |
647 |
kedacore/keda |
KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes |
648 |
WongKinYiu/yolov9 |
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information |
649 |
open-mmlab/Amphion |
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development. |
650 |
goldmansachs/gs-quant |
Python toolkit for quantitative finance |
651 |
brexhq/prompt-engineering |
Tips and tricks for working with Large Language Models like OpenAI's GPT-4. |
652 |
Portkey-AI/gateway |
A Blazing Fast AI Gateway. Route to 100+ LLMs with 1 fast & friendly API. |
653 |
puckeditor/puck |
The visual editor for React |
654 |
open-mmlab/mmsegmentation |
OpenMMLab Semantic Segmentation Toolbox and Benchmark. |
655 |
pymc-devs/pymc |
Bayesian Modeling and Probabilistic Programming in Python |
656 |
jina-ai/reader |
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/ |
657 |
baptisteArno/typebot.io |
💬 Typebot is a powerful chatbot builder that you can self-host. |
658 |
erincatto/box2d |
Box2D is a 2D physics engine for games |
659 |
dice2o/BingGPT |
Desktop application of new Bing's AI-powered chat (Windows, macOS and Linux) |
660 |
huggingface/chat-ui |
Open source codebase powering the HuggingChat app |
661 |
voicepaw/so-vits-svc-fork |
so-vits-svc fork with realtime support, improved interface and more features. |
662 |
gorse-io/gorse |
Gorse open source recommender system engine |
663 |
niedev/RTranslator |
RTranslator is the world's first open source real-time translation app. |
664 |
Acly/krita-ai-diffusion |
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required. |
665 |
Stability-AI/StableStudio |
Community interface for generative AI |
666 |
codexu/note-gen |
A cross-platform Markdown note-taking application dedicated to using AI to bridge recording and writing, organizing fragmented knowledge into a readable note. |
667 |
OpenGVLab/InternVL |
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源模型 |
668 |
togethercomputer/OpenChatKit |
OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots for various applications |
669 |
bunkerity/bunkerweb |
🛡️ Make your web services secure by default ! |
670 |
rany2/edge-tts |
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key |
671 |
bytedance/monolith |
ByteDance's Recommendation System |
672 |
IDEA-Research/GroundingDINO |
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection" |
|
emcie-co/parlant |
The behavior guidance framework for customer-facing LLM agents |
|
meetecho/janus-gateway |
Janus WebRTC Server |
|
google/magika |
Detect file content types with deep learning |
|
openvinotoolkit/openvino |
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference |
677 |
facebookresearch/ImageBind |
ImageBind One Embedding Space to Bind Them All |
678 |
zmh-program/chatnio |
🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot 等模型,支持对话分享,自定义预设,云端同步,模型市场,支持弹性计费和订阅计划模式,支持图片解析,支持联网搜索,支持模型缓存,丰富美观的后台管理与仪表盘数据统计。 |
679 |
jzhang38/TinyLlama |
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. |
680 |
nashsu/FreeAskInternet |
FreeAskInternet is a completely free, private and locally running search aggregator & answer generate using LLM, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to the ChatGPT3.5 LLM and generate the answer based on search results. |
681 |
elie222/inbox-zero |
AI personal assistant for email. Open source app to help you reach inbox zero fast. |
682 |
GreyDGL/PentestGPT |
A GPT-empowered penetration testing tool |
683 |
ashawkey/stable-dreamfusion |
A pytorch implementation of text-to-3D dreamfusion, powered by stable diffusion. |
684 |
adam-maj/tiny-gpu |
A minimal GPU design in Verilog to learn how GPUs work from the ground up |
685 |
wangrongding/wechat-bot |
🤖一个基于 WeChaty 结合 DeepSeek / ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人 ,可以用来帮助你自动回复微信消息,或者管理微信群/好友,检测僵尸粉等... |
686 |
adams549659584/go-proxy-bingai |
A Microsoft New Bing demo site built with Vue3 and Go, providing a consistent UI experience, supporting ChatGPT prompts, and accessible within China |
687 |
skypilot-org/skypilot |
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface. |
688 |
fishaudio/Bert-VITS2 |
vits2 backbone with multilingual-bert |
689 |
microsoft/TypeChat |
TypeChat is a library that makes it easy to build natural language interfaces using types. |
690 |
KoljaB/RealtimeSTT |
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription. |
691 |
friuns2/BlackFriday-GPTs-Prompts |
List of free GPTs that doesn't require plus subscription |
692 |
o3de/o3de |
Open 3D Engine (O3DE) is an Apache 2.0-licensed multi-platform 3D engine that enables developers and content creators to build AAA games, cinema-quality 3D worlds, and high-fidelity simulations without any fees or commercial obligations. |
693 |
xorbitsai/inference |
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. |
694 |
deepseek-ai/DeepEP |
DeepEP: An efficient expert parallel communication library |
695 |
OptimalScale/LMFlow |
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Model for All. |
696 |
oumi-ai/oumi |
Everything you need to build state-of-the-art foundation models, end-to-end. |
697 |
OpenBMB/XAgent |
An Autonomous LLM Agent for Complex Task Solving |
698 |
ai-collection/ai-collection |
The Generative AI Landscape - A Collection of Awesome Generative AI Applications |
|
krillinai/KrillinAI |
A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,TikTok, and Shorts. 基于AI大模型的视频翻译和配音工具,专业级翻译,一键部署全流程,可以生成适配抖音,小红书,哔哩哔哩,视频号,TikTok,Youtube Shorts等形态的内容 |
|
jrouwe/JoltPhysics |
A multi core friendly rigid body physics and collision detection library. Written in C++. Suitable for games and VR applications. Used by Horizon Forbidden West. |
701 |
vikhyat/moondream |
tiny vision language model |
702 |
FoundationVision/VAR |
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction" |
703 |
NVIDIA/cutlass |
CUDA Templates for Linear Algebra Subroutines |
704 |
jasonppy/VoiceCraft |
Zero-Shot Speech Editing and Text-to-Speech in the Wild |
705 |
ollama/ollama-python |
Ollama Python library |
706 |
SJTU-IPADS/PowerInfer |
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs |
707 |
roboflow/notebooks |
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM. |
708 |
netease-youdao/EmotiVoice |
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine |
709 |
intel-analytics/ipex-llm |
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, ModelScope, etc. |
710 |
sebastianstarke/AI4Animation |
Bringing Characters to Life with Computer Brains in Unity |
711 |
OpenBMB/MiniCPM |
MiniCPM-2B: An end-side LLM outperforms Llama2-13B. |
712 |
delta-io/delta |
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs |
713 |
reorproject/reor |
AI note-taking app that runs models locally. |
714 |
lllyasviel/IC-Light |
More relighting! |
715 |
firmai/financial-machine-learning |
A curated list of practical financial machine learning tools and applications. |
716 |
leptonai/search_with_lepton |
Building a quick conversation-based search demo with Lepton AI. |
717 |
digitalinnovationone/dio-lab-open-source |
Repositório do lab "Contribuindo em um Projeto Open Source no GitHub" da Digital Innovation One. |
718 |
graviraja/MLOps-Basics |
|
719 |
langchain-ai/ollama-deep-researcher |
Fully local web research and report writing assistant |
720 |
git-ecosystem/git-credential-manager |
Secure, cross-platform Git credential storage with authentication to GitHub, Azure Repos, and other popular Git hosting services. |
721 |
dair-ai/ML-Papers-Explained |
Explanation to key concepts in ML |
722 |
miurla/morphic |
An AI-powered answer engine with a generative UI |
723 |
CASIA-IVA-Lab/FastSAM |
Fast Segment Anything |
724 |
abhishekkrthakur/approachingalmost |
Approaching (Almost) Any Machine Learning Problem |
725 |
meta-llama/llama-stack |
Composable building blocks to build Llama Apps |
726 |
anse-app/chatgpt-demo |
A demo repo based on OpenAI API (gpt-3.5-turbo) |
727 |
acheong08/EdgeGPT |
Reverse engineered API of Microsoft's Bing Chat |
728 |
lencx/Noi |
🦄 AI + Tools + Plugins + Community |
729 |
Plachtaa/VALL-E-X |
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. The demo is available at https://plachtaa.github.io |
730 |
pipecat-ai/pipecat |
Open Source framework for voice and multimodal conversational AI |
731 |
udlbook/udlbook |
Understanding Deep Learning - Simon J.D. Prince |
732 |
deep-floyd/IF |
A novel state-of-the-art open-source text-to-image model with a high degree of photorealism and language understanding |
733 |
01-ai/Yi |
A series of large language models trained from scratch by developers @01-ai |
|
chiphuyen/aie-book |
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025) |
|
OpenSPG/KAG |
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model. |
|
WooooDyy/LLM-Agent-Paper-List |
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al. |
737 |
ml-explore/mlx-examples |
Examples in the MLX framework |
738 |
facebookresearch/DiT |
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers" |
739 |
xiangsx/gpt4free-ts |
Providing a free OpenAI GPT-4 API ! This is a replication project for the typescript version of xtekky/gpt4free |
740 |
kroma-network/tachyon |
Modular ZK(Zero Knowledge) backend accelerated by GPU |
741 |
zilliztech/GPTCache |
GPTCache is a library for creating semantic cache to store responses from LLM queries. |
742 |
meshery/meshery |
Meshery, the cloud native manager |
743 |
LiheYoung/Depth-Anything |
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data |
744 |
Future-House/paper-qa |
High accuracy RAG for answering questions from scientific documents with citations |
745 |
MotiaDev/motia |
Unified Backend Framework for APIs, Events, and AI Agents |
746 |
lllyasviel/Omost |
Your image is almost there! |
747 |
jaywalnut310/vits |
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech |
748 |
onuratakan/gpt-computer-assistant |
gpt-4o for windows, macos and ubuntu |
749 |
THUDM/CodeGeeX2 |
CodeGeeX2: A More Powerful Multilingual Code Generation Model |
750 |
microsoft/magentic-ui |
A research prototype of a human-centered web agent |
751 |
openstatusHQ/openstatus |
🏓 The open-source synthetic monitoring platform 🏓 |
752 |
sweepai/sweep |
Sweep is an AI junior developer |
753 |
PWhiddy/PokemonRedExperiments |
Playing Pokemon Red with Reinforcement Learning |
754 |
ahmedbahaaeldin/From-0-to-Research-Scientist-resources-guide |
Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation. |
755 |
tmc/langchaingo |
LangChain for Go, the easiest way to write LLM-based programs in Go |
756 |
openlm-research/open_llama |
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset |
757 |
bytedance/UI-TARS |
|
758 |
nadermx/backgroundremover |
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source. |
759 |
bigcode-project/starcoder |
Home of StarCoder: fine-tuning & inference! |
760 |
open-metadata/OpenMetadata |
OpenMetadata is a unified platform for discovery, observability, and governance powered by a central metadata repository, in-depth lineage, and seamless team collaboration. |
761 |
enso-org/enso |
Hybrid visual and textual functional programming. |
762 |
PaddlePaddle/ERNIE |
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle. |
763 |
microsoft/RD-Agent |
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automate these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which let AI drive data-driven AI. |
764 |
livekit/agents |
Build real-time multimodal AI applications 🤖🎙️📹 |
765 |
geekyutao/Inpaint-Anything |
Inpaint anything using Segment Anything and inpainting models. |
766 |
Zackriya-Solutions/meeting-minutes |
A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding linux support soon) |
767 |
gunnarmorling/1brc |
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java |
768 |
k2-fsa/sherpa-onnx |
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift |
769 |
TEN-framework/TEN-Agent |
TEN Agent is a realtime conversational AI agent powered by TEN. It seamlessly integrates the OpenAI Realtime API, RTC capabilities, and advanced features like weather updates, web search, computer vision, and Retrieval-Augmented Generation (RAG). |
770 |
PKU-YuanGroup/ChatLaw |
Chinese Legal Large Model |
771 |
jianchang512/ChatTTS-ui |
一个简单的本地网页界面,直接使用ChatTTS将文字合成为语音,同时支持对外提供API接口。 |
772 |
weaviate/Verba |
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate |
773 |
SciPhi-AI/R2R |
A framework for rapid development and deployment of production-ready RAG systems |
774 |
open-mmlab/mmagic |
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox |
775 |
Azure-Samples/azure-search-openai-demo |
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences. |
776 |
lastmile-ai/mcp-agent |
Build effective agents using Model Context Protocol and simple workflow patterns |
777 |
ymcui/Chinese-LLaMA-Alpaca-2 |
Chinese LLaMA-2 & Alpaca-2 LLMs |
778 |
dependency-check/DependencyCheck |
OWASP dependency-check is a software composition analysis utility that detects publicly disclosed vulnerabilities in application dependencies. |
779 |
bhaskatripathi/pdfGPT |
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The only open source solution to turn your pdf files in a chatbot! |
780 |
QuivrHQ/MegaParse |
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs. |
781 |
OpenTalker/video-retalking |
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild |
782 |
AbdullahAlfaraj/Auto-Photoshop-StableDiffusion-Plugin |
A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using Automatic1111-sd-webui as a backend. |
783 |
cloudwego/eino |
The ultimate LLM/AI application development framework in Golang. |
784 |
HariSekhon/DevOps-Bash-tools |
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux.. |
785 |
n4ze3m/page-assist |
Use your locally running AI models to assist you in your web browsing |
786 |
InternLM/InternLM |
InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system. |
787 |
mit-han-lab/streaming-llm |
Efficient Streaming Language Models with Attention Sinks |
788 |
microsoft/TinyTroupe |
LLM-powered multiagent persona simulation for imagination enhancement and business insights. |
789 |
apple/corenet |
CoreNet: A library for training deep neural networks |
790 |
AutoMQ/automq |
AutoMQ is a cloud-first alternative to Kafka by decoupling durability to S3 and EBS. 10x cost-effective. Autoscale in seconds. Single-digit ms latency. |
791 |
react-native-webview/react-native-webview |
React Native Cross-Platform WebView |
792 |
k8sgpt-ai/k8sgpt |
Giving Kubernetes Superpowers to everyone |
793 |
topoteretes/cognee |
Memory for AI Agents in 5 lines of code |
794 |
mylxsw/aidea |
AIdea is a versatile app that supports GPT and domestic large language models,also supports "Stable Diffusion" text-to-image generation, image-to-image generation, SDXL 1.0, super-resolution, and image colorization |
795 |
threestudio-project/threestudio |
A unified framework for 3D content generation. |
796 |
civitai/civitai |
Build a platform where people can share their stable diffusion models |
797 |
microsoft/Mastering-GitHub-Copilot-for-Paired-Programming |
A 6 Lesson course teaching everything you need to know about harnessing GitHub Copilot and an AI Paired Programing resource. |
798 |
gaomingqi/Track-Anything |
A flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI. |
799 |
PixarAnimationStudios/OpenUSD |
Universal Scene Description |
|
RayVentura/ShortGPT |
🚀🎬 ShortGPT - An experimental AI framework for automated short/video content creation. Enables creators to rapidly produce, manage, and deliver content using AI and automation. |
|
Project-MONAI/MONAI |
AI Toolkit for Healthcare Imaging |
802 |
GoogleCloudPlatform/kubectl-ai |
AI powered Kubernetes Assistant |
803 |
snakers4/silero-vad |
Silero VAD: pre-trained enterprise-grade Voice Activity Detector |
804 |
myshell-ai/MeloTTS |
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean. |
805 |
langchain-ai/opengpts |
This is an open source effort to create a similar experience to OpenAI's GPTs and Assistants API |
806 |
ai-boost/awesome-prompts |
Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers. |
807 |
spdustin/ChatGPT-AutoExpert |
🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding). |
808 |
microsoft/DeepSpeedExamples |
Example models using DeepSpeed |
809 |
OpenPipe/ART |
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more! |
810 |
Tohrusky/Final2x |
2^x Image Super-Resolution |
811 |
yihong0618/xiaogpt |
Play ChatGPT with xiaomi ai speaker |
812 |
spring-projects/spring-ai |
An Application Framework for AI Engineering |
813 |
enricoros/big-AGI |
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud. |
814 |
vladmandic/automatic |
SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models |
815 |
google/gemma.cpp |
lightweight, standalone C++ inference engine for Google's Gemma models. |
816 |
yangjianxin1/Firefly |
Firefly: Chinese conversational large language model (full-scale fine-tuning + QLoRA), supporting fine-tuning of Llma2, Llama, Baichuan, InternLM, Ziya, Bloom, and other large models |
817 |
Licoy/ChatGPT-Midjourney |
🎨 Own your own ChatGPT+Midjourney web service with one click |
818 |
deepseek-ai/DeepSeek-LLM |
DeepSeek LLM: Let there be answers |
819 |
WhiskeySockets/Baileys |
Lightweight full-featured typescript/javascript WhatsApp Web API |
820 |
clovaai/donut |
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022 |
821 |
run-llama/rags |
Build ChatGPT over your data, all with natural language |
822 |
qunash/chatgpt-advanced |
A browser extension that augments your ChatGPT prompts with web results. |
|
awslabs/agent-squad |
Flexible and powerful framework for managing multiple AI agents and handling complex conversations |
|
linyiLYi/street-fighter-ai |
This is an AI agent for Street Fighter II Champion Edition. |
825 |
mufeedvh/code2prompt |
A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting. |
826 |
Alibaba-NLP/WebAgent |
🌐 WebAgent for Information Seeking built by Tongyi Lab: WebWalker & WebDancer & WebSailor https://arxiv.org/pdf/2507.02592 |
827 |
lipku/metahuman-stream |
Real time interactive streaming digital human |
828 |
crestalnetwork/intentkit |
An open and fair framework for everyone to build AI agents equipped with powerful skills. Launch your agent, improve the world, your wallet, or both! |
829 |
Dooy/chatgpt-web-midjourney-proxy |
chatgpt web, midjourney, gpts,tts, whisper 一套ui全搞定 |
830 |
google/mesop |
|
831 |
levihsu/OOTDiffusion |
Official implementation of OOTDiffusion |
832 |
openai/consistency_models |
Official repo for consistency models. |
833 |
aptos-labs/aptos-core |
Aptos is a layer 1 blockchain built to support the widespread use of blockchain through better technology and user experience. |
834 |
Moonvy/OpenPromptStudio |
AIGC Hint Word Visualization Editor |
835 |
firebase/firebase-ios-sdk |
Firebase SDK for Apple App Development |
836 |
HVision-NKU/StoryDiffusion |
Create Magic Story! |
837 |
alibaba/higress |
🤖 AI Gateway |
|
QuentinFuxa/WhisperLiveKit |
Real-time & local speech-to-text, translation, and speaker diarization. With server & web UI. |
|
leanprover/lean4 |
Lean 4 programming language and theorem prover |
|
vespa-engine/vespa |
The open big data serving engine. https://vespa.ai |
841 |
MineDojo/Voyager |
An Open-Ended Embodied Agent with Large Language Models |
842 |
google-deepmind/graphcast |
|
843 |
apify/crawlee-python |
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation. |
844 |
sczhou/ProPainter |
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting |
845 |
648540858/wvp-GB28181-pro |
WEB VIDEO PLATFORM是一个基于GB28181-2016标准实现的网络视频平台,支持NAT穿透,支持海康、大华、宇视等品牌的IPC、NVR、DVR接入。支持国标级联,支持rtsp/rtmp等视频流转发到国标平台,支持rtsp/rtmp等推流转发到国标平台。 |
846 |
wenda-LLM/wenda |
Wenda: An LLM invocation platform. Its objective is to achieve efficient content generation tailored to specific environments while considering the limited computing resources of individuals and small businesses, as well as knowledge security and privacy concerns |
847 |
simular-ai/Agent-S |
Agent S: an open agentic framework that uses computers like a human |
848 |
awslabs/mcp |
AWS MCP Servers — helping you get the most out of AWS, wherever you use MCP. |
849 |
Yonom/assistant-ui |
React Components for AI Chat 💬 🚀 |
850 |
BasedHardware/omi |
AI wearables. Put it on, speak, transcribe, automatically |
851 |
fr0gger/Awesome-GPT-Agents |
A curated list of GPT agents for cybersecurity |
852 |
Shaunwei/RealChar |
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime(All in One Codebase!). Have a natural seamless conversation with AI everywhere(mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖 |
853 |
ostris/ai-toolkit |
Various AI scripts. Mostly Stable Diffusion stuff. |
854 |
ethereum-optimism/optimism |
Optimism is Ethereum, scaled. |
855 |
SkalskiP/courses |
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI) |
856 |
lavague-ai/LaVague |
Automate automation with Large Action Model framework |
857 |
rustformers/llm |
Run inference for Large Language Models on CPU, with Rust |
858 |
mnotgod96/AppAgent |
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps. |
859 |
jdepoix/youtube-transcript-api |
This is a Python API that enables you to retrieve the transcript/subtitles for a specified YouTube video. It supports automatically generated subtitles without needing an API key or a headless browser, unlike other Selenium-based solutions. |
860 |
homanp/superagent |
🥷 Superagent - Build, deploy, and manage LLM-powered agents |
861 |
pytorch-labs/gpt-fast |
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python. |
862 |
asg017/sqlite-vec |
Work-in-progress vector search SQLite extension that runs anywhere. |
863 |
deepseek-ai/DeepSeek-Coder-V2 |
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence |
864 |
EricLBuehler/mistral.rs |
Blazingly fast LLM inference. |
865 |
SevaSk/ecoute |
Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation. |
866 |
josStorer/RWKV-Runner |
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use. |
867 |
pengxiao-song/LaWGPT |
Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge |
868 |
open-compass/opencompass |
OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. |
869 |
mishushakov/llm-scraper |
Turn any webpage into structured data using LLMs |
870 |
fullstackhero/dotnet-starter-kit |
Production Grade Cloud-Ready .NET 8 Starter Kit (Web API + Blazor Client) with Multitenancy Support, and Clean/Modular Architecture that saves roughly 200+ Development Hours! All Batteries Included. |
871 |
kuafuai/DevOpsGPT |
Multi agent system for AI-driven software development. Convert natural language requirements into working software. Supports any development language and extends the existing base code. |
872 |
fastrepl/hyprnote |
Local-first AI Notepad for Private Meetings |
873 |
allenai/OLMo |
Modeling, training, eval, and inference code for OLMo |
874 |
nilsherzig/LLocalSearch |
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed. |
|
albertan017/LLM4Decompile |
Reverse Engineering: Decompiling Binary Code with Large Language Models |
|
HumanAIGC/OutfitAnyone |
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person |
877 |
apache/hudi |
Upserts, Deletes And Incremental Processing on Big Data. |
878 |
PrefectHQ/marvin |
A batteries-included library for building AI-powered software |
879 |
ToonCrafter/ToonCrafter |
a research paper for generative cartoon interpolation |
880 |
microsoft/TaskWeaver |
A code-first agent framework for seamlessly planning and executing data analytics tasks. |
881 |
OpenGVLab/LLaMA-Adapter |
Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters |
882 |
a16z-infra/companion-app |
AI companions with memory: a lightweight stack to create and host your own AI companions |
883 |
Azure/azure-sdk-for-net |
This repository is for active development of the Azure SDK for .NET. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/dotnet/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-net. |
884 |
AILab-CVC/YOLO-World |
Real-Time Open-Vocabulary Object Detection |
885 |
PawanOsman/ChatGPT |
OpenAI API Free Reverse Proxy |
886 |
e2b-dev/fragments |
Open-source Next.js template for building apps that are fully generated by AI. By E2B. |
887 |
Anxcye/anx-reader |
Featuring powerful AI capabilities and supporting various e-book formats, it makes reading smarter and more focused. 集成多种 AI 能力,支持丰富的电子书格式,让阅读更智能、更专注。 |
888 |
nsarrazin/serge |
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API |
889 |
Zipstack/unstract |
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents |
890 |
aiwaves-cn/agents |
An Open-source Framework for Autonomous Language Agents |
891 |
alibaba/spring-ai-alibaba |
Agentic AI Framework for Java Developers |
892 |
baichuan-inc/baichuan-7B |
A large-scale 7B pretraining language model developed by BaiChuan-Inc. |
893 |
microsoft/promptbase |
All things prompt engineering |
894 |
ramonvc/freegpt-webui |
GPT 3.5/4 with a Chat Web UI. No API key is required. |
895 |
Akegarasu/lora-scripts |
LoRA training scripts use kohya-ss's trainer, for diffusion model. |
896 |
yetone/bob-plugin-openai-translator |
A Bob Plugin base ChatGPT API |
897 |
HKUDS/DeepCode |
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)" |
898 |
Permify/permify |
An open-source authorization as a service inspired by Google Zanzibar, designed to build and manage fine-grained and scalable authorization systems for any application. |
899 |
google/gemma_pytorch |
The official PyTorch implementation of Google's Gemma models |
900 |
espeak-ng/espeak-ng |
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents. |
901 |
alpkeskin/mosint |
An automated e-mail OSINT tool |
|
santinic/audiblez |
Generate audiobooks from e-books |
|
pytorch/torchtune |
A Native-PyTorch Library for LLM Fine-tuning |
904 |
google-research/timesfm |
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. |
905 |
X-PLUG/MobileAgent |
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception |
906 |
lxfater/inpaint-web |
A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。 |
907 |
imoneoi/openchat |
OpenChat: Advancing Open-source Language Models with Imperfect Data |
908 |
langchain-ai/rag-from-scratch |
|
909 |
mosaicml/composer |
Train neural networks up to 7x faster |
910 |
huggingface/parler-tts |
Inference and training library for high-quality TTS models. |
911 |
microsoft/LLMLingua |
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss. |
912 |
ChaoningZhang/MobileSAM |
This is the official code for Faster Segment Anything (MobileSAM) project that makes SAM lightweight |
913 |
souzatharsis/podcastfy |
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI |
914 |
huggingface/alignment-handbook |
Robust recipes for to align language models with human and AI preferences |
915 |
timescale/pgai |
A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL |
916 |
MervinPraison/PraisonAI |
PraisonAI application combines AutoGen and CrewAI or similar frameworks into a low-code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and efficient human-agent collaboration. |
917 |
TaskingAI/TaskingAI |
The open source platform for AI-native application development. |
918 |
anti-work/shortest |
QA via natural language AI tests |
919 |
openchatai/OpenChat |
Run and create custom ChatGPT-like bots with OpenChat, embed and share these bots anywhere, the open-source chatbot console. |
920 |
dsdanielpark/Bard-API |
The python package that returns a response of Google Bard through API. |
921 |
OpenBMB/ToolBench |
An open platform for training, serving, and evaluating large language model for tool learning. |
922 |
Fanghua-Yu/SUPIR |
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild |
923 |
xxlong0/Wonder3D |
A cross-domain diffusion model for 3D reconstruction from a single image |
924 |
kyegomez/swarms |
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503 |
925 |
aigc-apps/sd-webui-EasyPhoto |
📷 EasyPhoto |
926 |
SuperDuperDB/superduperdb |
🔮 SuperDuperDB: Bring AI to your database: Integrate, train and manage any AI models and APIs directly with your database and your data. |
927 |
microsoft/SynapseML |
Simple and Distributed Machine Learning |
928 |
olimorris/codecompanion.nvim |
✨ AI-powered coding, seamlessly in Neovim. Supports Anthropic, Copilot, Gemini, Ollama, OpenAI and xAI LLMs |
929 |
qodo-ai/qodo-cover |
Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞 |
930 |
biobootloader/wolverine |
Automatically repair python scripts through GPT-4 to give them regenerative abilities. |
931 |
NVIDIA/garak |
the LLM vulnerability scanner |
932 |
openchatai/OpenCopilot |
🤖 🔥 Let your users chat with your product features and execute things by text - open source Shopify sidekick |
933 |
funstory-ai/BabelDOC |
Yet Another Document Translator |
934 |
OpenInterpreter/01 |
The open-source language model computer |
935 |
LMCache/LMCache |
Supercharge Your LLM with the Fastest KV Cache Layer |
⭐ 936 | 🔥bytebot-ai/bytebot |
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment. |
|
NVIDIA/NeMo-Guardrails |
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems. |
|
dnhkng/GlaDOS |
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve. |
|
landing-ai/vision-agent |
Vision agent |
940 |
argmaxinc/WhisperKit |
Swift native speech recognition on-device for iOS and macOS applications. |
941 |
BuilderIO/ai-shell |
A CLI that converts natural language to shell commands. |
942 |
Zejun-Yang/AniPortrait |
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation |
943 |
GaiaNet-AI/gaianet-node |
Install and run your own AI agent service |
944 |
google/highway |
Performance-portable, length-agnostic SIMD with runtime dispatch |
945 |
mpociot/chatgpt-vscode |
A VSCode extension that allows you to use ChatGPT |
946 |
OpenGVLab/DragGAN |
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux) |
947 |
Plachtaa/VITS-fast-fine-tuning |
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion |
948 |
XRPLF/rippled |
Decentralized cryptocurrency blockchain daemon implementing the XRP Ledger protocol in C++ |
949 |
AILab-CVC/VideoCrafter |
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation |
950 |
alibaba-damo-academy/FunClip |
Open-source, accurate and easy-to-use video clipping tool, LLM based AI clipping intergrated |
951 |
deepseek-ai/DeepSeek-V2 |
|
952 |
MahmoudAshraf97/whisper-diarization |
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper |
953 |
open-edge-platform/anomalib |
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference. |
954 |
developersdigest/llm-answer-engine |
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper |
955 |
potpie-ai/potpie |
Prompt-To-Agent : Create custom engineering agents for your codebase |
956 |
lvwzhen/law-cn-ai |
⚖️ AI Legal Assistant |
957 |
Deci-AI/super-gradients |
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS. |
958 |
snailyp/gemini-balance |
Gemini polling proxy service |
959 |
PathOfBuildingCommunity/PathOfBuilding |
Offline build planner for Path of Exile. |
960 |
ant-research/CoDeF |
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing |
961 |
SCIR-HI/Huatuo-Llama-Med-Chinese |
Repo for HuaTuo (华驼), Llama-7B tuned with Chinese medical knowledge |
962 |
Nukem9/dlssg-to-fsr3 |
Adds AMD FSR3 Frame Generation to games by replacing Nvidia DLSS-G Frame Generation (nvngx_dlssg). |
963 |
google/adk-samples |
A collection of sample agents built with Agent Development (ADK) |
964 |
1rgs/jsonformer |
A Bulletproof Way to Generate Structured JSON from Language Models |
965 |
togethercomputer/RedPajama-Data |
code for preparing large datasets for training large language models |
966 |
github/CopilotForXcode |
Xcode extension for GitHub Copilot |
967 |
Speykious/cve-rs |
Blazingly 🔥 fast 🚀 memory vulnerabilities, written in 100% safe Rust. 🦀 |
968 |
isaac-sim/IsaacLab |
Unified framework for robot learning built on NVIDIA Isaac Sim |
969 |
terraform-aws-modules/terraform-aws-eks |
Terraform module to create AWS Elastic Kubernetes (EKS) resources 🇺🇦 |
970 |
hengyoush/kyanos |
Visualize the time packets spend in the kernel, watch & analyze in command line. |
971 |
OpenBMB/AgentVerse |
🤖 AgentVerse 🪐 provides a flexible framework that simplifies the process of building custom multi-agent environments for large language models (LLMs). |
972 |
InternLM/xtuner |
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM) |
973 |
UX-Decoder/Segment-Everything-Everywhere-All-At-Once |
Official implementation of the paper "Segment Everything Everywhere All at Once" |
974 |
philz1337x/clarity-upscaler |
Clarity-Upscaler: Reimagined image upscaling for everyone |
975 |
yisol/IDM-VTON |
IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild |
976 |
argilla-io/argilla |
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency. |
977 |
facebookincubator/AITemplate |
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference. |
978 |
LLM-Red-Team/kimi-free-api |
🚀 KIMI AI 长文本大模型白嫖服务,支持高速流式输出、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。 |
979 |
google-deepmind/alphageometry |
Solving Olympiad Geometry without Human Demonstrations |
980 |
sanchit-gandhi/whisper-jax |
Optimised JAX code for OpenAI's Whisper Model, largely built on the Hugging Face Transformers Whisper implementation |
981 |
krishnaik06/Roadmap-To-Learn-Generative-AI-In-2024 |
Roadmap To Learn Generative AI In 2024 |
982 |
aipotheosis-labs/aci |
ACI.dev is the open source platform that connects your AI agents to 600+ tool integrations with multi-tenant auth, granular permissions, and access through direct function calling or a unified MCP server. |
983 |
sjvasquez/handwriting-synthesis |
Handwriting Synthesis with RNNs ✏️ |
984 |
agiresearch/AIOS |
AIOS: LLM Agent Operating System |
985 |
build-trust/ockam |
Orchestrate end-to-end encryption, cryptographic identities, mutual authentication, and authorization policies between distributed applications – at massive scale. |
986 |
fchollet/ARC-AGI |
The Abstraction and Reasoning Corpus |
987 |
luosiallen/latent-consistency-model |
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference |
988 |
Physical-Intelligence/openpi |
|
989 |
facebookresearch/co-tracker |
CoTracker is a model for tracking any point (pixel) on a video. |
990 |
NVlabs/neuralangelo |
Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023) |
991 |
kyegomez/tree-of-thoughts |
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70% |
992 |
NVlabs/Sana |
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer |
993 |
Blealtan/efficient-kan |
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN). |
994 |
Helicone/helicone |
🧊 Open source LLM-Observability Platform for Developers. One-line integration for monitoring, metrics, evals, agent tracing, prompt management, playground, etc. Supports OpenAI SDK, Vercel AI SDK, Anthropic SDK, LiteLLM, LLamaIndex, LangChain, and more. 🍓 YC W23 |
995 |
madawei2699/myGPTReader |
A slack bot that can read any webpage, ebook or document and summarize it with chatGPT |
996 |
xlang-ai/OpenAgents |
OpenAgents: An Open Platform for Language Agents in the Wild |
997 |
raspberrypi/pico-sdk |
|
998 |
collabora/WhisperSpeech |
An Open Source text-to-speech system built by inverting Whisper. |
999 |
pytorch/torchtitan |
A native PyTorch Library for large model training |
1000 |
SkyworkAI/SkyReels-V2 |
SkyReels-V2: Infinite-length Film Generative model |
1001 |
ragapp/ragapp |
The easiest way to use Agentic RAG in any enterprise |
1002 |
smol-ai/GodMode |
AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day. |
1003 |
DataTalksClub/llm-zoomcamp |
LLM Zoomcamp - a free online course about building an AI bot that can answer questions about your knowledge base |
1004 |
intelowlproject/IntelOwl |
IntelOwl: manage your Threat Intelligence at scale |
1005 |
luban-agi/Awesome-AIGC-Tutorials |
Curated tutorials and resources for Large Language Models, AI Painting, and more. |
1006 |
Doriandarko/maestro |
A framework for Claude Opus to intelligently orchestrate subagents. |
1007 |
ray-project/llm-numbers |
Numbers every LLM developer should know |
|
appcypher/awesome-mcp-servers |
Awesome MCP Servers - A curated list of Model Context Protocol servers |
|
Marker-Inc-Korea/AutoRAG |
AutoML tool for RAG |
|
openai/plugins-quickstart |
Get a ChatGPT plugin up and running in under 5 minutes! |
1011 |
fudan-generative-vision/champ |
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance |
1012 |
Tencent/HunyuanDiT |
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding |
1013 |
microsoft/poml |
Prompt Orchestration Markup Language |
1014 |
leetcode-mafia/cheetah |
Whisper & GPT-based app for passing remote SWE interviews |
1015 |
transformerlab/transformerlab-app |
Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer. |
1016 |
MarkFzp/mobile-aloha |
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation |
1017 |
truefoundry/cognita |
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry |
1018 |
OpenDriveLab/UniAD |
[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving |
1019 |
damo-vilab/AnyDoor |
Official implementations for paper: Anydoor: zero-shot object-level image customization |
1020 |
openai/grok |
|
1021 |
fixie-ai/ultravox |
A fast multimodal LLM for real-time voice |
1022 |
Ironclad/rivet |
The open-source visual AI programming environment and TypeScript library |
1023 |
google/security-research |
This project hosts security advisories and their accompanying proof-of-concepts related to research conducted at Google which impact non-Google owned code. |
1024 |
CrazyBoyM/llama3-Chinese-chat |
Llama3 Chinese Repository with modified versions, and training and deployment resources |
1025 |
metavoiceio/metavoice-src |
AI for human-level speech intelligence |
1026 |
mshumer/gpt-llm-trainer |
|
1027 |
Facico/Chinese-Vicuna |
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model |
1028 |
CanadaHonk/porffor |
A from-scratch experimental AOT JS engine, written in JS |
1029 |
tencentmusic/supersonic |
SuperSonic is the next-generation BI platform that integrates Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms. |
1030 |
zigtools/zls |
A language server for Zig supporting developers with features like autocomplete and goto definition |
1031 |
nucleuscloud/neosync |
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments. |
1032 |
a16z-infra/ai-getting-started |
A Javascript AI getting started stack for weekend projects, including image/text models, vector stores, auth, and deployment configs |
1033 |
keijiro/AICommand |
ChatGPT integration with Unity Editor |
1034 |
lfnovo/open-notebook |
An Open Source implementation of Notebook LM with more flexibility and features |
1035 |
unit-mesh/auto-dev |
🧙AutoDev: The AI-powered coding wizard with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀 |
1036 |
SysCV/sam-hq |
Segment Anything in High Quality |
1037 |
AI4Finance-Foundation/FinRobot |
FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀 |
1038 |
cvg/LightGlue |
LightGlue: Local Feature Matching at Light Speed (ICCV 2023) |
1039 |
openai/simple-evals |
|
1040 |
joanrod/star-vector |
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision. |
1041 |
jackMort/ChatGPT.nvim |
ChatGPT Neovim Plugin: Effortless Natural Language Generation with OpenAI's ChatGPT API |
1042 |
deepseek-ai/DeepSeek-VL |
DeepSeek-VL: Towards Real-World Vision-Language Understanding |
1043 |
huggingface/distil-whisper |
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate. |
1044 |
manycore-research/SpatialLM |
SpatialLM: Training Large Language Models for Structured Indoor Modeling |
1045 |
LLMBook-zh/LLMBook-zh.github.io |
《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣 |
1046 |
mongodb-developer/GenAI-Showcase |
GenAI Cookbook |
1047 |
dtyq/magic |
The first open-source all-in-one AI productivity platform (Generalist AI Agent Workflow Engine IM Online collaborative office system) |
1048 |
apple/ml-mgie |
|
1049 |
Codium-ai/AlphaCodium |
code generation tool that surpasses most human competitors in CodeContests |
1050 |
neo4j-labs/llm-graph-builder |
Neo4j graph construction from unstructured data using LLMs |
1051 |
Significant-Gravitas/Auto-GPT-Plugins |
Plugins for Auto-GPT |
1052 |
FlagAI-Open/FlagAI |
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model. |
1053 |
llSourcell/DoctorGPT |
DoctorGPT is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private. |
1054 |
shroominic/codeinterpreter-api |
Open source implementation of the ChatGPT Code Interpreter 👾 |
1055 |
ravenscroftj/turbopilot |
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU |
1056 |
deanxv/coze-discord-proxy |
代理Discord-Bot对话Coze-Bot,实现API形式请求GPT4对话模型/微调模型 |
⭐ 1057 |
coleam00/ottomator-agents |
All the open source AI Agents hosted on the oTTomator Live Agent Studio platform! |
1058 |
jupyterlab/jupyter-ai |
A generative AI extension for JupyterLab |
1059 |
billmei/every-chatgpt-gui |
Every front-end GUI client for ChatGPT |
1060 |
BasedHardware/OpenGlass |
Turn any glasses into AI-powered smart glasses |
1061 |
meta-llama/PurpleLlama |
Set of tools to assess and improve LLM security. |
1062 |
ageerle/ruoyi-ai |
RuoYi AI 是一个全栈式 AI 开发平台,旨在帮助开发者快速构建和部署个性化的 AI 应用。 |
1063 |
0hq/WebGPT |
Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~2000 lines of vanilla Javascript. |
1064 |
hiyouga/ChatGLM-Efficient-Tuning |
Fine-tuning ChatGLM-6B with PEFT |
1065 |
srush/Tensor-Puzzles |
Solve puzzles. Improve your pytorch. |
1066 |
Yue-Yang/ChatGPT-Siri |
Shortcuts for Siri using ChatGPT API gpt-3.5-turbo model |
1067 |
yuruotong1/autoMate |
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves |
1068 |
ant-design/x |
Craft AI-driven interfaces effortlessly 🤖 |
1069 |
google-deepmind/gemma |
Open weights LLM from Google DeepMind. |
1070 |
Kent0n-Li/ChatDoctor |
A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge |
1071 |
hemansnation/God-Level-Data-Science-ML-Full-Stack |
A collection of scientific methods, processes, algorithms, and systems to build stories & models. This roadmap contains 16 Chapters, whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI |
1072 |
mlabonne/llm-datasets |
High-quality datasets, tools, and concepts for LLM fine-tuning. |
1073 |
microsoft/torchgeo |
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data |
1074 |
pytorch/torchchat |
Run PyTorch LLMs locally on servers, desktop and mobile |
1075 |
unslothai/notebooks |
Fine-tune LLMs for free with guided Notebooks on Google Colab, Kaggle, and more. |
1076 |
panaversity/learn-agentic-ai |
Learn Agentic AI using Dapr Agentic Cloud Ascent (DACA) Design Pattern and Agent-Native Cloud Technologies: OpenAI Agents SDK, Memory, MCP, A2A, Knowledge Graphs, Dapr, Rancher Desktop, and Kubernetes. |
1077 |
whoiskatrin/chart-gpt |
AI tool to build charts based on text input |
1078 |
amazon-science/chronos-forecasting |
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting |
1079 |
ricklamers/gpt-code-ui |
An open source implementation of OpenAI's ChatGPT Code interpreter |
1080 |
hustvl/Vim |
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model |
1081 |
microsoft/PromptWizard |
Task-Aware Agent-driven Prompt Optimization Framework |
1082 |
NExT-GPT/NExT-GPT |
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model |
1083 |
Dataherald/dataherald |
Interact with your SQL database, Natural Language to SQL using LLMs |
1084 |
Lightning-AI/LitServe |
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale. |
1085 |
nerfstudio-project/gsplat |
CUDA accelerated rasterization of gaussian splatting |
1086 |
xtekky/chatgpt-clone |
ChatGPT interface with better UI |
1087 |
JoshuaC215/agent-service-toolkit |
Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit |
1088 |
NVlabs/VILA |
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops) |
1089 |
minimaxir/simpleaichat |
Python package for easily interfacing with chat apps, with robust features and minimal code complexity. |
1090 |
Netflix/maestro |
Maestro: Netflix’s Workflow Orchestrator |
1091 |
microsoft/Phi-3CookBook |
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks. |
1092 |
aliasrobotics/cai |
Cybersecurity AI (CAI), an open Bug Bounty-ready Artificial Intelligence |
1093 |
SLAM-Handbook-contributors/slam-handbook-public-release |
Release repo for our SLAM Handbook |
1094 |
run-llama/llama-hub |
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain |
1095 |
iryna-kondr/scikit-llm |
Seamlessly integrate powerful language models like ChatGPT into scikit-learn for enhanced text analysis tasks. |
1096 |
denizsafak/abogen |
Generate audiobooks from EPUBs, PDFs and text with synchronized captions. |
1097 |
ToTheBeginning/PuLID |
Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment |
1098 |
rowboatlabs/rowboat |
AI-powered multi-agent builder |
1099 |
rashadphz/farfalle |
🔍 AI search engine - self-host with local or cloud LLMs |
1100 |
huggingface/safetensors |
Simple, safe way to store and distribute tensors |
1101 |
princeton-nlp/SWE-bench |
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues? |
1102 |
TideDra/zotero-arxiv-daily |
Recommend new arxiv papers of your interest daily according to your Zotero library. |
1103 |
openai/openai-realtime-console |
React app for inspecting, building and debugging with the Realtime API |
1104 |
Stability-AI/stable-audio-tools |
Generative models for conditional audio generation |
1105 |
ethen8181/machine-learning |
🌎 machine learning tutorials (mainly in Python3) |
|
apple/embedding-atlas |
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata. |
|
MarkFzp/act-plus-plus |
Imitation Learning algorithms with Co-traing for Mobile ALOHA: ACT, Diffusion Policy, VINN |
1108 |
opengeos/segment-geospatial |
A Python package for segmenting geospatial data with the Segment Anything Model (SAM) |
1109 |
NVIDIA/GenerativeAIExamples |
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture. |
1110 |
pashpashpash/vault-ai |
OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, etc) using a simple React frontend. |
1111 |
sugarforever/chat-ollama |
ChatOllama is an open source chatbot based on LLMs. It supports a wide range of language models, and knowledge base management. |
1112 |
PKU-YuanGroup/Video-LLaVA |
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection |
1113 |
MrForExample/ComfyUI-3D-Pack |
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.) |
1114 |
iusztinpaul/hands-on-llms |
🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴 |
1115 |
ed-donner/llm_engineering |
Repo to accompany my mastering LLM engineering course |
1116 |
iree-org/iree |
A retargetable MLIR-based machine learning compiler and runtime toolkit. |
1117 |
dvlab-research/MiniGemini |
Official implementation for Mini-Gemini |
1118 |
ai-boost/Awesome-GPTs |
Curated list of awesome GPTs 👍. |
1119 |
Azure-Samples/cognitive-services-speech-sdk |
Sample code for the Microsoft Cognitive Services Speech SDK |
1120 |
AprilNEA/ChatGPT-Admin-Web |
ChatGPT WebUI with user management and admin dashboard system |
1121 |
adamcohenhillel/ADeus |
An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI. |
1122 |
Luodian/Otter |
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability. |
1123 |
smallcloudai/refact |
AI Agent that handles engineering tasks end-to-end: integrates with developers’ tools, plans, executes, and iterates until it achieves a successful result. |
1124 |
gptscript-ai/gptscript |
Natural Language Programming |
1125 |
girafe-ai/ml-course |
Open Machine Learning course |
1126 |
TracecatHQ/tracecat |
😼 The AI-native, open source alternative to Tines / Splunk SOAR. |
1127 |
OpenGVLab/InternGPT |
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统) |
1128 |
pytorch/executorch |
On-device AI across mobile, embedded and edge for PyTorch |
1129 |
facebookresearch/jepa |
PyTorch code and models for V-JEPA self-supervised learning from video. |
1130 |
linyiLYi/bilibot |
A local chatbot fine-tuned by bilibili user comments. |
1131 |
lanqian528/chat2api |
A service that can convert ChatGPT on the web to OpenAI API format. |
1132 |
project-baize/baize-chatbot |
Let ChatGPT teach your own chatbot in hours with a single GPU! |
1133 |
neuralmagic/deepsparse |
Sparsity-aware deep learning inference runtime for CPUs |
1134 |
allenai/open-instruct |
|
1135 |
facebookresearch/habitat-sim |
A flexible, high-performance 3D simulator for Embodied AI research. |
1136 |
intuitem/ciso-assistant-community |
CISO Assistant is a one-stop-shop for GRC, covering Risk, AppSec and Audit Management and supporting +70 frameworks worldwide with auto-mapping: NIST CSF, ISO 27001, SOC2, CIS, PCI DSS, NIS2, CMMC, PSPF, GDPR, HIPAA, Essential Eight, NYDFS-500, DORA, NIST AI RMF, 800-53, 800-171, CyFun, CJIS, AirCyber, NCSC, ECC, SCF and so much more |
1137 |
damo-vilab/i2vgen-xl |
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models |
1138 |
microsoft/torchscale |
Foundation Architecture for (M)LLMs |
1139 |
jqnatividad/qsv |
CSVs sliced, diced & analyzed. |
1140 |
cohere-ai/cohere-toolkit |
Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications. |
1141 |
daveshap/OpenAI_Agent_Swarm |
HAAS = Hierarchical Autonomous Agent Swarm - "Resistance is futile!" |
1142 |
xenova/whisper-web |
ML-powered speech recognition directly in your browser |
1143 |
Josh-XT/AGiXT |
AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions. |
1144 |
CVI-SZU/Linly |
Chinese-LLaMA basic model; ChatFlow Chinese conversation model; NLP pre-training/command fine-tuning dataset |
1145 |
salesforce/CodeT5 |
Home of CodeT5: Open Code LLMs for Code Understanding and Generation |
1146 |
facebookresearch/ijepa |
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture." |
1147 |
NVIDIA/trt-llm-rag-windows |
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM |
1148 |
CodingChallengesFYI/SharedSolutions |
Publicly shared solutions to Coding Challenges |
1149 |
eureka-research/Eureka |
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" |
1150 |
epicenter-so/epicenter |
Press shortcut → speak → get text. Free and open source. More local-first apps soon ❤️ |
1151 |
coleam00/ai-agents-masterclass |
Follow along with my AI Agents Masterclass videos! All of the code I create and use in this series on YouTube will be here for you to use and even build on top of! |
1152 |
nus-apr/auto-code-rover |
A project structure aware autonomous software engineer aiming for autonomous program improvement |
1153 |
williamyang1991/Rerender_A_Video |
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation |
1154 |
daveebbelaar/ai-cookbook |
Examples and tutorials to help developers build AI systems |
1155 |
li-plus/chatglm.cpp |
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs |
1156 |
baichuan-inc/Baichuan-13B |
A 13B large language model developed by Baichuan Intelligent Technology |
1157 |
deepseek-ai/DreamCraft3D |
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior |
1158 |
NVIDIA/cuda-python |
CUDA Python: Performance meets Productivity |
1159 |
cvlab-columbia/zero123 |
Zero-1-to-3: Zero-shot One Image to 3D Object: https://zero123.cs.columbia.edu/ |
1160 |
gmpetrov/databerry |
The no-code platform for building custom LLM Agents |
1161 |
Doubiiu/DynamiCrafter |
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors |
1162 |
hegelai/prompttools |
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate). |
1163 |
facebookresearch/Pearl |
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta. |
1164 |
Azure/azure-rest-api-specs |
The source for REST API specifications for Microsoft Azure. |
1165 |
willccbb/verifiers |
Verifiers for LLM Reinforcement Learning |
1166 |
Kludex/fastapi-tips |
FastAPI Tips by The FastAPI Expert! |
1167 |
JetBrains/koog |
Koog is the official Kotlin framework for building and running robust, scalable and production-ready AI agents across all platforms – from backend services to Android and iOS, JVM, and even in-browser environments. Koog is based on our AI products expertise and provides proven solutions for complex LLM and AI problems |
1168 |
muellerberndt/mini-agi |
A minimal generic autonomous agent based on GPT3.5/4. Can analyze stock prices, perform network security tests, create art, and order pizza. |
1169 |
mazzzystar/Queryable |
Run CLIP on iPhone to Search Photos. |
1170 |
deepseek-ai/DeepSeek-Math |
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models |
1171 |
airweave-ai/airweave |
Airweave lets agents search any app |
1172 |
elastic/otel-profiling-agent |
The production-scale datacenter profiler |
1173 |
Azure/PyRIT |
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems. |
1174 |
johnma2006/mamba-minimal |
Simple, minimal implementation of the Mamba SSM in one file of PyTorch. |
1175 |
facebookresearch/audio2photoreal |
Code and dataset for photorealistic Codec Avatars driven from audio |
1176 |
SamurAIGPT/privateGPT |
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks |
1177 |
SCUTlihaoyu/open-chat-video-editor |
Open source short video automatic generation tool |
1178 |
Alpha-VLLM/LLaMA2-Accessory |
An Open-source Toolkit for LLM Development |
1179 |
firebase/genkit |
An open source framework for building AI-powered apps with familiar code-centric patterns. Genkit makes it easy to develop, integrate, and test AI features with observability and evaluations. Genkit works with various models and platforms. |
1180 |
kubernetes-sigs/controller-runtime |
Repo for the controller-runtime subproject of kubebuilder (sig-apimachinery) |
1181 |
leptonai/leptonai |
A Pythonic framework to simplify AI service building |
1182 |
microsoft/genaiscript |
Automatable GenAI Scripting |
1183 |
TMElyralab/MuseV |
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising |
1184 |
Eladlev/AutoPrompt |
A framework for prompt tuning using Intent-based Prompt Calibration |
1185 |
janhq/nitro |
A fast, lightweight, embeddable inference engine to supercharge your apps with local AI. OpenAI-compatible API |
1186 |
FranxYao/chain-of-thought-hub |
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting |
1187 |
NVIDIA/nv-ingest |
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems. |
1188 |
src-d/hercules |
Gaining advanced insights from Git repository history. |
1189 |
OpenPipe/OpenPipe |
Turn expensive prompts into cheap fine-tuned models |
1190 |
microsoft/promptbench |
A unified evaluation framework for large language models |
1191 |
SuperTux/supertux |
SuperTux source code |
1192 |
georgia-tech-db/eva |
AI-Relational Database System |
1193 |
ishan0102/vimGPT |
Browse the web with GPT-4V and Vimium |
1194 |
Peterande/D-FINE |
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight] |
1195 |
gptlink/gptlink |
Build your own free commercial ChatGPT environment in 10 minutes. The setup is simple and includes features such as user management, orders, tasks, and payments |
1196 |
abi/secret-llama |
Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3. |
1197 |
Blaizzy/mlx-audio |
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon. |
1198 |
ashishpatel26/LLM-Finetuning |
LLM Finetuning with peft |
1199 |
iterative/datachain |
AI-data warehouse to enrich, transform and analyze data from cloud storages |
1200 |
liou666/polyglot |
Desktop AI Language Practice Application |
1201 |
TMElyralab/MusePose |
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation |
1202 |
baaivision/Painter |
Painter & SegGPT Series: Vision Foundation Models from BAAI |
1203 |
databricks/dbrx |
Code examples and resources for DBRX, a large language model developed by Databricks |
1204 |
IDEA-Research/T-Rex |
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy |
1205 |
lamini-ai/lamini |
Official repo for Lamini's data generator for generating instructions to train instruction-following LLMs |
1206 |
bghira/SimpleTuner |
A general fine-tuning kit geared toward Stable Diffusion 2.1, Stable Diffusion 3, DeepFloyd, and SDXL. |
1207 |
mshumer/gpt-author |
|
1208 |
6drf21e/ChatTTS_colab |
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。 |
1209 |
kevmo314/magic-copy |
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard. |
1210 |
naver/mast3r |
Grounding Image Matching in 3D with MASt3R |
1211 |
langwatch/langwatch |
The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨ |
1212 |
paulpierre/RasaGPT |
💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram |
1213 |
OpenBMB/CPM-Bee |
A bilingual large-scale model with trillions of parameters |
1214 |
THUDM/CogVLM2 |
GPT4V-level open-source multi-modal model based on Llama3-8B |
1215 |
SoraWebui/SoraWebui |
SoraWebui is an open-source Sora web client, enabling users to easily create videos from text with OpenAI's Sora model. |
1216 |
semanser/codel |
✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor. |
1217 |
Azure-Samples/graphrag-accelerator |
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure |
1218 |
IBM/mcp-context-forge |
A Model Context Protocol (MCP) Gateway & Registry. Serves as a central management point for tools, resources, and prompts that can be accessed by MCP-compatible LLM applications. Converts REST API endpoints to MCP, composes virtual MCP servers with added security and observability, and converts between protocols (stdio, SSE, Streamable HTTP). |
1219 |
tencent-ailab/V-Express |
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images. |
1220 |
cloudflare/agents |
Build and deploy AI Agents on Cloudflare |
1221 |
SpecterOps/BloodHound |
Six Degrees of Domain Admin |
1222 |
dvmazur/mixtral-offloading |
Run Mixtral-8x7B models in Colab or consumer desktops |
1223 |
pytorch/ao |
PyTorch native quantization and sparsity for training and inference |
|
unitreerobotics/unitree_rl_gym |
|
|
NVIDIA/gpu-operator |
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes |
|
aixcoder-plugin/aiXcoder-7B |
official repository of aiXcoder-7B Code Large Language Model |
|
darrenburns/elia |
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more. |
1228 |
BAAI-Agents/Cradle |
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements. |
1229 |
mshumer/gpt-investor |
|
1230 |
huggingface/cookbook |
Open-source AI cookbook |
1231 |
apple/axlearn |
An Extensible Deep Learning Library |
1232 |
PKU-YuanGroup/MoE-LLaVA |
Mixture-of-Experts for Large Vision-Language Models |
|
google/mangle |
|
|
Alpha-VLLM/Lumina-T2X |
Lumina-T2X is a unified framework for Text to Any Modality Generation |
|
hncboy/chatgpt-web-java |
ChatGPT project developed in Java, based on Spring Boot 3 and JDK 17, supports both AccessToken and ApiKey modes |
1236 |
exa-labs/exa-mcp-server |
Claude can perform Web Search |
1237 |
GaParmar/img2img-turbo |
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more |
1238 |
openai/consistencydecoder |
Consistency Distilled Diff VAE |
1239 |
emcf/engshell |
An English-language shell for any OS, powered by LLMs |
1240 |
cgpotts/cs224u |
Code for Stanford CS224u |
1241 |
JiauZhang/DragGAN |
Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold |
|
humanlayer/humanlayer |
HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes function calls with approval workflows across slack, email and more. Bring your LLM and Framework of choice and start giving your AI agents safe access to the world. Agentic Workflows, human in the loop, tool calling |
|
The-OpenROAD-Project/OpenROAD |
OpenROAD's unified application implementing an RTL-to-GDS Flow. Documentation at https://openroad.readthedocs.io/en/latest/ |
1244 |
xlang-ai/OSWorld |
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments |
1245 |
ai-christianson/RA.Aid |
Develop software autonomously. |
1246 |
snap-stanford/Biomni |
Biomni: a general-purpose biomedical AI agent |
1247 |
ed-donner/agents |
Repo for the Complete Agentic AI Engineering Course |
1248 |
nunchaku-tech/ComfyUI-nunchaku |
ComfyUI Plugin of Nunchaku |
1249 |
microsoft/aici |
AICI: Prompts as (Wasm) Programs |
1250 |
supabase-community/supabase-mcp |
Connect Supabase to your AI assistants |
1251 |
idaholab/moose |
Multiphysics Object Oriented Simulation Environment |
1252 |
PRIS-CV/DemoFusion |
Let us democratise high-resolution generation! (arXiv 2023) |
1253 |
ml-explore/mlx-swift-examples |
Examples using MLX Swift |
1254 |
Nutlope/notesGPT |
Record voice notes & transcribe, summarize, and get tasks |
1255 |
Niek/chatgpt-web |
ChatGPT web interface using the OpenAI API |
1256 |
InstantStyle/InstantStyle |
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥 |
1257 |
liltom-eth/llama2-webui |
Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Supporting Llama-2-7B/13B/70B with 8-bit, 4-bit. Supporting GPU inference (6 GB VRAM) and CPU inference. |
1258 |
ymcui/Chinese-LLaMA-Alpaca-3 |
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3 |
1259 |
google/maxtext |
A simple, performant and scalable Jax LLM! |
1260 |
microsoft/sample-app-aoai-chatGPT |
[PREVIEW] Sample code for a simple web chat experience targeting chatGPT through AOAI. |
1261 |
kyegomez/BitNet |
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch |
1262 |
eloialonso/diamond |
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. |
1263 |
landing-ai/agentic-doc |
Python library for Agentic Document Extraction from LandingAI |
1264 |
ytongbai/LVM |
Sequential Modeling Enables Scalable Learning for Large Vision Models |
1265 |
YangLing0818/RPG-DiffusionMaster |
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG) |
1266 |
SharifiZarchi/Introduction_to_Machine_Learning |
دورهی مقدمهای بر یادگیری ماشین، برای دانشجویان |
1267 |
skills/copilot-codespaces-vscode |
Develop with AI-powered code suggestions using GitHub Copilot and VS Code |
1268 |
AI-Citizen/SolidGPT |
Chat everything with your code repository, ask repository level code questions, and discuss your requirements. AI Scan and learning your code repository, provide you code repository level answer🧱 🧱 |
1269 |
flowdriveai/flowpilot |
flow-pilot is an openpilot based driver assistance system that runs on linux, windows and android powered machines. |
1270 |
kevmo314/scuda |
SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines. |
1271 |
linyiLYi/snake-ai |
An AI agent that beats the classic game "Snake". |
1272 |
baaivision/Emu |
Emu Series: Generative Multimodal Models from BAAI |
1273 |
microsoft/MoGe |
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision |
1274 |
TencentARC/BrushNet |
The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion" |
1275 |
microsoft/mcp |
Catalog of official Microsoft MCP (Model Context Protocol) server implementations for AI-powered data access and tool integration |
1276 |
stephansturges/WALDO |
Whereabouts Ascertainment for Low-lying Detectable Objects. The SOTA in FOSS AI for drones! |
1277 |
collabora/WhisperFusion |
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI. |
1278 |
TeamNewPipe/NewPipeExtractor |
NewPipe's core library for extracting data from streaming sites |
1279 |
sunnypilot/sunnypilot |
sunnypilot is an open source driver assistance system. sunnypilot offers the user a unique driving experience for over 300 supported car makes and models with modified behaviors of driving assist engagements. sunnypilot complies with the safety policy from comma.ai's openpilot as accurately as possible. |
1280 |
mhamilton723/FeatUp |
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024 |
1281 |
lichao-sun/Mora |
Mora: More like Sora for Generalist Video Generation |
1282 |
deepsense-ai/ragbits |
Building blocks for rapid development of GenAI applications |
1283 |
GoogleCloudPlatform/localllm |
Run LLMs locally on Cloud Workstations |
1284 |
microsoft/vscode-ai-toolkit |
|
1285 |
aws/amazon-q-developer-cli |
✨ Agentic chat experience in your terminal. Build applications using natural language. |
1286 |
AnswerDotAI/fsdp_qlora |
Training LLMs with QLoRA + FSDP |
1287 |
andrewnguonly/Lumos |
A RAG LLM co-pilot for browsing the web, powered by local LLMs |
1288 |
jgravelle/AutoGroq |
|
1289 |
zju3dv/EasyVolcap |
[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research |
1290 |
OpenGenerativeAI/llm-colosseum |
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM |
1291 |
goatcorp/Dalamud |
FFXIV plugin framework and API |
1292 |
TraceMachina/nativelink |
NativeLink is an open source high-performance build cache and remote execution server, compatible with Bazel, Buck2, Reclient, and other RBE-compatible build systems. It offers drastically faster builds, reduced test flakiness, and significant infrastructure cost savings. |
1293 |
Akkudoktor-EOS/EOS |
This repository features an Energy Optimization System (EOS) that optimizes energy distribution, usage for batteries, heat pumps& household devices. It includes predictive models for electricity prices (planned), load forecasting& dynamic optimization to maximize energy efficiency & minimize costs. Founder Dr. Andreas Schmitz (YouTube @akkudoktor) |
1294 |
McGill-NLP/webllama |
Llama-3 agents that can browse the web by following instructions and talking to you |
1295 |
Lightning-AI/lightning-thunder |
Make PyTorch models Lightning fast! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once. |
1296 |
lucidrains/self-rewarding-lm-pytorch |
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI |
1297 |
ishaan1013/sandbox |
A cloud-based code editing environment with an AI copilot and real-time collaboration. |
1298 |
scottpetrovic/mesh2motion-app |
Import a 3D Model and automatically assign and export animations |
1299 |
MLSysOps/MLE-agent |
🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Ollama, etc supported. 🎆 Code RAG |
1300 |
SakanaAI/evolutionary-model-merge |
Official repository of Evolutionary Optimization of Model Merging Recipes |
1301 |
elixir-lang/expert |
Official Elixir Language Server Protocol implementation |
1302 |
PKU-YuanGroup/MagicTime |
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators |
1303 |
dataease/SQLBot |
Text-to-SQL Generation via LLMs using RAG. |
1304 |
a-real-ai/pywinassistant |
The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models. |
1305 |
aws-samples/amazon-bedrock-samples |
This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models |
1306 |
wpilibsuite/allwpilib |
Official Repository of WPILibJ and WPILibC |
1307 |
langchain-ai/langchain-extract |
🦜⛏️ Did you say you like data? |
1308 |
elfvingralf/macOSpilot-ai-assistant |
Voice + Vision powered AI assistant that answers questions about any application, in context and in audio. |
1309 |
FoundationVision/GLEE |
【CVPR2024】GLEE: General Object Foundation Model for Images and Videos at Scale |
1310 |
Profluent-AI/OpenCRISPR |
AI-generated gene editing systems |
1311 |
ikawrakow/ik_llama.cpp |
llama.cpp fork with additional SOTA quants and improved performance |
1312 |
tenstorrent/tt-metal |
🤘 TT-NN operator library, and TT-Metalium low level kernel programming model. |
1313 |
PaddlePaddle/PaddleHelix |
Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集 |
⭐ 1314 |
socfortress/Wazuh-Rules |
Advanced Wazuh Rules for more accurate threat detection. Feel free to implement within your own Wazuh environment, contribute, or fork! |
1315 |
HITsz-TMG/FilmAgent |
Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the making! |
1316 |
likejazz/llama3.np |
llama3.np is pure NumPy implementation for Llama 3 model. |
1317 |
myshell-ai/JetMoE |
Reaching LLaMA2 Performance with 0.1M Dollars |
1318 |
mustafaaljadery/gemma-2B-10M |
Gemma 2B with 10M context length using Infini-attention. |
1319 |
NVIDIA-NeMo/RL |
Scalable toolkit for efficient model reinforcement |
Tip:
symbol | rule |
---|---|
🔥 | 256 < stars today <= 512 |
🔥🔥 | 512 < stars today <= 1k |
🔥🔥🔥 | stars today > 1k |
|
ranking up / down |
⭐ | on trending page today |
No. |
Tool | Description |
---|---|---|
1 | ChatGPT | A sibling model to InstructGPT, which is trained to follow instructions in a prompt and provide a detailed response |
2 | DALL·E 2 | Create original, realistic images and art from a text description |
3 | Murf AI | AI enabled, real people's voices |
4 | Midjourney | An independent research lab that produces an artificial intelligence program under the same name that creates images from textual descriptions, used in Discord |
5 | Make-A-Video | Make-A-Video is a state-of-the-art AI system that generates videos from text |
6 | Creative Reality™ Studio by D-ID | Use generative AI to create future-facing videos |
7 | chat.D-ID | The First App Enabling Face-to-Face Conversations with ChatGPT |
8 | Notion AI | Access the limitless power of AI, right inside Notion. Work faster. Write better. Think bigger. |
9 | Runway | Text to Video with Gen-2 |
10 | Resemble AI | Resemble’s AI voice generator lets you create human–like voice overs in seconds |
11 | Cursor | Write, edit, and chat about your code with a powerful AI |
12 | Hugging Face | Build, train and deploy state of the art models powered by the reference open source in machine learning |
13 |
Claude |
A next-generation AI assistant for your tasks, no matter the scale |
14 | Poe | Poe lets you ask questions, get instant answers, and have back-and-forth conversations with AI. Gives access to GPT-4, gpt-3.5-turbo, Claude from Anthropic, and a variety of other bots |
No. |
WebSite |
Description |
---|---|---|
1 | OpenAI | An artificial intelligence research lab |
2 | Bard | Base Google's LaMDA chatbots and pull from internet |
3 | ERNIE Bot | Baidu’s new generation knowledge-enhanced large language model is a new member of the Wenxin large model family |
4 | DALL·E 2 | An AI system that can create realistic images and art from a description in natural language |
5 | Whisper | A general-purpose speech recognition model |
6 | CivitAI | A platform that makes it easy for people to share and discover resources for creating AI art |
7 | D-ID | D-ID’s Generative AI enables users to transform any picture or video into extraordinary experiences |
8 | Nvidia eDiff-I | Text-to-Image Diffusion Models with Ensemble of Expert Denoisers |
9 | Stability AI | The world's leading open source generative AI company which opened source Stable Diffusion |
10 | Meta AI | Whether it be research, product or infrastructure development, we’re driven to innovate responsibly with AI to benefit the world |
11 | ANTHROPIC | AI research and products that put safety at the frontier |
No. |
Report&Paper |
Description |
---|---|---|
1 | GPT-4 Technical Report | GPT-4 Technical Report |
2 | mli/paper-reading | Deep learning classics and new papers are read carefully paragraph by paragraph. |
3 | labmlai/annotated_deep_learning_paper_implementations | A collection of simple PyTorch implementations of neural networks and related algorithms, which are documented with explanations |
4 | Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models | Talking, Drawing and Editing with Visual Foundation Models |
5 | OpenAI Research | The latest research report and papers from OpenAI |
6 | Make-A-Video: Text-to-Video Generation without Text-Video Data | Meta's Text-to-Video Generation |
7 | eDiff-I: Text-to-Image Diffusion Models with Ensemble of Expert Denoisers | Nvidia eDiff-I - New generation of generative AI content creation tool |
8 | Training an Assistant-style Chatbot with Large Scale Data Distillation from GPT-3.5-Turbo | 2023 GPT4All Technical Report |
9 | Segment Anything | Meta Segment Anything |
10 | LLaMA: Open and Efficient Foundation Language Models | LLaMA: a collection of foundation language models ranging from 7B to 65B parameters |
11 | papers-we-love/papers-we-love | Papers from the computer science community to read and discuss |
12 | CVPR 2023 papers | The most exciting and influential CVPR 2023 papers |
No. |
Tutorial | Description |
---|---|---|
1 | Coursera - Machine Learning | The Machine Learning Specialization Course taught by Dr. Andrew Ng |
2 | microsoft/ML-For-Beginners | 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all |
3 | ChatGPT Prompt Engineering for Developers | This short course taught by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI) will teach how to use a large language model (LLM) to quickly build new and powerful applications |
4 | Dive into Deep Learning | Targeting Chinese readers, functional and open for discussion. The Chinese and English versions are used for teaching in over 400 universities across more than 60 countries |
5 | AI Expert Roadmap | Roadmap to becoming an Artificial Intelligence Expert in 2022 |
6 | Computer Science courses | List of Computer Science courses with video lectures |
7 | Machine Learning with Python | Machine Learning with Python Certification on freeCodeCamp |
8 | Building Systems with the ChatGPT API | This short course taught by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI), you will learn how to automate complex workflows using chain calls to a large language model |
9 | LangChain for LLM Application Development | This short course taught by Harrison Chase (Co-Founder and CEO at LangChain) and Andrew Ng. you will gain essential skills in expanding the use cases and capabilities of language models in application development using the LangChain framework |
10 | How Diffusion Models Work | This short course taught by Sharon Zhou (CEO, Co-founder, Lamini). you will gain a deep familiarity with the diffusion process and the models which carry it out. More than simply pulling in a pre-built model or using an API, this course will teach you to build a diffusion model from scratch |
11 | Free Programming Books For AI | 📚 Freely available programming books for AI |
12 | microsoft/AI-For-Beginners | 12 Weeks, 24 Lessons, AI for All! |
13 | hemansnation/God-Level-Data-Science-ML-Full-Stack | A collection of scientific methods, processes, algorithms, and systems to build stories & models. This roadmap contains 16 Chapters, whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI |
14 | datawhalechina/prompt-engineering-for-developers | Chinese version of Andrew Ng's Big Model Series Courses, including "Prompt Engineering", "Building System", and "LangChain" |
15 | ossu/computer-science | 🎓 Path to a free self-taught education in Computer Science! |
16 | microsoft/Data-Science-For-Beginners | 10 Weeks, 20 Lessons, Data Science for All! |
17 |
jwasham/coding-interview-university |
A complete computer science study plan to become a software engineer. |
If this project has been helpful to you in any way, please give it a ⭐️ by clicking on the star.
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for AITreasureBox
Similar Open Source Tools

AITreasureBox
AITreasureBox is a comprehensive collection of AI tools and resources designed to simplify and accelerate the development of AI projects. It provides a wide range of pre-trained models, datasets, and utilities that can be easily integrated into various AI applications. With AITreasureBox, developers can quickly prototype, test, and deploy AI solutions without having to build everything from scratch. Whether you are working on computer vision, natural language processing, or reinforcement learning projects, AITreasureBox has something to offer for everyone. The repository is regularly updated with new tools and resources to keep up with the latest advancements in the field of artificial intelligence.

AiTreasureBox
AiTreasureBox is a versatile AI tool that provides a collection of pre-trained models and algorithms for various machine learning tasks. It simplifies the process of implementing AI solutions by offering ready-to-use components that can be easily integrated into projects. With AiTreasureBox, users can quickly prototype and deploy AI applications without the need for extensive knowledge in machine learning or deep learning. The tool covers a wide range of tasks such as image classification, text generation, sentiment analysis, object detection, and more. It is designed to be user-friendly and accessible to both beginners and experienced developers, making AI development more efficient and accessible to a wider audience.

LLM-FineTuning-Large-Language-Models
This repository contains projects and notes on common practical techniques for fine-tuning Large Language Models (LLMs). It includes fine-tuning LLM notebooks, Colab links, LLM techniques and utils, and other smaller language models. The repository also provides links to YouTube videos explaining the concepts and techniques discussed in the notebooks.

david-ai
David UI is a free and open-source collection of customizable, production-ready UI components built with Tailwind CSS. It is designed to be developer-friendly and performance-focused, streamlining the creation of modern, visually appealing interfaces to help deliver high-quality user experiences faster.

osm-ai-helper
OSM-AI-helper is a Blueprint by Mozilla.ai designed to assist users in mapping features in OpenStreetMap using object detection and image segmentation models. It provides tools for identifying and mapping various features, such as swimming pools, in OpenStreetMap. Users can also create custom datasets and fine-tune models for different use cases. The project is licensed under the AGPL-3.0 License and welcomes contributions from the community.

vectordb-recipes
This repository contains examples, applications, starter code, & tutorials to help you kickstart your GenAI projects. * These are built using LanceDB, a free, open-source, serverless vectorDB that **requires no setup**. * It **integrates into python data ecosystem** so you can simply start using these in your existing data pipelines in pandas, arrow, pydantic etc. * LanceDB has **native Typescript SDK** using which you can **run vector search** in serverless functions! This repository is divided into 3 sections: - Examples - Get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes! - Applications - Ready to use Python and web apps using applied LLMs, VectorDB and GenAI tools - Tutorials - A curated list of tutorials, blogs, Colabs and courses to get you started with GenAI in greater depth.

swift
SWIFT (Scalable lightWeight Infrastructure for Fine-Tuning) supports training, inference, evaluation and deployment of nearly **200 LLMs and MLLMs** (multimodal large models). Developers can directly apply our framework to their own research and production environments to realize the complete workflow from model training and evaluation to application. In addition to supporting the lightweight training solutions provided by [PEFT](https://github.com/huggingface/peft), we also provide a complete **Adapters library** to support the latest training techniques such as NEFTune, LoRA+, LLaMA-PRO, etc. This adapter library can be used directly in your own custom workflow without our training scripts. To facilitate use by users unfamiliar with deep learning, we provide a Gradio web-ui for controlling training and inference, as well as accompanying deep learning courses and best practices for beginners. Additionally, we are expanding capabilities for other modalities. Currently, we support full-parameter training and LoRA training for AnimateDiff.

UMOE-Scaling-Unified-Multimodal-LLMs
Uni-MoE is a MoE-based unified multimodal model that can handle diverse modalities including audio, speech, image, text, and video. The project focuses on scaling Unified Multimodal LLMs with a Mixture of Experts framework. It offers enhanced functionality for training across multiple nodes and GPUs, as well as parallel processing at both the expert and modality levels. The model architecture involves three training stages: building connectors for multimodal understanding, developing modality-specific experts, and incorporating multiple trained experts into LLMs using the LoRA technique on mixed multimodal data. The tool provides instructions for installation, weights organization, inference, training, and evaluation on various datasets.

mnn-llm
MNN-LLM is a high-performance inference engine for large language models (LLMs) on mobile and embedded devices. It provides optimized implementations of popular LLM models, such as ChatGPT, BLOOM, and GPT-3, enabling developers to easily integrate these models into their applications. MNN-LLM is designed to be efficient and lightweight, making it suitable for resource-constrained devices. It supports various deployment options, including mobile apps, web applications, and embedded systems. With MNN-LLM, developers can leverage the power of LLMs to enhance their applications with natural language processing capabilities, such as text generation, question answering, and dialogue generation.

Awesome-Colorful-LLM
Awesome-Colorful-LLM is a meticulously assembled anthology of vibrant multimodal research focusing on advancements propelled by large language models (LLMs) in domains such as Vision, Audio, Agent, Robotics, and Fundamental Sciences like Mathematics. The repository contains curated collections of works, datasets, benchmarks, projects, and tools related to LLMs and multimodal learning. It serves as a comprehensive resource for researchers and practitioners interested in exploring the intersection of language models and various modalities for tasks like image understanding, video pretraining, 3D modeling, document understanding, audio analysis, agent learning, robotic applications, and mathematical research.

databend
Databend is an open-source cloud data warehouse that serves as a cost-effective alternative to Snowflake. With its focus on fast query execution and data ingestion, it's designed for complex analysis of the world's largest datasets.

LLaVA-pp
This repository, LLaVA++, extends the visual capabilities of the LLaVA 1.5 model by incorporating the latest LLMs, Phi-3 Mini Instruct 3.8B, and LLaMA-3 Instruct 8B. It provides various models for instruction-following LMMS and academic-task-oriented datasets, along with training scripts for Phi-3-V and LLaMA-3-V. The repository also includes installation instructions and acknowledgments to related open-source contributions.

Weekly-Top-LLM-Papers
This repository provides a curated list of weekly published Large Language Model (LLM) papers. It includes top important LLM papers for each week, organized by month and year. The papers are categorized into different time periods, making it easy to find the most recent and relevant research in the field of LLM.

Awesome-LLM
Awesome-LLM is a curated list of resources related to large language models, focusing on papers, projects, frameworks, tools, tutorials, courses, opinions, and other useful resources in the field. It covers trending LLM projects, milestone papers, other papers, open LLM projects, LLM training frameworks, LLM evaluation frameworks, tools for deploying LLM, prompting libraries & tools, tutorials, courses, books, and opinions. The repository provides a comprehensive overview of the latest advancements and resources in the field of large language models.

speakeasy
Speakeasy is a tool that helps developers create production-quality SDKs, Terraform providers, documentation, and more from OpenAPI specifications. It supports a wide range of languages, including Go, Python, TypeScript, Java, and C#, and provides features such as automatic maintenance, type safety, and fault tolerance. Speakeasy also integrates with popular package managers like npm, PyPI, Maven, and Terraform Registry for easy distribution.

VideoLLaMA2
VideoLLaMA 2 is a project focused on advancing spatial-temporal modeling and audio understanding in video-LLMs. It provides tools for multi-choice video QA, open-ended video QA, and video captioning. The project offers model zoo with different configurations for visual encoder and language decoder. It includes training and evaluation guides, as well as inference capabilities for video and image processing. The project also features a demo setup for running a video-based Large Language Model web demonstration.
For similar tasks

Azure-Analytics-and-AI-Engagement
The Azure-Analytics-and-AI-Engagement repository provides packaged Industry Scenario DREAM Demos with ARM templates (Containing a demo web application, Power BI reports, Synapse resources, AML Notebooks etc.) that can be deployed in a customer’s subscription using the CAPE tool within a matter of few hours. Partners can also deploy DREAM Demos in their own subscriptions using DPoC.

sorrentum
Sorrentum is an open-source project that aims to combine open-source development, startups, and brilliant students to build machine learning, AI, and Web3 / DeFi protocols geared towards finance and economics. The project provides opportunities for internships, research assistantships, and development grants, as well as the chance to work on cutting-edge problems, learn about startups, write academic papers, and get internships and full-time positions at companies working on Sorrentum applications.

tidb
TiDB is an open-source distributed SQL database that supports Hybrid Transactional and Analytical Processing (HTAP) workloads. It is MySQL compatible and features horizontal scalability, strong consistency, and high availability.

zep-python
Zep is an open-source platform for building and deploying large language model (LLM) applications. It provides a suite of tools and services that make it easy to integrate LLMs into your applications, including chat history memory, embedding, vector search, and data enrichment. Zep is designed to be scalable, reliable, and easy to use, making it a great choice for developers who want to build LLM-powered applications quickly and easily.

telemetry-airflow
This repository codifies the Airflow cluster that is deployed at workflow.telemetry.mozilla.org (behind SSO) and commonly referred to as "WTMO" or simply "Airflow". Some links relevant to users and developers of WTMO: * The `dags` directory in this repository contains some custom DAG definitions * Many of the DAGs registered with WTMO don't live in this repository, but are instead generated from ETL task definitions in bigquery-etl * The Data SRE team maintains a WTMO Developer Guide (behind SSO)

mojo
Mojo is a new programming language that bridges the gap between research and production by combining Python syntax and ecosystem with systems programming and metaprogramming features. Mojo is still young, but it is designed to become a superset of Python over time.

pandas-ai
PandasAI is a Python library that makes it easy to ask questions to your data in natural language. It helps you to explore, clean, and analyze your data using generative AI.

databend
Databend is an open-source cloud data warehouse that serves as a cost-effective alternative to Snowflake. With its focus on fast query execution and data ingestion, it's designed for complex analysis of the world's largest datasets.
For similar jobs

weave
Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

kaito
Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

PyRIT
PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

tabby
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

spear
SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

Magick
Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.