Awesome-LLM-Resources-List

A Curated Collection of LLM resources (work in progress).

Stars: 126

Visit

Awesome LLM Resources is a curated collection of resources for Large Language Models (LLMs) covering various aspects such as serverless hosting, accessing off-the-shelf models via API, local inference, LLM serving frameworks, open-source LLM web chat UIs, renting GPUs for fine-tuning, fine-tuning with no-code UI, fine-tuning frameworks, OS agentic/AI workflow, AI agents, co-pilots, voice API, open-source TTS models, OS RAG frameworks, research papers on chain-of-thought prompting, CoT implementations, CoT fine-tuned models & datasets, and more.

README:

🌟 Awesome LLM Resources

A Curated Collection of LLM resources. 💡✨

🌐 Updated: 12th of March 2025

'Serverless' Hosting of Private/OS Models

Platform/Tool	Rel.	Scale Down	OS 🔓	Start	GPU Machine	One-Click	Dev Exp.	Free-Tier
Beam.Cloud	2021	> 1 min		Helpers	❌	❌	👍	🆓 15h
Baseten	2019	> 15 min	🔴	Guide	❌	🟡	👍	$30
Modal	2021	< 1 min	🔴	Helpers	❌	❌	👍	$30/m
HF Endpoints	2023	> 15 min	🔴	None Needed	❌	✅	😓	❌
Replicate	2019	< 1 min	🔴	Guide	✅	🟡	🤷	❌
Sagemaker (Serverless)	2017	N/A	🔴	N/A	🔵	❌	❌	300,000s
Lambda w/ EFS (AWS)	2014	< 1 min	🔴	Guide	🟡	❌	❌	✅
RunPod Serverless	2022	> 30s	🔴	N/A	🟡	❌	🤷	❌
BentoML	2019	> 5 min		Gallery	🟡	🟡	👍	🆓 $10

It goes without saying that these platforms can usually do more than LLM serving**

Access Off-the-Shelf OS Models (via API):

Platform/Tool	Released	GitHub
Together.ai	N/A	🔴
Fireworks.ai	N/A
Replicate	2019
Groq	N/A
DeepInfra	N/A
Bedrock	N/A
Lepton	N/A
Fal.ai	N/A
VertexAI	N/A

Local Inference

Framework	Browser Chat 🖥️	Organization	Open Source
Llama.cpp	❌	ggerganov
Ollama	❌	Ollama
gpt4all	✅	Nomic.ai
LMStudio	✅	LMStudio AI	🔴
OpenLLM	✅	BentoML

LLM Serving Frameworks

Framework	Open Source	GitHub
vLLM
OpenLLM
TGI (Text Generation Inference)
TensorRT LLM
Ray Serve
LMDeploy
Ollama
MLC-LLM

Building Open-Source LLM Web Chat UIs

Tool	Organization	Description
Text Generation WebUI	oobabooga	A Gradio web UI for Large Language Models.
Jan AI	Jan HQ	An open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM).
AnythingLLM	Mintplex Labs	The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Superagent	Superagent AI	Allows developers to add powerful AI assistants to their applications using LLMs and RAG.
Bionic-GPT	Bionic GPT	A ChatGPT replacement offering generative AI advantages while maintaining strict data confidentiality.
Open WebUI	Open WebUI	A user-friendly web interface for interacting with Large Language Models (LLMs).
Xyne	xynehq	A sleek, minimal web chat interface for interacting with Large Language Models.
Assistant UI	assistant-ui	An open-source ChatGPT-like interface with a clean and responsive design.
Scira	zaidmukaddam	An AI-powered search interface that leverages LLMs for intelligent search results.
Onyx	onyx-dot-app	A customizable and extendable web chat UI for interacting with large language models.
NextChat	ChatGPTNextWeb	A Next.js-based, open-source ChatGPT clone for seamless web interaction.

Rent GPUs (Fine-Tuning, Deploying, Training)

Platform	Templates	Beginner Friendly
Brev.dev	Fine-tuning	❌
Modal	Fine-tuning	❌
Hyperbolic AI	None	❌
RunPod	None	❌
Paperspace	Fine-tuning	✅
Colab	Small models only	✅

Fine-Tuning with No-Code UI

Tool	Beginner Friendly	Open Source	GitHub
Together.ai	✅	❌	N/A
Hugging Face AutoTrain	✅	❌
AutoML	❌	✅
LLaMA-Factory	❌	✅
H2O LLM Studio	✅	✅

Fine-Tuning Frameworks

Framework	Open Source	GitHub
Axolotl
Unsloth

OS Agentic/AI Workflow

Framework	Beginner Friendly	Released
LangChain	✅	2022
LlamaIndex	❌	2023
Swarms	❌	2023
CrewAI	✅	2023
Autogen	✅	2023
AutoChain	❌	2023
SuperAGI	❌	2023
AILegion	❌	2023
MemGPT (Letta)	❌	2023
uAgents	❌	2023
AGiXT	❌	2023
Dify	✅	2024
TaskingAI	✅	2024
Bee Agent Framework	❌	2024
Swarms	❌	2024
IoA	❌	2024
Atomic Agents	❌	2024
Upsonic	❌	2024
Parlant	❌	2024
Rig	❌	2024
smolagents	✅	2023
eliza	✅	2024

Top Agentic Frameworks

Framework	Beginner Friendly	Released
LangGraph	❌	2023
Flowise	✅	2023
Langroid	❌	2023
smolagents	✅	2023
Semantic Kernel	✅	2023
PydanticAI	❌	2024
Mastra	❌	2025

Visual AI Agent Builders

Tool	Organization	Description	Open Source
Rivet	Ironclad	A visual builder to design and deploy AI agent workflows.
PySpur	PySpur-Dev	A tool to build and visualize AI agents seamlessly.
Flowise	FlowiseAI	A no‑code, visual platform for designing AI agent workflows.
Agno		❌	2023

Agentic Tools (for “building”)

Tool	Organization	Description
browser-use	browser-use	Integrates browser functionalities into agentic workflows.
code2prompt	mufeedvh	Converts code snippets into actionable prompts for development.
note-gen	codexu	Automatically generates notes and documentation from your code.
refly	refly-ai	Automates code refactoring and prompt generation tasks.
potpie	potpie-ai	A toolkit for prototyping and building AI agent pipelines.
AgentStack	AgentOps-AI	A comprehensive stack for constructing and deploying AI agents.
browser	lightpanda-io	A browser‑based tool designed for integrating agentic functionalities.
Memary	kingjulio8238	A memory module for retaining context in agent workflows.
open-canvas	langchain-ai	A visual interface for designing agent workflows with LangChain.
agent-service-toolkit	JoshuaC215	A toolkit for building and deploying agent-based services.

Virtual Brains

Tool	Organization	Description	Open Source	GitHub
Leon	leon-ai	An open‑source personal assistant and automation platform powered by AI.
Khoj	khoj-ai	A virtual brain for organizing and retrieving your knowledge using AI.

AI Agents

Framework	Organization	Open Source	Released
GPT Engineer	GPT Engineer Org		2023
XAgent	OpenBMB		2023
Bolt.new	StackBlitz		2023
Goose	Block		2023
AI Hedge Fund	virattt		2023
FinRobot	AI4Finance Foundation		2024
STORM	Stanford OVAL		2024
Multion	MULTI-ON	🔴	N/A
Minion	Minion AI	🔴	N/A

Co-Pilots

Framework	Open Source	GitHub
Aider
Cursor
Continue

Voice API

Framework	Open Source	GitHub
VAPI.ai	🔴
Bland.ai	🔴	N/A
CallAnnie	🔴	N/A
RealtimeTTS
RealtimeSTT
Coqui TTS

Open Source TTS Models

Model	License	Stars/Likes	Downloads (Last Month)	Repository
Kokoro-82M	Apache 2.0	⭐ 3.16k (HF)	📥 557,392	Hugging Face
Zonos-v0.1-transformer	Apache 2.0	⭐ 249 (HF)	📥 24,240	Hugging Face
XTTS-v2	Non-Commercial	❤️ 368 (HF)	📥 2,545,850	Hugging Face
ChatTTS	AGPL-3.0	N/A	N/A	GitHub
MeloTTS	MIT	N/A	N/A	GitHub

For more TTS models and rankings, check out the TTS Leaderboard.

LLM Application Frameworks

Tool	Organization	Description
Eino	CloudWeGo	A lightweight LLM application framework for scalable AI solutions.
Conversation Knowledge Mining Solution Accelerator	Microsoft	A solution accelerator for integrating conversation intelligence and knowledge mining using LLMs.
Olmocr	AllenAI	An OCR framework optimized for integration with language models.
PDFMathTranslate	Byaidu	A tool for converting and translating mathematical content in PDFs using LLMs.
Podcastfy	souzatharsis	A tool to generate podcasts from written content using LLMs.
Pandas AI	sinaptik-ai	Brings LLM-powered analytics to pandas dataframes.
Ramalama	containers	An LLM application framework for containerized deployment of AI solutions.
Robyn	facebookexperimental	A scalable framework for building LLM applications from Facebook Experimental.
ExtractThinker	enoch3712	A tool for extracting and synthesizing insights from textual data using LLMs.

OS RAG Frameworks

Framework	Organization	Released
Haystack	deepset.ai	2023
RAGflow	Infiniflow	2024
txtai	Neuml	2022
LLM App	Pathway	2023
Cognita	Truefoundry	2024
R2R	SciPhi AI	2024
Raptor	Parth Sarthi	2024
LightRAG	HKUDS	2023
PIKE-RAG	Microsoft	2024
KAG	OpenSPG	2024
MemoRAG	qhjqhj00	2023

See RAG_Techniques if you get stuck (not always needed)

AI Tools (for “using”)

Tool	Organization	Description
magic-resume	JOYCEQL	An AI-powered tool for generating resumes.
VideoCaptioner	WEIFENG2333	An AI tool for automatically generating video captions.
DeepSeekAI	DeepLifeStudio	Browser extension for invoking the DeepSeek AI large model.
logocreator	Nutlope	A tool for creating logos using AI.
blinkshot	Nutlope	An AI-powered tool for capturing and enhancing screenshots.
pollinations	pollinations	A tool for generating creative images and artwork using AI.
PromptWizard	microsoft	A tool to generate, manage, and optimize prompts for AI models.
Open-Interface	AmberSahdev	Control Any Computer Using LLMs.
wut	shobrook	LLM for the terminal

Training/Optimization

Tool	Organization	Description
transformerlab-app	transformerlab	An application for training and optimizing transformer models.
fluxgym	cocktailpeanut	A gym environment for reinforcement learning training and optimization.
AutoGPTQ	AutoGPTQ	A tool for automating GPT quantization and optimization.

AI Models

Tool	Organization	Description
WALDO	stephansturges	An AI model for visual reasoning and object detection.
Janus	deepseek-ai	A multi-modal AI model for advanced data processing.
ModernBERT	AnswerDotAI	A modernized version of BERT for natural language processing tasks.
Magma	microsoft	A scalable AI model for large-scale data analysis.
Cosmos-Nemotron	NVlabs	An AI model for advanced image and video processing.
Paints-UNDO	lllyasviel	An interactive AI model for image generation and editing.

Monitoring / Evaluation

Tool	Organization	Description
helicone	Helicone	A platform for monitoring and analyzing AI model performance.
langwatch	langwatch	A tool for monitoring outputs and performance of language models.
shortest	antiwork	A tool for evaluating and optimizing AI-generated content.
deepeval	confident-ai	A framework for deep evaluation of AI models.

Infrastructure

Tool	Organization	Description	Open Source	GitHub
gpustack	gpustack	A toolkit for managing GPU infrastructure for AI workloads.
harbor	av	A repository for containerized AI infrastructure management.

Research Papers on Chain-of-Thought Prompting

Publication Date	Title	🔗	Authors	Organization	Technique
January 28, 2022	Chain-of-Thought Prompting Elicits Reasoning in Large Language Models	🔗	Jason Wei, et al.	DeepMind	CoT Prompting
March 21, 2022	Self-Consistency Improves Chain of Thought Reasoning in Language Models	🔗	Xuezhi Wang et al.	DeepMind	CoT with Self-Consistency
May 21, 2022	Least-to-Most Prompting Enables Complex Reasoning in Large Language Models	🔗	Denny Zhou et al.	DeepMind	Least-to-Most Prompting
May 21, 2022	Large Language Models are Zero-Shot Reasoners	🔗	Takeshi Kojima, et al.	DeepMind	Zero-shot-CoT
October 6, 2022	ReAct: Synergizing Reasoning and Acting in Language Models	🔗	Shunyu Yao et al.	Princeton University	ReAct
April 1, 2023	Teaching Large Language Models to Self-Debug	🔗	Xiang Lisa Li, et al.	DeepMind, Stanford University	Self-Debugging
May 6, 2023	Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models	🔗	Lei Wang et al.	The Chinese University of Hong Kong, SenseTime Research	Plan-and-Solve Prompting
May 23, 2023	Let’s Verify Step by Step	🔗	Anya Goyal, et al.	DeepMind	Verification for CoT
October 3, 2023	Large Language Models Cannot Self-Correct Reasoning Yet	🔗	Qingxiu Dong, et al.	The Chinese University of Hong Kong, Huawei Noah's Ark Lab	Self-Correction in LLMs
November 2023	Universal Self-Consistency for Large Language Model Generation	🔗	Xinyun Chen, Renat Aksitov, Uri Alon, Jie Ren, Kefan Xiao, Pengcheng Yin, Sushant Prakash, Charles Sutton, Xuezhi Wang, Denny Zhou	DeepMind	Universal Self-Consistency
May 17, 2023	Tree of Thoughts: Deliberate Problem Solving with Large Language Models	🔗	Shunyu Yao, et al.	Princeton University, DeepMind	Tree-of-Thought
February 15, 2024	Chain-of-Thought Reasoning Without Prompting	🔗	Xuezhi Wang, Denny Zhou	DeepMind	Chain-of-Thought Decoding
March 21, 2024	ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting	🔗	Xiaoxue Cheng et al.	Renmin University of China	CoTGenius
June 2024	Language Agent Tree Search Unifies Reasoning, Acting, and Planning in Language Models	🔗	Andy Zhou, Kai Yan, Michal Shlapentokh-Rothman, Haohan Wang, Yu-Xiong Wang		Language Agent Tree Search (LATS)
May 2024	Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning	🔗	Yuxi Xie, et al.	National University of Singapore, DeepMind	MCTS
September 18, 2024	To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning	🔗	Zayne Sprague, et al.	The University of Texas at Austin, Johns Hopkins University, Princeton University	Meta-analysis of CoT
September 25, 2024	Chain-of-Thoughtlessness? An Analysis of CoT in Planning	🔗	Kaya Stechly, et al.	Arizona State University	Analysis of CoT in Planning
October 18, 2024	Supervised Chain of Thought	🔗	Xiang Zhang, Dujian Ding	University of British Columbia	Supervised Chain of Thought
October 24, 2024	On examples: A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration	🔗	Zhiqiang Hu, et al.	Amazon, Michigan State University	Theoretical Analysis of CoT

CoT Implementations

Implementation	Link	Author
CoT	chain-of-thought-hub	Franx Yao
CoT	optillm	Codelion
CoT	auto-cot	Amazon Science
CoT	g1	BKlieger Groq
Decoding CoT	optillm/cot_decoding.py	Codelion
Tree of Thoughts	tree-of-thought-llm	Princeton NLP
Tree of Thoughts	tree-of-thoughts	Kye Gomez
Tree of Thoughts	saplings	Shobrook
MCTS	optillm/mcts.py	Codelion
Graph of Thoughts	graph-of-thoughts	SPCL
Other	CPO	SAIL SG
Other	Everything-of-Thoughts-XoT	Microsoft

CoT Fine-Tuned Models & Datasets

Models

Model Name	Author	Size	Link
CoT-T5-3B	KAIST AI	3B	🔗
CoT-T5-11B	KAIST AI	11B	🔗
Llama-3.2V-11B-cot	Xkev	11B	🔗
Llama-3.1-8B-Instruct-Reasoner-1o1_v0.3	Lyte	8B	🔗

Datasets

Dataset Name	Author	Data Size	Likes	Link
chain-of-thought-sharegpt	Isaiah Bjork	7.14k rows	🌟 8	🔗
CoT-Collection	KAIST AI	1.84 million rows	🌟 122	🔗
Reasoner-1o1-v0.3-HQ	Lyte	370 rows	🌟 7	🔗
OpenLongCoT-Pretrain	qq8933	103k rows	🌟 86	🔗

Learning Resources

Tool	Organization	Description
awesome-cursorrules	PatrickJS	A curated list of resources and guides on cursorrules.
ai-engineering-hub	patchy631	A hub of AI engineering learning resources, tutorials, and best practices.
GenAI_Agents	NirDiamant	Resources and examples for building Generative AI Agents.
learn-agentic-ai	panaversity	Learning materials for understanding and building agentic AI.
awesome-generative-ai	steven2358	A curated list of generative AI resources and projects.
awesome-mcp-servers	punkpeye	A curated collection of awesome MCP servers resources.
GenAI-Showcase	mongodb-developer	A showcase of innovative Generative AI projects.
well-architected-iac-analyzer	aws-samples	A tool to analyze and ensure well-architected Infrastructure as Code practices.
llama-cookbook	meta-llama	A collection of recipes and guides for working with LLaMA models.
optillm	codelion	Resources for optimizing LLM usage and performance.
cursor.directory	pontusab	A directory of tools and resources related to cursor-based workflows.
GenAI_Agents	NirDiamant	A curated collection of generative AI agents and related tools.

For Tasks:

Click tags to check more tools for each tasks

train models deploy models fine-tune models access off-the-shelf models build ai chat uis

For Jobs:

data scientist machine learning engineer ai researcher software developer nlp engineer

Alternative AI tools for Awesome-LLM-Resources-List

Similar Open Source Tools

Awesome-LLM-Resources-List

github

: 126

Touchstone

github

: 98

cgft-llm

The cgft-llm repository is a collection of video tutorials and documentation for implementing large models. It provides guidance on topics such as fine-tuning llama3 with llama-factory, lightweight deployment and quantization using llama.cpp, speech generation with ChatTTS, introduction to Ollama for large model deployment, deployment tools for vllm and paged attention, and implementing RAG with llama-index. Users can find detailed code documentation and video tutorials for each project in the repository.

github

: 1.1k

cs-books

github

: 52

awesome-VLLMs

github

: 52

Awesome-LLM-Tabular

This repository is a curated list of research papers that explore the integration of Large Language Model (LLM) technology with tabular data. It aims to provide a comprehensive resource for researchers and practitioners interested in this emerging field. The repository includes papers on a wide range of topics, including table-to-text generation, table question answering, and tabular data classification. It also includes a section on related datasets and resources.

github

: 335

fastapi

智元 Fast API is a one-stop API management system that unifies various LLM APIs in terms of format, standards, and management, achieving the ultimate in functionality, performance, and user experience. It supports various models from companies like OpenAI, Azure, Baidu, Keda Xunfei, Alibaba Cloud, Zhifu AI, Google, DeepSeek, 360 Brain, and Midjourney. The project provides user and admin portals for preview, supports cluster deployment, multi-site deployment, and cross-zone deployment. It also offers Docker deployment, a public API site for registration, and screenshots of the admin and user portals. The API interface is similar to OpenAI's interface, and the project is open source with repositories for API, web, admin, and SDK on GitHub and Gitee.

github

: 245

PaddleNLP

PaddleNLP is an easy-to-use and high-performance NLP library. It aggregates high-quality pre-trained models in the industry and provides out-of-the-box development experience, covering a model library for multiple NLP scenarios with industry practice examples to meet developers' flexible customization needs.

github

: 12.5k

EmoLLM

EmoLLM is a series of large-scale psychological health counseling models that can support **understanding-supporting-helping users** in the psychological health counseling chain, which is fine-tuned from `LLM` instructions. Welcome everyone to star~⭐⭐. The currently open source `LLM` fine-tuning configurations are as follows:

github

: 1.3k

widgets

Widgets is a desktop component front-end open source component. The project is still being continuously improved. The desktop component client can be downloaded and run in two ways: 1. https://www.microsoft.com/store/productId/9NPR50GQ7T53 2. https://widgetjs.cn After cloning the code, you need to download the dependency in the project directory: `shell pnpm install` and run: `shell pnpm serve`

github

: 228

JiwuChat

JiwuChat is a lightweight multi-platform chat application built on Tauri2 and Nuxt3, with various real-time messaging features, AI group chat bots (such as 'iFlytek Spark', 'KimiAI' etc.), WebRTC audio-video calling, screen sharing, and AI shopping functions. It supports seamless cross-device communication, covering text, images, files, and voice messages, also supporting group chats and customizable settings. It provides light/dark mode for efficient social networking.

github

: 400

AstrBot

AstrBot is a powerful and versatile tool that leverages the capabilities of large language models (LLMs) like GPT-3, GPT-3.5, and GPT-4 to enhance communication and automate tasks. It seamlessly integrates with popular messaging platforms such as QQ, QQ Channel, and Telegram, enabling users to harness the power of AI within their daily conversations and workflows.

github

: 6.6k

MedicalGPT

MedicalGPT is a training medical GPT model with ChatGPT training pipeline, implement of Pretraining, Supervised Finetuning, RLHF(Reward Modeling and Reinforcement Learning) and DPO(Direct Preference Optimization).

github

: 3.6k

lite.ai.toolkit

github

: 3.7k

llm-book

The 'llm-book' repository is dedicated to the introduction of large-scale language models, focusing on natural language processing tasks. The code is designed to run on Google Colaboratory and utilizes datasets and models available on the Hugging Face Hub. Note that as of July 28, 2023, there are issues with the MARC-ja dataset links, but an alternative notebook using the WRIME Japanese sentiment analysis dataset has been added. The repository covers various chapters on topics such as Transformers, fine-tuning language models, entity recognition, summarization, document embedding, question answering, and more.

github

: 291

awesome-ai-painting

This repository, named 'awesome-ai-painting', is a comprehensive collection of resources related to AI painting. It is curated by a user named 秋风, who is an AI painting enthusiast with a background in the AIGC industry. The repository aims to help more people learn AI painting and also documents the user's goal of creating 100 AI products, with current progress at 4/100. The repository includes information on various AI painting products, tutorials, tools, and models, providing a valuable resource for individuals interested in AI painting and related technologies.

github

: 11.0k

For similar tasks

Awesome-LLM-Resources-List

github

: 126

ai-on-gke

This repository contains assets related to AI/ML workloads on Google Kubernetes Engine (GKE). Run optimized AI/ML workloads with Google Kubernetes Engine (GKE) platform orchestration capabilities. A robust AI/ML platform considers the following layers: Infrastructure orchestration that support GPUs and TPUs for training and serving workloads at scale Flexible integration with distributed computing and data processing frameworks Support for multiple teams on the same infrastructure to maximize utilization of resources

github

: 280

ray

Ray is a unified framework for scaling AI and Python applications. It consists of a core distributed runtime and a set of AI libraries for simplifying ML compute, including Data, Train, Tune, RLlib, and Serve. Ray runs on any machine, cluster, cloud provider, and Kubernetes, and features a growing ecosystem of community integrations. With Ray, you can seamlessly scale the same code from a laptop to a cluster, making it easy to meet the compute-intensive demands of modern ML workloads.

github

: 36.4k

labelbox-python

Labelbox is a data-centric AI platform for enterprises to develop, optimize, and use AI to solve problems and power new products and services. Enterprises use Labelbox to curate data, generate high-quality human feedback data for computer vision and LLMs, evaluate model performance, and automate tasks by combining AI and human-centric workflows. The academic & research community uses Labelbox for cutting-edge AI research.

github

: 135

djl

Deep Java Library (DJL) is an open-source, high-level, engine-agnostic Java framework for deep learning. It is designed to be easy to get started with and simple to use for Java developers. DJL provides a native Java development experience and allows users to integrate machine learning and deep learning models with their Java applications. The framework is deep learning engine agnostic, enabling users to switch engines at any point for optimal performance. DJL's ergonomic API interface guides users with best practices to accomplish deep learning tasks, such as running inference and training neural networks.

github

: 4.1k

mlflow

MLflow is a platform to streamline machine learning development, including tracking experiments, packaging code into reproducible runs, and sharing and deploying models. MLflow offers a set of lightweight APIs that can be used with any existing machine learning application or library (TensorFlow, PyTorch, XGBoost, etc), wherever you currently run ML code (e.g. in notebooks, standalone applications or the cloud). MLflow's current components are: * `MLflow Tracking `_: An API to log parameters, code, and results in machine learning experiments and compare them using an interactive UI. * `MLflow Projects `_: A code packaging format for reproducible runs using Conda and Docker, so you can share your ML code with others. * `MLflow Models `_: A model packaging format and tools that let you easily deploy the same model (from any ML library) to batch and real-time scoring on platforms such as Docker, Apache Spark, Azure ML and AWS SageMaker. * `MLflow Model Registry `_: A centralized model store, set of APIs, and UI, to collaboratively manage the full lifecycle of MLflow Models.

github

: 19.9k

tt-metal

TT-NN is a python & C++ Neural Network OP library. It provides a low-level programming model, TT-Metalium, enabling kernel development for Tenstorrent hardware.

github

: 786

burn

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.

github

: 10.2k

For similar jobs

weave

Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

github

: 855

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

VisionCraft

The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

github

: 94

kaito

Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

github

: 405

PyRIT

PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

github

: 2.3k

tabby

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

github

: 30.6k

spear

SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

github

: 224

Magick

Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.

github

: 675