AI-windows-whl

Pre-compiled Python whl for Flash-attention, SageAttention, NATTEN, xFormer etc

Stars: 147

Visit

AI-windows-whl is a curated collection of pre-compiled Python wheels for difficult-to-install AI/ML libraries on Windows. It addresses the common pain point of building complex Python packages from source on Windows by providing direct links to pre-compiled `.whl` files for essential libraries like PyTorch, Flash Attention, xformers, SageAttention, NATTEN, Triton, bitsandbytes, and other packages. The goal is to save time for AI enthusiasts and developers on Windows, allowing them to focus on creating amazing things with AI.

README:

Windows AI Wheels

A curated collection of pre-compiled Python wheels for difficult-to-install AI/ML libraries on Windows.

Report a Broken Link · Request a New Wheel

Table of Contents

About The Project
Getting Started
- Prerequisites
- Installation
Available Wheels

About The Project

This repository was created to address a common pain point for AI enthusiasts and developers on the Windows platform: building complex Python packages from source. Libraries like flash-attention, xformers are essential for high-performance AI tasks but often lack official pre-built wheels for Windows, forcing users into a complicated and error-prone compilation process.

The goal here is to provide a centralized, up-to-date collection of direct links to pre-compiled .whl files for these libraries, primarily for the ComfyUI community and other PyTorch users on Windows. This saves you time and lets you focus on what's important: creating amazing things with AI.

(back to top)

Getting Started

Follow these simple steps to use the wheels from this repository.

Prerequisites

Python for Windows: Ensure you have a compatible Python version installed (PyTorch currently supports Python 3.9 - 3.12 on Windows). You can get it from the official Python website.

Installation

To install a wheel, use pip with the direct URL to the .whl file. Make sure to enclose the URL in quotes.

# Example of installing a specific flash-attention wheel
pip install "https://huggingface.co/lldacing/flash-attention-windows-wheel/blob/main/flash_attn-2.7.4.post1+cu128torch2.7.0cxx11abiFALSE-cp312-cp312-win_amd64.whl"

[!TIP] Find the package you need in the Available Wheels section below, find the row that matches your environment (Python, PyTorch, CUDA version), and copy the link for the pip install command.

(back to top)

Available Wheels

Here is the list of tracked packages.

PyTorch

The foundation of everything. Install this first from the official source.

Official Install Page: https://pytorch.org/get-started/locally/

For convenience, here are direct installation commands for specific versions on Linux/WSL with an NVIDIA GPU. For other configurations (CPU, macOS, ROCm), please use the official install page.

Stable Version (2.8.0)

This is the recommended version for most users.

CUDA Version	Pip Install Command
CUDA 12.9	`pip install torch torchvision --index-url https://download.pytorch.org/whl/cu129`
CUDA 12.8	`pip install torch torchvision --index-url https://download.pytorch.org/whl/cu128`
CUDA 12.6	`pip install torch torchvision --index-url https://download.pytorch.org/whl/cu126`
CPU only	`pip install torch torchvision --index-url https://download.pytorch.org/whl/cpu`

Previous Stable Version (2.7.1)

CUDA Version	Pip Install Command
CUDA 12.8	`pip install torch==2.7.1 torchvision==0.22.1 torchaudio==2.7.1 --index-url https://download.pytorch.org/whl/cu128`
CUDA 12.6	`pip install torch==2.7.1 torchvision==0.22.1 torchaudio==2.7.1 --index-url https://download.pytorch.org/whl/cu126`
CUDA 11.8	`pip install torch==2.7.1 torchvision==0.22.1 torchaudio==2.7.1 --index-url https://download.pytorch.org/whl/cu118`
CPU only	`pip install torch==2.7.1 torchvision==0.22.1 torchaudio==2.7.1 --index-url https://download.pytorch.org/whl/cpu`

Nightly Versions

Use these for access to the latest features, but expect potential instability.

PyTorch 2.9 (Nightly)

CUDA Version	Pip Install Command
CUDA 12.9	`pip install --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu130`
CUDA 12.8	`pip install --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu128`
CUDA 12.6	`pip install --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu126`

Torchaudio

Package Version	PyTorch Ver	CUDA Ver	Download Link
`2.8.0`	`2.9.0`	`12.8`	Link

▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲

Flash Attention

High-performance attention implementation.

Official Repo: Dao-AILab/flash-attention
Pre-built Sources: lldacing's HF, Wildminder's HF, mjun0812 GitHub

Package Version	PyTorch Ver	Python Ver	CUDA Ver	CXX11 ABI	Download Link
`2.8.3`	`2.9.0`	`3.12`	`12.8`	✓	Link
`2.8.3`	`2.8.0`	`3.12`	`12.8`	✓	Link
`2.8.2`	`2.9.0`	`3.12`	`12.8`	✓	Link
`2.8.2`	`2.8.0`	`3.10`	`12.8`	✓	Link
`2.8.2`	`2.8.0`	`3.11`	`12.8`	✓	Link
`2.8.2`	`2.8.0`	`3.12`	`12.8`	✓	Link
`2.8.2`	`2.7.0`	`3.10`	`12.8`	✗	Link
`2.8.2`	`2.7.0`	`3.11`	`12.8`	✗	Link
`2.8.2`	`2.7.0`	`3.12`	`12.8`	✗	Link
`2.8.1`	`2.8.0`	`3.12`	`12.8`	✓	Link
`2.8.0.post2`	`2.8.0`	`3.12`	`12.8`	✓	Link
`2.7.4.post1`	`2.8.0`	`3.10`	`12.8`	✓	Link
`2.7.4.post1`	`2.8.0`	`3.12`	`12.8`	✓	Link
`2.7.4.post1`	`2.7.0`	`3.10`	`12.8`	✗	Link
`2.7.4.post1`	`2.7.0`	`3.11`	`12.8`	✗	Link
`2.7.4.post1`	`2.7.0`	`3.12`	`12.8`	✗	Link
`2.7.4`	`2.8.0`	`3.10`	`12.8`	✓	Link
`2.7.4`	`2.8.0`	`3.11`	`12.8`	✓	Link
`2.7.4`	`2.8.0`	`3.12`	`12.8`	✓	Link
`2.7.4`	`2.7.0`	`3.10`	`12.8`	✗	Link
`2.7.4`	`2.7.0`	`3.11`	`12.8`	✗	Link
`2.7.4`	`2.7.0`	`3.12`	`12.8`	✗	Link
`2.7.4`	`2.6.0`	`3.10`	`12.6`	✗	Link
`2.7.4`	`2.6.0`	`3.11`	`12.6`	✗	Link
`2.7.4`	`2.6.0`	`3.12`	`12.6`	✗	Link
`2.7.4`	`2.6.0`	`3.10`	`12.4`	✗	Link
`2.7.4`	`2.6.0`	`3.11`	`12.4`	✗	Link
`2.7.4`	`2.6.0`	`3.12`	`12.4`	✗	Link

▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲

xformers

Another library for memory-efficient attention and other optimizations.

Official Repo: facebookresearch/xformers
PyTorch Pre-built Index: https://download.pytorch.org/whl/xformers/

[!NOTE] PyTorch provides official pre-built wheels for xformers. You can often install it with pip install xformers if you installed PyTorch correctly. If that fails, find your matching wheel at the index link above.

ABI3 version, any Python 3.9-3.12

Package Version	PyTorch Ver	CUDA Ver	Download Link
`0.0.32.post2`	`2.8.0`	`12.8`	Link
`0.0.32.post2`	`2.8.0`	`12.9`	Link

▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲

SageAttention

Official Repo: thu-ml/SageAttention
Pre-built Sources: woct0rdho's Releases, Wildminder's HF

Package Version	PyTorch Ver	Python Ver	CUDA Ver	Download Link
`2.1.1`	`2.5.1`	`3.9`	`12.4`	Link
`2.1.1`	`2.5.1`	`3.10`	`12.4`	Link
`2.1.1`	`2.5.1`	`3.11`	`12.4`	Link
`2.1.1`	`2.5.1`	`3.12`	`12.4`	Link
`2.1.1`	`2.6.0`	`3.9`	`12.6`	Link
`2.1.1`	`2.6.0`	`3.10`	`12.6`	Link
`2.1.1`	`2.6.0`	`3.11`	`12.6`	Link
`2.1.1`	`2.6.0`	`3.12`	`12.6`	Link
`2.1.1`	`2.6.0`	`3.12`	`12.6`	Link
`2.1.1`	`2.6.0`	`3.13`	`12.6`	Link
`2.1.1`	`2.7.0`	`3.10`	`12.8`	Link
`2.1.1`	`2.8.0`	`3.12`	`12.8`	Link

◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇

SageAttention 2.2 (SageAttention2++)

[!NOTE] Only supports CUDA >= 12.8, therefore PyTorch >= 2.7.

Package Version	PyTorch Ver	Python Ver	CUDA Ver	Download Link
`2.2.0.post2`	`2.5.1`	`>3.9`	`12.4`	Link
`2.2.0.post2`	`2.6.0`	`>3.9`	`12.6`	Link
`2.2.0.post2`	`2.7.1`	`>3.9`	`12.8`	Link
`2.2.0.post2`	`2.8.0`	`>3.9`	`12.8`	Link
`2.2.0.post2`	`2.9.0`	`>3.9`	`12.8`	Link
`2.2.0`	`2.7.1`	`3.9`	`12.8`	Link
`2.2.0`	`2.7.1`	`3.10`	`12.8`	Link
`2.2.0`	`2.7.1`	`3.11`	`12.8`	Link
`2.2.0`	`2.7.1`	`3.12`	`12.8`	Link
`2.2.0`	`2.7.1`	`3.13`	`12.8`	Link
`2.2.0`	`2.8.0`	`3.9`	`12.8`	Link
`2.2.0`	`2.8.0`	`3.10`	`12.8`	Link
`2.2.0`	`2.8.0`	`3.11`	`12.8`	Link
`2.2.0`	`2.8.0`	`3.12`	`12.8`	Link
`2.2.0`	`2.8.0`	`3.13`	`12.8`	Link

▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲

SpargeAttn

Official Repo: thu-ml/SpargeAttn
Pre-built Sources: woct0rdho's Releases

Package Version	PyTorch Ver	CUDA Ver	Download Link
`0.1.0.post1`	`2.7.1`	`12.8`	Link
`0.1.0.post1`	`2.8.0`	`12.8`	Link

▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲

Nunchaku

Official Repo: : mit-han-lab/nunchaku

Package Version	PyTorch Ver	Python Ver	Download Link
`1.0.0`	`2.5`	`3.10`	Link
`1.0.0`	`2.5`	`3.11`	Link
`1.0.0`	`2.5`	`3.12`	Link
`1.0.0`	`2.6`	`3.10`	Link
`1.0.0`	`2.6`	`3.11`	Link
`1.0.0`	`2.6`	`3.12`	Link
`1.0.0`	`2.6`	`3.13`	Link
`1.0.0`	`2.7`	`3.10`	Link
`1.0.0`	`2.7`	`3.11`	Link
`1.0.0`	`2.7`	`3.12`	Link
`1.0.0`	`2.7`	`3.13`	Link
`1.0.0`	`2.8`	`3.10`	Link
`1.0.0`	`2.8`	`3.11`	Link
`1.0.0`	`2.8`	`3.12`	Link
`1.0.0`	`2.8`	`3.13`	Link
`1.0.0`	`2.9`	`3.10`	Link
`1.0.0`	`2.9`	`3.11`	Link
`1.0.0`	`2.9`	`3.12`	Link
`1.0.0`	`2.9`	`3.13`	Link
`0.3.2`	`2.5`	`3.10`	Link
`0.3.2`	`2.5`	`3.11`	Link
`0.3.2`	`2.5`	`3.12`	Link
`0.3.2`	`2.6`	`3.10`	Link
`0.3.2`	`2.6`	`3.11`	Link
`0.3.2`	`2.6`	`3.12`	Link
`0.3.2`	`2.7`	`3.10`	Link
`0.3.2`	`2.7`	`3.11`	Link
`0.3.2`	`2.7`	`3.12`	Link
`0.3.2`	`2.8`	`3.10`	Link
`0.3.2`	`2.8`	`3.11`	Link
`0.3.2`	`2.8`	`3.12`	Link
`0.3.2`	`2.9`	`3.12`	Link

▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲

NATTEN

Neighborhood Attention Transformer.

Official Repo: SHI-Labs/NATTEN
Pre-built Source: lldacing's HF

Package Version	PyTorch Ver	Python Ver	CUDA Ver	Download Link
`0.17.5`	`2.6.0`	`3.10`	`12.6`	Link
`0.17.5`	`2.6.0`	`3.11`	`12.6`	Link
`0.17.5`	`2.6.0`	`3.12`	`12.6`	Link
`0.17.5`	`2.7.0`	`3.10`	`12.8`	Link
`0.17.5`	`2.7.0`	`3.11`	`12.8`	Link
`0.17.5`	`2.7.0`	`3.12`	`12.8`	Link
`0.17.3`	`2.4.0`	`3.10`	`12.4`	Link
`0.17.3`	`2.4.0`	`3.11`	`12.4`	Link
`0.17.3`	`2.4.0`	`3.12`	`12.4`	Link
`0.17.3`	`2.4.1`	`3.10`	`12.4`	Link
`0.17.3`	`2.4.1`	`3.11`	`12.4`	Link
`0.17.3`	`2.4.1`	`3.12`	`12.4`	Link
`0.17.3`	`2.5.0`	`3.10`	`12.4`	Link
`0.17.3`	`2.5.0`	`3.11`	`12.4`	Link
`0.17.3`	`2.5.0`	`3.12`	`12.4`	Link
`0.17.3`	`2.5.1`	`3.10`	`12.4`	Link
`0.17.3`	`2.5.1`	`3.11`	`12.4`	Link
`0.17.3`	`2.5.1`	`3.12`	`12.4`	Link

▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲

Triton (Windows Fork)

Triton is a language and compiler for writing highly efficient custom deep-learning primitives. Not officially supported on Windows, but a fork provides pre-built wheels.

Windows Fork: woct0rdho/triton-windows
Installation: pip install -U "triton-windows<3.5"

▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲

bitsandbytes

A lightweight wrapper around CUDA custom functions, particularly for 8-bit optimizers, matrix multiplication (LLM.int8()), and quantization functions.

Official Repo: bitsandbytes-foundation/bitsandbytes

▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲

RadialAttention for ComfyUI

Nodes: ComfyUI-RadialAttn

(back to top)

▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

Accessing Data Programmatically (wheels.json)

All wheel information in this repository is managed in the wheels.json file, which serves as the single source of truth. The tables in this README are automatically generated from this file.

This provides a stable, structured JSON endpoint for any external tool or application that needs to access this data without parsing Markdown.

How to Use

You can access the raw JSON file directly via the following URL:

https://raw.githubusercontent.com/wildminder/AI-windows-whl/main/wheels.json

Example using curl:

curl -L -o wheels.json https://raw.githubusercontent.com/wildminder/AI-windows-whl/main/wheels.json

The file contains a list of packages, each with its metadata and an array of wheels, where each wheel object contains version details and a direct download url.

(back to top)

▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have found a new pre-built wheel or a reliable source, please fork the repo and create a pull request, or simply open an issue with the link.

(back to top)

Acknowledgments

This repository is simply a collection of links. Huge thanks to the individuals and groups who do the hard work of building and hosting these wheels for the community:

For Tasks:

Click tags to check more tools for each tasks

install libraries compile packages access pre-built wheels save time on setup focus on ai tasks

For Jobs:

data scientist machine learning engineer ai researcher python developer software engineer

Alternative AI tools for AI-windows-whl

Similar Open Source Tools

AI-windows-whl

github

: 147

we-mp-rss

We-MP-RSS is a tool for subscribing to and managing WeChat official account content, providing RSS subscription functionality. It allows users to fetch and parse WeChat official account content, generate RSS feeds, manage subscriptions via a user-friendly web interface, automatically update content on a schedule, support multiple databases (default SQLite, optional MySQL), various fetching methods, multiple RSS clients, and expiration reminders for authorizations.

github

: 1.1k

search2ai

S2A allows your large model API to support networking, searching, news, and web page summarization. It currently supports OpenAI, Gemini, and Moonshot (non-streaming). The large model will determine whether to connect to the network based on your input, and it will not connect to the network for searching every time. You don't need to install any plugins or replace keys. You can directly replace the custom address in your commonly used third-party client. You can also deploy it yourself, which will not affect other functions you use, such as drawing and voice.

github

: 1.1k

apidash

API Dash is an open-source cross-platform API Client that allows users to easily create and customize API requests, visually inspect responses, and generate API integration code. It supports various HTTP methods, GraphQL requests, and multimedia API responses. Users can organize requests in collections, preview data in different formats, and generate code for multiple languages. The tool also offers dark mode support, data persistence, and various customization options.

github

: 2.4k

devops-gpt

DevOpsGPT is a revolutionary tool designed to streamline your workflow and empower you to build systems and automate tasks with ease. Tired of spending hours on repetitive DevOps tasks? DevOpsGPT is here to help! Whether you're setting up infrastructure, speeding up deployments, or tackling any other DevOps challenge, our app can make your life easier and more productive. With DevOpsGPT, you can expect faster task completion, simplified workflows, and increased efficiency. Ready to experience the DevOpsGPT difference? Visit our website, sign in or create an account, start exploring the features, and share your feedback to help us improve. DevOpsGPT will become an essential tool in your DevOps toolkit.

github

: 52

xiaomi_airpurifier

This repository contains a custom component for Home Assistant that integrates various Xiaomi Mi Air Purifier and Xiaomi Mi Air Humidifier models. It provides detailed support for different devices, including power control, preset modes, child lock, LED control, favorite level adjustment, and various attributes monitoring. The custom component offers a more extensive range of supported devices compared to the official Home Assistant component, with additional features and device compatibility. Users can easily set up and configure their Xiaomi air purifiers and humidifiers within Home Assistant for enhanced control and monitoring.

github

: 446

xiaogpt

xiaogpt is a tool that allows you to play ChatGPT and other LLMs with Xiaomi AI Speaker. It supports ChatGPT, New Bing, ChatGLM, Gemini, Doubao, and Tongyi Qianwen. You can use it to ask questions, get answers, and have conversations with AI assistants. xiaogpt is easy to use and can be set up in a few minutes. It is a great way to experience the power of AI and have fun with your Xiaomi AI Speaker.

github

: 6.5k

OneClickLLAMA

OneClickLLAMA is a tool designed to run local LLM models such as Qwen2.5 and SakuraLLM with ease. It can be used in conjunction with various OpenAI format translators and analyzers, including LinguaGacha and KeywordGacha. By following the setup guides provided on the page, users can optimize performance and achieve a 3-5 times speed improvement compared to default settings. The tool requires a minimum of 8GB dedicated graphics memory, preferably NVIDIA, and the latest version of graphics drivers installed. Users can download the tool from the release page, choose the appropriate model based on usage and memory size, and start the tool by selecting the corresponding launch script.

github

: 175

ChatTTS-Forge

ChatTTS-Forge is a powerful text-to-speech generation tool that supports generating rich audio long texts using a SSML-like syntax and provides comprehensive API services, suitable for various scenarios. It offers features such as batch generation, support for generating super long texts, style prompt injection, full API services, user-friendly debugging GUI, OpenAI-style API, Google-style API, support for SSML-like syntax, speaker management, style management, independent refine API, text normalization optimized for ChatTTS, and automatic detection and processing of markdown format text. The tool can be experienced and deployed online through HuggingFace Spaces, launched with one click on Colab, deployed using containers, or locally deployed after cloning the project, preparing models, and installing necessary dependencies.

github

: 692

beet

Beet is a collection of crates for authoring and running web pages, games and AI behaviors. It includes crates like `beet_flow` for scenes-as-control-flow bevy library, `beet_spatial` for spatial behaviors, `beet_ml` for machine learning, `beet_sim` for simulation tooling, `beet_rsx` for authoring tools for html and bevy, and `beet_router` for file-based router for web docs. The `beet` crate acts as a base crate that re-exports sub-crates based on feature flags, similar to the `bevy` crate structure.

github

: 80

no-cost-ai

No-cost-ai is a repository dedicated to providing a comprehensive list of free AI models and tools for developers, researchers, and curious builders. It serves as a living index for accessing state-of-the-art AI models without any cost. The repository includes information on various AI applications such as chat interfaces, media generation, voice and music tools, AI IDEs, and developer APIs and platforms. Users can find links to free models, their limits, and usage instructions. Contributions to the repository are welcome, and users are advised to use the listed services at their own risk due to potential changes in models, limitations, and reliability of free services.

github

: 74

AI-Guide-and-Demos-zh_CN

This is a Chinese AI/LLM introductory project that aims to help students overcome the initial difficulties of accessing foreign large models' APIs. The project uses the OpenAI SDK to provide a more compatible learning experience. It covers topics such as AI video summarization, LLM fine-tuning, and AI image generation. The project also offers a CodePlayground for easy setup and one-line script execution to experience the charm of AI. It includes guides on API usage, LLM configuration, building AI applications with Gradio, customizing prompts for better model performance, understanding LoRA, and more.

github

: 2.9k

hume-api-examples

This repository contains examples of how to use the Hume API with different frameworks and languages. It includes examples for Empathic Voice Interface (EVI) and Expression Measurement API. The EVI examples cover custom language models, modal, Next.js integration, Vue integration, Hume Python SDK, and React integration. The Expression Measurement API examples include models for face, language, burst, and speech, with implementations in Python and Typescript using frameworks like Next.js.

github

: 164

dbhub

DBHub is a universal database gateway that implements the Model Context Protocol (MCP) server interface. It allows MCP-compatible clients to connect to and explore different databases. The gateway supports various database resources and tools, providing capabilities such as executing queries, listing connectors, generating SQL, and explaining database elements. Users can easily configure their database connections and choose between different transport modes like stdio and sse. DBHub also offers a demo mode with a sample employee database for testing purposes.

github

: 76

Free-LLM-Collection

Free-LLM-Collection is a curated list of free resources for mastering the Legal Language Model (LLM) technology. It includes datasets, research papers, tutorials, and tools to help individuals learn and work with LLM models. The repository aims to provide a comprehensive collection of materials to support researchers, developers, and enthusiasts interested in exploring and leveraging LLM technology for various applications in the legal domain.

github

: 64

XiaoXinAir14IML_2019_hackintosh

XiaoXinAir14IML_2019_hackintosh is a repository dedicated to enabling macOS installation on Lenovo XiaoXin Air-14 IML 2019 laptops. The repository provides detailed information on the hardware specifications, supported systems, BIOS versions, related models, installation methods, updates, patches, and recommended settings. It also includes tools and guides for BIOS modifications, enabling high-resolution display settings, Bluetooth synchronization between macOS and Windows 10, voltage adjustments for efficiency, and experimental support for YogaSMC. The repository offers solutions for various issues like sleep support, sound card emulation, and battery information. It acknowledges the contributions of developers and tools like OpenCore, itlwm, VoodooI2C, and ALCPlugFix.

github

: 140

For similar tasks

AI-windows-whl

github

: 147

azhpc-images

This repository contains scripts for installing HPC and AI libraries and tools to build Azure HPC/AI images. It streamlines the process of provisioning compute-intensive workloads and crafting advanced AI models in the cloud, ensuring efficiency and reliability in deployments.

github

: 95

Aidan-Bench

Aidan Bench is a tool that rewards creativity, reliability, contextual attention, and instruction following. It is weakly correlated with Lmsys, has no score ceiling, and aligns with real-world open-ended use. The tool involves giving LLMs open-ended questions and evaluating their answers based on novelty scores. Users can set up the tool by installing required libraries and setting up API keys. The project allows users to run benchmarks for different models and provides flexibility in threading options.

github

: 71

llm-chatbot-python

This repository provides resources for building a chatbot backed by Neo4j using Python. It includes instructions on running the application, setting up tests, and installing necessary libraries. The chatbot is designed to interact with users and provide recommendations based on data stored in a Neo4j database. The repository is part of the Neo4j GraphAcademy course on building chatbots with Python.

github

: 79

For similar jobs

weave

Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

github

: 980

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

VisionCraft

The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

github

: 94

kaito

Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

github

: 405

PyRIT

PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

github

: 2.9k

tabby

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

github

: 32.1k

spear

SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

github

: 224

Magick

Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.

github

: 675