are-copilots-local-yet

Are Copilots Local Yet? The frontier of local LLM Copilots for code completion, project generation, shell assistance, and more. Find tools shaping tomorrow's developer experience, today!

Stars: 511

Visit

Current trends and state of the art for using open & local LLM models as copilots to complete code, generate projects, act as shell assistants, automatically fix bugs, and more. This document is a curated list of local Copilots, shell assistants, and related projects, intended to be a resource for those interested in a survey of the existing tools and to help developers discover the state of the art for projects like these.

README:

🛠️ Are Copilots Local Yet?

Current trends and state of the art for using open & local LLM models as copilots to complete code, generate projects, act as shell assistants, automatically fix bugs, and more.

📝 Help keep this list relevant and up-to-date by making edits!

Summary
Background
Editor Extensions
Tools
Chat Interfaces
Models
Datasets
Misc Tools
Suggested Setup
History
Stats

📋 Summary

Local Copilots are now fully functional, although with output quality still not on par with those offered by cloud-based services like GitHub Copilot.

This document is a curated list of local Copilots, shell assistants, and related projects. It is intended to be a resource for those interested in a survey of the existing tools, and to help developers discover the state of the art for projects like these.

📚 Background

In 2021, GitHub released Copilot which quickly became popular among devs. Since then, with the flurry of AI developments around LLMs, local models that can run on consumer machines have become available, and it has seemed only a matter of time before Copilot will go local.

Many perceived limitations of GitHub's Copilot are related to its closed and cloud-hosted nature.

As an alternative, local Copilots enable:

🌐 Offline & private use
⚡ Improved responsiveness
📚 Better project/context awareness
🎯 The ability to run models specialized for a particular language/task
🔒 Constraining the LLM output to fit a particular format/syntax.

🧩 Editor Extensions

Editor extensions used to complete code using LLMs:

Name	Editor	⭐	Released	Notes
GitHub Copilot	VSCode, vim	9125	2021-6-29	The GitHub Original, not local or open-source.
Cursor	VSCode	27112	2023-3-14	Fork of VSCode, not open-source
Fauxpilot	VSCode	14645	2022-9-3	Early local PoC. Stale?
Tabby	VSCode, vim, IntelliJ	29074	2023-9-30	Completes the cursor selection
turbopilot	VSCode	3818	2023-4-10	Completions with FIM support, inspired by fauxpilot
HuggingFace-vscode	VSCode	1255	2023-6-19	Fork of Tabnine, supports Starcoder
localpilot	VSCode	3369	2023-10-2	Utility for easily hosting models locally, for use with official Copilot extension using custom API endpoint.
StarcoderEx	VSCode	101	2023-5-5	Completes the cursor selection
WizardCoder-VSC	VSCode	145	2023-6-19	PoC, article available
KoboldAIConnect	VSCode		2023-10-7	Copilot clone using local KoboldAI backend
gen.nvim	vim	1323	2023-10-1	Edit selection using custom prompts
uniteai	VSCode, emacs, lsp	309	2023-8-27
Privy	VSCode	916	2024-1-8	A privacy-first coding assistant.
twinny	VSCode	3279	2024-1-24	The most no-nonsense locally hosted AI code completion plugin for VS Code
continue		21966	2023-5-24	VSCode extension with chat, autocomplete, and actions.

🛠️ Tools

Tools that try to generate projects/features from specification:

Name	⭐	Released	Notes
gpt-engineer	52940	2023-6-6	Specify what you want it to build, the AI asks for clarification, and then builds it.
gpt-pilot	32250	2023-7-18	Very similar to gpt-engineer
aider	25618	2023-6-8	AI pair programming in your terminal, works well with pre-existing, larger codebases
rift	3051	2023-6-20	VSCode extension. Lets you write code by chatting, makes your IDE agentic, AI engineer that works alongside you.
mentat	2583	2023-7-25	Mentat coordinates edits across multiple locations and files.
clippinator	364	2023-4-15	Uses a team of agents to plan, write, debug, and test
Refact.AI	1660	2023-10-06	Full self-hostable code completion, chat and training service, complete with VSCode extension.
LocalCompletion	27	2023-11-15	Inline completion with support for any OpenAI compatible backend

🗨️ Chat Interfaces

Chat interfaces with shell/REPL/notebook access. Similar to/inspired by ChatGPT's "Advanced Data Analysis" feature (previously "Code Interpreter").

Name	⭐	Notes
open-interpreter	57982	open-source, locally running implementation of OpenAI's Code Interpreter
gptme	3131	Supporting open models. Developed by me, @ErikBjare
octogen	256	Local Code Interpreter executing in Docker environment.
terminal-x	34	Very early prototype that converts natural language into shell commands, unmaintained since Sept. 2021
DODA	>50	Electron based GUI for a local OpenAI Dev Assistant

🤖 Models

Models relevant for local Copilot-use. Ordered by most recent first.

Name	Size	Languages	⭐	Released	Notes
Phind CodeLlama v2	34B	Many	829	2023-8-27
WizardCoder-Python	7/13/34B	Python	765	2023-8
CodeLlama	7/13/34B	Many	16165	2023-8
WizardCoder	15B	80+	750	2023-6	Fine-tuning of Starcoder
replit-glaive	3B	1?	88	2023-7	Small model fine-tuned on high-quality data with impressive performance.
Starcoder	15B	80+	7351	2023-5
replit-v1-3b	3B	20+	724	2023-5
SantaCoder	1.1B	Python, Java, JavaScript	331	2023-4	Tiny model selectively trained on 3 languages from 'The Stack'
Qwen 2.5 Coder	32b	92 different languages	3998	2024-11
Deepseek R1	671B	Many	3052	2025-01

Note: due to the pace of new model releases, this section is doomed to be out of date.

📚 Datasets

Datasets relevant for training models.

Name	Size	Languages	⭐	Released	Notes
The Stack	3TB/6TB	358	760	2022-10	Excludes weak-copyleft licenses (MPL, LGPL, EGL) since v1.1

Tools

Misc relevant useful tools.

Name	⭐	Released	Notes
ollama	111009	2023-8-27	Easily get up and running with large language models locally.

Suggested setup

As you can see above there are many options for models and editor extensions. If you use VS Code or JetBrains and want to get started straight away you can use the following setup:

Install LM Studio.
Install Continue.dev extension.
Download one or several models in LM Studio. As of January 2025, Qwen 2.5 Coder is a good choice for autocomplete and Deepseek R1 is a good choice for chat. Depending on your hardware you'll have to experiment with which model size and quantization level gives you sufficient speed. For example on a Macbook Pro M2 with 32GB RAM, Qwen2.5-Coder-7B-Instruct-Q4_K_M works well for autocomplete and DeepSeek-R1-Distill-Qwen-14B-Q4_0 works well for chat.
Go to the Developer tab in LM Studio and start the server.

Configure Continue.dev extension with by adding your selected models. For example:

{
    "models": [
        {
        "apiBase": "http://localhost:1234/v1/",
        "title": "Deepseek R1",
        "model": "bartowski/deepseek-r1-distill-qwen-14b",
        "provider": "lmstudio"
        }
    ],
    "tabAutocompleteModel": {
        "provider": "lmstudio",
        "apiBase": "http://localhost:1234/v1/",
        "title": "Qwen 2.5 Coder",
        "model": "qwen2.5-coder-7b-instruct"
    },
}

📰 History

🐦 Tweet announcing this repo

📈 Stats

Stargazers over time:

For Tasks:

Click tags to check more tools for each tasks

complete code generate projects fix bugs assist coding improve responsiveness

For Jobs:

software developer ai engineer data scientist web developer machine learning engineer

Alternative AI tools for are-copilots-local-yet

Similar Open Source Tools

are-copilots-local-yet

github

: 511

llm-engineer-toolkit

The LLM Engineer Toolkit is a curated repository containing over 120 LLM libraries categorized for various tasks such as training, application development, inference, serving, data extraction, data generation, agents, evaluation, monitoring, prompts, structured outputs, safety, security, embedding models, and other miscellaneous tools. It includes libraries for fine-tuning LLMs, building applications powered by LLMs, serving LLM models, extracting data, generating synthetic data, creating AI agents, evaluating LLM applications, monitoring LLM performance, optimizing prompts, handling structured outputs, ensuring safety and security, embedding models, and more. The toolkit covers a wide range of tools and frameworks to streamline the development, deployment, and optimization of large language models.

github

: 2.6k

Awesome-LLM-Constrained-Decoding

Awesome-LLM-Constrained-Decoding is a curated list of papers, code, and resources related to constrained decoding of Large Language Models (LLMs). The repository aims to facilitate reliable, controllable, and efficient generation with LLMs by providing a comprehensive collection of materials in this domain.

github

: 180

CogVLM2

CogVLM2 is a new generation of open source models that offer significant improvements in benchmarks such as TextVQA and DocVQA. It supports 8K content length, image resolution up to 1344 * 1344, and both Chinese and English languages. The project provides basic calling methods, fine-tuning examples, and OpenAI API format calling examples to help developers quickly get started with the model.

github

: 83

Awesome-LLM-Large-Language-Models-Notes

Awesome-LLM-Large-Language-Models-Notes is a repository that provides a comprehensive collection of information on various Large Language Models (LLMs) classified by year, size, and name. It includes details on known LLM models, their papers, implementations, and specific characteristics. The repository also covers LLM models classified by architecture, must-read papers, blog articles, tutorials, and implementations from scratch. It serves as a valuable resource for individuals interested in understanding and working with LLMs in the field of Natural Language Processing (NLP).

github

: 156

tamingLLMs

The 'Taming LLMs' repository provides a practical guide to the pitfalls and challenges associated with Large Language Models (LLMs) when building applications. It focuses on key limitations and implementation pitfalls, offering practical Python examples and open source solutions to help engineers and technical leaders navigate these challenges. The repository aims to equip readers with the knowledge to harness the power of LLMs while avoiding their inherent limitations.

github

: 233

watsonx-ai-samples

Sample notebooks for IBM Watsonx.ai for IBM Cloud and IBM Watsonx.ai software product. The notebooks demonstrate capabilities such as running experiments on model building using AutoAI or Deep Learning, deploying third-party models as web services or batch jobs, monitoring deployments with OpenScale, managing model lifecycles, inferencing Watsonx.ai foundation models, and integrating LangChain with Watsonx.ai. Notebooks with Python code and the Python SDK can be found in the `python_sdk` folder. The REST API examples are organized in the `rest_api` folder.

github

: 128

Github-Ranking-AI

This repository provides a list of the most starred and forked repositories on GitHub. It is updated automatically and includes information such as the project name, number of stars, number of forks, language, number of open issues, description, and last commit date. The repository is divided into two sections: LLM and chatGPT. The LLM section includes repositories related to large language models, while the chatGPT section includes repositories related to the chatGPT chatbot.

github

: 227

Model-References

The 'Model-References' repository contains examples for training and inference using Intel Gaudi AI Accelerator. It includes models for computer vision, natural language processing, audio, generative models, MLPerf™ training, and MLPerf™ inference. The repository provides performance data and model validation information for various frameworks like PyTorch. Users can find examples of popular models like ResNet, BERT, and Stable Diffusion optimized for Intel Gaudi AI accelerator.

github

: 138

redis-ai-resources

A curated repository of code recipes, demos, and resources for basic and advanced Redis use cases in the AI ecosystem. It includes demos for ArxivChatGuru, Redis VSS, Vertex AI & Redis, Agentic RAG, ArXiv Search, and Product Search. Recipes cover topics like Getting started with RAG, Semantic Cache, Advanced RAG, and Recommendation systems. The repository also provides integrations/tools like RedisVL, AWS Bedrock, LangChain Python, LangChain JS, LlamaIndex, Semantic Kernel, RelevanceAI, and DocArray. Additional content includes blog posts, talks, reviews, and documentation related to Vector Similarity Search, AI-Powered Document Search, Vector Databases, Real-Time Product Recommendations, and more. Benchmarks compare Redis against other Vector Databases and ANN benchmarks. Documentation includes QuickStart guides, official literature for Vector Similarity Search, Redis-py client library docs, Redis Stack documentation, and Redis client list.

github

: 170

llm-compression-intelligence

This repository presents the findings of the paper "Compression Represents Intelligence Linearly". The study reveals a strong linear correlation between the intelligence of LLMs, as measured by benchmark scores, and their ability to compress external text corpora. Compression efficiency, derived from raw text corpora, serves as a reliable evaluation metric that is linearly associated with model capabilities. The repository includes the compression corpora used in the paper, code for computing compression efficiency, and data collection and processing pipelines.

github

: 98

LLM-QAT

This repository contains the training code of LLM-QAT for large language models. The work investigates quantization-aware training for LLMs, including quantizing weights, activations, and the KV cache. Experiments were conducted on LLaMA models of sizes 7B, 13B, and 30B, at quantization levels down to 4-bits. Significant improvements were observed when quantizing weight, activations, and kv cache to 4-bit, 8-bit, and 4-bit, respectively.

github

: 230

2024-AICS-EXP

This repository contains the complete archive of the 2024 version of the 'Intelligent Computing System' experiment at the University of Chinese Academy of Sciences. The experiment content for 2024 has undergone extensive adjustments to the knowledge system and experimental topics, including the transition from TensorFlow to PyTorch, significant modifications to previous code, and the addition of experiments with large models. The project is continuously updated in line with the course progress, currently up to the seventh experiment. Updates include the addition of experiments like YOLOv5 in Experiment 5-3, updates to theoretical teaching materials, and fixes for bugs in Experiment 6 code. The repository also includes experiment manuals, questions, and answers for various experiments, with some data sets hosted on Baidu Cloud due to size limitations on GitHub.

github

: 71

goodai-ltm-benchmark

This repository contains code and data for replicating experiments on Long-Term Memory (LTM) abilities of conversational agents. It includes a benchmark for testing agents' memory performance over long conversations, evaluating tasks requiring dynamic memory upkeep and information integration. The repository supports various models, datasets, and configurations for benchmarking and reporting results.

github

: 51

open-llms

Open LLMs is a repository containing various Large Language Models licensed for commercial use. It includes models like T5, GPT-NeoX, UL2, Bloom, Cerebras-GPT, Pythia, Dolly, and more. These models are designed for tasks such as transfer learning, language understanding, chatbot development, code generation, and more. The repository provides information on release dates, checkpoints, papers/blogs, parameters, context length, and licenses for each model. Contributions to the repository are welcome, and it serves as a resource for exploring the capabilities of different language models.

github

: 10.3k

RAGHub

RAGHub is a community-driven project focused on cataloging new and emerging frameworks, projects, and resources in the Retrieval-Augmented Generation (RAG) ecosystem. It aims to help users stay ahead of changes in the field by providing a platform for the latest innovations in RAG. The repository includes information on RAG frameworks, evaluation frameworks, optimization frameworks, citation frameworks, engines, search reranker frameworks, projects, resources, and real-world use cases across industries and professions.

github

: 465

For similar tasks

auto-dev-vscode

AutoDev for VSCode is an AI-powered coding wizard with multilingual support, auto code generation, and a bug-slaying assistant. It offers customizable prompts and features like Auto Dev/Testing/Document/Agent. The tool aims to enhance coding productivity and efficiency by providing intelligent assistance and automation capabilities within the Visual Studio Code environment.

github

: 221

are-copilots-local-yet

github

: 511

LLPhant

LLPhant is a comprehensive PHP Generative AI Framework designed to be simple yet powerful, compatible with Symfony and Laravel. It supports various LLMs like OpenAI, Anthropic, Mistral, Ollama, and services compatible with OpenAI API. The framework enables tasks such as semantic search, chatbots, personalized content creation, text summarization, personal shopper creation, autonomous AI agents, and coding tool assistance. It provides tools for generating text, images, speech-to-text transcription, and customizing system messages for question answering. LLPhant also offers features for embeddings, vector stores, document stores, and question answering with various query transformations and reranking techniques.

github

: 1.0k

awesome-code-ai

A curated list of AI coding tools, including code completion, refactoring, and assistants. This list includes both open-source and commercial tools, as well as tools that are still in development. Some of the most popular AI coding tools include GitHub Copilot, CodiumAI, Codeium, Tabnine, and Replit Ghostwriter.

github

: 766

companion-vscode

Quack Companion is a VSCode extension that provides smart linting, code chat, and coding guideline curation for developers. It aims to enhance the coding experience by offering a new tab with features like curating software insights with the team, code chat similar to ChatGPT, smart linting, and upcoming code completion. The extension focuses on creating a smooth contribution experience for developers by turning contribution guidelines into a live pair coding experience, helping developers find starter contribution opportunities, and ensuring alignment between contribution goals and project priorities. Quack collects limited telemetry data to improve its services and products for developers, with options for anonymization and disabling telemetry available to users.

github

: 225

CodeGeeX4

CodeGeeX4-ALL-9B is an open-source multilingual code generation model based on GLM-4-9B, offering enhanced code generation capabilities. It supports functions like code completion, code interpreter, web search, function call, and repository-level code Q&A. The model has competitive performance on benchmarks like BigCodeBench and NaturalCodeBench, outperforming larger models in terms of speed and performance.

github

: 1.0k

probsem

ProbSem is a repository that provides a framework to leverage large language models (LLMs) for assigning context-conditional probability distributions over queried strings. It supports OpenAI engines and HuggingFace CausalLM models, and is flexible for research applications in linguistics, cognitive science, program synthesis, and NLP. Users can define prompts, contexts, and queries to derive probability distributions over possible completions, enabling tasks like cloze completion, multiple-choice QA, semantic parsing, and code completion. The repository offers CLI and API interfaces for evaluation, with options to customize models, normalize scores, and adjust temperature for probability distributions.

github

: 72

chatgpt

The ChatGPT R package provides a set of features to assist in R coding. It includes addins like Ask ChatGPT, Comment selected code, Complete selected code, Create unit tests, Create variable name, Document code, Explain selected code, Find issues in the selected code, Optimize selected code, and Refactor selected code. Users can interact with ChatGPT to get code suggestions, explanations, and optimizations. The package helps in improving coding efficiency and quality by providing AI-powered assistance within the RStudio environment.

github

: 310

For similar jobs

sweep

Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.

github

: 7.1k

teams-ai

The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.

github

: 502

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

github

: 620

chatbot-ui

Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.

github

: 27.7k

BricksLLM

BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students

github

: 953

uAgents

uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.

github

: 1.3k

griptape

Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.

github

: 2.2k

are-copilots-local-yet

README:

🛠️ Are Copilots Local Yet?

Table of Contents

📋 Summary

📚 Background

🧩 Editor Extensions

🛠️ Tools

🗨️ Chat Interfaces

🤖 Models

📚 Datasets

Tools

Suggested setup

📰 History

📈 Stats

For Tasks:

For Jobs:

Alternative AI tools for are-copilots-local-yet

Similar Open Source Tools

are-copilots-local-yet

llm-engineer-toolkit

Awesome-LLM-Constrained-Decoding

CogVLM2

Awesome-LLM-Large-Language-Models-Notes

tamingLLMs

watsonx-ai-samples

Github-Ranking-AI

Model-References

redis-ai-resources

llm-compression-intelligence

LLM-QAT

2024-AICS-EXP

goodai-ltm-benchmark

open-llms

RAGHub

For similar tasks

auto-dev-vscode

are-copilots-local-yet

LLPhant

awesome-code-ai

companion-vscode

CodeGeeX4

probsem

chatgpt

For similar jobs

sweep

teams-ai

ai-guide

classifai

chatbot-ui

BricksLLM

uAgents

griptape