swift-chat

A lightning-fast, cross-platform AI chat application built with React Native.

Stars: 343

Visit

SwiftChat is a fast and responsive AI chat application developed with React Native and powered by Amazon Bedrock. It offers real-time streaming conversations, AI image generation, multimodal support, conversation history management, and cross-platform compatibility across Android, iOS, and macOS. The app supports multiple AI models like Amazon Bedrock, Ollama, DeepSeek, and OpenAI, and features a customizable system prompt assistant. With a minimalist design philosophy and robust privacy protection, SwiftChat delivers a seamless chat experience with various features like rich Markdown support, comprehensive multimodal analysis, creative image suite, and quick access tools. The app prioritizes speed in launch, request, render, and storage, ensuring a fast and efficient user experience. SwiftChat also emphasizes app privacy and security by encrypting API key storage, minimal permission requirements, local-only data storage, and a privacy-first approach.

README:

SwiftChat - A Cross-platform AI Chat App

🚀 Your Personal AI Assistant - Fast, Private, and Cross-platform

📱 Quick Download

Download for Android
Download for macOS
For iOS: Currently available through local build with Xcode

中文

SwiftChat is a fast and responsive AI chat application developed with React Native and powered by Amazon Bedrock, with compatibility extending to other model providers such as Ollama, DeepSeek, OpenAI and OpenAI Compatible. With its minimalist design philosophy and robust privacy protection, it delivers real-time streaming conversations and AI image generation capabilities across Android, iOS, and macOS platforms.

What's New 🔥

Support for OpenAI Compatible models. You can now use easy-model-deployer, OpenRouter, or any OpenAI-compatible model provider via SwiftChat. Please check Configure OpenAI Compatible section for more details(From v2.2.0).
Support for quick model switching (From v2.2.0).
Support regeneration of AI responses (From v2.2.0).

Key Features:

Real-time streaming chat with AI
Rich Markdown Support: Tables, Code Blocks, LaTeX and More
AI image generation with progress
Multimodal support (images, videos & documents)
Conversation history list view and management
Cross-platform support (Android, iOS, macOS)
Tablet-optimized for iPad and Android tablets
Fast launch and responsive performance
Multiple AI model supported (Amazon Bedrock, Ollama, DeepSeek, OpenAI and OpenAI Compatible Models)
Fully Customizable System Prompt Assistant

Supported Features For Amazon Nova series

Record 30-second videos directly on Android and iOS for Nova analysis
Upload large videos (1080p/4K) beyond 8MB with auto compression
Support using natural language to make Nova Canvas generate images, remove backgrounds, replace backgrounds, and create images in similar styles.

Feature Showcase

YouTube Video

The content in the video is an early version. For UI, architecture, and inconsistencies, please refer to the current documentation.

Comprehensive Multimodal Analysis: Text, Image, Document and Video

Creative Image Suite: Generation, Style Replication, Background Removal & Replacement with Nova Canvas

System Prompt Assistant: Useful Preset System Prompts with Full Management Capabilities (Add/Edit/Sort/Delete)

Rich Markdown Support: Paragraph, Code Blocks, Tables, LaTeX and More

We redesigned the UI with optimized font sizes and line spacing for a more elegant and clean presentation. All of these features are also seamlessly displayed on Android and macOS with native UI

Note: Some animated images have been sped up for demonstration. If you experience lag, please view on Chrome, Firefox, or Edge browser on your computer.

Architecture

By default, we use AWS App Runner, which is commonly used to host Python FastAPI servers, offering high performance, scalability and low latency.

Alternatively, we provide the option to replace App Runner with AWS Lambda using Function URL for a more cost-effective solution, as shown in this example.

Getting Started with Amazon Bedrock

Prerequisites

Ensure you have access to Amazon Bedrock foundation models. SwiftChat default settings are:

Region: us-west-2
Text Model: Amazon Nova Pro
Image Model: Stable Diffusion 3.5 Large

If you are using the image generation feature, please make sure you have enabled access to the Amazon Nova Lite model. Please follow the Amazon Bedrock User Guide to enable your models.

🔧 Configuration Steps (Click to expand)

Step 1: Set up your API Key

Sign in to your AWS console and right-click Parameter Store to open it in a new tab.
Check whether you are in the supported region, then click on the Create parameter button.
Fill in the parameters below, leaving other options as default:
- Name: Enter a parameter name (e.g., "SwiftChatAPIKey", will be used as ApiKeyParam in Step 2).
- Type: Select SecureString
- Value: Enter any string without spaces.(this will be your API Key in Step 3)
Click Create parameter.

Step 2: Deploy stack and get your API URL

Click one of the following buttons to launch the CloudFormation Stack in the same region where your API Key was created.
- App Runner
- Lambda (Note: For AWS customer use only)
Click Next, On the "Specify stack details" page, provide the following information:
- Fill the ApiKeyParam with the parameter name you used for storing the API key (e.g., "SwiftChatAPIKey").
- For App Runner, choose an InstanceTypeParam based on your needs.
Click Next, Keep the "Configure stack options" page as default, Read the Capabilities and Check the "I acknowledge that AWS CloudFormation might create IAM resources" checkbox at the bottom.
Click Next, In the "Review and create" Review your configuration and click Submit.

Wait about 3-5 minutes for the deployment to finish, then click the CloudFormation stack and go to Outputs tab, you can find the API URL which looks like: https://xxx.xxx.awsapprunner.com or https://xxx.lambda-url.xxx.on.aws

Step 3: Open the App and setup with API URL and API Key

Launch the App, open the drawer menu, and tap Settings.
Paste the API URL and API Key(The Value you typed in Parameter Store) then select the Region.
Click the top right ✓ icon to save your configuration and start your chat.

Congratulations 🎉 Your SwiftChat App is ready to use!

Supported Region

US East (N. Virginia): us-east-1
US West (Oregon): us-west-2
Asia Pacific (Mumbai): ap-south-1
Asia Pacific (Singapore): ap-southeast-1
Asia Pacific (Sydney): ap-southeast-2
Asia Pacific (Tokyo): ap-northeast-1
Canada (Central): ca-central-1
Europe (Frankfurt): eu-central-1
Europe (London): eu-west-2
Europe (Paris): eu-west-3
South America (São Paulo): sa-east-1

Getting Started with Other Model Providers

Ollama

🔧 Configure Ollama (Click to expand)

Navigate to the Settings Page and select the Ollama tab.
Enter your Ollama Server URL. For example:
```
http://localhost:11434
```
Once the correct Server URL is entered, you can select your desired Ollama models from the Text Model dropdown list.

DeepSeek

🔧 Configure DeepSeek (Click to expand)

Go to the Settings Page and select the DeepSeek tab.
Input your DeepSeek API Key.
Choose DeepSeek models from the Text Model dropdown list. Currently, the following DeepSeek models are supported:
- DeepSeek-V3
- DeepSeek-R1

OpenAI

🔧 Configure OpenAI (Click to expand)

Navigate to the Settings Page and select the OpenAI tab.
Enter your OpenAI API Key.
Select OpenAI models from the Text Model dropdown list. The following OpenAI models are currently supported:
- GPT-4o
- GPT-4o mini

Additionally, if you have deployed the ClickStream Server, you can enable the Use Proxy option to forward your requests.

OpenAI Compatible

🔧 Configure OpenAI Compatible models (Click to expand)

Navigate to the Settings Page and select the OpenAI tab.
Under OpenAI Compatible, enter the following information:
- Base URL of your model provider
- API Key of your model provider
- Model ID of the models you want to use (separate multiple models with commas)
Select one of your models from the Text Model dropdown list.

Detailed Features

Quick Access Tools: Code & Content Copy, Selection Mode, Model Switch, Regenerate, Scroll Controls and Token Counter

We feature streamlined chat History, Settings pages, and intuitive Usage statistics:

Message Handling

[x] Text copy support:
- Copy button at the bottom of messages, or directly click the model name or user title section.
- Copy button in code blocks
- Direct Select and copy code on macOS (double click or long click on iOS)
- Long press text to copy entire sentence (Right-click on macOS)
[x] Text selection mode by click selection button.
[x] Message timeline view in history
[x] Delete messages through long press in history
[x] Click to preview for documents videos and images

Image Features

[x] Support image generation with Chinese prompts(Make sure Amazon Nova Lite is enabled in your selected region)
[x] Long press images to save or share
[x] Automatic image compression to improve response speed

User Experience

[x] Haptic feedback for Android and iOS (can be disabled in Settings)
[x] Support landscape mode on Android/iOS devices
[x] Double tap title bar to scroll to top
[x] Click bottom arrow to view latest messages
[x] Display system prompt and model switch icon again by clicking on the chat title
[x] View current session token usage by tapping twice Chat title
[x] Check detailed token usage and image generation count in Settings
[x] In-app upgrade notifications (Android & macOS)

We have optimized the layout for landscape mode. As shown below, you can comfortably view table/code contents in landscape orientation.

What Makes SwiftChat Really "Swift"?

🚀 Fast Launch Speed

Thanks to the AOT (Ahead of Time) compilation of RN Hermes engine
Added lazy loading of complex components
App launches instantly and is immediately ready for input

🌐 Fast Request Speed

Speed up end-to-end API requests through image compression
Deploying APIs in the same region as Bedrock provides lower latency

📱 Fast Render Speed

Using useMemo and custom caching to creates secondary cache for session content
Reduce unnecessary re-renders and speed up streaming messages display
All UI components are rendered as native components

📦 Fast Storage Speed

By using react-native-mmkv Messages can be read, stored, and updated 10x faster than AsyncStorage
Optimized session content and session list storage structure to accelerates history list display

App Privacy & Security

Encrypted API key storage
Minimal permission requirements
Local-only data storage
No user behavior tracking
No data collection
Privacy-first approach

App Build and Development

First, clone this repository. All app code is located in the react-native folder. Before proceeding, execute the following command to download dependencies.

cd react-native && npm i && npm start

Build for Android

open a new terminal and execute:

npm run android

Build for iOS

also open a new terminal, for the first time you need to install the native dependencies by execute cd ios && pod install && cd .., then execute the follow command:

npm run ios

Build for macOS

Execute npm start.
Double click ios/SwiftChat.xcworkspace to open the project in your Xcode.
Change the build destination to My Mac (Mac Catalyst) then click the ▶ Run button.

API Reference

Please refer API Reference

How to upgrade?

Upgrade App

Android and macOS: Navigate to Settings Page, if there is a new version, you will find it at the bottom of this page, then click the app version to download and install it.
iOS: If a new version is released in the Release page, update your local code, rebuild and install your app by Xcode.

Note: After downloading a new version, please check the release notes to see if an API version update is required.

Upgrade API

For AppRunner: Click and open App Runner Services page, find and open swiftchat-api, click top right Deploy button.
For Lambda: Click and open Lambda Services, find and open your Lambda which start with SwiftChatLambda-xxx, click the Deploy new image button and click Save.

Security

See CONTRIBUTING for more information.

License

This library is licensed under the MIT-0 License. See the LICENSE file.

For Tasks:

Click tags to check more tools for each tasks

chat with ai generate ai images manage conversation history customize system prompts analyze text, image, and video

For Jobs:

chatbot developer ai engineer mobile app developer ui/ux designer software engineer

Alternative AI tools for swift-chat

Similar Open Source Tools

swift-chat

github

: 343

DeepSeekAI

DeepSeekAI is a browser extension plugin that allows users to interact with AI by selecting text on web pages and invoking the DeepSeek large model to provide AI responses. The extension enhances browsing experience by enabling users to get summaries or answers for selected text directly on the webpage. It features context text selection, API key integration, draggable and resizable window, AI streaming replies, Markdown rendering, one-click copy, re-answer option, code copy functionality, language switching, and multi-turn dialogue support. Users can install the extension from Chrome Web Store or Edge Add-ons, or manually clone the repository, install dependencies, and build the extension. Configuration involves entering the DeepSeek API key in the extension popup window to start using the AI-driven responses.

github

: 203

chatbox

Chatbox is a desktop client for ChatGPT, Claude, and other LLMs, providing a user-friendly interface for AI copilot assistance on Windows, Mac, and Linux. It offers features like local data storage, multiple LLM provider support, image generation with Dall-E-3, enhanced prompting, keyboard shortcuts, and more. Users can collaborate, access the tool on various platforms, and enjoy multilingual support. Chatbox is constantly evolving with new features to enhance the user experience.

github

: 24.8k

chatbox

Chatbox is a desktop client for ChatGPT, Claude, and other LLMs, providing features like local data storage, multiple LLM provider support, image generation, enhanced prompting, keyboard shortcuts, and more. It offers a user-friendly interface with dark theme, team collaboration, cross-platform availability, web version access, iOS & Android apps, multilingual support, and ongoing feature enhancements. Developed for prompt and API debugging, it has gained popularity for daily chatting and professional role-playing with AI assistance.

github

: 33.5k

ComfyUI-Copilot

ComfyUI-Copilot is an intelligent assistant built on the Comfy-UI framework that simplifies and enhances the AI algorithm debugging and deployment process through natural language interactions. It offers intuitive node recommendations, workflow building aids, and model querying services to streamline development processes. With features like interactive Q&A bot, natural language node suggestions, smart workflow assistance, and model querying, ComfyUI-Copilot aims to lower the barriers to entry for beginners, boost development efficiency with AI-driven suggestions, and provide real-time assistance for developers.

github

: 949

ai-flow

AI Flow is an open-source, user-friendly UI application that empowers you to seamlessly connect multiple AI models together, specifically leveraging the capabilities of multiples AI APIs such as OpenAI, StabilityAI and Replicate. In a nutshell, AI Flow provides a visual platform for crafting and managing AI-driven workflows, thereby facilitating diverse and dynamic AI interactions.

github

: 188

word-GPT-Plus

Word GPT Plus seamlessly integrates AI models into Microsoft Word, allowing users to generate, translate, summarize, and polish text directly within their documents. The tool supports multiple AI models, offers built-in templates for various text-related tasks, and provides customization options for user preferences. Users can install the tool through a hosted service, Docker deployment, or self-hosting, and can easily fill in API keys to access different AI services. Word GPT Plus enhances writing workflows by providing AI-powered assistance without leaving the Word environment.

github

: 768

llm-x

LLM X is a ChatGPT-style UI for the niche group of folks who run Ollama (think of this like an offline chat gpt server) locally. It supports sending and receiving images and text and works offline through PWA (Progressive Web App) standards. The project utilizes React, Typescript, Lodash, Mobx State Tree, Tailwind css, DaisyUI, NextUI, Highlight.js, React Markdown, kbar, Yet Another React Lightbox, Vite, and Vite PWA plugin. It is inspired by ollama-ui's project and Perplexity.ai's UI advancements in the LLM UI space. The project is still under development, but it is already a great way to get started with building your own LLM UI.

github

: 113

reachat

Reachat is a UI library designed for building chat experiences without the need for manual coding of components. Users can customize each component and theme using Tailwind. The library offers features such as console and companion modes, markdown rendering, code highlighting, tables, JSON support, math rendering, YouTube embeds, file uploads, message sources, animations, conversation pagination, keyboard shortcuts, responsive design, and more. Reachat is highly customizable and suitable for creating interactive chat interfaces.

github

: 152

transformerlab-app

Transformer Lab is an app that allows users to experiment with Large Language Models by providing features such as one-click download of popular models, finetuning across different hardware, RLHF and Preference Optimization, working with LLMs across different operating systems, chatting with models, using different inference engines, evaluating models, building datasets for training, calculating embeddings, providing a full REST API, running in the cloud, converting models across platforms, supporting plugins, embedded Monaco code editor, prompt editing, inference logs, all through a simple cross-platform GUI.

github

: 1.5k

skypilot

SkyPilot is a framework for running LLMs, AI, and batch jobs on any cloud, offering maximum cost savings, highest GPU availability, and managed execution. SkyPilot abstracts away cloud infra burdens: - Launch jobs & clusters on any cloud - Easy scale-out: queue and run many jobs, automatically managed - Easy access to object stores (S3, GCS, R2) SkyPilot maximizes GPU availability for your jobs: * Provision in all zones/regions/clouds you have access to (the _Sky_), with automatic failover SkyPilot cuts your cloud costs: * Managed Spot: 3-6x cost savings using spot VMs, with auto-recovery from preemptions * Optimizer: 2x cost savings by auto-picking the cheapest VM/zone/region/cloud * Autostop: hands-free cleanup of idle clusters SkyPilot supports your existing GPU, TPU, and CPU workloads, with no code changes.

github

: 7.6k

payload-ai

The Payload AI Plugin is an advanced extension that integrates modern AI capabilities into your Payload CMS, streamlining content creation and management. It offers features like text generation, voice and image generation, field-level prompt customization, prompt editor, document analyzer, fact checking, automated content workflows, internationalization support, editor AI suggestions, and AI chat support. Users can personalize and configure the plugin by setting environment variables. The plugin is actively developed and tested with Payload version v3.2.1, with regular updates expected.

github

: 163

meeting-minutes

An open-source AI assistant for taking meeting notes that captures live meeting audio, transcribes it in real-time, and generates summaries while ensuring user privacy. Perfect for teams to focus on discussions while automatically capturing and organizing meeting content without external servers or complex infrastructure. Features include modern UI, real-time audio capture, speaker diarization, local processing for privacy, and more. The tool also offers a Rust-based implementation for better performance and native integration, with features like live transcription, speaker diarization, and a rich text editor for notes. Future plans include database connection for saving meeting minutes, improving summarization quality, and adding download options for meeting transcriptions and summaries. The backend supports multiple LLM providers through a unified interface, with configurations for Anthropic, Groq, and Ollama models. System architecture includes core components like audio capture service, transcription engine, LLM orchestrator, data services, and API layer. Prerequisites for setup include Node.js, Python, FFmpeg, and Rust. Development guidelines emphasize project structure, testing, documentation, type hints, and ESLint configuration. Contributions are welcome under the MIT License.

github

: 1.0k

vearch

Vearch is a cloud-native distributed vector database designed for efficient similarity search of embedding vectors in AI applications. It supports hybrid search with vector search and scalar filtering, offers fast vector retrieval from millions of objects in milliseconds, and ensures scalability and reliability through replication and elastic scaling out. Users can deploy Vearch cluster on Kubernetes, add charts from the repository or locally, start with Docker-compose, or compile from source code. The tool includes components like Master for schema management, Router for RESTful API, and PartitionServer for hosting document partitions with raft-based replication. Vearch can be used for building visual search systems for indexing images and offers a Python SDK for easy installation and usage. The tool is suitable for AI developers and researchers looking for efficient vector search capabilities in their applications.

github

: 2.0k

ebook2audiobook

ebook2audiobook is a CPU/GPU converter tool that converts eBooks to audiobooks with chapters and metadata using tools like Calibre, ffmpeg, XTTSv2, and Fairseq. It supports voice cloning and a wide range of languages. The tool is designed to run on 4GB RAM and provides a new v2.0 Web GUI interface for user-friendly interaction. Users can convert eBooks to text format, split eBooks into chapters, and utilize high-quality text-to-speech functionalities. Supported languages include Arabic, Chinese, English, French, German, Hindi, and many more. The tool can be used for legal, non-DRM eBooks only and should be used responsibly in compliance with applicable laws.

github

: 9.2k

TaskingAI

TaskingAI brings Firebase's simplicity to **AI-native app development**. The platform enables the creation of GPTs-like multi-tenant applications using a wide range of LLMs from various providers. It features distinct, modular functions such as Inference, Retrieval, Assistant, and Tool, seamlessly integrated to enhance the development process. TaskingAI’s cohesive design ensures an efficient, intelligent, and user-friendly experience in AI application development.

github

: 6.1k

For similar tasks

swift-chat

github

: 343

h2ogpt

h2oGPT is an Apache V2 open-source project that allows users to query and summarize documents or chat with local private GPT LLMs. It features a private offline database of any documents (PDFs, Excel, Word, Images, Video Frames, Youtube, Audio, Code, Text, MarkDown, etc.), a persistent database (Chroma, Weaviate, or in-memory FAISS) using accurate embeddings (instructor-large, all-MiniLM-L6-v2, etc.), and efficient use of context using instruct-tuned LLMs (no need for LangChain's few-shot approach). h2oGPT also offers parallel summarization and extraction, reaching an output of 80 tokens per second with the 13B LLaMa2 model, HYDE (Hypothetical Document Embeddings) for enhanced retrieval based upon LLM responses, a variety of models supported (LLaMa2, Mistral, Falcon, Vicuna, WizardLM. With AutoGPTQ, 4-bit/8-bit, LORA, etc.), GPU support from HF and LLaMa.cpp GGML models, and CPU support using HF, LLaMa.cpp, and GPT4ALL models. Additionally, h2oGPT provides Attention Sinks for arbitrarily long generation (LLaMa-2, Mistral, MPT, Pythia, Falcon, etc.), a UI or CLI with streaming of all models, the ability to upload and view documents through the UI (control multiple collaborative or personal collections), Vision Models LLaVa, Claude-3, Gemini-Pro-Vision, GPT-4-Vision, Image Generation Stable Diffusion (sdxl-turbo, sdxl) and PlaygroundAI (playv2), Voice STT using Whisper with streaming audio conversion, Voice TTS using MIT-Licensed Microsoft Speech T5 with multiple voices and Streaming audio conversion, Voice TTS using MPL2-Licensed TTS including Voice Cloning and Streaming audio conversion, AI Assistant Voice Control Mode for hands-free control of h2oGPT chat, Bake-off UI mode against many models at the same time, Easy Download of model artifacts and control over models like LLaMa.cpp through the UI, Authentication in the UI by user/password via Native or Google OAuth, State Preservation in the UI by user/password, Linux, Docker, macOS, and Windows support, Easy Windows Installer for Windows 10 64-bit (CPU/CUDA), Easy macOS Installer for macOS (CPU/M1/M2), Inference Servers support (oLLaMa, HF TGI server, vLLM, Gradio, ExLLaMa, Replicate, OpenAI, Azure OpenAI, Anthropic), OpenAI-compliant, Server Proxy API (h2oGPT acts as drop-in-replacement to OpenAI server), Python client API (to talk to Gradio server), JSON Mode with any model via code block extraction. Also supports MistralAI JSON mode, Claude-3 via function calling with strict Schema, OpenAI via JSON mode, and vLLM via guided_json with strict Schema, Web-Search integration with Chat and Document Q/A, Agents for Search, Document Q/A, Python Code, CSV frames (Experimental, best with OpenAI currently), Evaluate performance using reward models, and Quality maintained with over 1000 unit and integration tests taking over 4 GPU-hours.

github

: 11.7k

serverless-chat-langchainjs

This sample shows how to build a serverless chat experience with Retrieval-Augmented Generation using LangChain.js and Azure. The application is hosted on Azure Static Web Apps and Azure Functions, with Azure Cosmos DB for MongoDB vCore as the vector database. You can use it as a starting point for building more complex AI applications.

github

: 771

react-native-vercel-ai

Run Vercel AI package on React Native, Expo, Web and Universal apps. Currently React Native fetch API does not support streaming which is used as a default on Vercel AI. This package enables you to use AI library on React Native but the best usage is when used on Expo universal native apps. On mobile you get back responses without streaming with the same API of `useChat` and `useCompletion` and on web it will fallback to `ai/react`

github

: 117

LLamaSharp

LLamaSharp is a cross-platform library to run 🦙LLaMA/LLaVA model (and others) on your local device. Based on llama.cpp, inference with LLamaSharp is efficient on both CPU and GPU. With the higher-level APIs and RAG support, it's convenient to deploy LLM (Large Language Model) in your application with LLamaSharp.

github

: 3.1k

gpt4all

GPT4All is an ecosystem to run powerful and customized large language models that work locally on consumer grade CPUs and any GPU. Note that your CPU needs to support AVX or AVX2 instructions. Learn more in the documentation. A GPT4All model is a 3GB - 8GB file that you can download and plug into the GPT4All open-source ecosystem software. Nomic AI supports and maintains this software ecosystem to enforce quality and security alongside spearheading the effort to allow any person or enterprise to easily train and deploy their own on-edge large language models.

github

: 72.9k

ChatGPT-Telegram-Bot

ChatGPT Telegram Bot is a Telegram bot that provides a smooth AI experience. It supports both Azure OpenAI and native OpenAI, and offers real-time (streaming) response to AI, with a faster and smoother experience. The bot also has 15 preset bot identities that can be quickly switched, and supports custom bot identities to meet personalized needs. Additionally, it supports clearing the contents of the chat with a single click, and restarting the conversation at any time. The bot also supports native Telegram bot button support, making it easy and intuitive to implement required functions. User level division is also supported, with different levels enjoying different single session token numbers, context numbers, and session frequencies. The bot supports English and Chinese on UI, and is containerized for easy deployment.

github

: 476

twinny

Twinny is a free and open-source AI code completion plugin for Visual Studio Code and compatible editors. It integrates with various tools and frameworks, including Ollama, llama.cpp, oobabooga/text-generation-webui, LM Studio, LiteLLM, and Open WebUI. Twinny offers features such as fill-in-the-middle code completion, chat with AI about your code, customizable API endpoints, and support for single or multiline fill-in-middle completions. It is easy to install via the Visual Studio Code extensions marketplace and provides a range of customization options. Twinny supports both online and offline operation and conforms to the OpenAI API standard.

github

: 2.3k

For similar jobs

sweep

Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.

github

: 7.1k

teams-ai

The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.

github

: 502

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

github

: 620

chatbot-ui

Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.

github

: 27.7k

BricksLLM

BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students

github

: 953

uAgents

uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.

github

: 1.3k

griptape

Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.

github

: 2.2k