ZcChat

一个模仿Galgame效果的AI桌宠 | An AI desktop pet that mimics the effects of a Galgame

Stars: 420

Visit

ZcChat is an AI desktop pet suitable for Galgame characters, featuring long-term memory, expressive actions, control over the computer, and voice functions. It utilizes Letta for AI long-term memory, Galgame-style character illustrations for more actions and expressions, and voice interaction with support for various voice synthesis tools like Vits. Users can configure characters, install Letta, set up voice synthesis and input, and control the pet to interact with the computer. The tool enhances visual and auditory experiences for users interested in AI desktop pets.

README:

ZcChat

🌟 一个有长期记忆、表情动作立绘显示、操控电脑和语音功能，更适合Galgame角色的AI桌宠 🌟

中文 English

🎯 项目介绍

视频演示

✨ 特色功能

长期记忆：使用Letta实现AI长期记忆，并且发展拓展性格和个性。
表情动作：采用Galgame立绘的方式而不是Live2d，使得角色可以有更多的动作和表情
立绘动画：采用类似Galgame立绘的动画演出，提升视觉体验。
语音交互：支持语音输入、直接对话、唤醒和回复打断功能。
语音合成：支持Vits等多种语音合成工具，让声音更真实。

🛠️ 技术概览

一图看懂ZcChat的工作原理：

🚀 快速入门

📺 视频教程

点击这里观看视频教程

📋 Step 1 - 准备工作

在Release页面下载并安装ZcChat。请尽量避免中文路径，以免产生一些列问题！

如果出现尝试创建文件时出错请检查是否被杀毒软件拦截，并尝试使用管理员权限启动安装包。

讨论区找到现成的角色数据，或者自定义你的角色。

🎭 Step 2 - 配置角色

解压角色数据放入 {文档}/ZcChat/characters/{名称} 文件夹。

最终的项目结构应为 {文档}/ZcChat/characters/{名称}/正常.png...
在 ZcChat > 通用设置 > 角色选择 中选中角色。

Tips：右键立绘可以打开和关闭对话框，在对话框滚轮可以打开和关闭历史。

🤖 Step 3- 安装配置LLM模型

方式1：Openai通用接口

将BaseUrl、Key填入 ZcChat > AI模型配置 > OpenAI API设定。

BaseUrl应该类似于为 https://api.openai.com/v1/completions
在 ZcChat > 角色设置> 接口设置 中选择OpenAI并填入角色提示词。

（部分提示词可能已经内置在了角色数据中，无需额外配置）

配置格式提示词或者开启输出格式增强

默认格式提示词：

以下是我必须遵循的事项：
我精通中文和日语，在与用户交谈时，无论他们使用何种语言，我都必须始终提供双语回复。
我会在每句话之前添加我的心情和动作，我的心情必须选择以下单词之一：{emo}
我应该严格按照这个格式回答{心情}|{中文}|{日语}

方式2：Letta

Letta 是一个可以提供长期记忆和学习功能的AI代理

安装Letta 并启动。
创建agent，并在 LettaADE > agent > Persona 中填入角色提示词+格式提示词。
将Letta地址填入 ZcChat > AI模型配置 > Letta地址。

Letta默认地址为 http://localhost:8283
在 ZcChat > 角色设置> 接口设置 中选择Letta。
将agentID填入 ZcChat > 角色设置 > 接口设置 > agentID。

🎉🎉🎉至此，ZcChat已经可以基本使用了，以下是一些进阶功能的使用：🎉🎉🎉

🎙️ （可选）安装配置语音合成

方式1：vits-simple-api

安装vits-simple-api 并启动。
将URL填入 ZcChat > 语言合成设置 > vits-simple-api地址。

vits-simple-api默认地址为 http://localhost:23456
ZcChat > 角色配置 > 语音合成 中配置相关项（更多参数可以直接在id里以url的形式添加，比如4&length=1.20）

方式2：其他语音合成工具

开启自定义模式。
填入API调用地址，确保输出为mp3文件，并使用 {msg} 作为文本占位符，例如：
```
http://127.0.0.1:23456/voice/vits?id=0&text={msg}&format=mp3
```
ZcChat > 角色配置 > 语音合成 中配置相关项

🎤 （可选）配置语音输入

方式1：whisper-asr-webservice

安装whisper-asr-webservice 并启动。
将URL填入 ZcChat > 语言输入设置 > whisper-asr-webservice地址。

默认地址为 http://localhost:9000

方式2：百度语音识别

在百度智能云控制台创建短语音识别应用。
将API Key和Secret Key填入 ZcChat > 语言输入设置 > API Key/Secret Key。

🔔 （可选）配置语音唤醒和打断

先配置好语音输入
观察 语言输入设置 > 语音能量赋值，不说话和说话的能量应该会有较大区别，寻找的适合自己的阈值填入设置。

由于VAD算法较为简单，语音唤醒和打断对环境要求较高，请仔细配置。

💻 （可选）给予桌宠操控电脑的接口

将文档内提示词追加到角色提示词即可。

🦊 （可选）制作自己的角色

见角色制作文档

❓ 常见问题 (F&Q)

🤔 如何使用其他LLM，如OpenAI、Gemini、DeepSeek等？

你可以使用Openai通用接口，市面上几乎所有AI都支持这个协议。

如果你使用Letta，可以看 Letta文档

📝 对话不久后会出现格式错误？

可能是AI忘记了或受到误导性词语的影响。尝试改进提示词，或用命令的语气告诉AI使用正确的格式回复。效果取决于所使用的模型。

或者你可以开启回复格式增强功能

⁉️ 为什么不使用Live2d？

本项目的特色就是使用了类Galgame的立绘方式，使得拥有表情和动作变化，并且降低了角色制作成本。如果你在找一个Live2d桌宠，我推荐LogChat。

🌏 如何纯本地运行

Letta支持纯本地部署，当然你也可以使用如Ollama等工具并使用OpenAI通用接口。

🤗 如何贡献

ZcChat是一个开源项目。参与ZcChat项目的方法有很多！

为项目做出贡献：有兴趣做出贡献吗？欢迎Pull Request！~~（如果能读懂我的垃圾代码的话）~~
报告BUG、建议：有BUG或有功能请求？请通过Issues提交它们。
分享自己制作的模板：制作了自己的角色？不妨在Discussions分享一下！
或者……：给项目来一个star⭐怎么样？

🔗 相关链接

For Tasks:

Click tags to check more tools for each tasks

customize characters install letta configure voice synthesis set up voice input control desktop pet

For Jobs:

game developer ai engineer software developer ui/ux designer voice technology specialist

Alternative AI tools for ZcChat

Similar Open Source Tools

ZcChat

github

: 420

wechat-robot-client

The Wechat Robot Client is an intelligent robot management system that provides rich interactive experiences. It includes features such as AI chat, drawing, voice, group chat functionalities, song requests, daily summaries, friend circle viewing, friend adding, group chat management, file messaging, multiple login methods support, and more. The system also supports features like sending files, various login methods, and integration with other apps like '王者荣耀' and '吃鸡'. It offers a comprehensive solution for managing Wechat interactions and automating various tasks.

github

: 73

AivisSpeech

AivisSpeech is a Japanese text-to-speech software based on the VOICEVOX editor UI. It incorporates the AivisSpeech Engine for generating emotionally rich voices easily. It supports AIVMX format voice synthesis model files and specific model architectures like Style-Bert-VITS2. Users can download AivisSpeech and AivisSpeech Engine for Windows and macOS PCs, with minimum memory requirements specified. The development follows the latest version of VOICEVOX, focusing on minimal modifications, rebranding only where necessary, and avoiding refactoring. The project does not update documentation, maintain test code, or refactor unused features to prevent conflicts with VOICEVOX.

github

: 325

xiaozhi-client

Xiaozhi Client is a tool that supports integration with Xiaozhi official servers, acts as a regular MCP Server integrated into various clients, allows configuration of multiple Xiaozhi access points for shared MCP configuration, aggregates multiple MCP Servers in a standard way, dynamically controls MCP Server tool visibility, supports local deployment of open-source server integration, provides web-based visual configuration allowing customization of IP and port, integrates ModelScope remote MCP services, creates Xiaozhi Client projects through templates, and supports running in the background.

github

: 159

spring-boot-init-template

github

: 446

chatgpt-webui

ChatGPT WebUI is a user-friendly web graphical interface for various LLMs like ChatGPT, providing simplified features such as core ChatGPT conversation and document retrieval dialogues. It has been optimized for better RAG retrieval accuracy and supports various search engines. Users can deploy local language models easily and interact with different LLMs like GPT-4, Azure OpenAI, and more. The tool offers powerful functionalities like GPT4 API configuration, system prompt setup for role-playing, and basic conversation features. It also provides a history of conversations, customization options, and a seamless user experience with themes, dark mode, and PWA installation support.

github

: 79

Long-Novel-GPT

Long-Novel-GPT is a long novel generator based on large language models like GPT. It utilizes a hierarchical outline/chapter/text structure to maintain the coherence of long novels. It optimizes API calls cost through context management and continuously improves based on self or user feedback until reaching the set goal. The tool aims to continuously refine and build novel content based on user-provided initial ideas, ultimately generating long novels at the level of human writers.

github

: 396

ERNIE-SDK

ERNIE SDK repository contains two projects: ERNIE Bot Agent and ERNIE Bot. ERNIE Bot Agent is a large model intelligent agent development framework based on the Wenxin large model orchestration capability introduced by Baidu PaddlePaddle, combined with the rich preset platform functions of the PaddlePaddle Star River community. ERNIE Bot provides developers with convenient interfaces to easily call the Wenxin large model for text creation, general conversation, semantic vectors, and AI drawing basic functions.

github

: 321

XYBotV2

XYBot V2 is a feature-rich WeChat robot framework that supports various interactive functions and gameplays. It provides AI chat, daily news updates, song requests, weather queries, and gaming functionalities like Gomoku and Warthunder player lookup. The tool is open-source and intended for learning and research purposes only, not for commercial or illegal activities. Users must comply with relevant laws and respect WeChat's copyrights and privacy. The tool's functionalities can be extended through a plugin system, allowing for dynamic loading/unloading of plugins.

github

: 367

prompt-optimizer

Prompt Optimizer is a powerful AI prompt optimization tool that helps you write better AI prompts, improving AI output quality. It supports both web application and Chrome extension usage. The tool features intelligent optimization for prompt words, real-time testing to compare before and after optimization, integration with multiple mainstream AI models, client-side processing for security, encrypted local storage for data privacy, responsive design for user experience, and more.

github

: 15.5k

new-api

New API is a next-generation large model gateway and AI asset management system that provides a wide range of features, including a new UI interface, multi-language support, online recharge function, key query for usage quota, compatibility with the original One API database, model charging by usage count, channel weighted randomization, data dashboard, token grouping and model restrictions, support for various authorization login methods, support for Rerank models, OpenAI Realtime API, Claude Messages format, reasoning effort setting, content reasoning, user-specific model rate limiting, request format conversion, cache billing support, and various model support such as gpts, Midjourney-Proxy, Suno API, custom channels, Rerank models, Claude Messages format, Dify, and more.

github

: 10.9k

ChatPilot

ChatPilot is a chat agent tool that enables AgentChat conversations, supports Google search, URL conversation (RAG), and code interpreter functionality, replicates Kimi Chat (file, drag and drop; URL, send out), and supports OpenAI/Azure API. It is based on LangChain and implements ReAct and OpenAI Function Call for agent Q&A dialogue. The tool supports various automatic tools such as online search using Google Search API, URL parsing tool, Python code interpreter, and enhanced RAG file Q&A with query rewriting support. It also allows front-end and back-end service separation using Svelte and FastAPI, respectively. Additionally, it supports voice input/output, image generation, user management, permission control, and chat record import/export.

github

: 523

new-api

New API is an open-source project based on One API with additional features and improvements. It offers a new UI interface, supports Midjourney-Proxy(Plus) interface, online recharge functionality, model-based charging, channel weight randomization, data dashboard, token-controlled models, Telegram authorization login, Suno API support, Rerank model integration, and various third-party models. Users can customize models, retry channels, and configure caching settings. The deployment can be done using Docker with SQLite or MySQL databases. The project provides documentation for Midjourney and Suno interfaces, and it is suitable for AI enthusiasts and developers looking to enhance AI capabilities.

github

: 6.5k

langchain4j-aideepin-web

The langchain4j-aideepin-web repository is the frontend project of langchain4j-aideepin, an open-source, offline deployable retrieval enhancement generation (RAG) project based on large language models such as ChatGPT and application frameworks such as Langchain4j. It includes features like registration & login, multi-sessions (multi-roles), image generation (text-to-image, image editing, image-to-image), suggestions, quota control, knowledge base (RAG) based on large models, model switching, and search engine switching.

github

: 59

MarkMap-OpenAi-ChatGpt

MarkMap-OpenAi-ChatGpt is a Vue.js-based mind map generation tool that allows users to generate mind maps by entering titles or content. The application integrates the markmap-lib and markmap-view libraries, supports visualizing mind maps, and provides functions for zooming and adapting the map to the screen. Users can also export the generated mind map in PNG, SVG, JPEG, and other formats. This project is suitable for quickly organizing ideas, study notes, project planning, etc. By simply entering content, users can get an intuitive mind map that can be continuously expanded, downloaded, and shared.

github

: 77

EduChat

EduChat is a large-scale language model-based chatbot system designed for intelligent education by the EduNLP team at East China Normal University. The project focuses on developing a dialogue-based language model for the education vertical domain, integrating diverse education vertical domain data, and providing functions such as automatic question generation, homework correction, emotional support, course guidance, and college entrance examination consultation. The tool aims to serve teachers, students, and parents to achieve personalized, fair, and warm intelligent education.

github

: 672

For similar tasks

ZcChat

github

: 420

fount

fount is a character card frontend page that decouples AI sources, AI characters, user personas, dialogue environments, and AI plugins, allowing them to be freely combined to spark infinite possibilities. It serves as a bridge connecting imagination and reality, a lighthouse guiding characters and stories, and a free garden for AI sources, characters, personas, dialogue environments, and plugins to grow and bloom. It integrates AI sources without the need for reverse proxy servers, improves web experience with features like multi-device synchronization and unfiltered HTML rendering, and extends companionship beyond the web by connecting characters to Discord groups and providing gentle reminders through fount-pwsh. For character creators, fount offers infinite possibilities with JavaScript or TypeScript code customization, execution of code without filtering, loading npm packages, and creating custom HTML pages. It encourages extension through modularization and community contributions.

github

: 838

Facial-Data-Extractor

Facial Data Extractor is a software designed to extract facial data from images using AI, specifically to assist in character customization for Illusion series games. Currently, it only supports AI Shoujo and Honey Select2. Users can open images, select character card templates, extract facial data, and apply it to character cards in the game. The tool provides measurements for various facial features and allows for some customization, although perfect replication of faces may require manual adjustments.

github

: 81

GMTalker

GMTalker is an interactive digital human rendered by Unreal Engine, developed by the Media Intelligence Team at Bright Laboratory. The system integrates speech recognition, speech synthesis, natural language understanding, and lip-sync animation driving. It supports rapid deployment on Windows with only 2GB of VRAM required. The project showcases two 3D cartoon digital human avatars suitable for presentations, expansions, and commercial integration.

github

: 385

For similar jobs

sweep

Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.

github

: 7.1k

teams-ai

The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.

github

: 502

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

github

: 668

chatbot-ui

Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.

github

: 27.7k

BricksLLM

BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students

github

: 953

uAgents

uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.

github

: 1.3k

griptape

Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.

github

: 2.2k