
ZcChat
一个模仿Galgame效果的AI桌宠 | An AI desktop pet that mimics the effects of a Galgame
Stars: 209

ZcChat is an AI desktop pet suitable for Galgame characters, featuring long-term memory, expressive actions, control over the computer, and voice functions. It utilizes Letta for AI long-term memory, Galgame-style character illustrations for more actions and expressions, and voice interaction with support for various voice synthesis tools like Vits. Users can configure characters, install Letta, set up voice synthesis and input, and control the pet to interact with the computer. The tool enhances visual and auditory experiences for users interested in AI desktop pets.
README:
🌟 一个有长期记忆、表情动作立绘显示、操控电脑和语音功能,更适合Galgame角色的AI桌宠 🌟
中文 English
- 长期记忆:使用Letta实现AI长期记忆,并且发展拓展性格和个性。
- 表情动作:采用Galgame立绘的方式而不是Live2d,使得角色可以有更多的动作和表情
- 立绘动画:采用类似Galgame立绘的动画演出,提升视觉体验。
- 语音交互:支持语音输入、直接对话、唤醒和回复打断功能。
- 语音合成:支持Vits等多种语音合成工具,让声音更真实。
一图看懂ZcChat的工作原理:
- 在Release页面下载并安装ZcChat。请尽量避免中文路径,以免产生一些列问题!
如果出现
尝试创建文件时出错
请检查是否被杀毒软件拦截,并尝试使用管理员权限启动安装包。
- 讨论区找到现成的角色数据,或者自定义你的角色。
-
将角色数据中的立绘放入
{文档}/ZcChat/characters/{名称}
文件夹。最终的项目结构应为
{文档}/ZcChat/characters/{名称}/正常.png...
-
在
ZcChat > 通用设置 > 角色选择
中选中角色。 -
在
ZcChat > 角色配置
设置立绘大小。 -
右键托盘重置立绘位置。
Tips:右键立绘可以打开和关闭对话框,在对话框滚轮可以打开和关闭历史。
-
安装Letta 并启动。
-
创建agent,并在
LettaADE > agent > Persona
中填入角色数据的提示词(核心记忆)。 -
将Letta地址填入
ZcChat > AI模型配置 > Letta地址
。Letta默认地址为
http://localhost:8283
-
将agentID填入
ZcChat > 角色配置 > AI模型 > agentID
。
- 将BaseUrl、Key填入
ZcChat > AI模型配置 > Openai设定
。 - 在
ZcChat > 角色配置 > AI模型
中选择Openai并填入Persona角色数据的提示词(核心记忆)。
🎉🎉🎉至此,ZcChat已经可以基本使用了,以下是一些进阶功能的使用:🎉🎉🎉
-
安装vits-simple-api 并启动。
-
将URL填入
ZcChat > 语言合成设置 > vits-simple-api地址
。vits-simple-api默认地址为
http://localhost:23456
-
ZcChat > 角色配置 > 语音合成
中配置相关项
-
开启自定义模式。
-
填入API调用地址,确保输出为mp3文件,并使用
{msg}
作为文本占位符,例如:http://127.0.0.1:23456/voice/vits?id=0&text={msg}&format=mp3
-
ZcChat > 角色配置 > 语音合成
中配置相关项
-
将URL填入
ZcChat > 语言输入设置 > whisper-asr-webservice地址
。默认地址为
http://localhost:9000
- 在百度智能云控制台创建短语音识别应用。
- 将API Key和Secret Key填入
ZcChat > 语言输入设置 > API Key/Secret Key
。
-
先配置好语音输入
-
观察
语言输入设置 > 语音能量赋值
,不说话和说话的能量应该会有较大区别,寻找的适合自己的阈值填入设置。由于VAD算法较为简单,语音唤醒和打断对环境要求较高,请仔细配置。
- 将文档内提示词追加到角色提示词即可。
Letta使用不同LLM文档,设置环境变量即可。
添加环境变量 OPENAI_API_BASE
即可。
通常是由于Letta版本更新导致数据库格式变化。如果不介意数据丢失,可以删除用户文件夹下的 .letta
文件夹后重新启动。
可能是AI忘记了或受到误导性词语的影响。尝试改进提示词,或用命令的语气告诉AI使用正确的格式回复。效果取决于所使用的模型。
本项目的特色就是使用了类Galgame的立绘方式,使得拥有表情和动作变化,并且降低了角色制作成本。如果你在找一个Live2d桌宠,我推荐LogChat。
你需要使用Letta Desktop而不是Docker部署,Letta Desktop是一个纯本地运行的ADE。然后连接如ollama等本地LLM。
ZcChat是一个开源项目。参与ZcChat项目的方法有很多!
-
为项目做出贡献:有兴趣做出贡献吗?欢迎Pull Request!
(如果能读懂我的垃圾代码的话) - 报告BUG、建议:有BUG或有功能请求?请通过Issues提交它们。
- 分享自己制作的模板:制作了自己的角色?不妨在Discussions分享一下!
- 或者……:给项目来一个star⭐怎么样?
- log159/LogChat: 一个开源免费的AI聊天客户端!
- letta-ai/letta: Letta (formerly MemGPT) is a framework for creating LLM services with memory.
- Artrajz/vits-simple-api: A simple VITS HTTP API, developed by extending Moegoe with additional features.
- ahmetoner/whisper-asr-webservice: OpenAI Whisper ASR Webservice API
- Liniyous/ElaWidgetTools: Fluent-UI For QT-Widget
- nlohmann/json: JSON for Modern C++
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for ZcChat
Similar Open Source Tools

ZcChat
ZcChat is an AI desktop pet suitable for Galgame characters, featuring long-term memory, expressive actions, control over the computer, and voice functions. It utilizes Letta for AI long-term memory, Galgame-style character illustrations for more actions and expressions, and voice interaction with support for various voice synthesis tools like Vits. Users can configure characters, install Letta, set up voice synthesis and input, and control the pet to interact with the computer. The tool enhances visual and auditory experiences for users interested in AI desktop pets.

FisherAI
FisherAI is a Chrome extension designed to improve learning efficiency. It supports automatic summarization, web and video translation, multi-turn dialogue, and various large language models such as gpt/azure/gemini/deepseek/mistral/groq/yi/moonshot. Users can enjoy flexible and powerful AI tools with FisherAI.

AivisSpeech
AivisSpeech is a Japanese text-to-speech software based on the VOICEVOX editor UI. It incorporates the AivisSpeech Engine for generating emotionally rich voices easily. It supports AIVMX format voice synthesis model files and specific model architectures like Style-Bert-VITS2. Users can download AivisSpeech and AivisSpeech Engine for Windows and macOS PCs, with minimum memory requirements specified. The development follows the latest version of VOICEVOX, focusing on minimal modifications, rebranding only where necessary, and avoiding refactoring. The project does not update documentation, maintain test code, or refactor unused features to prevent conflicts with VOICEVOX.

AI-Codereview-Gitlab
AI-Codereview-Gitlab is an automated code review tool based on large models, designed to help development teams conduct intelligent code reviews quickly during code merging or submission. It supports multiple large models including DeepSeek, ZhipuAI, OpenAI, and Ollama. The tool can automatically push review results to DingTalk, WeChat Work, and Feishu, generate daily reports based on GitLab commit records, and provide a visual dashboard to display code review records. The tool works by triggering webhook events on GitLab when users submit code, calling third-party large models to review the code, and recording the review results in corresponding Merge Requests or Commit Notes.

chatgpt-webui
ChatGPT WebUI is a user-friendly web graphical interface for various LLMs like ChatGPT, providing simplified features such as core ChatGPT conversation and document retrieval dialogues. It has been optimized for better RAG retrieval accuracy and supports various search engines. Users can deploy local language models easily and interact with different LLMs like GPT-4, Azure OpenAI, and more. The tool offers powerful functionalities like GPT4 API configuration, system prompt setup for role-playing, and basic conversation features. It also provides a history of conversations, customization options, and a seamless user experience with themes, dark mode, and PWA installation support.

Long-Novel-GPT
Long-Novel-GPT is a long novel generator based on large language models like GPT. It utilizes a hierarchical outline/chapter/text structure to maintain the coherence of long novels. It optimizes API calls cost through context management and continuously improves based on self or user feedback until reaching the set goal. The tool aims to continuously refine and build novel content based on user-provided initial ideas, ultimately generating long novels at the level of human writers.

xlings
Xlings is a developer tool for programming learning, development, and course building. It provides features such as software installation, one-click environment setup, project dependency management, and cross-platform language package management. Additionally, it offers real-time compilation and running, AI code suggestions, tutorial project creation, automatic code checking for practice, and demo examples collection.

wiseflow
Wiseflow is an agile information mining tool that utilizes the thinking and analysis capabilities of large models to accurately extract specific information from various given sources, without the need for manual intervention. The tool focuses on filtering noise from a vast amount of information to reveal valuable insights. It is recommended to use normal language models for information extraction tasks to optimize speed and cost, rather than complex reasoning models. The tool is designed for continuous information gathering based on specified focus points from various sources.

XYBotV2
XYBot V2 is a feature-rich WeChat robot framework that supports various interactive functions and gameplays. It provides AI chat, daily news updates, song requests, weather queries, and gaming functionalities like Gomoku and Warthunder player lookup. The tool is open-source and intended for learning and research purposes only, not for commercial or illegal activities. Users must comply with relevant laws and respect WeChat's copyrights and privacy. The tool's functionalities can be extended through a plugin system, allowing for dynamic loading/unloading of plugins.

ERNIE-SDK
ERNIE SDK repository contains two projects: ERNIE Bot Agent and ERNIE Bot. ERNIE Bot Agent is a large model intelligent agent development framework based on the Wenxin large model orchestration capability introduced by Baidu PaddlePaddle, combined with the rich preset platform functions of the PaddlePaddle Star River community. ERNIE Bot provides developers with convenient interfaces to easily call the Wenxin large model for text creation, general conversation, semantic vectors, and AI drawing basic functions.

MarkMap-OpenAi-ChatGpt
MarkMap-OpenAi-ChatGpt is a Vue.js-based mind map generation tool that allows users to generate mind maps by entering titles or content. The application integrates the markmap-lib and markmap-view libraries, supports visualizing mind maps, and provides functions for zooming and adapting the map to the screen. Users can also export the generated mind map in PNG, SVG, JPEG, and other formats. This project is suitable for quickly organizing ideas, study notes, project planning, etc. By simply entering content, users can get an intuitive mind map that can be continuously expanded, downloaded, and shared.

llm_note
LLM notes repository contains detailed analysis on transformer models, language model compression, inference and deployment, high-performance computing, and system optimization methods. It includes discussions on various algorithms, frameworks, and performance analysis related to large language models and high-performance computing. The repository serves as a comprehensive resource for understanding and optimizing language models and computing systems.

ai-paint-today-BE
AI Paint Today is an API server repository that allows users to record their emotions and daily experiences, and based on that, AI generates a beautiful picture diary of their day. The project includes features such as generating picture diaries from written entries, utilizing DALL-E 2 model for image generation, and deploying on AWS and Cloudflare. The project also follows specific conventions and collaboration strategies for development.

HiveChat
HiveChat is an AI chat application designed for small and medium teams. It supports various models such as DeepSeek, Open AI, Claude, and Gemini. The tool allows easy configuration by one administrator for the entire team to use different AI models. It supports features like email or Feishu login, LaTeX and Markdown rendering, DeepSeek mind map display, image understanding, AI agents, cloud data storage, and integration with multiple large model service providers. Users can engage in conversations by logging in, while administrators can configure AI service providers, manage users, and control account registration. The technology stack includes Next.js, Tailwindcss, Auth.js, PostgreSQL, Drizzle ORM, and Ant Design.

LLM-And-More
LLM-And-More is a one-stop solution for training and applying large models, covering the entire process from data processing to model evaluation, from training to deployment, and from idea to service. In this project, users can easily train models through this project and generate the required product services with one click.
For similar tasks

ZcChat
ZcChat is an AI desktop pet suitable for Galgame characters, featuring long-term memory, expressive actions, control over the computer, and voice functions. It utilizes Letta for AI long-term memory, Galgame-style character illustrations for more actions and expressions, and voice interaction with support for various voice synthesis tools like Vits. Users can configure characters, install Letta, set up voice synthesis and input, and control the pet to interact with the computer. The tool enhances visual and auditory experiences for users interested in AI desktop pets.

fount
fount is a character card frontend page that decouples AI sources, AI characters, user personas, dialogue environments, and AI plugins, allowing them to be freely combined to spark infinite possibilities. It serves as a bridge connecting imagination and reality, a lighthouse guiding characters and stories, and a free garden for AI sources, characters, personas, dialogue environments, and plugins to grow and bloom. It integrates AI sources without the need for reverse proxy servers, improves web experience with features like multi-device synchronization and unfiltered HTML rendering, and extends companionship beyond the web by connecting characters to Discord groups and providing gentle reminders through fount-pwsh. For character creators, fount offers infinite possibilities with JavaScript or TypeScript code customization, execution of code without filtering, loading npm packages, and creating custom HTML pages. It encourages extension through modularization and community contributions.

Facial-Data-Extractor
Facial Data Extractor is a software designed to extract facial data from images using AI, specifically to assist in character customization for Illusion series games. Currently, it only supports AI Shoujo and Honey Select2. Users can open images, select character card templates, extract facial data, and apply it to character cards in the game. The tool provides measurements for various facial features and allows for some customization, although perfect replication of faces may require manual adjustments.
For similar jobs

sweep
Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.

teams-ai
The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.

ai-guide
This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

classifai
Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

chatbot-ui
Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.

BricksLLM
BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students

uAgents
uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.

griptape
Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.