
CradleAI
CradleAI Discord Channel-答疑,反馈,建议-Q&A,Issue,Advise
Stars: 181

CradleAI is an open-source front-end tool designed for non-commercial purposes. It allows users to create and manage characters, engage in AI roleplay chats, publish dynamic content in a social circle, participate in group chats, and manage memories and knowledge. The tool supports features like author notes, voice interactions, multimedia messaging, visual novel mode, rich text formatting, image generation, TTS enhancement, and more. Users can deploy the tool using Github Action for APK builds or EAS Build for Android and iOS platforms. The project is licensed under CC BY-NC 4.0, prohibiting commercial use and emphasizing proper attribution.
README:
|
|
|
|
|
|
- QQ 群: (答疑、bug 反馈、意见反馈)
二群:570343684 一群:1017292082
避免键盘弹出收回时的UI抖动闪烁
修复插图类角色卡闪退和Gemini官渠空回复问题。
支持从压缩包导入正则
注意含html的消息都统一放在了视觉小说模式,在常规模式看不到消息是正常的
oii~Cradle现在支持真 视觉小说了,配有编辑器,即使不懂代码也可以做自己的乙游/GAL(文件系统有破坏性更新,注意先备份卡和聊天记录)
Demo:
|
|
|
视觉小说功能使用的图床需要使用魔法,否则立绘/背景会加载失败。 加载一次后会缓存。
- 在Cradle-VN-Editor可以自己设置立绘,背景,变量,世界书(如果自己懒得设置素材可以滴滴我,用我的demo尝鲜。感觉还没做完善,就不拿出来献丑了hhhh)
- 导出项目的zip文件
- 将zip文件导入Cradle
- 在主页会自动创建视觉小说的对话窗口
需要node环境(装过SillyTavern可以忽略) 注意命名:立绘和背景注意标准命名:用角色-状态;时段-背景来做文件名!例如张三-站立;夜晚-客厅
视觉小说功能可以开启自动总结,避免上下文过长掉格式和各种其他问题。
众所周知AI在输出长文上面有各种逻辑混乱和记忆丢失问题;另外AI互动游戏有一个大毛病,就是怎么平衡用户自由度和作者的剧情设计,所以我做了和剧情输出AI共享上下文的体验管理器你可以取消它,因为它会在剧情输出后紧接着再调一次API,生成速度会很慢,你也可以设置它每x轮自动启动。
它的作用是根据当前的上下文,变量状态,和你设定的剧情大纲(如果有)来适时插入作者注释纠偏,
使用方法:在编辑器的提示词中加入${guidanceCurrentScript}
宏,开启体验管理器就行了
- 跑图:可以用midjourney跑图,豆包也可以;如果对画风Lora,姿势有要求,或者就是想还原某个同人角色,建议直接上Alice佬的comfyui工作流,不过我的demo是用mj跑的
- 重点:做立绘的差分直接用nano banana就行。建议给它两张图,一张是你想要的立绘的构图,一张是你跑出来的图,然后多跑几次,直到你对立绘比较满意(背景图同理),让它生成白底立绘。
- 美图秀秀抠成透明底(我还会顺便调下脸型身体比例,因为nano banana出的图有时候人物比例很怪)
关于模型:只推荐gemini 2.5 pro,2.5 flash有严重丢格式/重复问题
总结之后背景会消失,自主行动切换一次场景恢复。 可能出现重复渲染一轮对话的情况。
- 支持手动、自动创建和导入 SillyTavern 角色卡,世界书和预设(PNG/JSON 格式)。
- 角色可自定义头像、背景、动态立绘视频。
- 管理模式下可批量删除、导出角色数据,支持角色图库和图片生成。
- 独立实现了与 SillyTavern 功能等效的世界书、预设和角色信息系统:
- 世界书条目管理:自动注入相关世界设定条目到对话,根据对话关键词智能匹配。
- 预设条目处理:自定义位置,深度,角色的预设条目。
- 角色信息融合:无缝整合角色卡、世界书和预设。
- 记忆管理系统:
- 长对话自动总结。
- 智能记忆检索。
- 支持保存点系统,可随时恢复任意历史对话状态。
- 高级交互功能:
- 作者注释:支持自定义作者注释和插入深度设置。
- 语音交互:通过 TTS 功能让角色说话,带语音增强效果。
- 多媒体消息:在对话中发送和接收图片,支持图像分析和上下文理解。
- 图片管理:内置图片缓存系统,方便保存和管理所有对话图片。
- 视觉小说模式:提供类似视觉小说的沉浸式对话界面。
- 高级消息格式化:
- 富文本支持:处理常见的 HTML 标签和特殊标签,开发者可自定义需要渲染的标签。
- 自定义标签:支持思考标签、状态块、记忆标签等特殊格式:
- 图片嵌入:支持内联图片显示和全屏查看。
- 可视化辅助功能:
- 完整对话历史查看。
- 自动滚动和手动位置保存。
- 角色和用户可发布动态、图片,支持点赞、评论、转发。
- 朋友圈互动支持 AI 自动生成角色评论、点赞及内心想法展示。
- 支持用户自定义动态发布与图片选择。
- 多模态支持:
- 图片分析:AI 可以理解并回应带有图片的帖子内容。
- 智能回复:基于图像内容生成相关评论和反馈。
- 定时发布系统:
- 角色可设置多个时间点自动发布朋友圈。
- 基于当前时间生成适合的动态内容。
- 发布后可通过通知提醒用户。
- 关系系统集成:
- 角色互动会动态影响角色间关系。
- 帖子评论和点赞可提升角色间好感度。
- 支持通过朋友圈互动更新角色关系类型(如朋友、熟人、对手等)。
- 对话流连接:
- 朋友圈互动会自动同步到角色的聊天历史。
- 用户在朋友圈的互动可无缝延续到私聊对话。
- 互动频率控制:
- 可自定义角色朋友圈活跃度(低、中、高)。
- 智能管理角色互动频率,避免过度回复。
- 可持久化存储所有朋友圈内容,支持帖子管理和删除功能。
- 支持创建/加入群聊,与多个角色/用户共同对话。
- 群聊支持自定义背景、成员管理、消息同步。
- 集成向量记忆系统,自动存储和检索对话相关事实。
- 支持表格插件扩展角色知识。
- 记忆面板可查看和管理角色记忆内容。
- 双系统记忆增强:
- 结构化表格记忆:基于模板创建自定义表格,存储角色关系、喜好、事件等结构化信息。
- 向量存储记忆:自动提取对话要点,存储为向量记忆以供后续对话检索。
- 表格记忆管理功能:
- 多种预设模板:角色关系、事件记录、喜好清单等专用模板。
- 自定义编辑:可添加、删除、修改表格行列数据。
- 与对话深度整合:对话信息自动更新相关表格。
- 向量记忆管理功能:
- 检索优化:智能匹配当前对话最相关的历史记忆。
- 记忆编辑:增加、修改、删除具体记忆条目。
- 记忆导出/导入:支持备份和恢复角色记忆数据。
- 记忆优先级控制:自定义记忆处理频率,调节对话中记忆更新和检索的频次。
- 支持文本转语音(TTS)功能,可增强语音互动体验。
- TTS 增强器可自定义语音参数。
- 当前版本需开发者自行配置 TTS 服务端点(后续更新将支持前端自主配置 TTS 端点)。
- 支持多种语音控制功能:
- 语音播放/暂停控制。
- 增强模式音质提升。
- 多角色不同语音模板。
- 自动缓存已生成语音,减少重复请求。
- 支持 NovelAI 图片生成,自动根据对话上下文生成角色场景背景。
- 角色图库侧边栏可管理、收藏、设置图片为头像/背景。
- 高级标签系统:
- 支持智能分类的标签选择器,包含上百种分类标签。
- 支持标签加权与降权。
- 固定标签功能,可锁定重要特征标签。
- 角色标签从预设库中选择,确保精确的角色识别。
- 艺术家风格参考,可选择不同艺术家样式。
- 自定义提示词输入,支持复杂描述。
- "Roll" 功能可随机生成标签组合获得灵感。
- 高级生成设置:
- 多种图像尺寸预设:肖像、横向、方形、大型。
- 可自定义步数、采样器、噪声调度等专业参数。
- 支持 Seed 值管理,可使用相同 Seed 重现图像。
- 多角色场景控制(NovelAI):
- 位置控制系统,可精确定位多个角色在图像中的位置。
- 支持为场景中的每个角色设置独立提示词。
- 多角色互动姿势和表情控制。
- 生成结果处理:
- 一键设置为角色头像或背景图。
- 自动保存到角色图库。
- 可查看和复制生成参数,方便再次使用相同配置。
- 支持 API 设置、模型预算、存储管理、社区入口等扩展功能。
- 支持多种主题与界面自定义。
使用`main.yml`文件作为Github Action的工作流文件,进行构建即可,包含x86,x86_64,arm64的apk
-
Android 平台打包 在项目根目录下执行以下命令,通过 EAS 构建 Android 安装包(APK 或 AAB):
eas build --platform android
构建完成后,可在 Expo 控制台下载生成的安装包进行分发或安装。
-
iOS 平台打包 亦可通过如下命令为 iOS 平台生成 IPA 包:
eas build --platform ios
但目前尚未在苹果设备真机环境下充分测试,建议谨慎用于生产环境。
注意:请确保已正确配置 Expo 账户、EAS CLI 及相关平台证书。详细配置与分发流程请参考 Expo 官方文档。
本项目基于 CC BY-NC 4.0 许可开源,需遵守以下规则:
- 署名 — 您必须给出适当的署名,提供指向本协议的链接,并指明是否(对原作)作了修改。您可以以任何合理方式进行,但不得以任何方式暗示许可方认可您或您的使用。
- 非商业性使用 — 您不得将本作品用于商业目的,包括但不限于任何形式的商业倒卖、SaaS、API 付费接口、二次销售、打包出售、收费分发或其他直接或间接盈利行为。
- 本项目是一个非商业的开源前端工具,任何人可自行上传代码在 EXPO 平台打包和安装,或进行原生的 apk 构建。
- 现在和将来都不会提供任何角色卡上传或分享的功能或平台,如角色卡作者声明了不可以在 SillyTavern 之外的软件上导入角色卡,请勿在项目中导入角色卡。
- 禁止任何形式的商业倒卖、SaaS、API 付费接口、二次销售、打包出售、收费分发或其他直接或间接盈利行为。
- 禁止将本项目打包上传任何应用商店。
- 表格记忆:
https://github.com/muyoou/st-memory-enhancement
,在此感谢作者的代码分享,请大家多多支持原作者。 - mem0:
https://github.com/mem0ai/mem0
基于 Apache-2.0 license 许可。 - OpenAI-adapter:感谢 hajimi 作者提供测试渠道,项目地址:
https://github.com/wyeeeee/hajimi
。
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for CradleAI
Similar Open Source Tools

CradleAI
CradleAI is an open-source front-end tool designed for non-commercial purposes. It allows users to create and manage characters, engage in AI roleplay chats, publish dynamic content in a social circle, participate in group chats, and manage memories and knowledge. The tool supports features like author notes, voice interactions, multimedia messaging, visual novel mode, rich text formatting, image generation, TTS enhancement, and more. Users can deploy the tool using Github Action for APK builds or EAS Build for Android and iOS platforms. The project is licensed under CC BY-NC 4.0, prohibiting commercial use and emphasizing proper attribution.

vpnfast.github.io
VPNFast is a lightweight and fast VPN service provider that offers secure and private internet access. With VPNFast, users can protect their online privacy, bypass geo-restrictions, and secure their internet connection from hackers and snoopers. The service provides high-speed servers in multiple locations worldwide, ensuring a reliable and seamless VPN experience for users. VPNFast is easy to use, with a user-friendly interface and simple setup process. Whether you're browsing the web, streaming content, or accessing sensitive information, VPNFast helps you stay safe and anonymous online.

LabelQuick
LabelQuick_V2.0 is a fast image annotation tool designed and developed by the AI Horizon team. This version has been optimized and improved based on the previous version. It provides an intuitive interface and powerful annotation and segmentation functions to efficiently complete dataset annotation work. The tool supports video object tracking annotation, quick annotation by clicking, and various video operations. It introduces the SAM2 model for accurate and efficient object detection in video frames, reducing manual intervention and improving annotation quality. The tool is designed for Windows systems and requires a minimum of 6GB of memory.

ChatGPT-airport-tizi-fanqiang
This repository provides a curated list of recommended airport proxies for accessing ChatGPT and other AI tools while bypassing internet restrictions. The proxies are tested and verified to ensure reliability and stability. The readme includes detailed instructions on how to set up and use the proxies with various devices and platforms. Additionally, the repository offers advanced tutorials on upgrading to GPT-4/Plus, deploying a 24/7 ChatGPT微信机器人 server, and using Claude-3 securely and for free.

douyin-chatgpt-bot
Douyin ChatGPT Bot is an AI-driven system for automatic replies on Douyin, including comment and private message replies. It offers features such as comment filtering, customizable robot responses, and automated account management. The system aims to enhance user engagement and brand image on the Douyin platform, providing a seamless experience for managing interactions with followers and potential customers.

AI-Drug-Discovery-Design
AI-Drug-Discovery-Design is a repository focused on Artificial Intelligence-assisted Drug Discovery and Design. It explores the use of AI technology to accelerate and optimize the drug development process. The advantages of AI in drug design include speeding up research cycles, improving accuracy through data-driven models, reducing costs by minimizing experimental redundancies, and enabling personalized drug design for specific patients or disease characteristics.

MaiMBot
MaiMBot is an intelligent QQ group chat bot based on a large language model. It is developed using the nonebot2 framework, utilizes LLM for conversation abilities, MongoDB for data persistence, and NapCat for QQ protocol support. The bot features keyword-triggered proactive responses, dynamic prompt construction, support for images and message forwarding, typo generation, multiple replies, emotion-based emoji responses, daily schedule generation, user relationship management, knowledge base, and group impressions. Work-in-progress features include personality, group atmosphere, image handling, humor, meme functions, and Minecraft interactions. The tool is in active development with plans for GIF compatibility, mini-program link parsing, bug fixes, documentation improvements, and logic enhancements for emoji sending.

Snap-Solver
Snap-Solver is a revolutionary AI tool for online exam solving, designed for students, test-takers, and self-learners. With just a keystroke, it automatically captures any question on the screen, analyzes it using AI, and provides detailed answers. Whether it's complex math formulas, physics problems, coding issues, or challenges from other disciplines, Snap-Solver offers clear, accurate, and structured solutions to help you better understand and master the subject matter.

DocTranslator
DocTranslator is a document translation tool that supports various file formats, compatible with OpenAI format API, and offers batch operations and multi-threading support. Whether for individual users or enterprise teams, DocTranslator helps efficiently complete document translation tasks. It supports formats like txt, markdown, word, csv, excel, pdf (non-scanned), and ppt for AI translation. The tool is deployed using Docker for easy setup and usage.

gez
Gez is a high-performance micro frontend framework based on ESM. It uses Rspack compilation and maps modules to URLs with strong caching and content-based hashing. Gez embraces modern micro frontend architecture by leveraging ESM and importmap for dependency management, providing reliable isolation with module scope, seamless integration with any modern frontend framework, intuitive development experience, and optimal performance with zero runtime overhead and reliable caching strategies.

InterPilot
InterPilot is an AI-based assistant tool that captures audio from Windows input/output devices, transcribes it into text, and then calls the Large Language Model (LLM) API to provide answers. The project includes recording, transcription, and AI response modules, aiming to provide support for personal legitimate learning, work, and research. It may assist in scenarios like interviews, meetings, and learning, but it is strictly for learning and communication purposes only. The tool can hide its interface using third-party tools to prevent screen recording or screen sharing, but it does not have this feature built-in. Users bear the risk of using third-party tools independently.

Daily-DeepLearning
Daily-DeepLearning is a repository that covers various computer science topics such as data structures, operating systems, computer networks, Python programming, data science packages like numpy, pandas, matplotlib, machine learning theories, deep learning theories, NLP concepts, machine learning practical applications, deep learning practical applications, and big data technologies like Hadoop and Hive. It also includes coding exercises related to '剑指offer'. The repository provides detailed explanations and examples for each topic, making it a comprehensive resource for learning and practicing different aspects of computer science and data-related fields.

chatwiki
ChatWiki is an open-source knowledge base AI question-answering system. It is built on large language models (LLM) and retrieval-augmented generation (RAG) technologies, providing out-of-the-box data processing, model invocation capabilities, and helping enterprises quickly build their own knowledge base AI question-answering systems. It offers exclusive AI question-answering system, easy integration of models, data preprocessing, simple user interface design, and adaptability to different business scenarios.

get_jobs
Get Jobs is a tool designed to help users find and apply for job positions on various recruitment platforms in China. It features AI job matching, automatic cover letter generation, multi-platform job application, automated filtering of inactive HR and headhunter positions, real-time WeChat message notifications, blacklisted company updates, driver adaptation for Win11, centralized configuration, long-lasting cookie login, XPathHelper plugin, global logging, and more. The tool supports platforms like Boss直聘, 猎聘, 拉勾, 51job, and 智联招聘. Users can configure the tool for customized job searches and applications.

py-xiaozhi
py-xiaozhi is a Python-based XiaoZhi voice client designed for learning through code and experiencing AI XiaoZhi's voice functions without hardware conditions. The repository is based on the xiaozhi-esp32 port. It supports AI voice interaction, visual multimodal capabilities, IoT device integration, online music playback, voice wake-up, automatic conversation mode, graphical user interface, command-line mode, cross-platform support, volume control, session management, encrypted audio transmission, automatic captcha handling, automatic MAC address retrieval, code modularization, and stability optimization.

TypeTale
TypeTale is an AIGC creation software designed specifically for content creators, primarily used for novel promotion. It offers a wide range of AI capabilities such as image, video, and audio generation, as well as text processing and story extraction. The tool also provides workflow customization, AI assistant support, and a vast library of creative materials. With a user-friendly interface and system requirements compatible with Windows operating systems, TypeTale aims to streamline the content creation process for writers and creators.
For similar tasks

SillyTavern
SillyTavern is a user interface you can install on your computer (and Android phones) that allows you to interact with text generation AIs and chat/roleplay with characters you or the community create. SillyTavern is a fork of TavernAI 1.2.8 which is under more active development and has added many major features. At this point, they can be thought of as completely independent programs.

agnai
Agnaistic is an AI roleplay chat tool that allows users to interact with personalized characters using their favorite AI services. It supports multiple AI services, persona schema formats, and features such as group conversations, user authentication, and memory/lore books. Agnaistic can be self-hosted or run using Docker, and it provides a range of customization options through its settings.json file. The tool is designed to be user-friendly and accessible, making it suitable for both casual users and developers.

ragdoll-studio
Ragdoll Studio is a platform offering web apps and libraries for interacting with Ragdoll, enabling users to go beyond fine-tuning and create flawless creative deliverables, rich multimedia, and engaging experiences. It provides various modes such as Story Mode for creating and chatting with characters, Vector Mode for producing vector art, Raster Mode for producing raster art, Video Mode for producing videos, Audio Mode for producing audio, and 3D Mode for producing 3D objects. Users can export their content in various formats and share their creations on the community site. The platform consists of a Ragdoll API and a front-end React application for seamless usage.

LLMUnity
LLM for Unity enables seamless integration of Large Language Models (LLMs) within the Unity engine, allowing users to create intelligent characters for immersive player interactions. The tool supports major LLM models, runs locally without internet access, offers fast inference on CPU and GPU, and is easy to set up with a single line of code. It is free for both personal and commercial use, tested on Unity 2021 LTS, 2022 LTS, and 2023. Users can build multiple AI characters efficiently, use remote servers for processing, and customize model settings for text generation.

character-factory
Character Factory is a Python script designed to generate detailed character cards for SillyTavern, TavernAI, TextGenerationWebUI, and more using Large Language Model (LLM) and Stable Diffusion. It streamlines character generation by leveraging deep learning models to create names, summaries, personalities, greeting messages, and avatars for characters. The tool provides an easy way to create unique and imaginative characters for storytelling, chatting, and other purposes.

ai-anime-art-generator
AI Anime Art Generator is an AI-driven cutting-edge tool for anime arts creation. Perfect for beginners to easily create stunning anime art without any prior experience. It allows users to create detailed character designs, custom avatars for social media, and explore new artistic styles and ideas. Built on Next.js, TailwindCSS, Google Analytics, Vercel, Replicate, CloudFlare R2, and Clerk.

TavernAI
TavernAI is an atmospheric frontend tool for chat and storywriting, compatible with various backends. It offers features like character creation, online character database, group chat, story mode, world info, message swiping, configurable settings, interface themes, backgrounds, message editing, GPT-4.5, and Claude picture recognition. The tool supports backends like Kobold series, Oobabooga's Text Generation Web UI, OpenAI, NovelAI, and Claude. Users can easily install TavernAI on different operating systems and start using it for interactive storytelling and chat experiences.

Character-Engine-Discord
Character Engine is a Discord bot that aggregates various online platforms to create AI-driven characters using Discord Webhooks and LLM chatbots. It allows users to bring life and joy to their server by spawning characters, exploring embedded characters, and configuring settings on a per-server, per-channel, and per-character basis.
For similar jobs

weave
Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

kaito
Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

PyRIT
PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

tabby
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

spear
SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

Magick
Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.