
AiNiee
一款专注于Ai翻译的工具,可以用来一键自动翻译RPG SLG游戏,Epub Word TXT小说,Srt Vtt Lrc字幕等等。
Stars: 1594

AiNiee is a tool focused on AI translation, capable of automatically translating RPG SLG games, Epub TXT novels, Srt Lrc subtitles, and more. It provides features for configuring AI platforms, proxies, and translation settings. Users can utilize this tool for translating game scripts, novels, and subtitles efficiently. The tool supports multiple AI platforms and offers tutorials for beginners. It also includes functionalities for extracting and translating game text, with options for customizing translation projects and managing translation tasks effectively.
README:
AiNiee 是一款专注于 Ai 翻译的工具,可以用来一键自动翻译 游戏、小说、字幕 等复杂的长文本内容。
- 多格式: json/xlsx导出文件、Epub/TXT 小说、Srt/Vtt/Lrc 字幕、Word文档等。
- 多平台: 支持国内外主流AI平台,可方便快速使用OpenAI、Google、Anthropic、DeepSeek 系列模型。
- 多语言: 支持多种语言的互相翻译,例如中文、英文、日文、韩文、俄语等。
- 灵活配置: 自定义请求格式、平台、模型、翻译行数、线程数等。
- 高效翻译: 拥有多文件批量翻译、多线程翻译、多key轮询、混合平台翻译等功能。
- 翻译优化: 针对复杂的长文本进行优化,AI术语表,COT翻译、动态Few-Shot、提示书编写、联系上下文、文本自适应处理、回复检查等。
- 从 发布页 下载应用并解压运行
- 准备好要翻译的原文文本文件,并放入
输入文件夹
(默认为应用目录内的input
文件夹)- 大部分
小说
、字幕
、文档
文件可以直接翻译 - 游戏文本需要先使用第三方工具进行提取,比如 MTool - 使用教程
- 大部分
- 选择以下方式之一进行翻译:
- 在线接口:需付费但性价比很高,无显卡要求,全语言支持,接口设置说明 - DeepSeek
- 本地接口:免费,需要 8G+ 显存的 Nvidia 显卡,只支持日中,接口设置说明 - SakuraLLM
- 绝大部分情况下,这两种方式就是满足你翻译需求的最优解,没特殊需求不需要再去折腾其他的接口了
-
📖文本提取工具
工具名 介绍 Mtool 上手简单,推荐新人使用 Translator++ 上手中等,功能强大,推荐大佬使用 SExtractor 上手复杂,功能强大,推荐大佬使用 -
🤖AI调用平台
支持平台 模型 白嫖情况 模型价格 限制情况 OpenAI平台 ChatGPT系列 现无免费额度 贵 用途广泛 GooGle平台 Gemini系列 免费账号可白嫖,速度缓慢 贵 用途广泛 Cohere平台 Command系列 免费账号可白嫖,速度一般 一般 用途广泛 Anthropic平台 Claude系列 免费账号绑卡可白嫖少量额度,速度缓慢 贵 用途广泛 月之暗面平台 Moonshot系列 注册送少量免费额度 一般 用途广泛 零一万物平台 Yi系列 注册送少量免费额度 一般 安全限制 智谱平台 GLM系列 注册送少量免费额度 一般 安全限制 Deepseek平台 Deepseek系列 注册送少量免费额度,速度极快 便宜 用途广泛 Dashscope平台 千问系列 注册送大量免费额度 便宜 安全限制 Volcengine平台 豆包系列 注册送大量免费额度,速度极快 便宜 用途广泛 SakuraLLM Sakura系列 本地模型,需显卡 点击查看一键包 免费 用途广泛 -
📺视频教程
视频链接 说明 Mtool教程 初次使用推荐观看 Translator++教程 初次使用推荐观看 -
📖 角色提取工具
: KeywordGacha - 使用 AI 技术来自动生成实体词语表的翻译辅助工具 -
📡 下载地址
: AiNiee下载地址 -
🟪 魔法工具
:强烈建议您选择优质稳定的代理工具,不然接口会报错Connection eror或者一直没有回复
-
OpenAI官方配置示例:
账号类型
: 新注册账号为免费账号,有各种限制,单号速度不快模型选择
: 请自行了解模型之间的区别后再进行更改。API KEY
: 填入由OpenAi账号生成的api_key -
代理平台配置示例:
请求地址
: 填入国内代理平台提供的请求地址,示例:https://api.XXXXX.com
,不要在后面单带一个/
自动补全
: 会在上面输入的请求地址自动补全“v1”请求格式
: 根据中转能够支持的请求格式进行选择,一般是openai格式模型选择
: 可下拉选择,也可以自行填入模型名字API KEY
: 填入国内代理平台给你生成的API KEY每次发送文本上限
: 限制每次发送文本的容量大小,以tokens为单位每分钟请求数
: RPM (requests per minute)每分钟向模型接口发送的翻译任务数量每分钟tokens数
: TPM (tokens per minute)每分钟向模型接口发送的tokens总数(类似字符总数) -
SakuraLLM配置:
模型部署与应用设置请参考 SakuraLLMServer - 一键获取免费且高质量的日语翻译能力
-
配置示例:
项目类型
: 需要翻译的原文文件接口名称
: 翻译文本时希望使用的平台原文语言
: 根据你需要翻译游戏的语言选择相应的源语言译文语言
: 你希望翻译成的语言输入文件夹
: 选择你需要翻译的原文文件,把原文尽量放在一个干净的文件夹内,文件夹内没有其他文件,因为会读取该文件夹内所有相关的文件,包括子文件输出文件夹
: 选择翻译后文件的存储文件夹,请不要和输入文件夹一个路径行数切分模式
: 每次请求翻译的文本行数。行数设置越大,整体的翻译效果会更好,上下文更加流畅,但每次请求回复速度会越慢,回复的内容越容易出错,请根据模型类型来进行设置。tokens数切分模式
: 每次请求翻译的文本tokens数,整体效果和行数模式差不多,只不过这个可以更加精确控制发送的大小,从而提高效率翻译任务并发数
: 请根据翻译平台的速率进行设置,线程数越大,越容易吃满速率限制,翻译速度越快。多出的线程数不会影响翻译,但会增加电脑性能消耗携带上文行数
: 弱小的模型不建议携带上文,且不建议携带过多行数翻译流程最大轮次限制
: 有些在上一轮始终无法成功翻译的文本会进行拆分,并进入下一轮次翻译,如此循环翻译。所以是限制循环拆分的最大轮次数使用思维链翻译
: 会与提示书功能进行联动,让AI主动思考已经提供的上文,角色,背景等等信息,当然消耗会翻倍,并且建议在高性能模型下使用。使用中文提示词
: 会更改发送的prompt结构为全中文结构,部分大模型会在中文提示词下表现更优。保留句内换行符
: 在翻译前将换行符替换成特殊字符,再进行翻译,AI仍会吞符号,不是百分百保留。保留首尾非文本字符
: 主要用于T++导出的文本,该工具导出的文本带很多代码文本,可以截取处理了首尾的占位代码等,翻译了,再复原回来中文字形转换
: 可以将翻译后的中文字体进行简体,繁体,香港体等待进行转换。配置文件说明,请参考 https://github.com/BYVoid/OpenCC首轮翻译平台
: 文本会首先以当初设置的翻译行数进行翻译, 如果翻译时出现错误回复次数达到限制,则进入下轮次再次翻译次轮翻译平台
: 将之前没能成功翻译的文本拆分翻译,会重新自动计算翻译行数,并更换翻译平台,如果不设置,则沿用上轮设置的翻译平台末轮翻译平台
: 后续的所有轮次都使用该次指定的翻译平台,如果不设置,则沿用上轮设置的翻译平台更换轮次时不拆分
: 更换翻译轮次时不会对文本进行拆分,继续按设置行数翻译
-
1.🖱️打开
🔴Translator++
,选择“start a new project”,根据你的游戏图标来选择对应的游戏引擎 -
2.选择你的游戏文件,创建新工程,软件会自动解包和导入游戏数据
当弹出提示框,问你:Do you also want to load JavaScript files时,选择Cancel,加载脚本里的文本修改容易出错
-
3.🖱️点"Options"按钮,选择"Preferences",选择"UI Language",选择简体中文,方便之后操作
-
4.点左上角的导出工程,选择导出格式为XML格式到你指定的文件夹,生成data文件夹
当弹出提示框,问如何处理标记列,就点击红色和选择Do not process row with selected tag,或者不设置直接导出,因为这工具暂时存在bug,无法过滤标记内容
-
5.在
翻译设置
界面的翻译项目
选择🔴T++导出文件
,配置翻译设置配置示例
项目文件夹
: 选择之前🔴Translator++
导出的项目文件夹data
输出文件夹
: 选择翻译后项目文件夹的存储文件夹 -
6.🖱️到开始翻译页面,点开始翻译按钮,等待翻译进度到百分百,生成翻译好的data文件夹在输出文件夹中
1.回到
🔴Translator++
+,点击导入工程,选择从电子表格导入翻译,点击“Import Folder”,选择输出文件夹里的data文件夹,点击导入
|
2.🖱️右键左侧区域,移到"全部选择",选择"Create Automation",选择"对每行",复制粘贴下面的代码运行
-
7.对红色标签内容进行修改,这些内容不能翻译,以免游戏脚本出现错误。
if (this.tags) { if (this.tags.includes("red")) this.cells[1]=this.cells[0]; }
3.查看左边文件有哪个没有到达百分百的,寻找到空行并自行翻译
-
8.最后选择导出工程,选择导出到文件夹,指定你的游戏目录里的data文件夹的上一级文件夹,原文件会被替换,请注意备份原游戏
-
0.工具详情功能及介绍:工具原作者页面
-
1.在提取页面进行提取,目前只能适应于RPG Maker MVMZ游戏,能提取到游戏的原文和人物名字
是否翻译note类型文本
: # 在翻译ACT游戏时,尝试关闭该选项,否则大概率无法攻击或攻击没有效果游戏文件夹
: 游戏根目录原文存储文件夹
: 提取到的游戏原文存储的地方工程存储文件夹
: 关于这个游戏的工程数据存储的地方,后面注入还会用到 -
2.在
翻译设置
界面的翻译项目
选择🔵T++导出文件
,并配置翻译设置 -
3.注入回原文
游戏文件夹
: 游戏根目录译文文件夹
: 之前经过翻译的原文文件工程文件夹
: 之前这个游戏的工程数据存储的地方存储文件夹
: 注入译文后存储的地方
-
0.工具详情:官方网站 这是一个专用于业余翻译工作的站点,与 Ainiee 的对接主要用于预先对文本进行机翻,之后可以进行校对。
-
1.在项目的
文件管理
界面,对需要进行翻译的原文,执行下载原始数据
,将下载下来的数据复制到翻译设置
中的输入文件夹
目录 -
2.在
翻译设置
界面的翻译项目
选择🔵Paratranz导出文件
,并配置翻译设置 -
3.🖱️到开始翻译页面,点击开始翻译按钮,看控制台输出日志或者进度条。之后等待翻译进度到百分百,自动生成翻译好的文件在输出文件夹中
-
4.回到
Paratranz
工具,依然在文件管理
界面,选择导入译文
,选择翻译后的 json 文件进行导入即可
-
多key轮询
如果想使用多个key来分担消耗压力,根据key数量进行加速翻译,请使用同类型账号的key,而且输入时在每个key中间加上英文逗号,不要换行。例如:key1,key2,key3
-
批量文件翻译
把所有相同类型的文件放在输入文件夹即可,也支持多文件夹结构
-
配置迁移
配置信息都会存储在resource的config.json中,下载新版本可以把它复制到新版本的resource中。
-
自定义模型
在代理平台页面中,选择相应的请求格式,并在模型选择的下拉框中直接输入模型名字,可以自定义组合发送格式与模型。如果想在官方接口调用新模型,需要自行编辑Resource/platform文件夹里的模型信息文件。
-
缓存文件
当翻译遇到问题时,可以之后更改翻译项目为缓存文件,并在输入文件夹选择该缓存文件所在的文件夹进行继续翻译。当继续翻译Epub小说文件时,还需要把原来的文件和缓存文件放在同一个文件夹里面。
-
导出已翻译文件
会将已经翻译好的内容和未翻译的内容导出。mtool项目与Paratranz项目会分为两个文件,会带有不同的后缀。T++项目会仍然是同一个文件里,已翻译文本的右边会有内容,未翻译的没有。其他项目都会混合在一个文件里输出。
-
指令词典
用来统一名词的翻译,让AI翻译的人名,物品名词,怪物名词,特殊名词能够翻译成你想要的样子。备注信息可写可不写
-
提示词优化
用来提高翻译的准确度和流畅性,写好各种内容,配合高性能模型,能够提升翻译的质量
-
调试模式
软件根目录下创建debug.txt文件,即可触发
-
【如何反馈自己在使用中遇到的问题】————————将CMD窗口(黑黑的那个框框)的内容完整截图下来,里面有程序运行日志,还有软件界面设置截图,然后将问题描述清晰带上截图到群里或者issue提问。当进一步排除问题,需要到原文本或者翻译后文本时,请压缩并上传。
-
【翻译“卡住”了】————————如果运行日志中,无错误提醒,请耐心等待
-
【mtool导入翻译文本后,显示一句原文一句译文,或者原文与译文混杂,或者全部原文】————————更新mtool到最新版,或者找mtool作者反馈该问题
-
【翻译后文本导入到T++不完全,部分未能百分百导入全部译文】————————在非RPGMVZ游戏中,出现该问题比较多,使用最新赞助版T++可以缓解,还可以自己手动打开表格,自己复制粘贴进去
该款AI翻译工具仅供个人合法用途,任何使用该工具进行直接或者间接非法盈利活动的行为,均不属于授权范围,也不受到任何支持和认可。
-
交♂交流群
: QQ交流群(主要,答案:github):8216248九零,备用TG群:https://t.me/+JVHbDSGo8SI2Njhl ,
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for AiNiee
Similar Open Source Tools

AiNiee
AiNiee is a tool focused on AI translation, capable of automatically translating RPG SLG games, Epub TXT novels, Srt Lrc subtitles, and more. It provides features for configuring AI platforms, proxies, and translation settings. Users can utilize this tool for translating game scripts, novels, and subtitles efficiently. The tool supports multiple AI platforms and offers tutorials for beginners. It also includes functionalities for extracting and translating game text, with options for customizing translation projects and managing translation tasks effectively.

generative-ai-use-cases-jp
Generative AI (生成 AI) brings revolutionary potential to transform businesses. This repository demonstrates business use cases leveraging Generative AI.

ddddocr
ddddocr is a Rust version of a simple OCR API server that provides easy deployment for captcha recognition without relying on the OpenCV library. It offers a user-friendly general-purpose captcha recognition Rust library. The tool supports recognizing various types of captchas, including single-line text, transparent black PNG images, target detection, and slider matching algorithms. Users can also import custom OCR training models and utilize the OCR API server for flexible OCR result control and range limitation. The tool is cross-platform and can be easily deployed.

Chenyme-AAVT
Chenyme-AAVT is a user-friendly tool that provides automatic video and audio recognition and translation. It leverages the capabilities of Whisper, a powerful speech recognition model, to accurately identify speech in videos and audios. The recognized speech is then translated using ChatGPT or KIMI, ensuring high-quality translations. With Chenyme-AAVT, you can quickly generate字幕 files and merge them with the original video, making video translation a breeze. The tool supports various languages, allowing you to translate videos and audios into your desired language. Additionally, Chenyme-AAVT offers features such as VAD (Voice Activity Detection) to enhance recognition accuracy, GPU acceleration for faster processing, and support for multiple字幕 formats. Whether you're a content creator, translator, or anyone looking to make video translation more efficient, Chenyme-AAVT is an invaluable tool.

Streamer-Sales
Streamer-Sales is a large model for live streamers that can explain products based on their characteristics and inspire users to make purchases. It is designed to enhance sales efficiency and user experience, whether for online live sales or offline store promotions. The model can deeply understand product features and create tailored explanations in vivid and precise language, sparking user's desire to purchase. It aims to revolutionize the shopping experience by providing detailed and unique product descriptions to engage users effectively.

focusany
FocusAny is a desktop toolbar system that supports one-click startup of market plugins and local plugins, quickly expands functionality, and improves work efficiency. It features customizable keyboard shortcuts, plugin management, command management, quick file launching, global shortcut launching, data center for file synchronization, support for dark mode, and various plugins available in the market. The tool is built using Electron, Vue3, and TypeScript.

PureChat
PureChat is a chat application integrated with ChatGPT, featuring efficient application building with Vite5, screenshot generation and copy support for chat records, IM instant messaging SDK for sessions, automatic light and dark mode switching based on system theme, Markdown rendering, code highlighting, and link recognition support, seamless social experience with GitHub quick login, integration of large language models like ChatGPT Ollama for streaming output, preset prompts, and context, Electron desktop app versions for macOS and Windows, ongoing development of more features. Environment setup requires Node.js 18.20+. Clone code with 'git clone https://github.com/Hyk260/PureChat.git', install dependencies with 'pnpm install', start project with 'pnpm dev', and build with 'pnpm build'.

bce-qianfan-sdk
The Qianfan SDK provides best practices for large model toolchains, allowing AI workflows and AI-native applications to access the Qianfan large model platform elegantly and conveniently. The core capabilities of the SDK include three parts: large model reasoning, large model training, and general and extension: * `Large model reasoning`: Implements interface encapsulation for reasoning of Yuyan (ERNIE-Bot) series, open source large models, etc., supporting dialogue, completion, Embedding, etc. * `Large model training`: Based on platform capabilities, it supports end-to-end large model training process, including training data, fine-tuning/pre-training, and model services. * `General and extension`: General capabilities include common AI development tools such as Prompt/Debug/Client. The extension capability is based on the characteristics of Qianfan to adapt to common middleware frameworks.

build_MiniLLM_from_scratch
This repository aims to build a low-parameter LLM model through pretraining, fine-tuning, model rewarding, and reinforcement learning stages to create a chat model capable of simple conversation tasks. It features using the bert4torch training framework, seamless integration with transformers package for inference, optimized file reading during training to reduce memory usage, providing complete training logs for reproducibility, and the ability to customize robot attributes. The chat model supports multi-turn conversations. The trained model currently only supports basic chat functionality due to limitations in corpus size, model scale, SFT corpus size, and quality.

HivisionIDPhotos
HivisionIDPhoto is a practical algorithm for intelligent ID photo creation. It utilizes a comprehensive model workflow to recognize, cut out, and generate ID photos for various user photo scenarios. The tool offers lightweight cutting, standard ID photo generation based on different size specifications, six-inch layout photo generation, beauty enhancement (waiting), and intelligent outfit swapping (waiting). It aims to solve emergency ID photo creation issues.

ipex-llm
IPEX-LLM is a PyTorch library for running Large Language Models (LLMs) on Intel CPUs and GPUs with very low latency. It provides seamless integration with various LLM frameworks and tools, including llama.cpp, ollama, Text-Generation-WebUI, HuggingFace transformers, and more. IPEX-LLM has been optimized and verified on over 50 LLM models, including LLaMA, Mistral, Mixtral, Gemma, LLaVA, Whisper, ChatGLM, Baichuan, Qwen, and RWKV. It supports a range of low-bit inference formats, including INT4, FP8, FP4, INT8, INT2, FP16, and BF16, as well as finetuning capabilities for LoRA, QLoRA, DPO, QA-LoRA, and ReLoRA. IPEX-LLM is actively maintained and updated with new features and optimizations, making it a valuable tool for researchers, developers, and anyone interested in exploring and utilizing LLMs.

Langchain-Chatchat
LangChain-Chatchat is an open-source, offline-deployable retrieval-enhanced generation (RAG) large model knowledge base project based on large language models such as ChatGLM and application frameworks such as Langchain. It aims to establish a knowledge base Q&A solution that is friendly to Chinese scenarios, supports open-source models, and can run offline.

agentica
Agentica is a human-centric framework for building large language model agents. It provides functionalities for planning, memory management, tool usage, and supports features like reflection, planning and execution, RAG, multi-agent, multi-role, and workflow. The tool allows users to quickly code and orchestrate agents, customize prompts, and make API calls to various services. It supports API calls to OpenAI, Azure, Deepseek, Moonshot, Claude, Ollama, and Together. Agentica aims to simplify the process of building AI agents by providing a user-friendly interface and a range of functionalities for agent development.

LLMs-from-scratch-CN
This repository is a Chinese translation of the GitHub project 'LLMs-from-scratch', including detailed markdown notes and related Jupyter code. The translation process aims to maintain the accuracy of the original content while optimizing the language and expression to better suit Chinese learners' reading habits. The repository features detailed Chinese annotations for all Jupyter code, aiding users in practical implementation. It also provides various supplementary materials to expand knowledge. The project focuses on building Large Language Models (LLMs) from scratch, covering fundamental constructions like Transformer architecture, sequence modeling, and delving into deep learning models such as GPT and BERT. Each part of the project includes detailed code implementations and learning resources to help users construct LLMs from scratch and master their core technologies.

chatluna
Chatluna is a machine learning model plugin that provides chat services with large language models. It is highly extensible, supports multiple output formats, and offers features like custom conversation presets, rate limiting, and context awareness. Users can deploy Chatluna under Koishi without additional configuration. The plugin supports various models/platforms like OpenAI, Azure OpenAI, Google Gemini, and more. It also provides preset customization using YAML files and allows for easy forking and development within Koishi projects. However, the project lacks web UI, HTTP server, and project documentation, inviting contributions from the community.
For similar tasks

Senparc.AI
Senparc.AI is an AI extension package for the Senparc ecosystem, focusing on LLM (Large Language Models) interaction. It provides modules for standard interfaces and basic functionalities, as well as interfaces using SemanticKernel for plug-and-play capabilities. The package also includes a library for supporting the 'PromptRange' ecosystem, compatible with various systems and frameworks. Users can configure different AI platforms and models, define AI interface parameters, and run AI functions easily. The package offers examples and commands for dialogue, embedding, and DallE drawing operations.

AiNiee
AiNiee is a tool focused on AI translation, capable of automatically translating RPG SLG games, Epub TXT novels, Srt Lrc subtitles, and more. It provides features for configuring AI platforms, proxies, and translation settings. Users can utilize this tool for translating game scripts, novels, and subtitles efficiently. The tool supports multiple AI platforms and offers tutorials for beginners. It also includes functionalities for extracting and translating game text, with options for customizing translation projects and managing translation tasks effectively.

gpt-subtrans
GPT-Subtrans is an open-source subtitle translator that utilizes large language models (LLMs) as translation services. It supports translation between any language pairs that the language model supports. Note that GPT-Subtrans requires an active internet connection, as subtitles are sent to the provider's servers for translation, and their privacy policy applies.

chatgpt-subtitle-translator
This tool utilizes the OpenAI ChatGPT API to translate text, with a focus on line-based translation, particularly for SRT subtitles. It optimizes token usage by removing SRT overhead and grouping text into batches, allowing for arbitrary length translations without excessive token consumption while maintaining a one-to-one match between line input and output.

TeroSubtitler
Tero Subtitler is an open source, cross-platform, and free subtitle editing software with a user-friendly interface. It offers fully fledged editing with SMPTE and MEDIA modes, support for various subtitle formats, multi-level undo/redo, search and replace, auto-backup, source and transcription modes, translation memory, audiovisual preview, timeline with waveform visualizer, manipulation tools, formatting options, quality control features, translation and transcription capabilities, validation tools, automation for correcting errors, and more. It also includes features like exporting subtitles to MP3, importing/exporting Blu-ray SUP format, generating blank video, generating video with hardcoded subtitles, video dubbing, and more. The tool utilizes powerful multimedia playback engines like mpv, advanced audio/video manipulation tools like FFmpeg, tools for automatic transcription like whisper.cpp/Faster-Whisper, auto-translation API like Google Translate, and ElevenLabs TTS for video dubbing.

video2blog
video2blog is an open-source project aimed at converting videos into textual notes. The tool follows a process of extracting video information using yt-dlp, downloading the video, downloading subtitles if available, translating subtitles if not in Chinese, generating Chinese subtitles using whisper if no subtitles exist, converting subtitles to articles using gemini, and manually inserting images from the video into the article. The tool provides a solution for creating blog content from video resources, enhancing accessibility and content creation efficiency.

auto-subs
Auto-subs is a tool designed to automatically transcribe editing timelines using OpenAI Whisper and Stable-TS for extreme accuracy. It generates subtitles in a custom style, is completely free, and runs locally within Davinci Resolve. It works on Mac, Linux, and Windows, supporting both Free and Studio versions of Resolve. Users can jump to positions on the timeline using the Subtitle Navigator and translate from any language to English. The tool provides a user-friendly interface for creating and customizing subtitles for video content.

Srt-AI-Voice-Assistant
Srt-AI-Voice-Assistant is a convenient tool that generates audio from uploaded .srt subtitle files by calling APIs such as Bert-VITS2 (HiyoriUI), GPT-SoVITS, and Microsoft TTS (online). The code is currently not perfect, and feedback on bugs or suggestions can be provided at https://github.com/YYuX-1145/Srt-AI-Voice-Assistant/issues. Recent updates include adding custom API functionality with a focus on security, support for Microsoft online TTS (requires key configuration), error handling improvements, automatic project path detection, compatibility with API-v1 for limited functionality, and significant feature updates supporting card synthesis.
For similar jobs

sweep
Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.

teams-ai
The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.

ai-guide
This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

classifai
Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

chatbot-ui
Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.

BricksLLM
BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students

uAgents
uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.

griptape
Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.