manga-translator-ui

一款开源的漫画翻译工具，基于 manga-image-translator 核心引擎开发。支持日漫、韩漫、美漫的自动翻译，提供 5 种翻译引擎（包括 OpenAI、Gemini 等 AI 翻译），内置可视化编辑器可自由调整文本框和样式。一键安装脚本支持自动配置环境和更新，打包版本开箱即用。如果这个项目对你有帮助，欢迎给个 ⭐ Star 支持！

Stars: 879

Visit

This repository is a manga image translator tool that allows users to translate text in manga images automatically. It supports various types of manga, including Japanese, Korean, and American, in both black and white and color formats. The tool can detect, translate, and embed text, supporting multiple languages such as Japanese, Chinese, and English. It also includes a visual editor for adjusting text boxes. Users can interact with the tool through a Qt interface or command-line mode for batch processing. The tool offers features like intelligent text detection, multi-language OCR, multiple translation engines, high-quality translation using AI models, automatic term extraction, AI sentence segmentation, intelligent typesetting, PSD export, and batch processing. Additionally, it provides a visual editor for region editing, text editing, mask editing, undo/redo functionality, shortcut key support, and mouse wheel shortcuts.

README:

一键翻译漫画图片中的文字，支持日漫、韩漫、美漫，黑白漫和彩漫均可识别。自动检测、翻译、嵌字，支持日语、中文、英语等多种语言，内置可视化编辑器可调整文本框。

💬 QQ 交流群：1079089991（密码：kP9#mB2!vR5*sL1） | 🐛 提交 Issue

📚 文档导航

文档	说明
安装指南	详细安装步骤、系统要求、分卷下载说明
使用教程	基础操作、翻译器选择、常用设置
命令行模式	命令行使用指南、参数说明、批量处理
API 配置	API Key 申请、配置教程
功能特性	完整功能列表、可视化编辑器详解
工作流程	7 种工作流程、AI 断句、自定义模版
设置说明	翻译器配置、OCR 模型、参数详解
调试指南	调试流程、可调节参数、问题排查
开发者指南	项目结构、环境配置、构建打包

📸 效果展示

翻译前	翻译后

✨ 核心功能

翻译功能

🔍 智能文本检测 - 自动识别漫画中的文字区域
📝 多语言 OCR - 支持日语、中文、英语等多种语言
🌐 5 种翻译引擎 - OpenAI、Gemini（普通+高质量）、Sakura
🎯 高质量翻译 - 支持 GPT-4o、Gemini 多模态 AI 翻译
📚 自动提取术语 - AI 自动识别并积累专有名词，保持翻译一致性
🤖 AI 智能断句 - 提升文本可读性，自动优化换行
🎨 智能嵌字 - 自动排版译文，支持多种字体
📥 PSD 导出 - 导出可编辑的 PSD 文件（原图/修复图/文本分层）
📦 批量处理 - 一次处理整个文件夹

可视化编辑器

✏️ 区域编辑 - 移动、旋转、变形文本框
📐 文本编辑 - 手动翻译、样式调整
🖌️ 蒙版编辑 - 画笔工具、橡皮擦
⏪ 撤销/重做 - 完整操作历史
⌨️ 快捷键支持 - 支持 A/D 切换图片，Q/W/E 切换工具，Ctrl+Q/W/E 进行文件操作
🖱️ 鼠标滚轮快捷键 - Ctrl+滚轮缩放文本框，Shift+滚轮调整画笔大小

完整功能特性 → doc/FEATURES.md

🚀 快速开始

📥 安装方式

方式一：使用安装脚本（⭐ 推荐，支持更新）

⚠️ 无需预装 Python：脚本会自动安装 Miniconda（轻量级 Python 环境）
💡 一键更新：已安装用户运行 步骤4-更新维护.bat 即可更新到最新版本

下载安装脚本：
- 点击下载步骤1-首次安装.bat
- 保存到你想安装程序的目录（如 D:\manga-translator-ui\）
- ⚠️ 这个目录将作为安装的根目录，所有程序文件都会安装在此目录下
- ⚠️ 清理提醒：使用清理功能会清空整个根目录的文件，但会保留 Python 配置和 Git 配置相关文件
运行安装：
- 双击 步骤1-首次安装.bat
- 脚本会自动：
  - ✓ 检测并安装 Miniconda（如需要）
    - 提供下载源选择：清华大学镜像（国内推荐）或 Anaconda 官方
    - 自动下载安装（约 50MB）
    - 安装到项目目录，不占用C盘
  - ✓ 安装便携版 Git（如需要）
  - ✓ 克隆代码仓库
  - ✓ 创建 Conda 虚拟环境（Python 3.12）
  - ✓ 检测显卡类型（NVIDIA / AMD / 集显）
  - ✓ 自动选择对应的 PyTorch 版本
    - NVIDIA: CUDA 12.x 版本（需驱动 >= 525.60.13）
    - AMD: ROCm 版本（实验性支持，仅支持 RX 7000/9000 系列，RX 5000/6000 请使用 CPU 版本）
    - 其他: CPU 版本（通用，速度较慢）
  - ✓ 安装所有依赖
启动程序：
- 双击 步骤2-启动Qt界面.bat

方式二：下载打包版本

下载程序：
- 前往 GitHub Releases
- 选择版本：
  - CPU 版本：适用于所有电脑
  - GPU 版本 (NVIDIA)：需要支持 CUDA 12.x 的 NVIDIA 显卡
  - ⚠️ AMD GPU 不支持打包版本，请使用"方式一：安装脚本"安装
解压运行：
- 解压压缩包到任意目录
- 双击 app.exe

方式三：Docker 部署（实验性）

快速启动：

# Windows CMD / PowerShell
docker run -d --name manga-translator -p 8000:8000 hgmzhn/manga-translator:latest-cpu

# Linux / macOS
docker run -d --name manga-translator -p 8000:8000 hgmzhn/manga-translator:latest-cpu

镜像仓库：

本项目的 Docker 镜像同时发布在两个镜像仓库，选择下载速度更快的即可：

Docker Hub（推荐）：
- CPU 版本：hgmzhn/manga-translator:latest-cpu
- GPU 版本：hgmzhn/manga-translator:latest-gpu
GitHub Container Registry（备用，国内可能更快）：
- CPU 版本：ghcr.io/hgmzhn/manga-translator:latest-cpu
- GPU 版本：ghcr.io/hgmzhn/manga-translator:latest-gpu

访问地址（默认端口 8000）：

🌐 用户界面：http://localhost:8000
🔧 管理界面：http://localhost:8000/admin.html

📖 详细安装教程：Docker 部署文档
📖 使用教程：命令行使用指南

方式四：从源码运行（开发者）

适合开发者或想要自定义的用户。

安装 Python 3.12：下载

克隆仓库：

git clone https://github.com/hgmzhn/manga-translator-ui.git
cd manga-translator-ui

安装依赖：

# NVIDIA GPU
pip install -r requirements_gpu.txt

# AMD GPU（仅 RX 7000/9000 系列）
pip install -r requirements_amd.txt

# CPU 版本
pip install -r requirements_cpu.txt

运行程序：

# 桌面 UI
python -m desktop_qt_ui.main

# Web UI（可选）
python -m manga_translator web

📖 详细安装教程：安装指南
📖 使用教程：命令行使用指南

方式五：macOS 原生运行 (Apple Silicon)

专为 M1/M2/M3/M4 Mac 优化的原生运行方式，支持 MPS (Metal Performance Shaders) GPU 加速。

快速开始（推荐）：

下载安装脚本：

curl -O https://raw.githubusercontent.com/hgmzhn/manga-translator-ui/main/macOS_1_首次安装.sh
chmod +x macOS_1_首次安装.sh

运行安装：
```
./macOS_1_首次安装.sh
```
脚本会自动完成：
- 检查并安装必要组件（Xcode 命令行工具、Git）
- 克隆项目代码
- 安装 Miniforge 和 Python 环境
- 配置 MPS GPU 加速支持
启动程序：
```
./macOS_2_启动Qt界面.sh
```
后续更新：
```
./macOS_4_更新维护.sh
```

或者手动克隆：

git clone https://github.com/hgmzhn/manga-translator-ui.git
cd manga-translator-ui
chmod +x macOS_*.sh
./macOS_1_首次安装.sh

⚠️ 注意：

优先支持 Apple Silicon (M1/M2/M3/M4) 芯片

Intel Mac 也可运行，但会使用 CPU 模式

首次安装需要下载约 2GB 的依赖包，请确保网络畅通

📖 使用教程

🖥️ Qt 界面模式

安装完成后，请查看使用教程了解如何翻译图片：

使用教程 → doc/USAGE.md

基本步骤：

填写 API（如使用在线翻译器）→ API 配置教程
关闭 GPU（仅 CPU 版本）
设置输出目录
添加图片
选择翻译器
- 首次使用推荐：高质量翻译 OpenAI 或 高质量翻译 Gemini
- 需要配置 API Key，参考 API 配置教程
开始翻译

⌨️ 命令行模式

适合批量处理和自动化脚本：

命令行指南 → doc/CLI_USAGE.md

⚠️ 重要提示：使用命令行前，请先在项目目录激活虚拟环境：
# Windows
conda activate manga-env

# Linux/macOS
conda activate manga-env

快速开始：

# Local 模式（推荐，命令行翻译）
python -m manga_translator local -i manga.jpg

# 或简写（默认 Local 模式）
python -m manga_translator -i manga.jpg

# 翻译整个文件夹
python -m manga_translator local -i ./manga_folder/ -o ./output/

# Web 服务器模式（带管理界面和 API）
python -m manga_translator web --host 127.0.0.1 --port 8000 --use-gpu

# 查看所有参数
python -m manga_translator --help

📋 工作流程

本程序支持多种工作流程：

正常翻译流程 - 直接翻译图片
导出翻译 - 翻译后导出到 TXT 文件
导出原文 - 仅检测识别，导出原文用于手动翻译
导入翻译并渲染 - 从 TXT/JSON 导入翻译内容重新渲染

工作流程详解 → doc/WORKFLOWS.md

⚙️ 常用翻译器

在线翻译器（需要 API Key）

OpenAI - 使用 GPT 系列模型
Gemini - 使用 Google Gemini 模型
Sakura - 专门针对日语优化的翻译模型

高质量翻译器（推荐）

高质量翻译 OpenAI - 使用 GPT-4o 多模态模型
高质量翻译 Gemini - 使用 Gemini 多模态模型
📸 结合图片上下文，翻译更准确

完整设置说明 → doc/SETTINGS.md

🔍 遇到问题？

翻译效果不理想

在"基础设置"中勾选 详细日志
查看 result/ 目录中的调试文件
调整检测器和 OCR 参数

调试流程指南 → doc/DEBUGGING.md

⭐ Star 趋势

🙏 致谢

zyddnys/manga-image-translator - 核心翻译引擎
bilibili/ailab - Real-CUGAN 超分辨率模型
the-database/MangaJaNai - MangaJaNai/IllustrationJaNai 超分辨率模型
lhj5426/YSG - 提供模型支持
PaddleOCR - 提供 OCR 模型支持
kha-white/manga-ocr - MangaOCR 模型支持
jzhang533/PaddleOCR-VL-For-Manga - 提供 PaddleOCR-VL-For-Manga 模型支持
所有贡献者和用户的支持

❤️ 支持作者

如果这个项目对你有帮助，欢迎请作者喝杯奶茶 🧋

💚 微信赞赏

💙 支付宝赞助

感谢你的支持 ✨

📝 许可证

本项目基于 GPL-3.0 许可证开源。

模型协议声明

本项目代码采用 GPL-3.0 协议。

本项目支持使用 MangaJaNai/IllustrationJaNai 模型进行图像超分辨率处理。这些模型权重文件采用 CC BY-NC 4.0 协议（署名-非商业性使用 4.0 国际），仅供非商业用途使用。

模型来源：MangaJaNai
模型协议：CC BY-NC 4.0
使用限制：仅限非商业用途

⚠️ 特别声明

本项目仅提供技术演示与个人学习交流用途，不构成任何法律、商业或合规建议。
你在安装、配置、调用和分发本项目相关功能时，应自行确认并持续遵守所在地法律法规、平台规则、内容来源许可及第三方服务条款。

免责与责任限制

使用本项目产生的一切行为与后果（包括但不限于内容处理、发布、传播、二次分发、商业化使用），均由使用者独立承担责任。
你应自行确保输入内容、输出内容及数据来源具备合法授权，不得用于侵犯著作权、商标权、隐私权、肖像权等合法权益的场景。
严禁将本项目用于任何违法违规用途，包括但不限于盗版传播、未授权批量抓取与搬运、绕过平台限制、诈骗、诽谤、侵害他人合法权益等行为。
本项目依赖第三方模型、API、数据与库（含 OCR、翻译、超分模型等）；相关可用性、准确性、稳定性、费用、风控与合规要求由对应服务方负责，使用者需自行承担相应风险与成本。
对于因使用或无法使用本项目导致的任何直接或间接损失（包括但不限于数据损失、业务中断、收益损失、账户风险、第三方索赔等），项目作者与贡献者在适用法律允许范围内不承担责任。
若你将本项目用于团队或组织环境，应自行完成权限管理、日志审计、内容审核与合规评估，并建立必要的人工复核流程。

请在使用前审慎评估风险；继续使用即视为你已阅读、理解并同意上述声明。

🙏 最后致谢

huyvux3005/manga109-segmentation-bubble - MangaLens 漫画气泡分割检测模型
感谢所有开源作者、贡献者和用户的持续反馈与支持

For Tasks:

Click tags to check more tools for each tasks

translate manga text edit text boxes export psd files batch process images visual text editing

For Jobs:

translator graphic designer content creator language specialist software developer

Alternative AI tools for manga-translator-ui

Similar Open Source Tools

manga-translator-ui

github

: 879

AutoGLM-GUI

AutoGLM-GUI is an AI-driven Android automation productivity tool that supports scheduled tasks, remote deployment, and 24/7 AI assistance. It features core functionalities such as deploying to servers, scheduling tasks, and creating an AI automation assistant. The tool enhances productivity by automating repetitive tasks, managing multiple devices, and providing a layered agent mode for complex task planning and execution. It also supports real-time screen preview, direct device control, and zero-configuration deployment. Users can easily download the tool for Windows, macOS, and Linux systems, and can also install it via Python package. The tool is suitable for various use cases such as server automation, batch device management, development testing, and personal productivity enhancement.

github

: 856

BigBanana-AI-Director

BigBanana AI Director is an industrial AI motion comic and video workbench platform that provides a one-stop solution for creating short dramas and comics. It utilizes a 'Script-to-Asset-to-Keyframe' workflow with advanced AI models to automate the process from script to final production, ensuring precise control over character consistency, scene continuity, and camera movements. The tool is designed to streamline the production process for creators, enabling efficient production from idea to finished product.

github

: 532

get_jobs

Get Jobs is a tool designed to help users find and apply for job positions on various recruitment platforms in China. It features AI job matching, automatic cover letter generation, multi-platform job application, automated filtering of inactive HR and headhunter positions, real-time WeChat message notifications, blacklisted company updates, driver adaptation for Win11, centralized configuration, long-lasting cookie login, XPathHelper plugin, global logging, and more. The tool supports platforms like Boss直聘, 猎聘, 拉勾, 51job, and 智联招聘. Users can configure the tool for customized job searches and applications.

github

: 3.9k

aio-hub

AIO Hub is a cross-platform AI hub built on Tauri + Vue 3 + TypeScript, aiming to provide developers and creators with precise LLM control experience and efficient toolchain. It features a chat function designed for complex tasks and deep exploration, a unified context pipeline for controlling every token sent to the model, interactive AI buttons, dual-view management for non-linear conversation mapping, open ecosystem compatibility with various AI models, and a rich text renderer for LLM output. The tool also includes features for media workstation, developer productivity, system and asset management, regex applier, collaboration enhancement between developers and AI, and more.

github

: 89

Snap-Solver

Snap-Solver is a revolutionary AI tool for online exam solving, designed for students, test-takers, and self-learners. With just a keystroke, it automatically captures any question on the screen, analyzes it using AI, and provides detailed answers. Whether it's complex math formulas, physics problems, coding issues, or challenges from other disciplines, Snap-Solver offers clear, accurate, and structured solutions to help you better understand and master the subject matter.

github

: 74

chatless

Chatless is a modern AI chat desktop application built on Tauri and Next.js. It supports multiple AI providers, can connect to local Ollama models, supports document parsing and knowledge base functions. All data is stored locally to protect user privacy. The application is lightweight, simple, starts quickly, and consumes minimal resources.

github

: 212

agentica

Agentica is a human-centric framework for building large language model agents. It provides functionalities for planning, memory management, tool usage, and supports features like reflection, planning and execution, RAG, multi-agent, multi-role, and workflow. The tool allows users to quickly code and orchestrate agents, customize prompts, and make API calls to various services. It supports API calls to OpenAI, Azure, Deepseek, Moonshot, Claude, Ollama, and Together. Agentica aims to simplify the process of building AI agents by providing a user-friendly interface and a range of functionalities for agent development.

github

: 244

Saber-Translator

Saber-Translator is your exclusive AI comic translation tool, designed to effortlessly eliminate language barriers and enjoy the original comic fun. It offers features like translating comic images/PDFs, intelligent bubble detection and text recognition, powerful AI translation engine with multiple service providers, highly customizable translation effects, real-time preview and convenient operations, efficient image management and download, model recording and recommendation, and support for language learning with dual prompt word outputs.

github

: 2.7k

InterPilot

InterPilot is an AI-based assistant tool that captures audio from Windows input/output devices, transcribes it into text, and then calls the Large Language Model (LLM) API to provide answers. The project includes recording, transcription, and AI response modules, aiming to provide support for personal legitimate learning, work, and research. It may assist in scenarios like interviews, meetings, and learning, but it is strictly for learning and communication purposes only. The tool can hide its interface using third-party tools to prevent screen recording or screen sharing, but it does not have this feature built-in. Users bear the risk of using third-party tools independently.

github

: 88

vscode-antigravity-cockpit

VS Code extension for monitoring Google Antigravity AI model quotas. It provides a webview dashboard, QuickPick mode, quota grouping, automatic grouping, renaming, card view, drag-and-drop sorting, status bar monitoring, threshold notifications, and privacy mode. Users can monitor quota status, remaining percentage, countdown, reset time, progress bar, and model capabilities. The extension supports local and authorized quota monitoring, multiple account authorization, and model wake-up scheduling. It also offers settings customization, user profile display, notifications, and group functionalities. Users can install the extension from the Open VSX Marketplace or via VSIX file. The source code can be built using Node.js and npm. The project is open-source under the MIT license.

github

: 2.7k

Con-Nav-Item

Con-Nav-Item is a modern personal navigation system designed for digital workers. It is not just a link bookmark but also an all-in-one workspace integrated with AI smart generation, multi-device synchronization, card-based management, and deep browser integration.

github

: 67

bk-lite

Blueking Lite is an AI First lightweight operation product with low deployment resource requirements, low usage costs, and progressive experience, providing essential tools for operation administrators.

github

: 119

Flux-AI-Pro

Flux AI Pro - NanoBanana Edition is a high-performance, single-file AI image generation solution built on Cloudflare Workers. It integrates top AI providers like Pollinations.ai, Infip/Ghostbot, Aqua Server, Kinai API, and Airforce API to offer a serverless, fast, and feature-rich creative experience. It provides seamless interface for generating high-quality AI art without complex server setups. The tool supports multiple languages, smart language detection, RTL support, AI prompt generator, high-definition image generation, and local history storage with export/import functionality.

github

: 66

BiBi-Keyboard

BiBi-Keyboard is an AI-based intelligent voice input method that aims to make voice input more natural and efficient. It provides features such as voice recognition with simple and intuitive operations, multiple ASR engine support, AI text post-processing, floating ball input for cross-input method usage, AI editing panel with rich editing tools, Material3 design for modern interface style, and support for multiple languages. Users can adjust keyboard height, test input directly in the settings page, view recognition word count statistics, receive vibration feedback, and check for updates automatically. The tool requires Android 10.0 or higher, microphone permission for voice recognition, optional overlay permission for the floating ball feature, and optional accessibility permission for automatic text insertion.

github

: 499

claude-init

Claude Code Chinese development suite is a localized version based on the Claude Code Development Kit, offering a seamless Chinese AI programming experience. It features complete Chinese AI commands, documentation system, error messages, and installation experience. The suite includes intelligent context management with a three-tier document structure, automatic context injection, smart document routing, and cross-session state management. It integrates development tools like Hook system, MCP server support, security scans, and notification system. Additionally, it provides a comprehensive template library with project templates, document templates, and configuration examples.

github

: 519

For similar tasks

IOPaint

IOPaint is a free and open-source inpainting & outpainting tool powered by SOTA AI model. It supports various AI models to perform erase, inpainting, or outpainting tasks. Users can remove unwanted objects, defects, watermarks, or people from images using erase models. Additionally, diffusion models can replace objects or perform outpainting. The tool also offers plugins for interactive object segmentation, background removal, anime segmentation, super resolution, face restoration, and file management. IOPaint provides a web UI for easy access to the latest AI models and supports batch processing of images through the command line. Developers can contribute to the project by installing front-end dependencies, setting up the backend, and starting the development environment for both front-end and back-end components.

github

: 18.7k

PanelCleaner

Panel Cleaner is a tool that uses machine learning to find text in images and generate masks to cover it up with high accuracy. It is designed to clean text bubbles without leaving artifacts, avoiding painting over non-text parts, and inpainting bubbles that can't be masked out. The tool offers various customization options, detailed analytics on the cleaning process, supports batch processing, and can run OCR on pages. It supports CUDA acceleration, multiple themes, and can handle bubbles on any solid grayscale background color. Panel Cleaner is aimed at saving time for cleaners by automating monotonous work and providing precise cleaning of text bubbles.

github

: 289

AI-Lossless-Zoomer

AI-Lossless-Zoomer is a tool that utilizes the Real-ESRGAN model provided by Tencent ARC Lab to enhance images, particularly portraits and anime pictures, with fast processing. It supports multi-thread processing, batch image processing, customizable options, output formats, output paths, AI engine selection, and batch cleaning tasks. The tool is designed for Windows 7 or later with .NET Framework 4.6+. Users can choose between the installable version (.exe) and the portable version (.zip) that includes the latest AI engine. The tool is efficient for enlarging images while maintaining quality.

github

: 1.3k

manga-translator-ui

github

: 879

For similar jobs

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

daily-poetry-image

Daily Chinese ancient poetry and AI-generated images powered by Bing DALL-E-3. GitHub Action triggers the process automatically. Poetry is provided by Today's Poem API. The website is built with Astro.

github

: 492

exif-photo-blog

EXIF Photo Blog is a full-stack photo blog application built with Next.js, Vercel, and Postgres. It features built-in authentication, photo upload with EXIF extraction, photo organization by tag, infinite scroll, light/dark mode, automatic OG image generation, a CMD-K menu with photo search, experimental support for AI-generated descriptions, and support for Fujifilm simulations. The application is easy to deploy to Vercel with just a few clicks and can be customized with a variety of environment variables.

github

: 1.4k

SillyTavern

SillyTavern is a user interface you can install on your computer (and Android phones) that allows you to interact with text generation AIs and chat/roleplay with characters you or the community create. SillyTavern is a fork of TavernAI 1.2.8 which is under more active development and has added many major features. At this point, they can be thought of as completely independent programs.

github

: 23.1k

Twitter-Insight-LLM

This project enables you to fetch liked tweets from Twitter (using Selenium), save it to JSON and Excel files, and perform initial data analysis and image captions. This is part of the initial steps for a larger personal project involving Large Language Models (LLMs).

github

: 401

AISuperDomain

Aila Desktop Application is a powerful tool that integrates multiple leading AI models into a single desktop application. It allows users to interact with various AI models simultaneously, providing diverse responses and insights to their inquiries. With its user-friendly interface and customizable features, Aila empowers users to engage with AI seamlessly and efficiently. Whether you're a researcher, student, or professional, Aila can enhance your AI interactions and streamline your workflow.

github

: 1.2k

ChatGPT-On-CS

This project is an intelligent dialogue customer service tool based on a large model, which supports access to platforms such as WeChat, Qianniu, Bilibili, Douyin Enterprise, Douyin, Doudian, Weibo chat, Xiaohongshu professional account operation, Xiaohongshu, Zhihu, etc. You can choose GPT3.5/GPT4.0/ Lazy Treasure Box (more platforms will be supported in the future), which can process text, voice and pictures, and access external resources such as operating systems and the Internet through plug-ins, and support enterprise AI applications customized based on their own knowledge base.

github

: 768

obs-localvocal

LocalVocal is a live-streaming AI assistant plugin for OBS that allows you to transcribe audio speech into text and perform various language processing functions on the text using AI / LLMs (Large Language Models). It's privacy-first, with all data staying on your machine, and requires no GPU, cloud costs, network, or downtime.

github

: 248