Saber-Translator
✨ 一款小白也能轻松使用的漫画翻译工具,旨在帮助漫画爱好者轻松跨越语言障碍,畅享原汁原味的日文漫画。 利用先进的 AI 技术,智能检测漫画中的对话气泡,精准识别日文文本,并快速翻译成流畅自然的中文。 ✨ 无论是图片还是 PDF 格式的漫画,Saber-Translator 都能轻松应对,让你无压力阅读心爱的漫画作品。
Stars: 2748
Saber-Translator is your exclusive AI comic translation tool, designed to effortlessly eliminate language barriers and enjoy the original comic fun. It offers features like translating comic images/PDFs, intelligent bubble detection and text recognition, powerful AI translation engine with multiple service providers, highly customizable translation effects, real-time preview and convenient operations, efficient image management and download, model recording and recommendation, and support for language learning with dual prompt word outputs.
README:
✨ 你的专属 AI 漫画翻译与管理神器,集翻译、编辑、书架管理、智能分析于一体!✨
**求职广告:本人是26届应届本科生,走过路过的老板们给个工作机会好不好😭😭😭😭😭😭😭😭**Saber-Translator 提供了一站式的漫画翻译与管理解决方案,集成了从内容导入、翻译编辑、书架管理到智能分析的完整工作流。
📥 导入图片/PDF → 🔍 文本检测 → 📝 OCR识别 → 🌐 AI翻译 → 🎨 图像修复 → ✏️ 文字渲染 → 📤 导出
| 模块 | 功能 | 支持选项 |
|---|---|---|
| 📥 文件导入 | 支持多种格式上传 | JPG, PNG, WEBP, PDF (自动提取图像) |
| 🔍 文本检测 | 自动识别漫画气泡/文本区域 | CTD (默认, 精确掩膜) + 手动标注模式 |
| 📝 OCR 识别 | 多引擎多语言文字识别 | MangaOCR (日语) / PaddleOCR (多语言) / 百度OCR / AI视觉OCR |
| 🌐 AI 翻译 | 多服务商翻译引擎 | SiliconFlow / DeepSeek / Gemini / 火山引擎 / Ollama / Sakura 等 |
| 🎨 图像修复 | 消除原文文字 | LAMA智能修复 / 纯色填充 |
| ✏️ 文字渲染 | 将译文嵌入图片 | 自定义字体/字号/颜色/描边/方向/位置 |
| 📤 导出下载 | 多格式导出 | PNG / ZIP / PDF / CBZ |
💡 高质量翻译模式 (Beta): 使用多模态模型进行多图上下文感知翻译,通过结合多张图片信息提升翻译连贯性和准确性。
| 功能 | 说明 |
|---|---|
| 编辑模式 | 翻译后精细调整:修改文本/字体/颜色/位置,内置标注功能可手动绘制/调整文本框 |
| 会话管理 | 保存/加载完整工作进度 |
| 文本导入导出 | JSON格式导入导出,便于协作校对 |
|
📖 书籍管理 创建/编辑/搜索/批量操作 |
📑 章节管理 创建/排序/自动统计 |
🏷️ 标签系统 分类/筛选/自定义颜色 |
专属阅读界面:翻页阅读 | 原图/译图切换 | 章节导航 | 自适应显示
🔮 基于 AI 的漫画内容深度理解引擎
| 功能 | 说明 |
|---|---|
| 内容分析 | 角色识别、剧情梳理、对话提取 |
| 任务管理 | 全书/增量/章节分析,支持暂停恢复 |
| 智能问答 | 基于分析结果对漫画具体情节进行智能问答 |
| 向量检索 | 语义搜索快速定位内容 |
翻译效果对比:
-
下载最新版本: 前往 Releases 页面,或加入 QQ 群下载对应你操作系统的最新版本压缩包。
- 1群: 1041505784
- 2群: 1047553423
- 3群: 536706173
- 4群: 1074032394
- 解压: 将下载的压缩包解压到你希望存放程序的目录。
-
运行程序: 在解压后的目录中,找到
Saber-Translator.exe(Windows) 或对应名称的可执行文件,双击运行。 -
开始翻译: 程序启动后,通常会自动在你的默认浏览器中打开 Saber-Translator 的 Web 界面 (默认地址为
http://127.0.0.1:5000/)。如果未自动打开,请手动访问该地址。 - 上传与配置: 通过拖拽或点击选择按钮上传你的漫画图片或 PDF 文件。并参考 使用教程 完成翻译服务商配置。
- 翻译: 点击“翻译当前图片”或“翻译所有图片”按钮开始处理。
- 查看与下载: 在主界面查看翻译结果,使用右侧缩略图切换图片,调整显示大小,并通过下载按钮获取翻译后的文件。
- 详细教程: 参考 使用教程 获取更详细的步骤和技巧。
翻译完成后,点击图片上方的“切换编辑模式”按钮进入编辑模式,可进行以下操作:
气泡编辑:
- 选择气泡后可修改文本、字体、字号、颜色、填充色、排版方向、旋转角度、位置偏移
- 所有更改实时预览,支持一键应用样式到全部气泡
标注功能:
- 绘制/移动/调整/删除文本框,解决自动检测不准确的问题
- 支持自动检测辅助、批量检测所有图片
- 可使用手动标注框进行翻译
位于左侧设置栏的 "高质量翻译模式 " 板块提供了一个实验性的多步骤翻译流程。该模式旨在通过多模态模型结合多张图片的上下文信息来提升翻译的连贯性和准确性。
- 配置: 用户可以设置AI服务商、API Key、模型、批处理大小(一次发送给AI的图片数量)、会话重置频率(多少批次后清空AI的短期记忆)、RPM限制、是否使用低推理模式(部分模型支持以降低思考深度换取速度)、是否强制AI返回JSON格式以及自定义的上下文翻译提示词。
-
流程:
- 程序会先对所有已上传的图片进行文字区域的消除和原文提取。
- 然后将所有图片的原文组织成JSON格式。
- 同时收集所有原始图片的Base64数据。
- 接着,它会根据用户设置的“每批次图片数”,分批将图片和对应的JSON文本片段发送给AI进行上下文感知的翻译。
- AI返回的翻译结果会被合并。
- 最后,程序将合并后的翻译结果导入到对应的图片和气泡中,并重新渲染。
- 适用场景: 当漫画对话连贯性要求较高,且单个气泡的独立翻译效果不佳时,可以尝试此模式。
- 注意: 此模式会消耗更多的处理时间和API配额,因为它需要对所有图片进行多次处理和AI调用。请确保你的API Key有足够的额度,并根据服务商的限制合理配置RPM。
书架系统提供了完整的漫画管理功能,让你的翻译项目井井有条:
- 访问书架: 点击导航栏的"书架"按钮进入书架管理界面.
- 创建书籍: 点击"新建书籍",输入书名,可选择上传封面图片和添加标签.
- 管理章节: 进入书籍后,可以创建、编辑、删除和排序章节.
- 翻译章节: 点击章节进入翻译工作区,翻译进度会自动保存到该章节.
- 阅读模式: 完成翻译后,可以进入阅读器查看翻译成果.
- 标签管理: 在书架页面可以创建和管理标签,为书籍分类整理.
Manga Insight 是一个基于 AI 的漫画内容深度理解引擎,它能够分析漫画原图、提取剧情信息、构建知识库,并支持基于分析结果的智能问答。
-
多种分析模式:
- 全书分析: 从头到尾分析整本漫画,适合首次分析。
- 增量分析: 仅分析新增的章节/页面,适合连载漫画的更新。
- 章节分析: 分析指定章节,适合部分内容重新分析。
- 批量页面分析: 分析指定的多个页面,适合精细化调整。
- 任务控制: 分析过程中支持暂停、恢复、取消操作,任务进度实时可见。
- 任务历史: 查看历史分析任务的状态和结果。
采用层级式分析架构,从页面级逐步汇总到全书级,确保分析的完整性和连贯性:
📄 批量分析 (每5页) → 📑 段落总结 → 📖 章节总结 → 📚 全书概述
-
架构预设:
- 简洁模式: 批量分析 → 全书总结(适合100页以内的短篇)
- 标准模式: 批量分析 → 段落总结 → 全书总结(通用)
- 章节模式: 批量分析 → 章节总结 → 全书总结(有明确章节的漫画)
- 完整模式: 批量分析 → 小总结 → 章节总结 → 全书总结(长篇连载)
- 上下文关联: 批量分析时自动引用前N批的分析结果作为上下文,保持剧情理解的连贯性。
基于检索增强生成 (RAG) 技术,支持对漫画内容进行智能问答:
-
两种问答模式:
- 精确模式: 使用向量检索定位相关页面,适合具体问题(如"第15页发生了什么")
- 全局模式: 使用全书摘要作为上下文,适合总结性问题(如"故事的主题是什么")
- 混合检索: 结合向量相似度(70%)和关键词匹配(30%)的双路检索策略。
- Reranker 重排序: 支持 Jina、Cohere、SiliconFlow、BGE 等重排序模型,提升检索精度。
- 推理检索: 自动将复杂问题分解为多个子问题并行检索,整合结果后回答。
- 父子块检索: 返回匹配内容所属批次的完整上下文,便于理解前后剧情。
- 引用定位: 回答时自动标注引用的页码,支持点击跳转到对应页面。
支持多种风格的概述生成,满足不同场景需求:
| 模板 | 说明 |
|---|---|
| 📖 故事概要 | 完整的剧情回顾,包含所有剧透 |
| 🎁 无剧透简介 | 不含关键剧透的故事简介,适合推荐给朋友 |
| ⏪ 前情回顾 | 精炼版剧情回顾,适合接续阅读前快速回忆 |
| 👥 角色图鉴 | 详细的人物介绍和关系梳理 |
| 🌍 世界观设定 | 故事的世界观、势力、规则等背景设定 |
| ✨ 名场面盘点 | 精彩场景和高光时刻回顾,附页码定位 |
| 📝 阅读笔记 | 结构化的阅读笔记,包含要点和思考 |
- 增强模式: 使用 LLM 智能整合事件,生成结构化的剧情弧和角色行动线。
- 简单模式: 直接提取关键事件列表,按页面顺序排列。
- 可视化展示: 时间线数据可在界面中直观查看。
- 双层索引: 页面级向量(page_summary)+ 事件级向量(key_events)
- 语义搜索: 输入关键词或描述,快速定位相关内容。
- 重建向量: 支持在更换 Embedding 模型后重新构建向量索引。
我们计划在未来的版本中加入更多令人兴奋的功能:
- 更多 AI 服务支持: 集成更多云端翻译和OCR服务。
- 更多目标语言: 增加对更多目标语言的翻译支持。
- 术语库/词典: 允许用户导入自定义术语库,保证特定词汇翻译的一致性。
- 性能优化: 持续优化图像处理、模型推理速度和前端交互的流畅性。
- UI/UX 改进: 进一步提升用户界面的易用性和美观度,例如提供更直观的字体预览、颜色选择器等。
- 国际化 (i18n): 支持界面语言切换。
- 更完善的错误恢复机制: 例如,在批量处理失败时,允许用户重试失败的部分。
- Docker 支持: 提供 Docker 镜像,简化部署和环境配置。
- Manga Insight 增强: 进一步完善智能分析功能,增加更多角色识别和剧情分析能力。
感谢你对 Saber-Translator 项目的关注和支持! 如果你觉得本项目对你有所帮助,并且愿意支持我继续开发和维护,可以通过以下方式进行赞助。 你的支持是我前进的最大动力!
赞助方式:
你可以通过微信或支付宝扫描下方二维码进行赞赏。 你的每一份支持都将用于功能开发和持续优化。
再次感谢你的慷慨支持!
欢迎任何形式的贡献,包括但不限于:
- 代码贡献 (Pull Requests): 修复 Bug, 增加新功能, 优化代码性能, 开发新插件等。
- 问题反馈 (Issues): 报告 Bug, 提出功能建议, 分享使用遇到的问题等。
- 文档完善: 改进 README 文档, 编写更详细的使用教程或开发文档。
- 翻译贡献: 将 README 文档或界面翻译成其他语言。
- 推广宣传: 向更多人介绍和推荐 Saber-Translator。
- Star 项目: 如果你觉得 Saber-Translator 对你有帮助,请给项目点个 Star ⭐,这将是对我最大的鼓励和支持!
- 分享推荐: 将 Saber-Translator 分享给你的朋友或有需要的人。
-
后端: Python 3.10+, Flask, Flask-CORS, PyTorch, PaddlePaddle-OCR, MangaOCR, Pillow, OpenCV, NumPy,
openai(SDK),litelama/lama-mpe, PyPDF2, Requests, PyYAML, Colorama. - 前端: Vue 3, Vite, TypeScript, Pinia, Axios, Canvas API, JSZip, jsPDF.
- 文本检测: CTD (Comic Text Detector), YOLOv5.
- 图像修复: LAMA-MPE, LiteLAMA.
- 向量检索: Embedding, Reranker (用于 Manga Insight).
- 打包: PyInstaller.
- Saber-Translator 主要用于学习和技术交流,请在遵守当地法律法规和版权政策的前提下使用。严禁用于非法目的或商业用途。
- 本项目依赖第三方 AI 服务 (如 OCR, 翻译 API, 背景修复模型),其服务质量、可用性和成本由相应服务商决定。请用户自行了解并遵守相关服务商的使用协议和条款。
- 翻译结果由 AI 模型生成,可能存在错误或不准确之处。本项目不对翻译结果的准确性、完整性或及时性做任何保证。
- 用户应对使用本工具处理的任何内容负责,包括确保拥有处理这些内容的合法权利。
- 对于因使用或无法使用本项目而造成的任何直接或间接损失,项目作者不承担任何责任。
感谢使用 Saber-Translator!希望它能帮助你更轻松地跨越语言障碍,享受漫画的乐趣!
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for Saber-Translator
Similar Open Source Tools
Saber-Translator
Saber-Translator is your exclusive AI comic translation tool, designed to effortlessly eliminate language barriers and enjoy the original comic fun. It offers features like translating comic images/PDFs, intelligent bubble detection and text recognition, powerful AI translation engine with multiple service providers, highly customizable translation effects, real-time preview and convenient operations, efficient image management and download, model recording and recommendation, and support for language learning with dual prompt word outputs.
get_jobs
Get Jobs is a tool designed to help users find and apply for job positions on various recruitment platforms in China. It features AI job matching, automatic cover letter generation, multi-platform job application, automated filtering of inactive HR and headhunter positions, real-time WeChat message notifications, blacklisted company updates, driver adaptation for Win11, centralized configuration, long-lasting cookie login, XPathHelper plugin, global logging, and more. The tool supports platforms like Boss直聘, 猎聘, 拉勾, 51job, and 智联招聘. Users can configure the tool for customized job searches and applications.
douyin-chatgpt-bot
Douyin ChatGPT Bot is an AI-driven system for automatic replies on Douyin, including comment and private message replies. It offers features such as comment filtering, customizable robot responses, and automated account management. The system aims to enhance user engagement and brand image on the Douyin platform, providing a seamless experience for managing interactions with followers and potential customers.
chatwiki
ChatWiki is an open-source knowledge base AI question-answering system. It is built on large language models (LLM) and retrieval-augmented generation (RAG) technologies, providing out-of-the-box data processing, model invocation capabilities, and helping enterprises quickly build their own knowledge base AI question-answering systems. It offers exclusive AI question-answering system, easy integration of models, data preprocessing, simple user interface design, and adaptability to different business scenarios.
hongbomiao.com
hongbomiao.com is a personal research and development (R&D) lab that facilitates the sharing of knowledge. The repository covers a wide range of topics including web development, mobile development, desktop applications, API servers, cloud native technologies, data processing, machine learning, computer vision, embedded systems, simulation, database management, data cleaning, data orchestration, testing, ops, authentication, authorization, security, system tools, reverse engineering, Ethereum, hardware, network, guidelines, design, bots, and more. It provides detailed information on various tools, frameworks, libraries, and platforms used in these domains.
NovelForge
NovelForge is an AI-assisted writing tool with the potential for creating long-form content of millions of words. It offers a solution that combines world-building, structured content generation, and consistency maintenance. The tool is built around four core concepts: modular 'cards', customizable 'dynamic output models', flexible 'context injection', and consistency assurance through a 'knowledge graph'. It provides a highly structured and configurable writing environment, inspired by the Snowflake Method, allowing users to create and organize their content in a tree-like structure. NovelForge is highly customizable and extensible, allowing users to tailor their writing workflow to their specific needs.
activepieces
Activepieces is an open source replacement for Zapier, designed to be extensible through a type-safe pieces framework written in Typescript. It features a user-friendly Workflow Builder with support for Branches, Loops, and Drag and Drop. Activepieces integrates with Google Sheets, OpenAI, Discord, and RSS, along with 80+ other integrations. The list of supported integrations continues to grow rapidly, thanks to valuable contributions from the community. Activepieces is an open ecosystem; all piece source code is available in the repository, and they are versioned and published directly to npmjs.com upon contributions. If you cannot find a specific piece on the pieces roadmap, please submit a request by visiting the following link: Request Piece Alternatively, if you are a developer, you can quickly build your own piece using our TypeScript framework. For guidance, please refer to the following guide: Contributor's Guide
Operit
Operit AI is a fully functional AI assistant application for mobile devices, running independently on Android devices with powerful tool invocation capabilities. It offers over 40 built-in tools for file system operations, HTTP requests, system operations, UI automation, and media processing. The app combines these tools with rich plugins to enable a wide range of tasks, from simple to complex, providing a comprehensive experience of a smartphone AI assistant.
chatless
Chatless is a modern AI chat desktop application built on Tauri and Next.js. It supports multiple AI providers, can connect to local Ollama models, supports document parsing and knowledge base functions. All data is stored locally to protect user privacy. The application is lightweight, simple, starts quickly, and consumes minimal resources.
NeuroAI_Course
Neuromatch Academy NeuroAI Course Syllabus is a repository that contains the schedule and licensing information for the NeuroAI course. The course is designed to provide participants with a comprehensive understanding of artificial intelligence in neuroscience. It covers various topics related to AI applications in neuroscience, including machine learning, data analysis, and computational modeling. The content is primarily accessed from the ebook provided in the repository, and the course is scheduled for July 15-26, 2024. The repository is shared under a Creative Commons Attribution 4.0 International License and software elements are additionally licensed under the BSD (3-Clause) License. Contributors to the project are acknowledged and welcomed to contribute further.
Interview-for-Algorithm-Engineer
This repository provides a collection of interview questions and answers for algorithm engineers. The questions are organized by topic, and each question includes a detailed explanation of the answer. This repository is a valuable resource for anyone preparing for an algorithm engineering interview.
godoos
GodoOS is an efficient intranet office operating system that includes various office tools such as word/excel/ppt/pdf/internal chat/whiteboard/mind map, with native file storage support. The platform interface mimics the Windows style, making it easy to operate while maintaining low resource consumption and high performance. It automatically connects to intranet users without registration, enabling instant communication and file sharing. The flexible and highly configurable app store allows for unlimited expansion.
Daily-DeepLearning
Daily-DeepLearning is a repository that covers various computer science topics such as data structures, operating systems, computer networks, Python programming, data science packages like numpy, pandas, matplotlib, machine learning theories, deep learning theories, NLP concepts, machine learning practical applications, deep learning practical applications, and big data technologies like Hadoop and Hive. It also includes coding exercises related to '剑指offer'. The repository provides detailed explanations and examples for each topic, making it a comprehensive resource for learning and practicing different aspects of computer science and data-related fields.
Snap-Solver
Snap-Solver is a revolutionary AI tool for online exam solving, designed for students, test-takers, and self-learners. With just a keystroke, it automatically captures any question on the screen, analyzes it using AI, and provides detailed answers. Whether it's complex math formulas, physics problems, coding issues, or challenges from other disciplines, Snap-Solver offers clear, accurate, and structured solutions to help you better understand and master the subject matter.
easyaiot
EasyAIoT is an AI cloud platform designed to support camera integration, annotation, training, inference, data collection, analysis, alerts, recording, storage, and deployment. It aims to provide a zero-threshold AI experience for everyone, with a focus on cameras below a hundred levels. The platform consists of five core projects: WEB module for frontend management, DEVICE module for device management, VIDEO module for video processing, AI module for AI analysis, and TASK module for high-performance task execution. EasyAIoT combines Java, Python, and C++ to create a versatile and user-friendly AIoT platform.
For similar tasks
Saber-Translator
Saber-Translator is your exclusive AI comic translation tool, designed to effortlessly eliminate language barriers and enjoy the original comic fun. It offers features like translating comic images/PDFs, intelligent bubble detection and text recognition, powerful AI translation engine with multiple service providers, highly customizable translation effects, real-time preview and convenient operations, efficient image management and download, model recording and recommendation, and support for language learning with dual prompt word outputs.
For similar jobs
sweep
Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.
teams-ai
The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.
ai-guide
This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.
classifai
Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.
chatbot-ui
Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.
BricksLLM
BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students
uAgents
uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.
griptape
Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.














