Srt-AI-Voice-Assistant

Srt-AI-Voice-Assistant

Subtitle dubbing with multiple TTS Engines

Stars: 198

Visit
 screenshot

Srt-AI-Voice-Assistant is a convenient tool that generates audio from uploaded .srt subtitle files by calling APIs such as Bert-VITS2 (HiyoriUI), GPT-SoVITS, and Microsoft TTS (online). The code is currently not perfect, and feedback on bugs or suggestions can be provided at https://github.com/YYuX-1145/Srt-AI-Voice-Assistant/issues. Recent updates include adding custom API functionality with a focus on security, support for Microsoft online TTS (requires key configuration), error handling improvements, automatic project path detection, compatibility with API-v1 for limited functionality, and significant feature updates supporting card synthesis.

README:

Srt-AI-Voice-Assistant

English | 简体中文 | Français | Other |Documents in other languages are translated by AI and they are provided only for reference.

This project can use multiple AI-TTS to dub for your subtitle or text files.
And provides various convenient auxiliary functions including audio/video transcription and subtitle translation.

If you have encountered problems or want to create a feature request, please go to Issues .

Features

  • ✅ Open-source, Friendly WebUI interface, Run locally and Accessible via LAN
  • ✅ Support multiple TTS projects: BV2, GSV, CosyVoice2, AzureTTS, and you can even customize your APIs!
  • ✅ Save personalized settings and presets
  • ✅ Batch mode
  • ✅ Subtitle editing
  • ✅ Subtitle translation
  • ✅ Regenerating Specific Lines
  • ✅ Support multi-speaker dubbing
  • ✅ Re-export subtitles
  • ✅ Extended functions: subtitle transcription for audio/video
  • ✅ I18n

Installation

From Source Code

git clone https://github.com/YYuX-1145/Srt-AI-Voice-Assistant.git
cd Srt-AI-Voice-Assistant/
pip install -r requirements.txt
python Srt-AI-Voice-Assistant.py

Optional Command Line Arguments

You can customize the behavior of the application with the following command-line arguments:

Arguments Description
-p Specify the server port
--lan Enable LAN access
--no_ext Disable all extensions
--share Create a publicly shareable link for the gradio app.
--server_mode Activate server mode

And then prepare TTS engines yourself. For Windows users, you can download the packaged version or use the integrated package with GPT-SoVITS.

If the required TTS engine is not on the supported list, you can refer to the documentation to write an extension.


  • Use this version only when there are dependency conflicts or installation issues.
  • The GPT-SoVITS integrated package includes the packaged version, without removing any built-in or pretrained models, and its code for finetuning and inference is the same with the official repository.
  • Note: Packaged Version included in the GPT-SoVITS integrated package may not be the latest version; overwrite it to update.

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for Srt-AI-Voice-Assistant

Similar Open Source Tools

For similar tasks

For similar jobs