Best AI tools for< write subtitles for videos >
20 - AI tool Sites
AudioTranscription.ai
AudioTranscription.ai is an AI-powered transcription tool that allows users to quickly and accurately transcribe audio and video files. It supports a variety of file formats, including MP3, MP4, AAC, AIFF, WMA, and WAV. The tool also offers speaker identification and punctuation features. AudioTranscription.ai is a valuable tool for journalists, transcribers, students, and anyone else who needs to transcribe audio or video files.
SpeechFlow
SpeechFlow is a powerful speech-to-text API that transcribes audio and video files into text with high accuracy. It supports 14 languages and offers features such as punctuation, easy deployment, scalability, and fast processing. SpeechFlow is ideal for businesses and individuals who need accurate and timely transcription services.
Ecango
Ecango is an AI-powered audio and video transcription tool that allows users to convert audio and video files into text in over 133 languages. It is easy to use, accurate, and affordable, making it a great choice for businesses and individuals alike.
Immersive Translate
Immersive Translate is a highly-rated AI-powered dual-language webpage translation extension that has helped over a million users break down language barriers since its launch in 2023. With Immersive Translate, you can translate web pages, PDFs, EPUBs, and video subtitles in real-time, completely free of charge. You can also choose to use AI engines like OpenAI (ChatGPT), DeepL, and Gemini to translate the content. Immersive Translate is available as an extension for Edge, Chrome, Firefox, Safari, and as a userscript. It also has a built-in plugin for Android and iOS browsers, making it easy to use on the go. Immersive Translate has revolutionized the way people read and interact with online content by providing a seamless dual-language reading experience. It supports over 10 translation engines, including DeepL, OpenAI, and Google Translate, giving users a wide range of options to choose from. Immersive Translate also offers innovative features like mouse hover translation, input box translation, and one-click ePub ebook translation, making it a versatile tool for various tasks.
SEOlligence
SEOlligence is an AI-powered SEO and translation tool designed to help e-commerce businesses localize their products and boost their online presence globally. It offers a range of services including product title and subtitle enhancement, keyword optimization, product description translation, FAQ schema optimization, and meta title and description optimization. SEOlligence utilizes cutting-edge AI techniques to generate unique, SEO-compliant content that helps businesses improve their search rankings, drive organic traffic, and increase conversions.
Video Tap
Video Tap is an AI-powered tool that helps you transform your videos into a variety of marketing content, including viral clips, articles, blog posts, transcripts, subtitles, and more. With Video Tap, you can easily repurpose your existing videos to reach a wider audience and get more value from your content.
Trancy
Trancy is an immersive AI language learning tool that offers a range of features to help users master new languages. With its bilingual subtitles, AI-powered translation, and personalized learning experience, Trancy makes language learning enjoyable and effective. Users can watch videos, read articles, and practice listening and speaking skills, all while expanding their vocabulary and understanding of grammar. Trancy's AI capabilities provide detailed syntactic analysis, speech recognition, and natural-sounding text-to-speech voices, making it a comprehensive and engaging language learning tool.
Write with LAIKA
Write with LAIKA is an AI-powered writing tool that provides users with a team of virtual companions designed using artificial intelligence. These companions offer support in editing, summarizing, and giving feedback, creating a collaborative and creative environment for writers. The tool aims to enhance the writing process by providing personalized AI peers to assist users in their creative endeavors.
Write.homes
Write.homes is an AI tool designed for real estate professionals, offering a comprehensive suite of features to streamline various tasks such as property listings, room redesign, objection handling, blog post creation, and social media posts. With advanced AI capabilities like GPT-4 integration, multilingual support, and virtual room staging, Write.homes empowers realtors to work more efficiently, save time, and focus on growing their business in today's fast-paced real estate industry.
Clarity Write
Clarity Write is an open-source SaaS script that provides a comprehensive suite of AI-powered tools to transform content creation. With its powerful AI capabilities, users can effortlessly generate high-quality content, create stunning visuals, automate coding tasks, transcribe audio and video files, and engage with AI experts via chatbots. Clarity Write also offers a vast library of over 500 professionally designed templates, a feature-rich editor for refining content, and robust admin tools for streamlined management. By leveraging the capabilities of OpenAI APIs, Clarity Write empowers users to enhance their content creation process, unlock endless creativity, and simplify their operations.
Write Release
Write Release is an AI-powered tool that helps users write press releases in minutes. It is easy to use and free to get started. Users simply answer a few questions, and Write Release will generate a high-quality press release that can be used to promote their company or organization.
write right ai
write right ai is an AI-powered writing coach designed to help users improve their grammar and sentence structure quickly and effectively. It offers features such as AI-powered grammar checking, 200+ practice questions for learning, professional and unique AI suggestions, and a freetext grammar check for assessing emails, assignments, and CVs. The application is suitable for users of all levels of English ability and provides a user-friendly platform for enhancing writing skills.
Write once. Tailor for every role.
This application allows you to write once and tailor your writing for every role. It is a powerful tool that can help you save time and improve the quality of your writing.
Write Conch
Write Conch is an AI-powered writing assistant that offers a range of tools to enhance your writing experience. With over 100 AI writing templates, you can effortlessly create essays, articles, emails, and more. The AI PDF reader simplifies document navigation and provides deeper insights. ChatGOT allows you to access ChatGPT without logins, offering AI solutions for learning and professional needs. Write Conch also includes AI detection and humanization tools to safeguard your AI content and ensure originality. Whether you're a student, marketer, teacher, researcher, or content creator, Write Conch has the tools to streamline your writing process and boost your productivity.
Write Conch
Write Conch is an AI-powered writing assistant that offers a range of tools to enhance your writing experience. With over 100 AI writing templates, you can effortlessly create essays, articles, ad copy, and more. The AI Document Reader simplifies complex texts, providing summaries, definitions, and explanations to improve comprehension. Additionally, Write Conch includes AI detection and humanization tools to ensure the originality and authenticity of your content. Whether you're a student, marketer, teacher, researcher, or content creator, Write Conch has the tools to meet your writing needs.
Lazy Write
Lazy Write is an AI content writing tool that assists users in generating high-quality written content efficiently. The tool utilizes artificial intelligence algorithms to analyze input data and produce well-structured articles, blog posts, or any other written material. With Lazy Write, users can save time and effort by automating the writing process, allowing them to focus on other aspects of their work. The tool is designed to be user-friendly, making it accessible to individuals with varying levels of writing expertise. Lazy Write aims to revolutionize the way content is created by providing a seamless and efficient writing experience.
Frontitude UX Writing Assistant
Frontitude is an AI writing assistant designed specifically for design teams, offering a seamless integration with Figma to help users write engaging and consistent UX content effortlessly. The tool provides copy suggestions based on design elements, character limits, and length, empowering teams to deliver better user experiences and save time on writing tasks. Frontitude also allows users to embed content guidelines into their design components, streamlining design reviews and critiques. With a focus on product copy and UX writing best practices, Frontitude's AI Writing Assistant is a valuable tool for designers looking to maintain a consistent voice and enhance their workflow.
Notion
Notion is a connected workspace that combines wikis, docs, projects, and calendars into a single platform. It is designed to be simple and powerful, with a focus on collaboration and organization. Notion's AI assistant can help you with a variety of tasks, such as answering questions, generating text, and translating languages. With its powerful building blocks, you can customize Notion to fit your specific needs and workflows. Notion is used by millions of people around the world, from individuals and small businesses to large enterprises.
Grammarly
Grammarly is an AI-powered writing assistant that helps users improve their writing. It offers a range of features, including a grammar checker, plagiarism checker, and writing suggestions. Grammarly is available as a desktop app, browser extension, and mobile app.
Kimi.ai
Kimi.ai is an AI-powered writing assistant that helps you create high-quality content quickly and easily. With Kimi.ai, you can generate articles, blog posts, social media content, and more, in just a few clicks. Kimi.ai is the perfect tool for busy professionals, students, and anyone who wants to create great content without spending hours writing and editing.
20 - Open Source AI Tools
Linly-Talker
Linly-Talker is an innovative digital human conversation system that integrates the latest artificial intelligence technologies, including Large Language Models (LLM) 🤖, Automatic Speech Recognition (ASR) 🎙️, Text-to-Speech (TTS) 🗣️, and voice cloning technology 🎤. This system offers an interactive web interface through the Gradio platform 🌐, allowing users to upload images 📷 and engage in personalized dialogues with AI 💬.
FunClip
FunClip is an open-source, locally deployable automated video editing tool that utilizes the FunASR Paraformer series models from Alibaba DAMO Academy for speech recognition in videos. Users can select text segments or speakers from the recognition results and click the clip button to obtain the corresponding video segments. FunClip integrates advanced features such as the Paraformer-Large model for accurate Chinese ASR, SeACo-Paraformer for customized hotword recognition, CAM++ speaker recognition model, Gradio interactive interface for easy usage, support for multiple free edits with automatic SRT subtitles generation, and segment-specific SRT subtitles.
FunClip
FunClip is an open-source, locally deployed automated video clipping tool that leverages Alibaba TONGYI speech lab's FunASR Paraformer series models for speech recognition on videos. Users can select text segments or speakers from recognition results to obtain corresponding video clips. It integrates industrial-grade models for accurate predictions and offers hotword customization and speaker recognition features. The tool is user-friendly with Gradio interaction, supporting multi-segment clipping and providing full video and target segment subtitles. FunClip is suitable for users looking to automate video clipping tasks with advanced AI capabilities.
whispering-ui
Whispering Tiger UI is a Native-UI tool designed to control the Whispering Tiger application, a free and Open-Source tool that can listen/watch to audio streams or in-game images on your machine and provide transcription or translation to a web browser using Websockets or over OSC. It features a Native-UI for Windows, easy access to all Whispering Tiger features including transcription, translation, text-to-speech, and in-game image recognition. The tool supports loopback audio device, configuration saving/loading, plugin support for additional features, and auto-update functionality. Users can create profiles, configure audio devices, select A.I. devices for speech-to-text, and install/manage plugins for extended functionality.
ai-audio-datasets
AI Audio Datasets List (AI-ADL) is a comprehensive collection of datasets consisting of speech, music, and sound effects, used for Generative AI, AIGC, AI model training, and audio applications. It includes datasets for speech recognition, speech synthesis, music information retrieval, music generation, audio processing, sound synthesis, and more. The repository provides a curated list of diverse datasets suitable for various AI audio tasks.
Awesome-AITools
This repo collects AI-related utilities. ## All Categories * All Categories * ChatGPT and other closed-source LLMs * AI Search engine * Open Source LLMs * GPT/LLMs Applications * LLM training platform * Applications that integrate multiple LLMs * AI Agent * Writing * Programming Development * Translation * AI Conversation or AI Voice Conversation * Image Creation * Speech Recognition * Text To Speech * Voice Processing * AI generated music or sound effects * Speech translation * Video Creation * Video Content Summary * OCR(Optical Character Recognition)
ai-notes
Notes on AI state of the art, with a focus on generative and large language models. These are the "raw materials" for the https://lspace.swyx.io/ newsletter. This repo used to be called https://github.com/sw-yx/prompt-eng, but was renamed because Prompt Engineering is Overhyped. This is now an AI Engineering notes repo.
MouseTooltipTranslator
MouseTooltipTranslator is a Chrome extension that allows users to translate any text on a webpage by simply hovering over it. It supports both Google Translate and Bing Translate, and can also be used to listen to the pronunciation of words and phrases. Additionally, the extension can be used to translate text in input boxes and highlighted text, and to display translated tooltips for PDFs and YouTube videos. It also supports OCR, allowing users to translate text in images by holding down the left shift key and hovering over the image.
venom
Venom is a high-performance system developed with JavaScript to create a bot for WhatsApp, support for creating any interaction, such as customer service, media sending, sentence recognition based on artificial intelligence and all types of design architecture for WhatsApp.
manga-image-translator
Translate texts in manga/images. Some manga/images will never be translated, therefore this project is born. * Image/Manga Translator * Samples * Online Demo * Disclaimer * Installation * Pip/venv * Poetry * Additional instructions for **Windows** * Docker * Hosting the web server * Using as CLI * Setting Translation Secrets * Using with Nvidia GPU * Building locally * Usage * Batch mode (default) * Demo mode * Web Mode * Api Mode * Related Projects * Docs * Recommended Modules * Tips to improve translation quality * Options * Language Code Reference * Translators Reference * GPT Config Reference * Using Gimp for rendering * Api Documentation * Synchronous mode * Asynchronous mode * Manual translation * Next steps * Support Us * Thanks To All Our Contributors :
VideoLLaMA2
VideoLLaMA 2 is a project focused on advancing spatial-temporal modeling and audio understanding in video-LLMs. It provides tools for multi-choice video QA, open-ended video QA, and video captioning. The project offers model zoo with different configurations for visual encoder and language decoder. It includes training and evaluation guides, as well as inference capabilities for video and image processing. The project also features a demo setup for running a video-based Large Language Model web demonstration.
thepipe
The Pipe is a multimodal-first tool for feeding files and web pages into vision-language models such as GPT-4V. It is best for LLM and RAG applications that require a deep understanding of tricky data sources. The Pipe is available as a hosted API at thepi.pe, or it can be set up locally.
blog
这是一个程序员关于 ChatGPT 学习过程的记录,其中包括了 ChatGPT 的使用技巧、相关工具和资源的整理,以及一些个人见解和思考。 **使用技巧** * **充值 OpenAI API**:可以通过 https://beta.openai.com/account/api-keys 进行充值,支持信用卡和 PayPal。 * **使用专梯**:推荐使用稳定的专梯,可以有效提高 ChatGPT 的访问速度和稳定性。 * **使用魔法**:可以通过 https://my.x-air.app:666/#/register?aff=32853 访问 ChatGPT,无需魔法即可访问。 * **下载各种 apk**:可以通过 https://apkcombo.com 下载各种安卓应用的 apk 文件。 * **ChatGPT 官网**:ChatGPT 的官方网站是 https://ai.com。 * **Midjourney**:Midjourney 是一个生成式 AI 图像平台,可以通过 https://midjourney.com 访问。 * **文本转视频**:可以通过 https://www.d-id.com 将文本转换为视频。 * **国内大模型**:国内也有很多大模型,如阿里巴巴的通义千问、百度文心一言、讯飞星火、阿里巴巴通义听悟等。 * **查看 OpenAI 状态**:可以通过 https://status.openai.com/ 查看 OpenAI 的服务状态。 * **Canva 画图**:Canva 是一个在线平面设计平台,可以通过 https://www.canva.cn 进行画图。 **相关工具和资源** * **文字转语音**:可以通过 https://modelscope.cn/models?page=1&tasks=text-to-speech&type=audio 找到文字转语音的模型。 * **可好好玩玩的项目**: * https://github.com/sunner/ChatALL * https://github.com/labring/FastGPT * https://github.com/songquanpeng/one-api * **个人博客**: * https://baoyu.io/ * https://gorden-sun.notion.site/527689cd2b294e60912f040095e803c5?v=4f6cc12006c94f47aee4dc909511aeb5 * **srt 2 lrc 歌词**:可以通过 https://gotranscript.com/subtitle-converter 将 srt 格式的字幕转换为 lrc 格式的歌词。 * **5 种速率限制**:OpenAI API 有 5 种速率限制:RPM(每分钟请求数)、RPD(每天请求数)、TPM(每分钟 tokens 数量)、TPD(每天 tokens 数量)、IPM(每分钟图像数量)。 * **扣子平台**:coze.cn 是一个扣子平台,可以提供各种扣子。 * **通过云函数免费使用 GPT-3.5**:可以通过 https://juejin.cn/post/7353849549540589587 免费使用 GPT-3.5。 * **不蒜子 统计网页基数**:可以通过 https://busuanzi.ibruce.info/ 统计网页的基数。 * **视频总结和翻译网页**:可以通过 https://glarity.app/zh-CN 总结和翻译视频。 * **视频翻译和配音工具**:可以通过 https://github.com/jianchang512/pyvideotrans 翻译和配音视频。 * **文字生成音频**:可以通过 https://www.cnblogs.com/jijunjian/p/18118366 将文字生成音频。 * **memo ai**:memo.ac 是一个多模态 AI 平台,可以将视频链接、播客链接、本地音视频转换为文字,支持多语言转录后翻译,还可以将文字转换为新的音频。 * **视频总结工具**:可以通过 https://summarize.ing/ 总结视频。 * **可每天免费玩玩**:可以通过 https://www.perplexity.ai/ 每天免费玩玩。 * **Suno.ai**:Suno.ai 是一个 AI 语言模型,可以通过 https://bibigpt.co/ 访问。 * **CapCut**:CapCut 是一个视频编辑软件,可以通过 https://www.capcut.cn/ 下载。 * **Valla.ai**:Valla.ai 是一个多模态 AI 模型,可以通过 https://www.valla.ai/ 访问。 * **Viggle.ai**:Viggle.ai 是一个 AI 视频生成平台,可以通过 https://viggle.ai 访问。 * **使用免费的 GPU 部署文生图大模型**:可以通过 https://www.cnblogs.com/xuxiaona/p/18088404 部署文生图大模型。 * **语音转文字**:可以通过 https://speech.microsoft.com/portal 将语音转换为文字。 * **投资界的 ai**:可以通过 https://reportify.cc/ 了解投资界的 ai。 * **抓取小视频 app 的各种信息**:可以通过 https://github.com/NanmiCoder/MediaCrawler 抓取小视频 app 的各种信息。 * **马斯克 Grok1 开源**:马斯克的 Grok1 模型已经开源,可以通过 https://github.com/xai-org/grok-1 访问。 * **ChatALL**:ChatALL 是一个跨端支持的聊天机器人,可以通过 https://github.com/sunner/ChatALL 访问。 * **零一万物**:零一万物是一个 AI 平台,可以通过 https://www.01.ai/cn 访问。 * **智普**:智普是一个 AI 语言模型,可以通过 https://chatglm.cn/ 访问。 * **memo ai 下载**:可以通过 https://memo.ac/ 下载 memo ai。 * **ffmpeg 学习**:可以通过 https://www.ruanyifeng.com/blog/2020/01/ffmpeg.html 学习 ffmpeg。 * **自动生成文章小工具**:可以通过 https://www.cognition-labs.com/blog 生成文章。 * **简易商城**:可以通过 https://www.cnblogs.com/whuanle/p/18086537 搭建简易商城。 * **物联网**:可以通过 https://www.cnblogs.com/xuxiaona/p/18088404 学习物联网。 * **自定义表单、自定义列表、自定义上传和下载、自定义流程、自定义报表**:可以通过 https://www.cnblogs.com/whuanle/p/18086537 实现自定义表单、自定义列表、自定义上传和下载、自定义流程、自定义报表。 **个人见解和思考** * ChatGPT 是一个强大的工具,可以用来提高工作效率和创造力。 * ChatGPT 的使用门槛较低,即使是非技术人员也可以轻松上手。 * ChatGPT 的发展速度非常快,未来可能会对各个行业产生深远的影响。 * 我们应该理性看待 ChatGPT,既要看到它的优点,也要意识到它的局限性。 * 我们应该积极探索 ChatGPT 的应用场景,为社会创造价值。
aiolauncher_scripts
AIO Launcher Scripts is a collection of Lua scripts that can be used with AIO Launcher to enhance its functionality. These scripts can be used to create widget scripts, search scripts, and side menu scripts. They provide various functions such as displaying text, buttons, progress bars, charts, and interacting with app widgets. The scripts can be used to customize the appearance and behavior of the launcher, add new features, and interact with external services.
gpt-subtrans
GPT-Subtrans is an open-source subtitle translator that utilizes large language models (LLMs) as translation services. It supports translation between any language pairs that the language model supports. Note that GPT-Subtrans requires an active internet connection, as subtitles are sent to the provider's servers for translation, and their privacy policy applies.
cody
Cody is a free, open-source AI coding assistant that can write and fix code, provide AI-generated autocomplete, and answer your coding questions. Cody fetches relevant code context from across your entire codebase to write better code that uses more of your codebase's APIs, impls, and idioms, with less hallucination.
aiscript
AiScript is a lightweight scripting language that runs on JavaScript. It supports arrays, objects, and functions as first-class citizens, and is easy to write without the need for semicolons or commas. AiScript runs in a secure sandbox environment, preventing infinite loops from freezing the host. It also allows for easy provision of variables and functions from the host.
20 - OpenAI Gpts
SEOGenius - Craft SEO titles & Effectiveness Score
Crafts SEO-friendly titles, subtitles, summaries, TLDRs, and hashtags for online content. Imagine crafting titles so SEO-friendly that Google sends you a personal thank-you note 😂
Générateur d'articles de blog
Je convertis les sous-titres YouTube en articles de blog, avec un ton sympa et accessible.
Write Better Emails at Work
Create professional, clear, and effective emails to improve team communication
Write a romance novel
Use this GPT to outline your romance novel: design your story, your characters, obstacles, stakes, twists, arena, etc… Then ask GPT to draft the chapters ❤️ (remember: you are the brain, GPT is just the hand. Stay creative, use this GPT as an author!)
Code Like a GOAT 🐐🧙🏻♂️
Unleash Your Inner GOAT in Coding! Be the ultimate full-stack developer with unrivaled skills in all coding languages and platforms. Write elegant, secure code, and more. Excel in cybersecurity and innovate with your comprehensive expertise. Ready to code like never before?
Academic Reports Buddy
Give me the name of a student and what you want to say and I'll help you write your reports. Upload your comments and I will proof read them.
DreamBerd
I can write and interpret code written in Dreamberd, the perfect programming language