Best AI tools for< Improve Voice Acting >
20 - AI tool Sites

Dubbing AI
Dubbing AI is a free real-time AI voice changer that allows you to change your voice in real-time while speaking. It offers a variety of voice effects, including male, female, child, robot, and more. You can also use Dubbing AI to add sound effects and music to your recordings. Dubbing AI is perfect for creating funny videos, voiceovers, and other creative projects.

Atlanta Voiceover Studio
Atlanta Voiceover Studio is a professional voiceover training and recording studio based in Atlanta, GA. They offer a wide range of workshops and classes for voiceover artists of all levels, from beginners to experienced professionals. The studio provides training in various aspects of voiceover work, including animation, commercial voiceover, audiobook narration, and more. In addition to training, they also offer services such as auditions, demos, and business coaching to help voiceover artists succeed in the industry.

AI Interview Answers Generator
AI Interview Answers Generator is an innovative tool designed to assist individuals in acing their job interviews by providing real-time voice transcription, instant optimal solutions, and industry-specific knowledge base. The tool acts as a virtual copilot during interviews, ensuring users have access to relevant and up-to-date information to stand out among other candidates. With cutting-edge AI technology, users can confidently navigate through technical questions and showcase their skills effectively.

Voice Crush
Voice Crush is an AI-powered recording application designed to enhance audio quality by eliminating background noise and stuttering. It offers a user-friendly interface for individuals looking to improve their voice recordings in challenging acoustic environments. With state-of-the-art denoising AI technology, Voice Crush ensures that your voice stands out clearly in every recording. Whether you are a language learner or a professional seeking to deliver articulate messages, Voice Crush provides the tools to boost your confidence and improve the flow of your voice messages. Say goodbye to noisy backgrounds and stuttering with Voice Crush, your ultimate solution for high-quality audio recordings.

Vocal Image
Vocal Image is an AI-powered coaching app that offers speech and communication lessons to help speakers and singers boost confidence and enhance the attractiveness of their voice. The app provides voice evaluations, educational content, specialized programs, and challenges designed to improve voice quality and communication skills. Users can record their voice, receive feedback from a community of voice enthusiasts, and engage with AI coach recommendations to achieve their voice goals.

Voam
Voam is a productive AI platform that helps you to automate your tasks and improve your productivity. With Voam, you can create custom AI models to automate any task, from simple data entry to complex decision-making. Voam is easy to use and requires no coding experience. You can create an AI model in minutes and start automating your tasks right away.

Yoodli
Yoodli is a private, real-time, and judgment-free communication coaching tool powered by AI. It helps users improve their communication skills by providing feedback on speech, similar to Grammarly but for spoken language. Trusted by top companies like Google, Uber, and Accenture, Yoodli offers personalized coaching experiences to enhance public speaking, sales pitches, negotiations, and crucial conversations. With features like AI-powered follow-up questions, real-time feedback, and customizable scenarios, Yoodli aims to be the go-to platform for individuals and enterprises seeking to enhance their communication abilities.

AI Voice Studio
AI Voice Studio is an innovative online tool that allows users to convert text into lifelike speech using advanced AI technology. With AI Voice Studio, users can easily create high-quality voiceovers for various purposes such as videos, podcasts, and presentations. The tool offers a user-friendly interface and a wide range of customization options to tailor the voice output to specific needs. Whether you are a content creator, marketer, or educator, AI Voice Studio provides a convenient and efficient solution for generating natural-sounding voice content.

Outer Voice AI
Outer Voice AI is a mobile application that provides users with an AI-powered coach. The coach can be used to get advice, support, or information on a variety of topics. The coach's responses are generated using artificial intelligence, and they are tailored to the user's individual needs. The coach's voice can also be customized to sound like the user's own voice.

Writetone
Writetone is an AI-powered writing assistant that helps users write in a variety of tones, from formal to informal, persuasive to informative, and creative to engaging. It offers a range of features to help users improve their writing skills, including a paraphrasing tool, co-writer, summarizer, grammar checker, text-to-voice tool, and subject matter expert. Writetone is available as a Chrome extension and MS Word add-in, and it offers a variety of resources to help users get started, including blogs, guides, tutorials, and free templates.

BoldVoice Accent Oracle
BoldVoice Accent Oracle is an AI-powered application designed to help users improve their American English accent. By analyzing users' speech patterns, it can accurately guess their native language within 30 seconds. The app provides personalized training to enhance pronunciation and intonation, aiming to help users sound more like native English speakers. BoldVoice Accent Oracle is a user-friendly tool that offers a fun and interactive way to work on accent reduction and language proficiency.

VoiceCanvas
VoiceCanvas is an advanced AI-powered multilingual voice synthesis and voice cloning platform that offers instant text-to-speech in over 40 languages. It utilizes cutting-edge AI technology to provide high-quality voice synthesis with natural intonation and rhythm, along with personalized voice cloning for more human-like AI speech. Users can upload voice samples, have AI analyze voice features, generate personalized AI voice models, input text for conversion, and apply the cloned AI voice model to generate natural voice speech. VoiceCanvas is highly praised by language learners, content creators, teachers, business owners, voice actors, and educators for its exceptional voice quality, multiple language support, and ease of use in creating voiceovers, learning materials, and podcast content.

Quant-Tek.AI
Quant-Tek.AI is a premier provider of conversational artificial intelligence tools, empowering businesses with human-like voice AI solutions. Their mission is to revolutionize the way businesses interact with customers by providing intelligent solutions that automate communication and enhance customer experience. They aim to drive efficiency, improve customer satisfaction, and foster growth through cutting-edge AI technology. Quant-Tek.AI values innovation, excellence, integrity, and collaboration in their pursuit of AI innovation and shaping the future of business communication.

Presto
Presto is an AI-driven automation tool designed for drive-thru restaurants to improve staff productivity, increase revenue, and enhance the guest experience. With over 15 years of industry experience, Presto is the most popular automation solution for drive-thru restaurants. It offers a powerful spectrum of Voice AI to optimize staff efficiency, supercharge upselling, improve order accuracy, and accelerate service.

PlayHT
PlayHT is an advanced AI Voice Generator that offers realistic Text to Speech capabilities and AI Voiceover services. With a wide range of features and advantages, PlayHT provides users with ultra-realistic AI voices that are indistinguishable from humans. The platform allows users to generate conversational, expressive, and emotional voice content for various applications, including videos, podcasts, e-learning, gaming, and more. PlayHT's AI Voice Cloning feature encapsulates every accent and dialect, offering a seamless voice generation experience. The platform also provides Voice Generation API for real-time voice cloning and generation. PlayHT is trusted by individuals and teams of all sizes for creating custom AI voices and enhancing projects with high-quality voice content.

Pronounce
Pronounce is an AI-powered English speech checker designed for professionals, educators, language learners, and speech therapists. It offers instant feedback and multiple drills to help users master speaking skills, understand specific communication challenges, and track therapy progress. With features like AI-powered speech feedback, English speaking partner, confident communication tips, pronunciation correction, and vocabulary enhancement, Pronounce aims to improve users' English pronunciation, grammar, and fluency. The application provides a user-friendly interface and visually appealing experience, making it suitable for beginners and advanced speakers alike.

Whisper Memos
Whisper Memos is an application that allows users to record voice memos and have them transcribed into text. The app uses artificial intelligence to generate an emoji or two for the subject of the memo, and to divide the text into paragraphs. Whisper Memos also has a private mode, which allows users to opt-out of storing transcripts in their account.

Hiya
Hiya is an AI-powered caller ID, call blocker, and protection application that enhances voice communication experiences. It helps users identify incoming calls, block spam and fraud, and protect against AI voice fraud and scams. Hiya offers solutions for businesses, carriers, and consumers, with features like branded caller ID, spam detection, call filtering, and more. With a global reach and a user base of over 450 million, Hiya aims to bring trust, identity, and intelligence back to phone calls.

Shook
Shook is an app that allows you to hear your voice in different languages. It is a fun and easy way to learn new languages or to simply hear how your voice sounds in a different language.

MagicLoop
MagicLoop is a voice survey tool designed to enhance customer feedback by replacing written feedback with spoken responses. It allows users to gather higher-quality responses through voice surveys, capturing emotions, tones, and nuances for a deeper understanding of participants' feelings and intentions. The tool aims to improve participant engagement and provide detailed insights by encouraging genuine responses. MagicLoop offers a modern approach to surveys, addressing the limitations of traditional methods and providing tailored solutions for various use cases such as user research, satisfaction surveys, NPS, feedback collection, market research, and data monitoring. With features like AI analysis, speech-to-text transcription, and custom branding, MagicLoop streamlines the process of generating insights from voice recordings.
20 - Open Source AI Tools

embodied-agents
Embodied Agents is a toolkit for integrating large multi-modal models into existing robot stacks with just a few lines of code. It provides consistency, reliability, scalability, and is configurable to any observation and action space. The toolkit is designed to reduce complexities involved in setting up inference endpoints, converting between different model formats, and collecting/storing datasets. It aims to facilitate data collection and sharing among roboticists by providing Python-first abstractions that are modular, extensible, and applicable to a wide range of tasks. The toolkit supports asynchronous and remote thread-safe agent execution for maximal responsiveness and scalability, and is compatible with various APIs like HuggingFace Spaces, Datasets, Gymnasium Spaces, Ollama, and OpenAI. It also offers automatic dataset recording and optional uploads to the HuggingFace hub.

fount
fount is a character card frontend page that decouples AI sources, AI characters, user personas, dialogue environments, and AI plugins, allowing them to be freely combined to spark infinite possibilities. It serves as a bridge connecting imagination and reality, a lighthouse guiding characters and stories, and a free garden for AI sources, characters, personas, dialogue environments, and plugins to grow and bloom. It integrates AI sources without the need for reverse proxy servers, improves web experience with features like multi-device synchronization and unfiltered HTML rendering, and extends companionship beyond the web by connecting characters to Discord groups and providing gentle reminders through fount-pwsh. For character creators, fount offers infinite possibilities with JavaScript or TypeScript code customization, execution of code without filtering, loading npm packages, and creating custom HTML pages. It encourages extension through modularization and community contributions.

home-llm
Home LLM is a project that provides the necessary components to control your Home Assistant installation with a completely local Large Language Model acting as a personal assistant. The goal is to provide a drop-in solution to be used as a "conversation agent" component by Home Assistant. The 2 main pieces of this solution are Home LLM and Llama Conversation. Home LLM is a fine-tuning of the Phi model series from Microsoft and the StableLM model series from StabilityAI. The model is able to control devices in the user's house as well as perform basic question and answering. The fine-tuning dataset is a custom synthetic dataset designed to teach the model function calling based on the device information in the context. Llama Conversation is a custom component that exposes the locally running LLM as a "conversation agent" in Home Assistant. This component can be interacted with in a few ways: using a chat interface, integrating with Speech-to-Text and Text-to-Speech addons, or running the oobabooga/text-generation-webui project to provide access to the LLM via an API interface.

Awesome-LLM-Resources-List
Awesome LLM Resources is a curated collection of resources for Large Language Models (LLMs) covering various aspects such as serverless hosting, accessing off-the-shelf models via API, local inference, LLM serving frameworks, open-source LLM web chat UIs, renting GPUs for fine-tuning, fine-tuning with no-code UI, fine-tuning frameworks, OS agentic/AI workflow, AI agents, co-pilots, voice API, open-source TTS models, OS RAG frameworks, research papers on chain-of-thought prompting, CoT implementations, CoT fine-tuned models & datasets, and more.

voice-pro
Voice-Pro is an integrated solution for subtitles, translation, and TTS. It offers features like multilingual subtitles, live translation, vocal remover, and supports OpenAI Whisper and Open-Source Translator. The tool provides a Studio tab for various functions, Whisper Caption tab for subtitle creation, Translate tab for translation, TTS tab for text-to-speech, Live Translation tab for real-time voice recognition, and Batch tab for processing multiple files. Users can download YouTube videos, improve voice recognition accuracy, create automatic subtitles, and produce multilingual videos with ease. The tool is easy to install with one-click and offers a Web-UI for user convenience.

Synthetic-Voice-Detection-Vocoder-Artifacts
The Synthetic-Voice-Detection-Vocoder-Artifacts repository provides the LibriSeVoc dataset containing self-vocoding samples created with six state-of-the-art vocoders to expose and exploit vocoder artifacts. It also introduces a new approach for detecting synthetic human voices by identifying signal artifacts left by neural vocoders and enhancing the RawNet2 baseline. The repository includes a paper and dataset for further reference and offers instructions for training the model and testing it in the wild.

voicechat2
Voicechat2 is a fast, fully local AI voice chat tool that uses WebSockets for communication. It includes a WebSocket server for remote access, default web UI with VAD and Opus support, and modular/swappable SRT, LLM, TTS servers. Users can customize components like SRT, LLM, and TTS servers, and run different models for voice-to-voice communication. The tool aims to reduce latency in voice communication and provides flexibility in server configurations.

audiobook-creator
Audiobook Creator is an open-source tool that converts books in various text formats into fully voiced audiobooks with intelligent character voice attribution. It utilizes NLP, LLMs, and TTS technologies to provide an engaging audiobook experience. The project includes components for text cleaning and formatting, character identification, and audiobook generation. Key features include a Gradio UI app, M4B audiobook creation, multi-format support, Docker compatibility, customizable narration, progress tracking, and open-source licensing.

ultravox
Ultravox is a fast multimodal Language Model (LLM) that can understand both text and human speech in real-time without the need for a separate Audio Speech Recognition (ASR) stage. By extending Meta's Llama 3 model with a multimodal projector, Ultravox converts audio directly into a high-dimensional space used by Llama 3, enabling quick responses and potential understanding of paralinguistic cues like timing and emotion in human speech. The current version (v0.3) has impressive speed metrics and aims for further enhancements. Ultravox currently converts audio to streaming text and plans to emit speech tokens for direct audio conversion. The tool is open for collaboration to enhance this functionality.

DeepBattler
DeepBattler is a tool designed for Hearthstone Battlegrounds players, providing real-time strategic advice and insights to improve gameplay experience. It integrates with the Hearthstone Deck Tracker plugin and offers voice-assisted guidance. The tool is powered by a large language model (LLM) and can match the strength of top players on EU servers. Users can set up the tool by adding dependencies, configuring the plugin path, and launching the LLM agent. DeepBattler is licensed for personal, educational, and non-commercial use, with guidelines on non-commercial distribution and acknowledgment of external contributions.
20 - OpenAI Gpts

CDR Guru
To master Unified Communications Data across platforms like Cisco, Avaya, Mitel, and Microsoft Teams, by orchestrating a team of expert agents and providing actionable solutions.

Passive to Active Voice Text Converter AI
I convert and rewrite passive voice text into active voice tone and language. Simply put your passive voice text below! Perfect for sentences, paragraphs, daily emails, and longer texts.

Your Lingo AI Coach
Welcome! I'm a voice-focused language teacher for interactive speaking practice. To enable voice, download the app and tap the headphone button next to my chat window. Then choose your preferred voice. When you're ready, tell me what language you'd like to learn. It's FREE!

DateMate
Your friendly AI assistant for voice-based dating, offering personalized tips, safety advice, and fun interactions.

Speak GPT
Voice-centric English role-play tool for speaking practice and offering personalized feedback!

Language Coach
Practice speaking another language like a local without being a local (use ChatGPT Voice via mobile app!)

AI Phonetics and Reading Coach with Speech
Phonetics and reading coach with interactive voice capabilities, tailored for adult beginners.

The Master in Brand Identity - GetMax
Guiding startups to creating unique brand/product voice & tone for content marketing.

Bob's Language Tutor
Language tutor focusing on communication. Responds to voice. Starts with basics.

CaseCracker™: Consultant Case Interview Practice
Crack open the door to your future. (Partner tip: use the iPhone app for voice chat)

📝 Study Guide AI: Spelling 🏆
Transform your spelling study sessions into interactive spelling bees! 🐝 Upload your word list and dive into a voice-activated quiz. Hear the word, spell it out, and get instant feedback before tackling the next challenge. Perfect your spelling skills one word at a time!

Polish your Polish
A bilingual Polish tutor || Learn/ Translate/ Double-check Polish with some support of your native language (try our VOICE chat!)

Marina the Brazilian Portuguese Tutor
More than your average AI Teacher! A Teacher with a REAL personality👋🏻 Hi there! ❤️ Learn with me Brazilian Portuguese ✅ I coach beginner to advanced level 💬 Practice vocabulary, writing, reading, speaking, or learn a new topic 📲 Use voice in mobile for talking