Best AI tools for< Select Voice >
20 - AI tool Sites
![MakePodcast Screenshot](/screenshots/makepodcast.io.jpg)
MakePodcast
MakePodcast is an AI-powered platform that enables users to effortlessly craft professional podcasts in minutes. By leveraging Open AI TTS and Eleven Labs Voices, MakePodcast allows users to generate high-quality podcast episodes with ease. Users can upload scripts, select voices, and create personalized podcasts in various languages. The platform supports multiple use cases, including creating full podcast episodes, incorporating custom voices, generating ad reads, and reaching a global audience with multilingual support. MakePodcast offers a lifetime pricing plan with unlimited episode limits and the option to use custom voice models.
![Voxify Screenshot](/screenshots/voxify.ai.jpg)
Voxify
Voxify is an AI voice generator tool that allows users to effortlessly create immersive audio experiences by converting text to speech. With over 450 voices available in more than 120 languages and accents, users can customize every aspect of the narration, including pitch, speed, and emotion. Ideal for content creators, podcasters, and educators looking to enhance the quality of their voiceovers, Voxify offers a user-friendly interface and a wide range of customization options to bring text to life through realistic and engaging voice generation.
![Ankara AI Screenshot](/screenshots/ankarawebsite.vercel.app.jpg)
Ankara AI
Ankara AI is an automated video narration and commentary application that utilizes AI technology to generate narrations for videos in over 25+ different languages. Users can upload a video, select a voice, and provide a narration prompt to create high-quality narrations. The application ensures user privacy by not retaining user videos but securely storing anonymized prompts and script results to enhance narration quality.
![Podial Screenshot](/screenshots/podial.ai.jpg)
Podial
Podial is an AI-powered platform that allows users to generate podcasts from text documents, making it easy to learn complex topics through engaging discussions. Users can control the podcast topics, select voices and personalities for the discussion, and adjust the podcast length. Podial aims to simplify learning and information sharing by converting text into audio content, catering to various learning styles and preferences.
![Covers.AI Screenshot](/screenshots/covers.ai.jpg)
Covers.AI
Covers.AI is an AI voice generator and AI song generator platform that allows users to create custom AI voices by uploading voice recordings. It offers a wide range of AI voice models for various categories such as anime, cartoons, streamers, gaming, famous personalities, and more. Users can easily generate AI voices and songs in minutes, making it a game-changing tool for music lovers of all levels of expertise. Covers.AI provides a user-friendly experience, empowering users to control and enhance their voices effortlessly.
![Sound of Text Screenshot](/screenshots/soundoftext.app.jpg)
Sound of Text
Sound of Text is a free online text-to-speech converter that uses AI technology to convert written text into spoken words. It supports over 840 different voices in more than 135 languages, and allows users to download the resulting audio files in a variety of formats. Sound of Text is easy to use and can be used for a variety of purposes, such as creating audiobooks, podcasts, and presentations.
![Firebay Studios Screenshot](/screenshots/www.firebaystudios.com.jpg)
Firebay Studios
Firebay Studios is an AI-powered platform that enables users to create high-quality radio ads in seconds. The tool helps companies and organizations of all sizes to automate production processes, streamline ad creation, and ultimately boost revenue. With features like AI & Cloned Voices, Editing & Production, Script Writing, SFX & Music, and support for 29 languages, Firebay Studios offers a comprehensive solution for creating captivating audio-based advertisements effortlessly.
![Poddy.ai Screenshot](/screenshots/poddy.ai.jpg)
Poddy.ai
Poddy.ai is an AI-powered platform that simplifies podcasting by providing end-to-end solutions for creating, publishing, and growing podcasts. With features like automatic episode generation, podcast series creation, and AI voices, Poddy.ai offers a comprehensive toolkit for podcasters to bring their vision to life. The platform is completely free to use and ensures advanced security for podcast data.
![Audyo Screenshot](/screenshots/www.audyo.ai.jpg)
Audyo
Audyo is a text-to-speech tool that allows users to create realistic-sounding audio from text. With over 100 voices to choose from, users can create audio in a variety of languages and accents. Audyo is easy to use, simply type in your text and select a voice. You can then download your audio file or embed it on your website or blog. Audyo is a great tool for creating voiceovers for videos, podcasts, audiobooks, and more.
![Videolulu Screenshot](/screenshots/videolulu.com.jpg)
Videolulu
Videolulu is an AI-powered tool that enables users to generate faceless videos on autopilot. It allows users to turn their ideas into viral shorts in minutes by creating engaging content in popular formats for platforms like TikTok, Instagram, and YouTube. With a simple 4-step process, users can choose a video type, select a voice from a variety of AI voices, add background music, and select a video format using AI images, stock videos, or split screens. Videolulu offers different pricing plans to suit varying needs, from a free plan with limited features to premium plans with more credits and options.
![Trupeer Screenshot](/screenshots/trupeer.ai.jpg)
Trupeer
Trupeer is an AI-powered platform that allows users to effortlessly create professional product videos and detailed documentation in minutes. By leveraging AI technology, Trupeer transforms simple screen recordings into polished videos and guides, eliminating the need for prior experience in video editing, technical writing, or graphic design. The platform offers studio-quality product videos with AI voiceovers, automated zoom effects, and cleaned-up grammar. Trupeer is suitable for product marketing, design walkthroughs, learning and development, sales and operations excellence, customer onboarding, and YouTube content creation. Users can easily record with a Chrome extension, edit scripts, select AI voiceovers, wallpapers, and music, and download the content in various formats. Trupeer is designed to save time and effort in creating how-to-guides and offers pricing options for individuals, hobbyists, professionals, and enterprise users.
![Voice.ai Screenshot](/screenshots/voice.ai.jpg)
Voice.ai
Voice.ai is a free real-time voice changer and the largest ecosystem of free AI voice tools. With Voice.ai, you can change your voice in real-time, clone voices, create soundboards, and more. Voice.ai is perfect for streamers, content creators, gamers, and anyone who wants to have fun with their voice.
![Voicemaker Screenshot](/screenshots/voicemaker.in.jpg)
Voicemaker
Voicemaker is a text-to-speech converter that allows users to create audio files for commercial use. It offers a variety of features, including the ability to select from a range of AI-powered voices, adjust the speed, pitch, and volume of the audio, and add background music. Voicemaker's audio files can be shared on any platform worldwide and are trusted by over 1000 well-known brands.
![Voxdazz Screenshot](/screenshots/voxdazz.com.jpg)
Voxdazz
Voxdazz is a celebrity AI voice generator website that allows users to select a celebrity template, input text, and generate a video with lifelike voices. With famous characters like Donald Trump, Joe Biden, and more, Voxdazz offers a fun and entertaining way to bring words to life through AI-generated voices. The service provides realistic voice cloning technology for creating humorous content, skits, and parodies with friends and family.
![Audyo Screenshot](/screenshots/audyo.ai.jpg)
Audyo
Audyo is an AI tool that allows users to create human-quality AI voices easily by simply typing text. With over 100 voices to choose from, users can select speakers in various languages, accents, and even celebrity impersonators. The tool enables users to edit words, not waveforms, and export audio for use in videos, podcasts, presentations, and more. Audyo also offers features like creating conversations, mixing and matching languages, customizing pronunciations, and utilizing an AI assistant for script tweaking. Users can enjoy 15 minutes of audio generation with a free account and earn additional time by inviting friends. Audyo empowers creators to unleash their imagination and enhance their content with lifelike AI voices.
![VidGenesis Screenshot](/screenshots/vidgenesis.gyata.ai.jpg)
VidGenesis
VidGenesis is an AI-powered video generator that allows users to create engaging videos in minutes. With its user-friendly interface and powerful AI technology, VidGenesis makes it easy for anyone to create high-quality videos for a variety of purposes, including marketing, education, and entertainment. Some of the key features of VidGenesis include the ability to choose from a variety of video templates, add custom text and images, and select from a range of AI-generated voices. VidGenesis also offers a variety of advanced features, such as the ability to add custom branding and download videos in HD quality.
![AutoFeed.ai Screenshot](/screenshots/autofeed.ai.jpg)
AutoFeed.ai
AutoFeed.ai is a generative AI text to video platform that enables users to create viral TikToks, Reels, and Shorts in seconds using advanced AI features. The platform offers a unique AI video generator for YouTube, TikTok, and Reels, allowing users to select from various video categories and generate high-quality videos with hyper-realistic quality. With a one-click feature, users can easily create and customize videos, leveraging trending categories and viral channel ideas. AutoFeed.ai supports multiple languages and provides well-known AI voices for narration, making it a versatile tool for content creators looking to grow their online presence.
![Adori Blog to Video Maker Screenshot](/screenshots/www.adorilabs.com.jpg)
Adori Blog to Video Maker
Adori Blog to Video Maker is an AI-powered tool that helps bloggers convert their written content into engaging and visually appealing videos. With its advanced AI algorithms, Adori analyzes blog content, selects relevant images, suggests transitions, and generates professional voiceovers, transforming blogs into videos that capture attention and drive engagement. The tool offers a range of features, including realistic AI voiceovers, eye-catching visuals, SEO optimization, and social media integration, making it easy for bloggers to create high-quality videos that resonate with their audience.
![xPromo Screenshot](/screenshots/xpromo.tech.jpg)
xPromo
xPromo is a platform that uses AI to help projects with similar audiences launch win-win marketing campaigns that generate views, leads, and customers. It analyzes your project and selects non-competing partners with similar audiences who will be most interested in your solution. You can then integrate a special promo page into your project where AI will recommend partner solutions to your audience and vice versa. AI also balances cross-promotion so that each project gets as many views and clicks as it generates for its partners.
![Sniper AI Screenshot](/screenshots/ixceed.sniperai.com.jpg)
Sniper AI
Sniper AI is an AI-powered platform that serves as a marketplace connecting job candidates with recruiters. The platform streamlines the recruitment process by leveraging artificial intelligence algorithms to match candidates with suitable job openings based on their skills and preferences. With a user-friendly interface, Sniper AI aims to revolutionize the hiring process by providing a seamless and efficient experience for both candidates and recruiters.
20 - Open Source AI Tools
![TEN-Agent Screenshot](/screenshots_githubs/TEN-framework-TEN-Agent.jpg)
TEN-Agent
TEN Agent is an open-source multimodal agent powered by the world’s first real-time multimodal framework, TEN Framework. It offers high-performance real-time multimodal interactions, multi-language and multi-platform support, edge-cloud integration, flexibility beyond model limitations, and real-time agent state management. Users can easily build complex AI applications through drag-and-drop programming, integrating audio-visual tools, databases, RAG, and more.
![openai-edge-tts Screenshot](/screenshots_githubs/travisvn-openai-edge-tts.jpg)
openai-edge-tts
This project provides a local, OpenAI-compatible text-to-speech (TTS) API using `edge-tts`. It emulates the OpenAI TTS endpoint (`/v1/audio/speech`), enabling users to generate speech from text with various voice options and playback speeds, just like the OpenAI API. `edge-tts` uses Microsoft Edge's online text-to-speech service, making it completely free. The project supports multiple audio formats, adjustable playback speed, and voice selection options, providing a flexible and customizable TTS solution for users.
![AICoverGen Screenshot](/screenshots_githubs/SociallyIneptWeeb-AICoverGen.jpg)
AICoverGen
AICoverGen is an autonomous pipeline designed to create covers using any RVC v2 trained AI voice from YouTube videos or local audio files. It caters to developers looking to incorporate singing functionality into AI assistants/chatbots/vtubers, as well as individuals interested in hearing their favorite characters sing. The tool offers a WebUI for easy conversions, cover generation from local audio files, volume control for vocals and instrumentals, pitch detection method control, pitch change for vocals and instrumentals, and audio output format options. Users can also download and upload RVC models via the WebUI, run the pipeline using CLI, and access various advanced options for voice conversion and audio mixing.
![Pandrator Screenshot](/screenshots_githubs/lukaszliniewicz-Pandrator.jpg)
Pandrator
Pandrator is a GUI tool for generating audiobooks and dubbing using voice cloning and AI. It transforms text, PDF, EPUB, and SRT files into spoken audio in multiple languages. It leverages XTTS, Silero, and VoiceCraft models for text-to-speech conversion and voice cloning, with additional features like LLM-based text preprocessing and NISQA for audio quality evaluation. The tool aims to be user-friendly with a one-click installer and a graphical interface.
![WeeaBlind Screenshot](/screenshots_githubs/FlorianEagox-WeeaBlind.jpg)
WeeaBlind
Weeablind is a program that uses modern AI speech synthesis, diarization, language identification, and voice cloning to dub multi-lingual media and anime. It aims to create a pleasant alternative for folks facing accessibility hurdles such as blindness, dyslexia, learning disabilities, or simply those that don't enjoy reading subtitles. The program relies on state-of-the-art technologies such as ffmpeg, pydub, Coqui TTS, speechbrain, and pyannote.audio to analyze and synthesize speech that stays in-line with the source video file. Users have the option of dubbing every subtitle in the video, setting the start and end times, dubbing only foreign-language content, or full-blown multi-speaker dubbing with speaking rate and volume matching.
![orcish-ai-nextjs-framework Screenshot](/screenshots_githubs/TheOrcDev-orcish-ai-nextjs-framework.jpg)
orcish-ai-nextjs-framework
The Orcish AI Next.js Framework is a powerful tool that leverages OpenAI API to seamlessly integrate AI functionalities into Next.js applications. It allows users to generate text, images, and text-to-speech based on specified input. The framework provides an easy-to-use interface for utilizing AI capabilities in application development.
![classifai Screenshot](/screenshots_githubs/10up-classifai.jpg)
classifai
Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.
![AI Screenshot](/screenshots_githubs/PreternaturalAI-AI.jpg)
AI
AI is an open-source Swift framework for interfacing with generative AI. It provides functionalities for text completions, image-to-text vision, function calling, DALLE-3 image generation, audio transcription and generation, and text embeddings. The framework supports multiple AI models from providers like OpenAI, Anthropic, Mistral, Groq, and ElevenLabs. Users can easily integrate AI capabilities into their Swift projects using AI framework.
![openedai-speech Screenshot](/screenshots_githubs/matatonic-openedai-speech.jpg)
openedai-speech
OpenedAI Speech is a free, private text-to-speech server compatible with the OpenAI audio/speech API. It offers custom voice cloning and supports various models like tts-1 and tts-1-hd. Users can map their own piper voices and create custom cloned voices. The server provides multilingual support with XTTS voices and allows fixing incorrect sounds with regex. Recent changes include bug fixes, improved error handling, and updates for multilingual support. Installation can be done via Docker or manual setup, with usage instructions provided. Custom voices can be created using Piper or Coqui XTTS v2, with guidelines for preparing audio files. The tool is suitable for tasks like generating speech from text, creating custom voices, and multilingual text-to-speech applications.
![Generative-AI-Pharmacist Screenshot](/screenshots_githubs/kennethleungty-Generative-AI-Pharmacist.jpg)
Generative-AI-Pharmacist
Generative AI Pharmacist is a project showcasing the use of generative AI tools to create an animated avatar named Macy, who delivers medication counseling in a realistic and professional manner. The project utilizes tools like Midjourney for image generation, ChatGPT for text generation, ElevenLabs for text-to-speech conversion, and D-ID for creating a photorealistic talking avatar video. The demo video featuring Macy discussing commonly-prescribed medications demonstrates the potential of generative AI in healthcare communication.
![VoiceStreamAI Screenshot](/screenshots_githubs/alesaccoia-VoiceStreamAI.jpg)
VoiceStreamAI
VoiceStreamAI is a Python 3-based server and JavaScript client solution for near-realtime audio streaming and transcription using WebSocket. It employs Huggingface's Voice Activity Detection (VAD) and OpenAI's Whisper model for accurate speech recognition. The system features real-time audio streaming, modular design for easy integration of VAD and ASR technologies, customizable audio chunk processing strategies, support for multilingual transcription, and secure sockets support. It uses a factory and strategy pattern implementation for flexible component management and provides a unit testing framework for robust development.
![voice-pro Screenshot](/screenshots_githubs/abus-aikorea-voice-pro.jpg)
voice-pro
Voice-Pro is an integrated solution for subtitles, translation, and TTS. It offers features like multilingual subtitles, live translation, vocal remover, and supports OpenAI Whisper and Open-Source Translator. The tool provides a Studio tab for various functions, Whisper Caption tab for subtitle creation, Translate tab for translation, TTS tab for text-to-speech, Live Translation tab for real-time voice recognition, and Batch tab for processing multiple files. Users can download YouTube videos, improve voice recognition accuracy, create automatic subtitles, and produce multilingual videos with ease. The tool is easy to install with one-click and offers a Web-UI for user convenience.
![wit-unity Screenshot](/screenshots_githubs/wit-ai-wit-unity.jpg)
wit-unity
Wit-unity is a Unity C# based wrapper around the rest apis provided by Wit.ai. It is meant to be used as a base library within Voice SDK. We have made it accessible here for contributions and early adoption testing. Wit-unity is ideal for developers looking to do early research with voice and potential expand the core capabilities of Voice SDK.
![dinopal Screenshot](/screenshots_githubs/fatwang2-dinopal.jpg)
dinopal
DinoPal is an AI voice assistant residing in the Mac menu bar, offering real-time voice and video chat, screen sharing, online search, and multilingual support. It provides various AI assistants with unique strengths and characteristics to meet different conversational needs. Users can easily install DinoPal and access different communication modes, with a call time limit of 30 minutes. User feedback can be shared in the Discord community. DinoPal is powered by Google Gemini & Pipecat.
![pipecat Screenshot](/screenshots_githubs/pipecat-ai-pipecat.jpg)
pipecat
Pipecat is an open-source framework designed for building generative AI voice bots and multimodal assistants. It provides code building blocks for interacting with AI services, creating low-latency data pipelines, and transporting audio, video, and events over the Internet. Pipecat supports various AI services like speech-to-text, text-to-speech, image generation, and vision models. Users can implement new services and contribute to the framework. Pipecat aims to simplify the development of applications like personal coaches, meeting assistants, customer support bots, and more by providing a complete framework for integrating AI services.
![talk-to-chatgpt Screenshot](/screenshots_githubs/C-Nedelcu-talk-to-chatgpt.jpg)
talk-to-chatgpt
Talk-To-ChatGPT is a Google Chrome and Microsoft Edge extension that enables users to interact with the ChatGPT AI using voice commands for speech recognition and text-to-speech responses. The tool enhances the conversational experience by allowing users to speak to the AI and receive spoken responses, making interactions more natural and engaging. It also supports ElevenLabs API integration for creating custom voices for text-to-speech. The extension provides settings for voice, language, and more, and can be installed from the Chrome and Edge web stores or manually. While the project has been discontinued due to upcoming desktop apps from OpenAI, it has been used to assist individuals with disabilities and the elderly in interacting with ChatGPT.
![wunjo.wladradchenko.ru Screenshot](/screenshots_githubs/wladradchenko-wunjo.wladradchenko.ru.jpg)
wunjo.wladradchenko.ru
Wunjo AI is a comprehensive tool that empowers users to explore the realm of speech synthesis, deepfake animations, video-to-video transformations, and more. Its user-friendly interface and privacy-first approach make it accessible to both beginners and professionals alike. With Wunjo AI, you can effortlessly convert text into human-like speech, clone voices from audio files, create multi-dialogues with distinct voice profiles, and perform real-time speech recognition. Additionally, you can animate faces using just one photo combined with audio, swap faces in videos, GIFs, and photos, and even remove unwanted objects or enhance the quality of your deepfakes using the AI Retouch Tool. Wunjo AI is an all-in-one solution for your voice and visual AI needs, offering endless possibilities for creativity and expression.
![kobold_assistant Screenshot](/screenshots_githubs/lee-b-kobold_assistant.jpg)
kobold_assistant
Kobold-Assistant is a fully offline voice assistant interface to KoboldAI's large language model API. It can work online with the KoboldAI horde and online speech-to-text and text-to-speech models. The assistant, called Jenny by default, uses the latest coqui 'jenny' text to speech model and openAI's whisper speech recognition. Users can customize the assistant name, speech-to-text model, text-to-speech model, and prompts through configuration. The tool requires system packages like GCC, portaudio development libraries, and ffmpeg, along with Python >=3.7, <3.11, and runs on Ubuntu/Debian systems. Users can interact with the assistant through commands like 'serve' and 'list-mics'.
20 - OpenAI Gpts
![Gift Book Advisor Screenshot](/screenshots_gpts/g-YAFyIlqFE.jpg)
Gift Book Advisor
Help you to select a book as a present for your friend, family member, co-worker, client or business partner
C Programming Pointer Tutor
To get started, please type "menu" and select an option from the menu by typing the corresponding keyword or number. If at any time you need assistance or wish to ask a question, just type "help" for more options.
![401k to Gold IRA Rollover Tool - FREE Screenshot](/screenshots_gpts/g-tHt9nypGs.jpg)
401k to Gold IRA Rollover Tool - FREE
This is a guide on how to do a 401k to gold IRA rollover, and select the best company to work with.
![Astro Light Explorer Screenshot](/screenshots_gpts/g-POyAAfOJs.jpg)
Astro Light Explorer
Your guide through the luminous wonders of the cosmos! Expert-level astronomy research assistant in light phenomena. Select a prompt or type begin to start.
![SuperHero Me | Create a SuperHero Alter Ego Screenshot](/screenshots_gpts/g-1S2PnH52a.jpg)
SuperHero Me | Create a SuperHero Alter Ego
Level up Now. Upload a selfie for some superhero flair. Create a backstory. Select a superpower, arch-villain, and crew. Answer trivia. Pow!
![Logo Creator Pro GPT Screenshot](/screenshots_gpts/g-fehbSh8KZ.jpg)
Logo Creator Pro GPT
Design logos from sketches. Upload a sketch of your logo idea to Logo Creator GPT. Tell it your company name, select the style you like, choose your colors and let Logo Creator GPT do the rest. Then work with Logo Creator GPT to refine and edit it until you have the perfect brand logo.
![Polymer Engineering Advisor Screenshot](/screenshots_gpts/g-Mk1bu5XFC.jpg)
Polymer Engineering Advisor
Guides polymer selection and application in manufacturing processes.
![GMB Listing Category Selector Tool Screenshot](/screenshots_gpts/g-hP51RmPzH.jpg)
GMB Listing Category Selector Tool
Aid in selecting Google Business categories for diverse small businesses.
Orchard
Expert in fruit orchards and cultivation with a focus on agriculture and horticulture.
![Metal Screenshot](/screenshots_gpts/g-1vmYjqIVs.jpg)
Metal
Expert in metals, metalworking, and alloys, providing detailed and informative insights.
![Typography Layout Advisor Screenshot](/screenshots_gpts/g-QqCaqNIdu.jpg)
Typography Layout Advisor
Typography layout design, typeface, consultation regarding font color, modern font layout Help to enhance the brand according to new typography trends.