Best AI tools for< Add Sound >
20 - AI tool Sites

Epidemic Sound
Epidemic Sound is a platform that offers a vast catalog of music and sound effects for videos, allowing users to bring their stories to life with exclusive soundtracking tools and worry-free publishing worldwide. With over 2.5 billion daily views, Epidemic Sound provides genres, themes, moods, and sound effects for various content types like ads, vlogs, cinematic videos, corporate projects, workouts, sports, and nature. The platform also features a plugin for Adobe and DaVinci Resolve Studio, track suggestions based on video frames, music search based on tone and sound, and an app for on-the-go music discovery. Epidemic Sound is known for its royalty-free music and innovative licensing model that offers users direct licenses with all rights included globally.

Voicemod
Voicemod is a free real-time voice changer and soundboard software that allows users to modify their voices in real-time. It is compatible with both Windows and macOS and can be used with a variety of applications, including games, chat apps, and video streaming platforms. Voicemod offers a wide range of voice effects, including robot, demon, chipmunk, woman, man, and many others. It also includes a soundboard feature that allows users to play sound effects at the touch of a button. Voicemod is a popular choice for gamers, content creators, and anyone who wants to add some fun and creativity to their voice communications.

Voicemod
Voicemod is a free real-time voice changer and soundboard available on both Windows and macOS. It allows users to change their voice in real-time, add sound effects, and create custom voices. Voicemod integrates with popular games, streaming software, and chat applications, making it a versatile tool for gamers, content creators, and anyone who wants to add some fun to their voice communication.

Slick
Slick is an AI-powered video editing tool that helps you create and edit viral short videos. With Slick, you can add trendy captions, cut silences and umms, snap b-rolls, add sound effects, use magic zooms, and more. Slick supports all aspect ratios and up to 4k resolution. You can also add custom background music and sound effects, and remove filler words in one click. Slick is available in over 30 languages, including English, French, Spanish, German, Hindi, and more. New caption styles are added every week, and all captions are 100% customizable. With Slick, you can trim and extend clips, and adjust clip duration. All of these features are available without lifting a finger, thanks to Slick's AI technology.

Dubbing AI
Dubbing AI is a free real-time AI voice changer that allows you to change your voice in real-time while speaking. It offers a variety of voice effects, including male, female, child, robot, and more. You can also use Dubbing AI to add sound effects and music to your recordings. Dubbing AI is perfect for creating funny videos, voiceovers, and other creative projects.

Tangia
Tangia is an interactive streaming platform that offers custom TTS interactions, alerts, media sharing, monitor overlay, and Discord integration for streamers. It provides streamers with a wide range of tools to engage their audience, including AI TTS in their own voice, memes showing up on stream, hype parties at every level reached, adding soundbites from Twitch clips, and leaderboards with challenges. Tangia aims to enhance the streaming experience by enabling streamers to create interactive and entertaining content effortlessly.

TTSLabs
TTSLabs is an AI-powered text-to-speech service designed specifically for Twitch streamers. It allows streamers to customize their TTS experience with dedicated desktop apps, faster-than-real-time processing, custom voices, sound clips, profanity filters, and more. With TTSLabs, streamers can enhance their viewer engagement and create a more interactive and entertaining streaming experience.

MotionX
MotionX is an AI-driven video creation platform that aims to revolutionize media production in the entertainment industry. By harnessing the power of artificial intelligence, MotionX offers features such as generating videos from scripts or prompts, editing videos through prompts, and adding sound effects. The platform also provides tools to streamline the video creation workflow, collaborate on video projects, and unlock new possibilities with AI-powered tools. MotionX caters to filmmakers, content creators, and media professionals looking to enhance storytelling and accelerate content production.

PromoMix
PromoMix is an AI-powered tool that helps users generate voiceovers for their short videos. It is designed to make it easy for users to create professional-sounding voiceovers, even if they don't have any experience in voiceover work. PromoMix offers a variety of features to help users create the perfect voiceover for their videos, including the ability to choose from a variety of voices, adjust the speed and pitch of the voice, and add music and sound effects. PromoMix is a valuable tool for anyone who wants to create high-quality voiceovers for their videos.

AiGalaxy
AiGalaxy is an all-in-one AI solution that offers a wide range of user-friendly AI tools in a single app. Users can easily generate images, remove backgrounds, change clothing, create hidden messages, generate QR codes, change ages, extract music and vocals, create voice models, convert images to videos, transcribe speech to text, convert text to speech, turn tunes into music tracks, change voices, unblur images, add sound to videos, create slow-motion videos, restore old images, and more. The app is designed to be easy enough for beginners while also offering powerful features for professionals. AiGalaxy constantly adds new AI tools to its platform, making it a versatile and evolving tool for various tasks.

Muzaic
Muzaic is a generative AI Soundtrack-as-a-Service. It lets you automatically add custom soundtracks to your videos, presentations, or even games. Muzaic works on the parameters that describe music: intensity, tempo, rhythm, tone and variation. Not only can it adapt to the preset levels of these parameters, but it can also change them over time on command. At the same time, Muzaic works on high quality music.

Speechelo
Speechelo is a text-to-speech software that allows users to instantly generate human-sounding voiceovers from text. It offers a wide range of features, including over 30 human-sounding voices, the ability to add breathing sounds and pauses, and the ability to generate voiceovers in over 23 languages. Speechelo is easy to use and can be integrated with any video creation software. It is a great tool for creating voiceovers for sales videos, training videos, educational videos, and more.

Sound Effect Generator
The Sound Effect Generator is an AI-powered tool that allows users to create custom sound effects instantly. It uses cutting-edge AI Text to Sound Effect technology to transform ideas into high-quality sound effects. Perfect for creators, developers, and sound designers, the generator offers a free sound effect library with thousands of AI-generated sound effects. Users can fine-tune duration and audio quality, support multiple languages, and even upload videos to add AI-generated sound effects. The tool combines professional sound design with AI technology to provide a unique and creative audio experience.

Noisee AI
Noisee AI is a powerful AI-powered video editing tool that makes it easy to create professional-quality videos in minutes. With Noisee AI, you can quickly and easily remove unwanted noise from your videos, add music and sound effects, and create stunning visual effects. Noisee AI is perfect for anyone who wants to create high-quality videos without having to spend hours learning complex video editing software.

Animaker
Animaker is an online video-making platform that uses artificial intelligence (AI) to help users create animated and live-action videos. The platform offers a variety of features, including a character builder, a library of assets, and a range of video editing tools. Animaker is designed to be easy to use, even for beginners, and it offers a free plan that allows users to create videos without paying a subscription fee.

Voicera
Voicera is a text-to-speech tool that allows users to convert written content into natural-sounding speech. With Voicera, users can create audio versions of their articles, blog posts, and other written content, making it more accessible to a wider audience. Voicera offers a variety of features to help users create high-quality audio content, including a library of natural-sounding voices, advanced audio editing tools, and the ability to add music and sound effects.

Adobe Podcast
Adobe Podcast is an AI-powered audio recording and editing tool that allows users to create and edit podcasts entirely on the web. With Adobe Podcast, users can record high-quality audio, add music and sound effects, and edit their recordings with ease. Adobe Podcast also offers a variety of features to help users promote and distribute their podcasts.

Soundeff
Soundeff is an AI Sound Effects Generator that allows users to create custom sound effects using cutting-edge AI technology. It offers a platform for professionals and enthusiasts in the audio-visual world to enhance their creative projects with unique, professional-grade sound effects in seconds. Users can generate a variety of sound effects for gaming, videos, podcasts, films, music, and user interfaces, improving user engagement and storytelling. Soundeff stands out with its AI-generated effects that cater to a wide range of creative needs, providing a seamless workflow and expanding sound libraries.

SOUNDRAW
SOUNDRAW is an AI music generator that allows users to create music by simply choosing the mood, genre, and length. The AI will then generate a beautiful song that can be customized to the user's needs. SOUNDRAW is perfect for creators and artists who need background music for their content, or for music industry professionals who need to add vocals to beats and make songs.

Narrify AI
Narrify AI is an AI-powered application that transforms your videos by adding sports commentary to them. With Narrify AI, users can upload any video file up to 45 seconds in length and enhance it with personalized commentary, highlighting names and key words. The application allows users to create engaging and fun narrated videos to share with friends and family. Narrify AI is a user-friendly tool that adds a unique touch to your videos, making them more entertaining and memorable.
20 - Open Source AI Tools

WavCraft
WavCraft is an LLM-driven agent for audio content creation and editing. It applies LLM to connect various audio expert models and DSP function together. With WavCraft, users can edit the content of given audio clip(s) conditioned on text input, create an audio clip given text input, get more inspiration from WavCraft by prompting a script setting and let the model do the scriptwriting and create the sound, and check if your audio file is synthesized by WavCraft.

CameraChessWeb
Camera Chess Web is a tool that allows you to use your phone camera to replace chess eBoards. With Camera Chess Web, you can broadcast your game to Lichess, play a game on Lichess, or digitize a chess game from a video or live stream. Camera Chess Web is free to download on Google Play.

vnve
VNVE is a Visual Novel Video Editor that allows users to create visual novel videos in their browser with AI-powered rapid creation. It offers a low-cost production solution for converting textual content into videos, creating interactive videos for gaming experiences, and making video teasers for novels and short video dramas. The tool is a pure front-end Typescript implementation powered by PixiJS + WebCodecs, and users can also create videos programmatically using the npm package. VNVE is tailored specifically for visual novels, focusing on text content and simplifying the video creation process for users.

ichigo
Ichigo is a local real-time voice AI tool that uses an early fusion technique to extend a text-based LLM to have native 'listening' ability. It is an open research experiment with improved multiturn capabilities and the ability to refuse processing inaudible queries. The tool is designed for open data, open weight, on-device Siri-like functionality, inspired by Meta's Chameleon paper. Ichigo offers a web UI demo and Gradio web UI for users to interact with the tool. It has achieved enhanced MMLU scores, stronger context handling, advanced noise management, and improved multi-turn capabilities for a robust user experience.

bark.cpp
Bark.cpp is a C/C++ implementation of the Bark model, a real-time, multilingual text-to-speech generation model. It supports AVX, AVX2, and AVX512 for x86 architectures, and is compatible with both CPU and GPU backends. Bark.cpp also supports mixed F16/F32 precision and 4-bit, 5-bit, and 8-bit integer quantization. It can be used to generate realistic-sounding audio from text prompts.

ai-audio-startups
The 'ai-audio-startups' repository is a community list of startups working with AI for audio and music tech. It includes a comprehensive collection of tools and platforms that leverage artificial intelligence to enhance various aspects of music creation, production, source separation, analysis, recommendation, health & wellbeing, radio/podcast, hearing, sound detection, speech transcription, synthesis, enhancement, and manipulation. The repository serves as a valuable resource for individuals interested in exploring innovative AI applications in the audio and music industry.

openedai-speech
OpenedAI Speech is a free, private text-to-speech server compatible with the OpenAI audio/speech API. It offers custom voice cloning and supports various models like tts-1 and tts-1-hd. Users can map their own piper voices and create custom cloned voices. The server provides multilingual support with XTTS voices and allows fixing incorrect sounds with regex. Recent changes include bug fixes, improved error handling, and updates for multilingual support. Installation can be done via Docker or manual setup, with usage instructions provided. Custom voices can be created using Piper or Coqui XTTS v2, with guidelines for preparing audio files. The tool is suitable for tasks like generating speech from text, creating custom voices, and multilingual text-to-speech applications.

aiode
aiode is a Discord bot that plays Spotify tracks and YouTube videos or any URL including Soundcloud links and Twitch streams. It allows users to create cross-platform playlists, customize player commands, create custom command presets, adjust properties for deeper customization, sign in to Spotify to play personal playlists, manage access permissions for commands, customize bot summoning methods, and execute advanced admin commands. The bot also features a scripting sandbox for running and storing custom groovy scripts and modifying command behavior through interceptors.

awesome-generative-ai
Awesome Generative AI is a curated list of modern Generative Artificial Intelligence projects and services. Generative AI technology creates original content like images, sounds, and texts using machine learning algorithms trained on large data sets. It can produce unique and realistic outputs such as photorealistic images, digital art, music, and writing. The repo covers a wide range of applications in art, entertainment, marketing, academia, and computer science.

AIOC
AIOC is an All-in-one-Cable for Ham Radio enthusiasts, providing a cheap and hackable digital mode USB interface with features like sound-card, virtual tty, and CM108 compatible HID endpoint. It supports various software and tested radios for functions like programming, APRS, and Dual-PTT HTs. Users can fabricate and assemble the AIOC using specific instructions, and program it using STM32CubeIDE. The tool can be used for tasks like programming radios, asserting PTT, and accessing audio data channels. Future work includes configurable AIOC settings, virtual-PTT, and virtual-COS features.

AiR
AiR is an AI tool built entirely in Rust that delivers blazing speed and efficiency. It features accurate translation and seamless text rewriting to supercharge productivity. AiR is designed to assist non-native speakers by automatically fixing errors and polishing language to sound like a native speaker. The tool is under heavy development with more features on the horizon.

ElevenLabs-DotNet
ElevenLabs-DotNet is a non-official Eleven Labs voice synthesis RESTful client that allows users to convert text to speech. The library targets .NET 8.0 and above, working across various platforms like console apps, winforms, wpf, and asp.net, and across Windows, Linux, and Mac. Users can authenticate using API keys directly, from a configuration file, or system environment variables. The tool provides functionalities for text to speech conversion, streaming text to speech, accessing voices, dubbing audio or video files, generating sound effects, managing history of synthesized audio clips, and accessing user information and subscription status.

comfyui-web-viewer
The ComfyUI Web Viewer by vrch.ai is a real-time AI-generated interactive art framework that integrates realtime streaming into ComfyUI workflows. It supports keyboard control nodes, OSC control nodes, sound input nodes, and more, accessible from any device with a web browser. It enables real-time interaction with AI-generated content, ideal for interactive visual projects and enhancing ComfyUI workflows with efficient content management and display.

stark
STaRK is a large-scale semi-structure retrieval benchmark on Textual and Relational Knowledge Bases. It provides natural-sounding and practical queries crafted to incorporate rich relational information and complex textual properties, closely mirroring real-life scenarios. The benchmark aims to assess how effectively large language models can handle the interplay between textual and relational requirements in queries, using three diverse knowledge bases constructed from public sources.

TFTMuZeroAgent
TFTMuZeroAgent is an implementation of a purely artificial intelligence algorithm to play Teamfight Tactics, an auto chess game made by Riot. It uses a simulation of TFT Set 4 and the MuZero reinforcement learning algorithm. The project provides a multi-agent petting zoo environment where players, pool, and game round classes are designed for AI project. The implementation excludes graphics and sounds but covers all aspects of the game from set 4. The codebase is open for contributions and improvements, allowing for additional models to be added to the environment.

Awesome-AITools
This repo collects AI-related utilities. ## All Categories * All Categories * ChatGPT and other closed-source LLMs * AI Search engine * Open Source LLMs * GPT/LLMs Applications * LLM training platform * Applications that integrate multiple LLMs * AI Agent * Writing * Programming Development * Translation * AI Conversation or AI Voice Conversation * Image Creation * Speech Recognition * Text To Speech * Voice Processing * AI generated music or sound effects * Speech translation * Video Creation * Video Content Summary * OCR(Optical Character Recognition)

aiolauncher_scripts
AIO Launcher Scripts is a collection of Lua scripts that can be used with AIO Launcher to enhance its functionality. These scripts can be used to create widget scripts, search scripts, and side menu scripts. They provide various functions such as displaying text, buttons, progress bars, charts, and interacting with app widgets. The scripts can be used to customize the appearance and behavior of the launcher, add new features, and interact with external services.

ai-audio-datasets
AI Audio Datasets List (AI-ADL) is a comprehensive collection of datasets consisting of speech, music, and sound effects, used for Generative AI, AIGC, AI model training, and audio applications. It includes datasets for speech recognition, speech synthesis, music information retrieval, music generation, audio processing, sound synthesis, and more. The repository provides a curated list of diverse datasets suitable for various AI audio tasks.

clapper
Clapper is an open-source AI story visualization tool that can interpret screenplays and render them into storyboards, videos, voice, sound, and music. It is currently in early development stages and not recommended for general use due to some non-functional features and lack of tutorials. A public alpha version is available on Hugging Face's platform. Users can sponsor specific features through bounties and developers can contribute to the project under the GPL v3 license. The tool lacks automated tests and code conventions like Prettier or a Linter.

aitour-interact-with-llms
This repository is for the AI Tour workshop: Interacting with Multimodal models in Azure AI Foundry. The workshop provides a hands-on introduction to core concepts and best practices for interacting with OpenAI models in Azure AI Foundry portal. Participants can innovate with Azure OpenAI's GPT-4o multimodal model to generate text, sound, and images using GPT-4o-mini, DALL-E, and GPT-4o-realtime. The workshop also covers creating AI Agents to enhance user experiences and drive innovation. It includes instructions, resources for continued learning, and information on responsible AI practices.
20 - OpenAI Gpts

MIXING & MASTERING GPT
Your personal audio mixing and mastering engineer assistant for music production

Music Production Teacher
It acts as an instructor guiding you through music production skills, such as fine-tuning parameters in mixing, mastering, and compression. Additionally, it functions as an aide, offering advice for your music production hurdles with just a screenshot of your production or parameter settings.

AIProductGPT: Add AI to your Product and get a PRD
With simple prompts, AIProductGPT instantly crafts detailed AI-powered requirements (PRD) and mocks so that you team can hit the ground running

GroceriesGPT
I manage your grocery lists to help you stay organized. *1/ Tell me what to add to a list. 2/ Ask me to add all ingredients for a receipe. 3/ Upload a receipt to remove items from your lists 4/ Add an item by simply uploading a picture. 5/ Ask me what items I would recommend you add to your lists.*

SpintaxGPT
I add spintax to emails for Instantly.ai. For more cold email tips, follow me on Twitter/𝕏 at @kenautoup

Meal Planner + Home Delivery
Find your next favorite recipe and instantly add fresh, affordable ingredients to your Walmart cart. Enjoy the convenience of home delivery or pickup. Delicious, healthy, and budget-friendly.

QR Code Creator & Customizer
Create a QR code in 30 seconds + add a cool design effect or overlay it on top of any image. Free, no watermarks, no email required, and we don't store your messages/images.

WP coding assistant
Friendly WordPress expert that will help you write custom plugins, functions, add custom fields and enhance your WordPress website.

AI Tools Guru
Find the best AI tools. Want to add your tool? Fill the form: https://forms.gle/uqMaC2EFZzh3Y4yT6