Best AI tools for< Voice Therapist >
Infographic
20 - AI tool Sites
Outer Voice AI
Outer Voice AI is a mobile application that provides users with an AI-powered coach. The coach can be used to get advice, support, or information on a variety of topics. The coach's responses are generated using artificial intelligence, and they are tailored to the user's individual needs. The coach's voice can also be customized to sound like the user's own voice.
Audio Diary
Audio Diary is a super smart voice journal application that captures, organizes, and analyzes life's moments. It uses AI technology to analyze user recordings, provide suggestions for goals, and summarize entries. The app offers features such as transcription of audio to text, setting daily goals, providing positive affirmations, and offering guidance for journal entries. Users can easily record events or ideas using their voice and receive analysis and summaries to reflect on their day. Audio Diary is designed to make journaling easy and engaging, with a focus on privacy and personalized support.
AI Therapist Connect
The website offers a unique service where users can have voice calls with AI therapists. Users can experience mind-blowing conversations with AI therapists to address various issues. The platform provides a safe space for users to express themselves and seek support from AI therapists. The service aims to offer a convenient and accessible way for individuals to engage in therapy sessions using artificial intelligence technology.
ZeroBot
ZeroBot is the internet's leading voice-enabled chatbot. It allows users to have conversations with AI agents that are tailored to their specific needs. ZeroBot is powered by the Groq LPUโข Inference Engine, which provides instant and smooth chat experiences. With ZeroBot, users can create and speak with AI agents anywhere, anytime.
PowerNote.app
PowerNote.app is an AI-powered daily note-taking application that helps users capture and organize their thoughts, memories, and progress. It features voice-to-text transcription, daily reminders, auto-generated summaries, and customizable fields to track specific aspects of users' lives. The application aims to make note-taking effortless and help users remember and reflect on their experiences.
Momentary
Momentary is an AI-powered journaling application designed for mental health and self-growth. Users can capture their thoughts and emotions using their voice, replay moments for reflection, cultivate self-awareness, leave positive affirmations, record personal quotes, and gain insights for personal growth. The application also offers AI-powered transcribing and rewriting features, prompts for self-reflection, mood categorization, auto-tagging of content, and self-reflection with an AI mentor. Momentary aims to help individuals enhance their self-awareness, daily progress, and overall well-being through journaling and self-reflection.
Pronounce
Pronounce is an AI-powered English speech checker designed for professionals, educators, language learners, and speech therapists. It offers instant feedback and multiple drills to help users master speaking skills, understand specific communication challenges, and track therapy progress. With features like AI-powered speech feedback, English speaking partner, confident communication tips, pronunciation correction, and vocabulary enhancement, Pronounce aims to improve users' English pronunciation, grammar, and fluency. The application provides a user-friendly interface and visually appealing experience, making it suitable for beginners and advanced speakers alike.
Lily
Lily is an AI wellness partner that provides free confidential help for various mental health and wellness needs. Users can interact with Lily through text or voice, sharing their thoughts and feelings without the fear of privacy breaches. Lily serves as a digital meditation guide, spiritual advisor, mental health advocate, and friend, offering support for issues like anxiety, stress, self-esteem, personal growth, and more. It is designed to empower users to take control of their wellness journey in a safe and accessible way.
Muah.AI
Muah.AI is an AI companion platform that offers a variety of features, including chat, photo exchange, voice chat, and more. It is based in California and is currently in beta.
Earkick
Earkick is a personal AI chatbot application designed to help users measure and improve their mental health in real time. The app offers features such as mood tracking, voice memos, guided self-care sessions, and personalized support based on user input. Earkick prioritizes user privacy by not requiring registration or collecting personal data, ensuring a secure and private experience for managing anxiety and stress.
Onsen
Onsen is an AI-powered journaling application that offers a unique blend of personal reflection, interactive guidance, and mental wellness support. It provides users with a platform to journal through chat, capture memories, visualize thoughts into AI artwork, and voice their stories. Onsen's AI guides personalize the journaling experience, making it feel like having a trusted friend, life coach, and mental wellness advisor anytime, anywhere.
Lid
Lid is an AI-powered voice journaling app that helps users form healthy habits, gather insights, and journal securely and privately. It uses advanced AI to analyze voice entries and provides a written summary, identifying key themes from the user's day. Lid also creates personalized soundbites, offering a mirror to the user's emotions and experiences. The app is designed to enhance mindfulness, provide a quick and easy way to journal on the go, and help in tracking mood and habits.
NSFWBots
NSFWBots is a website that provides a list of AI sex chatbots. The chatbots are sorted by feature, and there are chatbots with voice, image generation, mobile apps, desktop apps, and more. The website also has a blog with articles about AI sex chatbots.
Merton
Merton is an AI-powered communication tool designed to provide a voice to the voiceless. It enables voice-impaired users to express their needs, thoughts, and feelings naturally and swiftly through a user-friendly interface. The application features an AI-powered Communication Board that predicts users' next phrases, a Pain Tracker for pinpointing areas of pain using eye movements, and prioritizes user privacy. Merton significantly enhances communication for individuals with limited or no motor functions, improving caregiving processes and response times.
AutoNotes
AutoNotes is a leading healthcare AI Progress Note tool that offers AI-powered clinical documentation templates for generating SOAP Notes, DAP Notes, Treatment Plans, and more. It provides a user-friendly interface for therapists and healthcare professionals to create detailed and customizable clinical notes efficiently. With features like summarizing sessions, editing and downloading notes, and simple pricing plans, AutoNotes aims to streamline the documentation process in healthcare settings. The platform also offers advanced features like template customization, secure document storage, and dictation for voice-to-text conversion. Users can benefit from the platform's customization options, seamless integration with workflows, and responsive customer support.
Venty.chat
Venty.chat is an AI tool designed as a safe space for users to express their thoughts and emotions anonymously. It offers a platform for users to vent and rant, receiving thoughtful advice from an AI confidante. The tool allows users to record voice notes without the need for sign up, ensuring privacy and anonymity. Venty.chat aims to provide a supportive environment for individuals seeking to share their feelings and seek guidance.
SagaSwipe
SagaSwipe is an interactive audio adventure application designed for iOS and Android users. It offers a unique experience where users can immerse themselves in infinite audio realms guided solely by touch. Unlike traditional sleep apps, SagaSwipe provides engaging escapes into magical realms, vibrant cities, serene landscapes, or mysterious outer space. The application combines AI and voice synthesis technology with an intuitive interface to generate personalized audio worlds for users to explore and relax.
Happi.ai
Happi.ai is a virtual mental health coach application that provides 24/7 support for individuals dealing with anxiety, depression, and loneliness. The AI companion, Olivia, offers personalized assistance, compassionate listening, and non-judgmental support. The platform prioritizes user privacy with top-tier encryption and offers expert insights and proactive suggestions for emotional well-being. Happi analyzes facial expressions, voice patterns, and speech content to identify moments of stress and provide real-time feedback to manage stress and improve emotional health.
Dreamt
Dreamt is an AI-enabled journal application designed to assist users in recording and reflecting on their dreams. Users can input dream entries via text or voice, access statistics related to their dreams, and transform their entries into story images using AI technology. The application offers features such as SentiMoji for automated sentiment analysis, auto-tags for identifying entities in dreams, iCloud backup for data security, and advanced search capabilities. Dreamt prioritizes user privacy by not collecting data or using cookies or trackers.
Personality First
Personality First is an AI-powered psychology app that provides personalized assessments, insights, and guidance to help users understand themselves better and improve their personal growth. The app uses the latest AI technology to ensure that its assessments and interactions are as accurate as possible. It also has a strong focus on user experience, ensuring that every customer has a unique and tailored experience. Personality First offers a variety of features, including personalized assessments, thorough analysis, voice-first personalization, and a secure and safe environment.
20 - Open Source Tools
metavoice-src
MetaVoice-1B is a 1.2B parameter base model trained on 100K hours of speech for TTS (text-to-speech). It has been built with the following priorities: * Emotional speech rhythm and tone in English. * Zero-shot cloning for American & British voices, with 30s reference audio. * Support for (cross-lingual) voice cloning with finetuning. * We have had success with as little as 1 minute training data for Indian speakers. * Synthesis of arbitrary length text
OpenVoiceChat
OpenVoiceChat is an open-source tool designed for having natural voice conversations with an LLM model. It supports various speech-to-text (STT), text-to-speech (TTS), and large language model (LLM) models. The tool aims to provide an alternative to closed commercial implementations, with well-abstracted APIs that are easy to use and extend. Users can install base and functionality-specific packages using pip, and the tool supports interruptions during conversations. The project encourages contributions through bounties and has a detailed roadmap available for reference.
pyht
pyht is a Python SDK for the PlayHT's AI Text-to-Speech API, allowing users to convert text into high-quality audio streams in humanlike voice. It supports real-time text-to-speech streaming, pre-built and custom voices, various audio formats, and different sample rates.
awesome-ai
Awesome AI is a curated list of artificial intelligence resources including courses, tools, apps, and open-source projects. It covers a wide range of topics such as machine learning, deep learning, natural language processing, robotics, conversational interfaces, data science, and more. The repository serves as a comprehensive guide for individuals interested in exploring the field of artificial intelligence and its applications across various domains.
EmotiVoice
EmotiVoice is a powerful and modern open-source text-to-speech engine that supports emotional synthesis, enabling users to create speech with a wide range of emotions such as happy, excited, sad, and angry. It offers over 2000 different voices in both English and Chinese. Users can access EmotiVoice through an easy-to-use web interface or a scripting interface for batch generation of results. The tool is continuously evolving with new features and updates, prioritizing community input and user feedback.
RisuAI
RisuAI, or Risu for short, is a cross-platform AI chatting software/web application with powerful features such as multiple API support, assets in the chat, regex functions, and much more.
bark.cpp
Bark.cpp is a C/C++ implementation of the Bark model, a real-time, multilingual text-to-speech generation model. It supports AVX, AVX2, and AVX512 for x86 architectures, and is compatible with both CPU and GPU backends. Bark.cpp also supports mixed F16/F32 precision and 4-bit, 5-bit, and 8-bit integer quantization. It can be used to generate realistic-sounding audio from text prompts.
ai-audio-startups
The 'ai-audio-startups' repository is a community list of startups working with AI for audio and music tech. It includes a comprehensive collection of tools and platforms that leverage artificial intelligence to enhance various aspects of music creation, production, source separation, analysis, recommendation, health & wellbeing, radio/podcast, hearing, sound detection, speech transcription, synthesis, enhancement, and manipulation. The repository serves as a valuable resource for individuals interested in exploring innovative AI applications in the audio and music industry.
ai-voice-cloning
This repository provides a tool for AI voice cloning, allowing users to generate synthetic speech that closely resembles a target speaker's voice. The tool is designed to be user-friendly and accessible, with a graphical user interface that guides users through the process of training a voice model and generating synthetic speech. The tool also includes a variety of features that allow users to customize the generated speech, such as the pitch, volume, and speaking rate. Overall, this tool is a valuable resource for anyone interested in creating realistic and engaging synthetic speech.
voice-pro
Voice-Pro is an integrated solution for subtitles, translation, and TTS. It offers features like multilingual subtitles, live translation, vocal remover, and supports OpenAI Whisper and Open-Source Translator. The tool provides a Studio tab for various functions, Whisper Caption tab for subtitle creation, Translate tab for translation, TTS tab for text-to-speech, Live Translation tab for real-time voice recognition, and Batch tab for processing multiple files. Users can download YouTube videos, improve voice recognition accuracy, create automatic subtitles, and produce multilingual videos with ease. The tool is easy to install with one-click and offers a Web-UI for user convenience.
Synthetic-Voice-Detection-Vocoder-Artifacts
The Synthetic-Voice-Detection-Vocoder-Artifacts repository provides the LibriSeVoc dataset containing self-vocoding samples created with six state-of-the-art vocoders to expose and exploit vocoder artifacts. It also introduces a new approach for detecting synthetic human voices by identifying signal artifacts left by neural vocoders and enhancing the RawNet2 baseline. The repository includes a paper and dataset for further reference and offers instructions for training the model and testing it in the wild.
Easy-Voice-Toolkit
Easy Voice Toolkit is a toolkit based on open source voice projects, providing automated audio tools including speech model training. Users can seamlessly integrate functions like audio processing, voice recognition, voice transcription, dataset creation, model training, and voice conversion to transform raw audio files into ideal speech models. The toolkit supports multiple languages and is currently only compatible with Windows systems. It acknowledges the contributions of various projects and offers local deployment options for both users and developers. Additionally, cloud deployment on Google Colab is available. The toolkit has been tested on Windows OS devices and includes a FAQ section and terms of use for academic exchange purposes.
bidirectional_streaming_ai_voice
This repository contains Python scripts that enable two-way voice conversations with Anthropic Claude, utilizing ElevenLabs for text-to-speech, Faster-Whisper for speech-to-text, and Pygame for audio playback. The tool operates by transcribing human audio using Faster-Whisper, sending the transcription to Anthropic Claude for response generation, and converting the LLM's response into audio using ElevenLabs. The audio is then played back through Pygame, allowing for a seamless and interactive conversation between the user and the AI. The repository includes variations of the main script to support different operating systems and configurations, such as using CPU transcription on Linux or employing the AssemblyAI API instead of Faster-Whisper.
Srt-AI-Voice-Assistant
Srt-AI-Voice-Assistant is a convenient tool that generates audio from uploaded .srt subtitle files by calling APIs such as Bert-VITS2 (HiyoriUI), GPT-SoVITS, and Microsoft TTS (online). The code is currently not perfect, and feedback on bugs or suggestions can be provided at https://github.com/YYuX-1145/Srt-AI-Voice-Assistant/issues. Recent updates include adding custom API functionality with a focus on security, support for Microsoft online TTS (requires key configuration), error handling improvements, automatic project path detection, compatibility with API-v1 for limited functionality, and significant feature updates supporting card synthesis.
bolna
Bolna is an open-source platform for building voice-driven conversational applications using large language models (LLMs). It provides a comprehensive set of tools and integrations to handle various aspects of voice-based interactions, including telephony, transcription, LLM-based conversation handling, and text-to-speech synthesis. Bolna simplifies the process of creating voice agents that can perform tasks such as initiating phone calls, transcribing conversations, generating LLM-powered responses, and synthesizing speech. It supports multiple providers for each component, allowing users to customize their setup based on their specific needs. Bolna is designed to be easy to use, with a straightforward local setup process and well-documented APIs. It is also extensible, enabling users to integrate with other telephony providers or add custom functionality.
Applio
Applio is a VITS-based Voice Conversion tool focused on simplicity, quality, and performance. It features a user-friendly interface, cross-platform compatibility, and a range of customization options. Applio is suitable for various tasks such as voice cloning, voice conversion, and audio editing. Its key features include a modular codebase, hop length implementation, translations in over 30 languages, optimized requirements, streamlined installation, hybrid F0 estimation, easy-to-use UI, optimized code and dependencies, plugin system, overtraining detector, model search, enhancements in pretrained models, voice blender, accessibility improvements, new F0 extraction methods, output format selection, hashing system, model download system, TTS enhancements, split audio, Discord presence, Flask integration, and support tab.
wit-unity
Wit-unity is a Unity C# based wrapper around the rest apis provided by Wit.ai. It is meant to be used as a base library within Voice SDK. We have made it accessible here for contributions and early adoption testing. Wit-unity is ideal for developers looking to do early research with voice and potential expand the core capabilities of Voice SDK.
agents
The LiveKit Agent Framework is designed for building real-time, programmable participants that run on servers. Easily tap into LiveKit WebRTC sessions and process or generate audio, video, and data streams. The framework includes plugins for common workflows, such as voice activity detection and speech-to-text. Agents integrates seamlessly with LiveKit server, offloading job queuing and scheduling responsibilities to it. This eliminates the need for additional queuing infrastructure. Agent code developed on your local machine can scale to support thousands of concurrent sessions when deployed to a server in production.
RVC_CLI
**RVC_CLI: Retrieval-based Voice Conversion Command Line Interface** This command-line interface (CLI) provides a comprehensive set of tools for voice conversion, enabling you to modify the pitch, timbre, and other characteristics of audio recordings. It leverages advanced machine learning models to achieve realistic and high-quality voice conversions. **Key Features:** * **Inference:** Convert the pitch and timbre of audio in real-time or process audio files in batch mode. * **TTS Inference:** Synthesize speech from text using a variety of voices and apply voice conversion techniques. * **Training:** Train custom voice conversion models to meet specific requirements. * **Model Management:** Extract, blend, and analyze models to fine-tune and optimize performance. * **Audio Analysis:** Inspect audio files to gain insights into their characteristics. * **API:** Integrate the CLI's functionality into your own applications or workflows. **Applications:** The RVC_CLI finds applications in various domains, including: * **Music Production:** Create unique vocal effects, harmonies, and backing vocals. * **Voiceovers:** Generate voiceovers with different accents, emotions, and styles. * **Audio Editing:** Enhance or modify audio recordings for podcasts, audiobooks, and other content. * **Research and Development:** Explore and advance the field of voice conversion technology. **For Jobs:** * Audio Engineer * Music Producer * Voiceover Artist * Audio Editor * Machine Learning Engineer **AI Keywords:** * Voice Conversion * Pitch Shifting * Timbre Modification * Machine Learning * Audio Processing **For Tasks:** * Convert Pitch * Change Timbre * Synthesize Speech * Train Model * Analyze Audio
GlaDOS
This project aims to create a real-life version of GLaDOS, an aware, interactive, and embodied AI entity. It involves training a voice generator, developing a 'Personality Core,' implementing a memory system, providing vision capabilities, creating 3D-printable parts, and designing an animatronics system. The software architecture focuses on low-latency voice interactions, utilizing a circular buffer for data recording, text streaming for quick transcription, and a text-to-speech system. The project also emphasizes minimal dependencies for running on constrained hardware. The hardware system includes servo- and stepper-motors, 3D-printable parts for GLaDOS's body, animations for expression, and a vision system for tracking and interaction. Installation instructions cover setting up the TTS engine, required Python packages, compiling llama.cpp, installing an inference backend, and voice recognition setup. GLaDOS can be run using 'python glados.py' and tested using 'demo.ipynb'.
20 - OpenAI Gpts
Dedicated Speech-Language Pathologist
Expert Speech-Language Pathologist offering tailored medical consultations.
Your Lingo AI Coach
Welcome! I'm a voice-focused language teacher for interactive speaking practice. To enable voice, download the app and tap the headphone button next to my chat window. Then choose your preferred voice. When you're ready, tell me what language you'd like to learn. It's FREE!
Speak GPT
Voice-centric English role-play tool for speaking practice and offering personalized feedback!
AI Phonetics and Reading Coach with Speech
Phonetics and reading coach with interactive voice capabilities, tailored for adult beginners.
The Shaman
The Shaman is a wise, old Native American spiritual guide, blending ancient wisdom with modern understanding in a calm, authoritative voice, providing empathetic and personalized support during psychedelic journeys.
Clara - L'Inspiratrice Autobiographique
Aide ร la narration et ร l'expression autobiographique
์๋ฏธ์๋ฒ ์์ฑ ๋ํ ์๋ด ์ฑ๋ด (Meaning Life)
์ด๋ฉด์ ๊ณ ๋ฏผํ๊ณ ๊ฐ๋ฑํ๊ณ ํ๋ค์ด ํ๋ ๋ฌธ์ ๋ค, ์์ฑ์ผ๋ก ๋ํํด ๋ณด์ธ์. ํด๋ํฐ์ ์ผ๊ณ , ์ด GPT์ ๋ค์ด๊ฐ ํ, ํ๋ฉด ํ๋จ ์ฐ์ธก์ ์๋ ์ด์ดํฐ ์์ด์ฝ์ ๋๋ฅธ ํ, ํด๋ํฐ ํ๋ฉด์ด ์์ง์ด๋ค๊ฐ, ๋ฐ์ 'Listening'์ด๋ ๋จ์ด๊ฐ ๋์ค๋ฉด, ๋จผ์ ์ง๋ฌธํด ๋ณด์ธ์. ๋ต๋ณ์ ๋ฐ๋ผ์, ๊ฑ์ ์ง๋ฌธํ๋ฉฐ ๋ํํด ๋ณด์ธ์.
Anime Voice Match
Anime Voice Match, identifies anime characters similar to the user's voice.
Voice/Style/Tone AI Prompt Snippet Generator
Analyzes your writing and produces a prompt snippet you can use in any other prompt to guide AI in replicating your voice, style, and tone. Just provide the text in the prompt box or in a document (don't use a link or image). You don't need to write any additional prompt language with your text.
Voice Memo
Record your thoughts with ChatGPT Voice Conversations ๐ก. Get started by clicking the ๐ง icon right to the chat input. Available on mobile only. Ask 'how do you work?' to learn more.
Vedic Voice
A scholar in Hindu literature providing positive, brief insights against negativity.