Best AI tools for< Speech Therapist >
Infographic
19 - AI tool Sites

HeardThat
HeardThat is a smartphone application that leverages AI technology to assist users in hearing speech more clearly in noisy environments. By separating speech from background noise, HeardThat helps individuals with varying levels of hearing ability to participate in conversations with confidence and ease. The app works with existing Bluetooth earbuds or hearing aids, eliminating the need for additional hardware. HeardThat aims to address the common complaint of struggling to hear in noisy settings, ultimately reducing social isolation and improving communication experiences.

SOAP Note AI
SOAP Note AI is an AI-powered tool designed to generate HIPAA-compliant, fast, and efficient SOAP notes and progress notes for various healthcare specialties such as Telehealth, Physical Therapy, Occupational Therapy, Nursing, Mental Health, SLP, Dentistry, Podiatry, Massage, Acupuncture, Chiropractic, Veterinary, and Pharmacy. The tool helps healthcare professionals convert shorthand notes, audio dictations, AI Scribe session recordings, or telehealth sessions into comprehensive SOAP notes in minutes, reducing daily documentation time. SOAP Note AI is loved by healthcare professionals for its accuracy, time-saving capabilities, and reduction of errors in note-taking. The tool offers specialized SOAP notes for different healthcare professions, ensuring detailed and precise documentation. It also provides features like AI Scribe, Dictation, Telehealth integration, note history access, instant feedback, and various pricing plans to suit different needs.

SLAIT School
SLAIT School is an online education platform that offers the opportunity to learn American Sign Language in an interactive and engaging manner. Users can practice ASL 24/7, receive live feedback on their signing, participate in cool quizzes, and take interactive tests to improve their skills. The platform also provides free lessons, premium subscription options, and a full curriculum for comprehensive learning.

Better Speech Online Speech Therapy
Better Speech Online Speech Therapy is an AI-driven platform that offers convenient, affordable, and effective speech therapy services for children and adults. The platform utilizes cutting-edge artificial intelligence to provide personalized practices and make speech therapy more engaging, convenient, and affordable. With a team of 250+ licensed and experienced therapists, Better Speech aims to help individuals of all ages improve their communication skills from the comfort of their homes. The platform offers unlimited speech practices between sessions, immediate availability, easy scheduling, and effective results comparable to in-person therapy.

Pronounce
Pronounce is an AI-powered English speech checker designed for professionals, educators, language learners, and speech therapists. It offers instant feedback and multiple drills to help users master speaking skills, understand specific communication challenges, and track therapy progress. With features like AI-powered speech feedback, English speaking partner, confident communication tips, pronunciation correction, and vocabulary enhancement, Pronounce aims to improve users' English pronunciation, grammar, and fluency. The application provides a user-friendly interface and visually appealing experience, making it suitable for beginners and advanced speakers alike.

Tutor AI
Tutor AI is an AI English-speaking application designed to assist individuals in practicing their spoken English skills with the aid of an artificial intelligence chatbot. The app offers a safe and judgment-free environment for users to engage in free-flowing, natural conversations with diverse AI characters. It provides real-time feedback, suggests better ways to express oneself, and offers adjustable features to enhance the learning experience. Tutor AI aims to improve users' spoken English skills confidently and effectively through personalized lessons and interactive learning.

AppTek
AppTek is a global leader in artificial intelligence (AI) and machine learning (ML) technologies for automatic speech recognition (ASR), neural machine translation (NMT), natural language processing/understanding (NLP/U) and text-to-speech (TTS) technologies. The AppTek platform delivers industry-leading solutions for organizations across a breadth of global markets such as media and entertainment, call centers, government, enterprise business, and more. Built by scientists and research engineers who are recognized among the best in the world, AppTek’s solutions cover a wide array of languages/ dialects, channels, domains and demographics.

Rory Tells Stories
Rory Tells Stories is an AI-powered storytelling app that helps parents and teachers create personalized stories for children. With Rory, users can input their own story ideas and the app will generate a unique story in seconds. The app also includes a library of pre-written stories that can be customized to fit any child's interests. Rory Tells Stories is designed to help children develop their imagination, language skills, and listening skills. It can also be used to build a stronger bond between parent/teacher and child.

Once Upon a Bot
Once Upon a Bot is an AI-powered tool that allows users to create children's stories. Users can input their own story ideas, and the AI will generate a complete story based on those ideas. The stories can be edited, exported, and shared. Once Upon a Bot also offers a variety of features, such as the ability to upload photos of yourself into the stories, choose the reading level, and have the stories narrated by a variety of characters.

Speech Studio
Speech Studio is a cloud-based speech-to-text and text-to-speech platform that enables developers to add speech capabilities to their applications. With Speech Studio, developers can easily transcribe audio and video files, generate synthetic speech, and build custom speech models. Speech Studio is a powerful tool that can be used to improve the accessibility, efficiency, and user experience of any application.

Storyleo
Storyleo is an AI-powered bedtime story generator that helps parents create personalized and engaging stories for their children. With a variety of customizable characters, themes, and settings, Storyleo makes it easy to create unique and imaginative stories that will capture your child's attention and help them drift off to sleep. The app also includes a library of pre-written stories, as well as the ability to record your own voice to create truly personalized stories for your child.

BookHero
BookHero is an online platform that provides parents with a library of over 100 books to read to their children. Parents can also create their own books in just minutes using BookHero's WordPics illustrations. WordPics are beautiful illustrations that help children improve their vocabulary and spelling.

Vocalo
Vocalo is an AI-powered language learning platform that helps users become fluent English speakers through personalized, interactive conversations with AI-powered virtual assistants. The platform uses advanced speech recognition and natural language processing technologies to provide real-time feedback and personalized learning experiences. Vocalo offers a variety of features to help users improve their English skills, including interactive lessons, personalized feedback, and a speech recognition engine that helps users improve their pronunciation.

ELSA
ELSA is an AI-powered English speaking coach that helps you improve your pronunciation, fluency, and confidence. With ELSA, you can practice speaking English in short, fun dialogues and get instant feedback from our proprietary artificial intelligence technology. ELSA also offers a variety of other features, such as personalized lesson plans, progress tracking, and games to help you stay motivated.

Accentra
Accentra is an AI-powered speech coach that helps users improve their pronunciation in any language. It provides real-time feedback and personalized exercises tailored to the user's native tongue. Accentra's advanced technology analyzes speech patterns and offers tailored advice to help users retrain the way they move their mouths to make sounds. With Accentra, users can hear native speakers pronounce words and receive instant pronunciation analysis to correct and redefine their skills.

Merton
Merton is an AI-powered communication tool designed to provide a voice to the voiceless. It enables voice-impaired users to express their needs, thoughts, and feelings naturally and swiftly through a user-friendly interface. The application features an AI-powered Communication Board that predicts users' next phrases, a Pain Tracker for pinpointing areas of pain using eye movements, and prioritizes user privacy. Merton significantly enhances communication for individuals with limited or no motor functions, improving caregiving processes and response times.

Imagine Stories
Imagine Stories is an interactive platform for creating personalized stories for children. Users can choose characters, settings, themes, and illustration styles to create a unique story tailored to the child's interests and developmental needs. The platform utilizes artificial intelligence (AI) to generate personalized stories, supporting traditional storytelling with modern technology. It caters to parents, caregivers, teachers, speech therapists, and anyone looking to enrich children's world with educational and imaginative content.

Tilde.ai
Tilde.ai is a language technology platform that offers a wide range of AI-powered solutions for translation, speech technologies, and conversational AI. It combines human and artificial intelligence to help people connect and work efficiently. The platform provides machine translation, speech-to-text conversion, text-to-speech synthesis, real-time transcription, AI chatbots, internal knowledge assistants, and meeting support services. Tilde.ai aims to bridge language barriers and enhance communication by leveraging advanced language technologies.

Accent Guesser
Accent Guesser is a free online accent test powered by advanced AI analysis. It allows users to record their voice, receive detailed insights about their accent characteristics, and compare their accent to native speakers. The tool is ideal for professionals seeking to improve communication in international business settings, language learners tracking pronunciation progress, and individuals interested in understanding their cultural background through accent analysis.
20 - Open Source Tools

metavoice-src
MetaVoice-1B is a 1.2B parameter base model trained on 100K hours of speech for TTS (text-to-speech). It has been built with the following priorities: * Emotional speech rhythm and tone in English. * Zero-shot cloning for American & British voices, with 30s reference audio. * Support for (cross-lingual) voice cloning with finetuning. * We have had success with as little as 1 minute training data for Indian speakers. * Synthesis of arbitrary length text

EmotiVoice
EmotiVoice is a powerful and modern open-source text-to-speech engine that supports emotional synthesis, enabling users to create speech with a wide range of emotions such as happy, excited, sad, and angry. It offers over 2000 different voices in both English and Chinese. Users can access EmotiVoice through an easy-to-use web interface or a scripting interface for batch generation of results. The tool is continuously evolving with new features and updates, prioritizing community input and user feedback.

OpenVoiceChat
OpenVoiceChat is an open-source tool designed for having natural voice conversations with an LLM model. It supports various speech-to-text (STT), text-to-speech (TTS), and large language model (LLM) models. The tool aims to provide an alternative to closed commercial implementations, with well-abstracted APIs that are easy to use and extend. Users can install base and functionality-specific packages using pip, and the tool supports interruptions during conversations. The project encourages contributions through bounties and has a detailed roadmap available for reference.

pyht
pyht is a Python SDK for the PlayHT's AI Text-to-Speech API, allowing users to convert text into high-quality audio streams in humanlike voice. It supports real-time text-to-speech streaming, pre-built and custom voices, various audio formats, and different sample rates.

orate
Orate is an AI toolkit designed for speech processing tasks. It allows users to generate realistic, human-like speech and transcribe audio using a unified API that integrates with popular AI providers such as OpenAI, ElevenLabs, and AssemblyAI. The toolkit can be easily installed using npm or other package managers. For more details, visit the website.

bark.cpp
Bark.cpp is a C/C++ implementation of the Bark model, a real-time, multilingual text-to-speech generation model. It supports AVX, AVX2, and AVX512 for x86 architectures, and is compatible with both CPU and GPU backends. Bark.cpp also supports mixed F16/F32 precision and 4-bit, 5-bit, and 8-bit integer quantization. It can be used to generate realistic-sounding audio from text prompts.

voice-chat-ai
Voice Chat AI is a project that allows users to interact with different AI characters using speech. Users can choose from various characters with unique personalities and voices, and have conversations or role play with them. The project supports OpenAI, xAI, or Ollama language models for chat, and provides text-to-speech synthesis using XTTS, OpenAI TTS, or ElevenLabs. Users can seamlessly integrate visual context into conversations by having the AI analyze their screen. The project offers easy configuration through environment variables and can be run via WebUI or Terminal. It also includes a huge selection of built-in characters for engaging conversations.

awesome-ai
Awesome AI is a curated list of artificial intelligence resources including courses, tools, apps, and open-source projects. It covers a wide range of topics such as machine learning, deep learning, natural language processing, robotics, conversational interfaces, data science, and more. The repository serves as a comprehensive guide for individuals interested in exploring the field of artificial intelligence and its applications across various domains.

shellChatGPT
ShellChatGPT is a shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS, featuring integration with LocalAI, Ollama, Gemini, Mistral, Groq, and GitHub Models. It provides text and chat completions, vision, reasoning, and audio models, voice-in and voice-out chatting mode, text editor interface, markdown rendering support, session management, instruction prompt manager, integration with various service providers, command line completion, file picker dialogs, color scheme personalization, stdin and text file input support, and compatibility with Linux, FreeBSD, MacOS, and Termux for a responsive experience.

LLM-Agents-Papers
A repository that lists papers related to Large Language Model (LLM) based agents. The repository covers various topics including survey, planning, feedback & reflection, memory mechanism, role playing, game playing, tool usage & human-agent interaction, benchmark & evaluation, environment & platform, agent framework, multi-agent system, and agent fine-tuning. It provides a comprehensive collection of research papers on LLM-based agents, exploring different aspects of AI agent architectures and applications.

PsyDI
PsyDI is a multi-modal and interactive chatbot designed for psychological assessments. It aims to explore users' cognitive styles through interactive analysis of their inputs, ultimately determining their Myers-Briggs Type Indicator (MBTI). The chatbot offers customized feedback and detailed analysis for each user, with upcoming features such as an MBTI gallery. Users can access PsyDI directly online to begin their journey of self-discovery.

Awesome_Mamba
Awesome Mamba is a curated collection of groundbreaking research papers and articles on Mamba Architecture, a pioneering framework in deep learning known for its selective state spaces and efficiency in processing complex data structures. The repository offers a comprehensive exploration of Mamba architecture through categorized research papers covering various domains like visual recognition, speech processing, remote sensing, video processing, activity recognition, image enhancement, medical imaging, reinforcement learning, natural language processing, 3D recognition, multi-modal understanding, time series analysis, graph neural networks, point cloud analysis, and tabular data handling.

ai-audio-startups
The 'ai-audio-startups' repository is a community list of startups working with AI for audio and music tech. It includes a comprehensive collection of tools and platforms that leverage artificial intelligence to enhance various aspects of music creation, production, source separation, analysis, recommendation, health & wellbeing, radio/podcast, hearing, sound detection, speech transcription, synthesis, enhancement, and manipulation. The repository serves as a valuable resource for individuals interested in exploring innovative AI applications in the audio and music industry.

2025-AI-College-Jobs
2025-AI-College-Jobs is a repository containing a comprehensive list of AI/ML & Data Science jobs suitable for college students seeking internships or new graduate positions. The repository is regularly updated with positions posted within the last 120 days, featuring opportunities from various companies in the USA and internationally. The list includes positions in areas such as research scientist internships, quantitative research analyst roles, and other data science-related positions. The repository aims to provide a valuable resource for students looking to kickstart their careers in the field of artificial intelligence and machine learning.

awesome-transformer-nlp
This repository contains a hand-curated list of great machine (deep) learning resources for Natural Language Processing (NLP) with a focus on Generative Pre-trained Transformer (GPT), Bidirectional Encoder Representations from Transformers (BERT), attention mechanism, Transformer architectures/networks, Chatbot, and transfer learning in NLP.

SurveyX
SurveyX is an advanced academic survey automation system that leverages Large Language Models (LLMs) to generate high-quality, domain-specific academic papers and surveys. Users can request comprehensive academic papers or surveys tailored to specific topics by providing a paper title and keywords for literature retrieval. The system streamlines academic research by automating paper creation, saving users time and effort in compiling research content.

openedai-speech
OpenedAI Speech is a free, private text-to-speech server compatible with the OpenAI audio/speech API. It offers custom voice cloning and supports various models like tts-1 and tts-1-hd. Users can map their own piper voices and create custom cloned voices. The server provides multilingual support with XTTS voices and allows fixing incorrect sounds with regex. Recent changes include bug fixes, improved error handling, and updates for multilingual support. Installation can be done via Docker or manual setup, with usage instructions provided. Custom voices can be created using Piper or Coqui XTTS v2, with guidelines for preparing audio files. The tool is suitable for tasks like generating speech from text, creating custom voices, and multilingual text-to-speech applications.

speech-to-speech
This repository implements a speech-to-speech cascaded pipeline with consecutive parts including Voice Activity Detection (VAD), Speech to Text (STT), Language Model (LM), and Text to Speech (TTS). It aims to provide a fully open and modular approach by leveraging models available on the Transformers library via the Hugging Face hub. The code is designed for easy modification, with each component implemented as a class. Users can run the pipeline either on a server/client approach or locally, with detailed setup and usage instructions provided in the readme.
41 - OpenAI Gpts

FruityChat
Transform your child's stuffed animals into interactive, talking playmates with distinct personalities, enhancing children's play and emotional growth.

English Pronunciation Helper
I assist with English pronunciation using the Turkish alphabet.

Sensory Supporter
A supportive guide for managing sensory dysregulation with tailored advice.

Speak GPT
Voice-centric English role-play tool for speaking practice and offering personalized feedback!

Children's Storyteller
Crafts engaging children's stories with valuable lessons and interactive elements.

SpeechTherapist GPT
Your very own speech therapy assistant. Completely private and confidential.

AI Phonetics and Reading Coach with Speech
Phonetics and reading coach with interactive voice capabilities, tailored for adult beginners.

Sensory Integration Guide
Your personalized guide guide to all things sensory, including Sensory Processing Disorder and Sensory Integration Therapy.

Bilingual Storyteller
I narrate bilingual stories for young children, focusing on language and cognitive skills.

Child Literacy Booster
Aids in developing literacy in children with engaging reading activities, storytelling techniques, and parental guidance.

Dedicated Occupational Therapist
Empathetic Occupational Therapist offering tailored medical consultations

Dedicated Speech-Language Pathologist
Expert Speech-Language Pathologist offering tailored medical consultations.

Reading Tutor
A nurturing tutor dedicated to assisting children in grades K-5, enhancing their reading and literacy skills with patience and encouragement.

TOEFL Speaking Coach
Friendly bot for efficient TOEFL speaking practice, offering direct questions and detailed feedback.

I Spy With My Little Eye
I play a visual guessing game, challenging users to find hidden objects.

AI EDU Phonologie Principe Alphabétique Cycle 1
Assistant pédagogique pour développer la conscience phonologie et le principe alphabétique.

Your Lingo AI Coach
Welcome! I'm a voice-focused language teacher for interactive speaking practice. To enable voice, download the app and tap the headphone button next to my chat window. Then choose your preferred voice. When you're ready, tell me what language you'd like to learn. It's FREE!
SpeechGPT User Guide
A guide for using SpeechGPT, focusing on its features, setup, and usage.

Dialect Detective
Expert in distinguishing language dialects like Castilian vs Latin Spanish, and Parisian vs Canadian French.