Best AI tools for< Find Speech Therapists >
20 - AI tool Sites

ToastwithAI
ToastwithAI is an AI-powered tool that helps users create wedding speeches. It asks users a few questions about the event and the people involved, and then generates a speech tailored to the user's tone and style. The speeches are designed to sound natural and personal, and can be edited and finalized by the user until they are satisfied. ToastwithAI is a quick and easy way to create a memorable wedding speech.

Future Tools
Future Tools is a website that collects and organizes AI tools. It provides a comprehensive list of AI tools categorized into various domains, including AI detection, aggregators, avatar chat, copywriting, finance, gaming, generative art, generative code, generative video, image improvement, image scanning, inspiration, marketing, motion capture, music, podcasting, productivity, prompt guides, research, self-improvement, social media, speech-to-text, text-to-speech, text-to-video, translation, video editing, and voice modulation. The website also offers a search bar to help users find specific tools based on their needs.

The Tech Deck
The Tech Deck is a comprehensive directory of AI tools that empowers users to find the perfect AI solution for their specific needs. Whether you're looking to enhance your writing, teaching, SEO, image generation, text-to-speech capabilities, or more, The Tech Deck has got you covered. With a wide range of AI tools available, you can streamline your workflow, boost your productivity, and achieve exceptional results.

babs.ai
babs.ai is an AI-powered job matching platform that connects talent with opportunities. It leverages intelligent matching algorithms to streamline the recruitment process and ensure a seamless experience for both job seekers and employers. The platform caters to a wide range of job roles and industries, making it a versatile solution for all types of users.

BoldVoice Accent Oracle
BoldVoice Accent Oracle is an AI-powered application designed to help users improve their American English accent. By analyzing users' speech patterns, it can accurately guess their native language within 30 seconds. The app provides personalized training to enhance pronunciation and intonation, aiming to help users sound more like native English speakers. BoldVoice Accent Oracle is a user-friendly tool that offers a fun and interactive way to work on accent reduction and language proficiency.

Stenote
Stenote is an AI-powered transcription and summarization tool that provides real-time, highly accurate transcriptions and concise summaries of conversations, lectures, and meetings. With its advanced AI capabilities, Stenote captures every word spoken, identifies key highlights, and generates comprehensive meeting notes, making it an invaluable tool for professionals seeking clarity and efficiency in their communication.

Free AI Tool
The website is a comprehensive directory of free and freemium AI tools in 2024. It showcases the latest artificial intelligence innovations that can enhance work and creativity at no cost. Users can explore a wide range of AI-powered tools for tasks like lead generation, music analysis, image generation, text-to-speech conversion, prompt databases, image processing, and more. The platform aims to provide users with cutting-edge AI solutions to boost productivity and efficiency in various domains.

TopApps.Ai
TopApps.Ai is the #1 AI Apps Directory on the Internet, offering a curated selection of AI-powered web applications across various categories. Users can find innovative tools for tasks such as copywriting, image editing, video generation, and productivity enhancement. The platform showcases cutting-edge AI technologies that cater to content creators, writers, musicians, and artists, enabling them to streamline their workflows and boost creativity. With a focus on user experience and efficiency, TopApps.Ai serves as a valuable resource for individuals seeking to leverage AI for enhanced digital experiences.

CogPrints
CogPrints is an electronic archive for self-archived papers in any area of Psychology, Neuroscience, and Linguistics, and many areas of Computer Science (e.g., artificial intelligence, robotics, vision, learning, speech, neural networks), Philosophy (e.g., mind, language, knowledge, science, logic), Biology (e.g., ethology, behavioral ecology, sociobiology, behavior genetics, evolutionary theory), Medicine (e.g., Psychiatry, Neurology, human genetics, Imaging), Anthropology (e.g., primatology, cognitive ethnology, archeology, paleontology), as well as any other portions of the physical, social and mathematical sciences that are pertinent to the study of cognition.

Audionotes
Audionotes is an AI-powered note-taking app that uses speech-to-text technology to transcribe and summarize audio recordings. It also offers a variety of features to help users organize and manage their notes, including the ability to create to-do lists, set reminders, and share notes with others. Audionotes is available as a web app, a mobile app, and a Chrome extension.

TopTools.ai
The website toptools.ai is the #1 AI Tools Directory, providing a platform for users to discover and access various AI tools and applications. Users can filter tools based on pricing models and categories such as advertising, analysis, chatbots, design, education, marketing, and more. The site offers a wide range of AI-powered tools for different purposes, from content creation and SEO optimization to mental health support and influencer marketing. Users can find tools for free, on a free trial, freemium, or paid basis, catering to diverse needs and preferences in the AI space.

Wavel
Wavel is the #1 AI marketplace offering a comprehensive directory of AI tools for professionals and business owners. With over 5000 AI tools in various categories like marketing, business, finance, and code, Wavel aims to elevate users' professional edge by providing access to cutting-edge AI technology. Users can find both free and paid tools on the platform, catering to a wide range of needs from logo design to 3D modeling and text-to-speech services.

Forit.ai
Forit.ai is a comprehensive directory that connects users with AI tools across various categories to solve specific problems or improve productivity. It provides a curated list of the best AI technologies available today, including creative aids, analytical tools, and privacy solutions. Users can easily find the right solutions for their needs, whether they are developers, marketers, or hobbyists.

Raw Shorts
Raw Shorts is an artificial intelligence video editor and text to video maker that helps businesses make explainer videos, animations and promo videos for the web and social media. It uses AI to create a video draft within seconds, saving you countless hours of video creation. You can then customize the video in the drag and drop editor, or use the text to speech feature to add narration. Raw Shorts has a library of over one million commercially licensed and creative commons videos, photos, icons and animations that you can use in your videos. It's a great tool for businesses that want to create high-quality videos without spending a lot of time or money.

AIToolsly
AIToolsly.com is a free AI Tools Directory that offers a wide range of AI applications for various purposes such as AI ChatBots, AI HR, AI Marketplace, AI SEO, Analytics, Audio Editing, Text to Speech, Business, Customer Support, Finance, Home Design, Legal Assistants, Presentation, Productivity, Real Estate, Sales, Virtual Assistant, Code Assistants, Developer Tools, Data Analysis, Ecommerce, Education, Engineering, Fashion, Fitness, Health, Image Editing, Logo Generators, Project Management, Social Network, Video Editing, Writing, and more. Users can find and explore over 9999 AI tools in one place, catering to different industries and needs.

Five9
Five9 is a leading provider of cloud contact center software. We are driven by a passion to transform call and contact centers into customer engagement centers of excellence. Our AI-powered solutions help businesses deliver exceptional customer experiences, improve operational efficiency, and reduce costs. With Five9, you can: * Empower agents to deliver results anywhere * Improve CX with practical AI * Find efficiency with AI & automation * Scale with AI & digital workforce * Realize results with Five9

Find AI
Find AI is an AI-powered search engine that provides users with advanced search capabilities to unlock contact details and gain more accurate insights. The platform caters to individuals and companies looking to research people, companies, startups, founders, and more. Users can access email addresses and premium search features to explore a wide range of data related to various industries and sectors. Find AI offers a user-friendly interface and efficient search algorithms to deliver relevant results in a timely manner.

Find your next book
Find your next book is an AI-powered librarian that provides personalized book recommendations based on your preferences. It uses advanced algorithms to analyze your reading history, interests, and other factors to suggest books that you're likely to enjoy. The platform offers a wide range of genres and authors to choose from, making it easy to find your next favorite read.

Find Your AIs
Find Your AIs is an AI directory website that showcases a wide range of AI tools and applications. It offers a platform for users to explore and discover various AI-powered solutions across different categories such as digital wellness, marketing, text-to-image generation, resume customization, and more. The website aims to connect users with innovative AI technologies to enhance their daily lives and work efficiency.

Find My Remote
Find My Remote is an AI-powered job search platform that streamlines the job hunting process by leveraging artificial intelligence to find and structure job postings from various ATS platforms. Users can set their job preferences, receive personalized job matches, and save time by applying to curated job listings. The platform offers exclusive job opportunities not typically found on popular job search websites like LinkedIn. With features such as job discovery, application tracking, and faster application process, Find My Remote aims to revolutionize the way job seekers find and apply for jobs.
20 - Open Source AI Tools

pyht
pyht is a Python SDK for the PlayHT's AI Text-to-Speech API, allowing users to convert text into high-quality audio streams in humanlike voice. It supports real-time text-to-speech streaming, pre-built and custom voices, various audio formats, and different sample rates.

EmotiVoice
EmotiVoice is a powerful and modern open-source text-to-speech engine that supports emotional synthesis, enabling users to create speech with a wide range of emotions such as happy, excited, sad, and angry. It offers over 2000 different voices in both English and Chinese. Users can access EmotiVoice through an easy-to-use web interface or a scripting interface for batch generation of results. The tool is continuously evolving with new features and updates, prioritizing community input and user feedback.

voice-chat-ai
Voice Chat AI is a project that allows users to interact with different AI characters using speech. Users can choose from various characters with unique personalities and voices, and have conversations or role play with them. The project supports OpenAI, xAI, or Ollama language models for chat, and provides text-to-speech synthesis using XTTS, OpenAI TTS, or ElevenLabs. Users can seamlessly integrate visual context into conversations by having the AI analyze their screen. The project offers easy configuration through environment variables and can be run via WebUI or Terminal. It also includes a huge selection of built-in characters for engaging conversations.

awesome-ai
Awesome AI is a curated list of artificial intelligence resources including courses, tools, apps, and open-source projects. It covers a wide range of topics such as machine learning, deep learning, natural language processing, robotics, conversational interfaces, data science, and more. The repository serves as a comprehensive guide for individuals interested in exploring the field of artificial intelligence and its applications across various domains.

shellChatGPT
ShellChatGPT is a shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS, featuring integration with LocalAI, Ollama, Gemini, Mistral, Groq, and GitHub Models. It provides text and chat completions, vision, reasoning, and audio models, voice-in and voice-out chatting mode, text editor interface, markdown rendering support, session management, instruction prompt manager, integration with various service providers, command line completion, file picker dialogs, color scheme personalization, stdin and text file input support, and compatibility with Linux, FreeBSD, MacOS, and Termux for a responsive experience.

SurveyX
SurveyX is an advanced academic survey automation system that leverages Large Language Models (LLMs) to generate high-quality, domain-specific academic papers and surveys. Users can request comprehensive academic papers or surveys tailored to specific topics by providing a paper title and keywords for literature retrieval. The system streamlines academic research by automating paper creation, saving users time and effort in compiling research content.

FunClip
FunClip is an open-source, locally deployable automated video editing tool that utilizes the FunASR Paraformer series models from Alibaba DAMO Academy for speech recognition in videos. Users can select text segments or speakers from the recognition results and click the clip button to obtain the corresponding video segments. FunClip integrates advanced features such as the Paraformer-Large model for accurate Chinese ASR, SeACo-Paraformer for customized hotword recognition, CAM++ speaker recognition model, Gradio interactive interface for easy usage, support for multiple free edits with automatic SRT subtitles generation, and segment-specific SRT subtitles.

FunClip
FunClip is an open-source, locally deployed automated video clipping tool that leverages Alibaba TONGYI speech lab's FunASR Paraformer series models for speech recognition on videos. Users can select text segments or speakers from recognition results to obtain corresponding video clips. It integrates industrial-grade models for accurate predictions and offers hotword customization and speaker recognition features. The tool is user-friendly with Gradio interaction, supporting multi-segment clipping and providing full video and target segment subtitles. FunClip is suitable for users looking to automate video clipping tasks with advanced AI capabilities.

openedai-speech
OpenedAI Speech is a free, private text-to-speech server compatible with the OpenAI audio/speech API. It offers custom voice cloning and supports various models like tts-1 and tts-1-hd. Users can map their own piper voices and create custom cloned voices. The server provides multilingual support with XTTS voices and allows fixing incorrect sounds with regex. Recent changes include bug fixes, improved error handling, and updates for multilingual support. Installation can be done via Docker or manual setup, with usage instructions provided. Custom voices can be created using Piper or Coqui XTTS v2, with guidelines for preparing audio files. The tool is suitable for tasks like generating speech from text, creating custom voices, and multilingual text-to-speech applications.

speech-trident
Speech Trident is a repository focusing on speech/audio large language models, covering representation learning, neural codec, and language models. It explores speech representation models, speech neural codec models, and speech large language models. The repository includes contributions from various researchers and provides a comprehensive list of speech/audio language models, representation models, and codec models.

cgft-llm
The cgft-llm repository is a collection of video tutorials and documentation for implementing large models. It provides guidance on topics such as fine-tuning llama3 with llama-factory, lightweight deployment and quantization using llama.cpp, speech generation with ChatTTS, introduction to Ollama for large model deployment, deployment tools for vllm and paged attention, and implementing RAG with llama-index. Users can find detailed code documentation and video tutorials for each project in the repository.

Awesome-Audio-LLM
Awesome-Audio-LLM is a repository dedicated to various models and methods related to audio and language processing. It includes a wide range of research papers and models developed by different institutions and authors. The repository covers topics such as bridging audio and language, speech emotion recognition, voice assistants, and more. It serves as a comprehensive resource for those interested in the intersection of audio and language processing.

MockingBird
MockingBird is a toolbox designed for Mandarin speech synthesis using PyTorch. It supports multiple datasets such as aidatatang_200zh, magicdata, aishell3, and data_aishell. The toolbox can run on Windows, Linux, and M1 MacOS, providing easy and effective speech synthesis with pretrained encoder/vocoder models. It is webserver ready for remote calling. Users can train their own models or use existing ones for the encoder, synthesizer, and vocoder. The toolbox offers a demo video and detailed setup instructions for installation and model training.

LLM-Codec
This repository provides an LLM-driven audio codec model, LLM-Codec, for building multi-modal LLMs (text and audio modalities). The model enables frozen LLMs to achieve multiple audio tasks in a few-shot style without parameter updates. It compresses the audio modality into a well-trained LLMs token space, treating audio representation as a 'foreign language' that LLMs can learn with minimal examples. The proposed approach supports tasks like speech emotion classification, audio classification, text-to-speech generation, speech enhancement, etc., demonstrating feasibility and effectiveness in simple scenarios. The LLM-Codec model is open-sourced to facilitate research on few-shot audio task learning and multi-modal LLMs.

VSP-LLM
VSP-LLM (Visual Speech Processing incorporated with LLMs) is a novel framework that maximizes context modeling ability by leveraging the power of LLMs. It performs multi-tasks of visual speech recognition and translation, where given instructions control the task type. The input video is mapped to the input latent space of a LLM using a self-supervised visual speech model. To address redundant information in input frames, a deduplication method is employed using visual speech units. VSP-LLM utilizes Low Rank Adaptors (LoRA) for computationally efficient training.

MeloTTS
MeloTTS is a high-quality multi-lingual text-to-speech library by MyShell.ai. It supports various languages including English (American, British, Indian, Australian), Spanish, French, Chinese, Japanese, and Korean. The Chinese speaker also supports mixed Chinese and English. The library is fast enough for CPU real-time inference and offers features like using without installation, local installation, and training on custom datasets. The Python API and model cards are available in the repository and on HuggingFace. The community can join the Discord channel for discussions and collaboration opportunities. Contributions are welcome, and the library is under the MIT License. MeloTTS is based on TTS, VITS, VITS2, and Bert-VITS2.

openai-edge-tts
This project provides a local, OpenAI-compatible text-to-speech (TTS) API using `edge-tts`. It emulates the OpenAI TTS endpoint (`/v1/audio/speech`), enabling users to generate speech from text with various voice options and playback speeds, just like the OpenAI API. `edge-tts` uses Microsoft Edge's online text-to-speech service, making it completely free. The project supports multiple audio formats, adjustable playback speed, and voice selection options, providing a flexible and customizable TTS solution for users.

wunjo.wladradchenko.ru
Wunjo AI is a comprehensive tool that empowers users to explore the realm of speech synthesis, deepfake animations, video-to-video transformations, and more. Its user-friendly interface and privacy-first approach make it accessible to both beginners and professionals alike. With Wunjo AI, you can effortlessly convert text into human-like speech, clone voices from audio files, create multi-dialogues with distinct voice profiles, and perform real-time speech recognition. Additionally, you can animate faces using just one photo combined with audio, swap faces in videos, GIFs, and photos, and even remove unwanted objects or enhance the quality of your deepfakes using the AI Retouch Tool. Wunjo AI is an all-in-one solution for your voice and visual AI needs, offering endless possibilities for creativity and expression.
20 - OpenAI Gpts

SpeechTherapist GPT
Your very own speech therapy assistant. Completely private and confidential.

I Spy With My Little Eye
I play a visual guessing game, challenging users to find hidden objects.

Sensory Integration Guide
Your personalized guide guide to all things sensory, including Sensory Processing Disorder and Sensory Integration Therapy.

Occupational Therapist Expert GPT
A robust assistant for OTs creating patient-focused plans.

Neurodiversity Navigator
Autism and Beyond: Your Daily Companion for Neurodiversity Understanding and Support

Autism Pathway Companion
Expert in autism development, offering guidance on milestones and support strategies.

Detailed Speech Drafting Wizard
Crafts speeches from PowerPoint slides and reference materials, adding depth and context.

AI.EX Wedding Speech Consultant
Your partner in crafting perfect wedding speeches. Let me be your guide to writing impactful, memorable speeches for unforgettable moments.

Topics for TED Talk-style presentations
'TED Talk Presentation Topics' is a prompt expert in discovering and developing captivating TED Talk presentation topics, tailored to the user's experience, interest, and goals, ensuring relevance, diversity, and inspiration.

Oracle of TED
The Oracle of TED uses the top 100 ted talks from 2006-2021 to help you prepare your next presentation.