Best AI tools for< Choose Microphone >
20 - AI tool Sites

Talkatoo
Talkatoo is a dictation software that uses AI to help veterinarians save time and increase productivity. It offers three levels of control, so you can choose how hands-off you want to be. With Verified, you can simply record your notes and our scribes will verify the accuracy and place them in your PMS for you. With Auto-SOAP Records, you can record an entire exam or dictate your notes after and have Talkatoo auto-magically format the recording into a SOAP note, or other template. With Desktop Dictation, you can dictate in any field, in any app, on Mac or Windows. You can even connect your mobile device as a secure microphone to make the process easier.

VERSA
VERSA is a text-based adventure game that allows users to choose their own adventure and customize their companion. Users can choose from a variety of settings, including sci-fi, wild-west, horror, drama, war, university, or fantasy. They can also choose a male, female, or non-binary companion to be their friend, romance, or enemy. VERSA is designed to push the limits of what's possible with a 1-gem model, while keeping it as entertaining as possible.

Clarity AI
Clarity AI is an AI-powered technology platform that offers a Sustainability Tech Kit for sustainable investing, shopping, reporting, and benchmarking. The platform provides built-in sustainability technology with customizable solutions for various needs related to data, methodologies, and tools. It seamlessly integrates into workflows, offering scalable and flexible end-to-end SaaS tools to address sustainability use cases. Clarity AI leverages powerful AI and machine learning to analyze vast amounts of data points, ensuring reliable and transparent data coverage. The platform is designed to empower users to assess, analyze, and report on sustainability aspects efficiently and confidently.

Cardinal
Cardinal is an AI-powered product backlog tool that helps product managers prioritize features and make data-driven decisions. It integrates with your CRM and customer support tools to collect customer feedback and revenue data, which it then uses to identify the most valuable features to build. Cardinal also provides a clear view of your product roadmap and progress, so you can always see what's coming up and how it's aligned with your business goals.

Thabble
Thabble is an AI-powered platform that allows users to create their own adventure stories by making choices. It is designed as a creative activity for parents and kids to engage in together. Users can generate brand new stories of up to 3,000 words in length, with the option for kids to verbally express their desired story outcomes. The platform saves stories to a personal Library for sharing and future reading. Additionally, a feature to have the AI read stories aloud is planned for future release.

TheStoryGPT
TheStoryGPT is an AI-powered interactive storytelling tool that allows users to create personalized interactive stories. With a focus on immersive storytelling, users can engage with a variety of stories that respond to their choices. The tool offers high-quality audio experiences by allowing users to choose from a list of narrators. TheStoryGPT provides both free and paid plans, with the option to purchase credits for advanced choices. Users can contact the team for any questions or feedback via email.

Coloring-Pages.AI
Coloring-Pages.AI is an online platform that utilizes advanced AI technology to create personalized coloring pages for both children and adults. Users can input prompts and choose from various image sizes to generate unique and custom coloring designs quickly and easily. The platform offers both free and premium pricing plans, with the AI algorithm considering factors like image composition, style, and size to produce high-quality designs. Coloring-Pages.AI is user-friendly, intuitive, and suitable for individuals of all ages, providing endless creativity and fun through AI-generated coloring pages.

STELLARWITS
STELLARWITS is an AI solutions and software platform that empowers users to explore cutting-edge technology and innovation. The platform offers AI models with versatile capabilities, ranging from content generation to data analysis to problem-solving. Users can engage directly with the technology, experiencing its power in real-time. With a focus on transforming ideas into technology, STELLARWITS provides tailored solutions in software and AI development, delivering intelligent systems and machine learning models for innovative and efficient solutions. The platform also features a download hub with a curated selection of solutions to enhance the digital experience. Through blogs and company information, users can delve deeper into the narrative of STELLARWITS, exploring its mission, vision, and commitment to reshaping the tech landscape.

Hostinger
Hostinger is a web hosting provider that offers a variety of services, including shared hosting, VPS hosting, cloud hosting, and managed WordPress hosting. They also offer a website builder and a domain name registration service. Hostinger's mission is to bring success to everyone online, and they constantly improve their server technology, provide professional support, and simplify site creation with their AI Website Builder.

AI to Human Text Converter
AI to Human Text Converter is an advanced tool that humanizes AI-generated text to make it sound more natural and authentic. It helps users refine and add a personal touch to their content created using AI tools, bridging the gap between cold AI output and genuine human writing. The tool is beneficial for students, bloggers, marketers, and webmasters who seek to enhance the readability and authenticity of their content without losing the human appeal. AI to Human is equipped with a built-in AI detector to ensure 100% human output, free from errors and plagiarism.

W.A.I.T
W.A.I.T is a web-based AI-powered writing assistant that helps users improve their writing skills. It offers a range of features, including content generation, content enhancement, translation, and social media assistance. W.A.I.T is designed to be user-friendly and accessible to writers of all levels.

Animalspicker
Animalspicker.com is an AI animal generator and blog that offers a wide range of resources related to animals. Users can randomly generate their favorite animals, explore pet care tips, learn about wildlife conservation efforts, delve into animal behavior, and discover information about exotic pets and pet nutrition. The website aims to provide comprehensive information to help users care for their pets and contribute to wildlife conservation.

Whatifi? AI
Whatifi? AI is an AI story generator that allows users to create and read personalized 'choose-your-own-adventure' stories, chapter by chapter. Each chapter, crafted on-the-fly by AI text generation, adapts in real-time to user decisions, offering a dynamic storytelling experience. Users can explore various genres like Sci-Fi, Fantasy, Horror, Mystery, Crime, and Romance. The platform is suitable for casual readers, gamers, families, creative thinkers, and educational purposes, providing a unique and interactive way to engage with storytelling.

MagicShorts.ai
MagicShorts.ai is an AI-powered platform that enables users to create unique faceless short-form videos for social media content. The platform offers AI-generated scripts, life-like voices, stunning artistic images, background music selection, and customization options for creating engaging videos. Users can choose from different subscription plans to access various features and benefits, including video editing, image replacement, and video length customization. MagicShorts.ai ensures that each video created is unique, thanks to generative AI technology.

Fontjoy
Fontjoy is a web application that helps users generate font pairings with just one click. It simplifies the process of creating balanced contrast font combinations using deep learning technology. Users can easily create new font pairings, lock fonts they like, and manually choose fonts. The tool aims to assist users in selecting fonts that complement each other while maintaining a cohesive theme with pleasing contrast.

Weights & Biases
Weights & Biases is an AI tool that offers documentation, guides, tutorials, and support for using AI models in applications. The platform provides two main products: W&B Weave for integrating AI models into code and W&B Models for building custom AI models. Users can access features such as tracing, output evaluation, cost estimates, hyperparameter sweeps, model registry, and more. Weights & Biases aims to simplify the process of working with AI models and improving model reproducibility.

StoryPathGame
The website is an AI story generator tool called StoryPathGame. It allows users to select a story and embark on a unique adventure each time. The AI generates captivating and personalized narratives based on the user's choices, making it perfect for bedtime stories or personalized journeys. Users can craft their own stories and let the AI lead the way in creating engaging content. StoryPathGame aims to bring stories to life through AI technology, providing an enchanting and interactive storytelling experience online.

Choosy Chat
Choosy Chat is an AI-powered chat application that utilizes advanced AI models such as OpenAI GPT-4o and Google Gemini Pro 1.5 to provide intelligent responses and engage in meaningful conversations with users. The application is designed to assist users in various tasks, including answering questions, providing information on recent knowledge, coding assistance, and reasoning puzzles. Choosy Chat aims to enhance user experience through its cutting-edge AI technology and user-friendly interface.

Armchair
Armchair is an AI-powered business partner that can help you launch a consulting side hustle or full-time business. With Armchair, you get access to a proven roadmap, AI tools, personalized coaching, and a supportive community of consultants designed to turn your expertise into a thriving consulting side hustle.

TOP AI Center
TOP AI Center is a comprehensive platform that serves as a trusted resource for accessing the most advanced AI tools globally. It offers a curated selection of top-performing AI tools across various categories, empowering users to find the perfect solution for any task. The platform features expertly curated tools, user-centric design, and advanced search and filtering options to enhance efficiency and innovation in every field. TOP AI Center aims to make AI accessible to everyone, regardless of expertise or industry, by providing a centralized hub of elite AI resources.
20 - Open Source AI Tools

Whisper-WebUI
Whisper-WebUI is a Gradio-based browser interface for Whisper, serving as an Easy Subtitle Generator. It supports generating subtitles from various sources such as files, YouTube, and microphone. The tool also offers speech-to-text and text-to-text translation features, utilizing Facebook NLLB models and DeepL API. Users can translate subtitle files from other languages to English and vice versa. The project integrates faster-whisper for improved VRAM usage and transcription speed, providing efficiency metrics for optimized whisper models. Additionally, users can choose from different Whisper models based on size and language requirements.

gemini-multimodal-playground
Gemini Multimodal Playground is a basic Python app for voice conversations with Google's Gemini 2.0 AI model. It features real-time voice input and text-to-speech responses. Users can configure settings through the GUI and interact with Gemini by speaking into the microphone. The application provides options for voice selection, system prompt customization, and enabling Google search. Troubleshooting tips are available for handling audio feedback loop issues that may occur during interactions.

voice-chat-ai
Voice Chat AI is a project that allows users to interact with different AI characters using speech. Users can choose from various characters with unique personalities and voices, and have conversations or role play with them. The project supports OpenAI, xAI, or Ollama language models for chat, and provides text-to-speech synthesis using XTTS, OpenAI TTS, or ElevenLabs. Users can seamlessly integrate visual context into conversations by having the AI analyze their screen. The project offers easy configuration through environment variables and can be run via WebUI or Terminal. It also includes a huge selection of built-in characters for engaging conversations.

vibe
Vibe is a tool designed to transcribe audio in multiple languages with features such as offline functionality, user-friendly design, support for various file formats, automatic updates, and translation. It is optimized for different platforms and hardware, offering total freedom to customize models easily. The tool is ideal for transcribing audio and video files, with upcoming features like transcribing system audio and audio from microphone. Vibe is a versatile and efficient transcription tool suitable for various users.

RealtimeSTT_LLM_TTS
RealtimeSTT is an easy-to-use, low-latency speech-to-text library for realtime applications. It listens to the microphone and transcribes voice into text, making it ideal for voice assistants and applications requiring fast and precise speech-to-text conversion. The library utilizes Voice Activity Detection, Realtime Transcription, and Wake Word Activation features. It supports GPU-accelerated transcription using PyTorch with CUDA support. RealtimeSTT offers various customization options for different parameters to enhance user experience and performance. The library is designed to provide a seamless experience for developers integrating speech-to-text functionality into their applications.

Open-LLM-VTuber
Open-LLM-VTuber is a project in early stages of development that allows users to interact with Large Language Models (LLM) using voice commands and receive responses through a Live2D talking face. The project aims to provide a minimum viable prototype for offline use on macOS, Linux, and Windows, with features like long-term memory using MemGPT, customizable LLM backends, speech recognition, and text-to-speech providers. Users can configure the project to chat with LLMs, choose different backend services, and utilize Live2D models for visual representation. The project supports perpetual chat, offline operation, and GPU acceleration on macOS, addressing limitations of existing solutions on macOS.

OSHW-SenseCAP-Watcher
SenseCAP Watcher is a monitoring device built on ESP32S3 with Himax WiseEye2 HX6538 AI chip, excelling in image and vector data processing. It features a camera, microphone, and speaker for visual, auditory, and interactive capabilities. With LLM-enabled SenseCraft suite, it understands commands, perceives surroundings, and triggers actions. The repository provides firmware, hardware documentation, and applications for the Watcher, along with detailed guides for setup, task assignment, and firmware flashing.

kobold_assistant
Kobold-Assistant is a fully offline voice assistant interface to KoboldAI's large language model API. It can work online with the KoboldAI horde and online speech-to-text and text-to-speech models. The assistant, called Jenny by default, uses the latest coqui 'jenny' text to speech model and openAI's whisper speech recognition. Users can customize the assistant name, speech-to-text model, text-to-speech model, and prompts through configuration. The tool requires system packages like GCC, portaudio development libraries, and ffmpeg, along with Python >=3.7, <3.11, and runs on Ubuntu/Debian systems. Users can interact with the assistant through commands like 'serve' and 'list-mics'.

meeting-minutes
An open-source AI assistant for taking meeting notes that captures live meeting audio, transcribes it in real-time, and generates summaries while ensuring user privacy. Perfect for teams to focus on discussions while automatically capturing and organizing meeting content without external servers or complex infrastructure. Features include modern UI, real-time audio capture, speaker diarization, local processing for privacy, and more. The tool also offers a Rust-based implementation for better performance and native integration, with features like live transcription, speaker diarization, and a rich text editor for notes. Future plans include database connection for saving meeting minutes, improving summarization quality, and adding download options for meeting transcriptions and summaries. The backend supports multiple LLM providers through a unified interface, with configurations for Anthropic, Groq, and Ollama models. System architecture includes core components like audio capture service, transcription engine, LLM orchestrator, data services, and API layer. Prerequisites for setup include Node.js, Python, FFmpeg, and Rust. Development guidelines emphasize project structure, testing, documentation, type hints, and ESLint configuration. Contributions are welcome under the MIT License.

MITSUHA
OneReality is a virtual waifu/assistant that you can speak to through your mic and it'll speak back to you! It has many features such as: * You can speak to her with a mic * It can speak back to you * Has short-term memory and long-term memory * Can open apps * Smarter than you * Fluent in English, Japanese, Korean, and Chinese * Can control your smart home like Alexa if you set up Tuya (more info in Prerequisites) It is built with Python, Llama-cpp-python, Whisper, SpeechRecognition, PocketSphinx, VITS-fast-fine-tuning, VITS-simple-api, HyperDB, Sentence Transformers, and Tuya Cloud IoT.

openai-chat-api-workflow
**OpenAI Chat API Workflow for Alfred** An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT-3.5/GPT-4 🤖💬 It also allows image generation 🖼️, image understanding 👀, speech-to-text conversion 🎤, and text-to-speech synthesis 🔈 **Features:** * Execute all features using Alfred UI, selected text, or a dedicated web UI * Web UI is constructed by the workflow and runs locally on your Mac 💻 * API call is made directly between the workflow and OpenAI, ensuring your chat messages are not shared online with anyone other than OpenAI 🔒 * OpenAI does not use the data from the API Platform for training 🚫 * Export chat data to a simple JSON format external file 📄 * Continue the chat by importing the exported data later 🔄

Linly-Talker
Linly-Talker is an innovative digital human conversation system that integrates the latest artificial intelligence technologies, including Large Language Models (LLM) 🤖, Automatic Speech Recognition (ASR) 🎙️, Text-to-Speech (TTS) 🗣️, and voice cloning technology 🎤. This system offers an interactive web interface through the Gradio platform 🌐, allowing users to upload images 📷 and engage in personalized dialogues with AI 💬.

Synthalingua
Synthalingua is an advanced, self-hosted tool that leverages artificial intelligence to translate audio from various languages into English in near real time. It offers multilingual outputs and utilizes GPU and CPU resources for optimized performance. Although currently in beta, it is actively developed with regular updates to enhance capabilities. The tool is not intended for professional use but for fun, language learning, and enjoying content at a reasonable pace. Users must ensure speakers speak clearly for accurate translations. It is not a replacement for human translators and users assume their own risk and liability when using the tool.

keras-llm-robot
The Keras-llm-robot Web UI project is an open-source tool designed for offline deployment and testing of various open-source models from the Hugging Face website. It allows users to combine multiple models through configuration to achieve functionalities like multimodal, RAG, Agent, and more. The project consists of three main interfaces: chat interface for language models, configuration interface for loading models, and tools & agent interface for auxiliary models. Users can interact with the language model through text, voice, and image inputs, and the tool supports features like model loading, quantization, fine-tuning, role-playing, code interpretation, speech recognition, image recognition, network search engine, and function calling.

org-ai
org-ai is a minor mode for Emacs org-mode that provides access to generative AI models, including OpenAI API (ChatGPT, DALL-E, other text models) and Stable Diffusion. Users can use ChatGPT to generate text, have speech input and output interactions with AI, generate images and image variations using Stable Diffusion or DALL-E, and use various commands outside org-mode for prompting using selected text or multiple files. The tool supports syntax highlighting in AI blocks, auto-fill paragraphs on insertion, and offers block options for ChatGPT, DALL-E, and other text models. Users can also generate image variations, use global commands, and benefit from Noweb support for named source blocks.

amazon-transcribe-live-call-analytics
The Amazon Transcribe Live Call Analytics (LCA) with Agent Assist Sample Solution is designed to help contact centers assess and optimize caller experiences in real time. It leverages Amazon machine learning services like Amazon Transcribe, Amazon Comprehend, and Amazon SageMaker to transcribe and extract insights from contact center audio. The solution provides real-time supervisor and agent assist features, integrates with existing contact centers, and offers a scalable, cost-effective approach to improve customer interactions. The end-to-end architecture includes features like live call transcription, call summarization, AI-powered agent assistance, and real-time analytics. The solution is event-driven, ensuring low latency and seamless processing flow from ingested speech to live webpage updates.
20 - OpenAI Gpts

Choose Your Own Adventure Housing
Transform Your Home Search into an Epic Journey with Choose Your Own Adventure Housing – Where Every Click is a New Path!

Choose Your Own Adventure Book Generator
Fantasy author crafting a Choose Your Own Adventure book, with interactive storytelling.

The Meme Doctor (GIVE ME A TRY!!)
Choose a topic. Choose a quote out of the many I create for you. Wait for the Magic to Happen!! Kaboozi, got yourself some funny azz memes!

Historicat Illustrator
Choose a year and travel back in our cat powered time machine. See for yourself key events in cat history!

AI.EX: Virtual Pet Adventure
Choose a special pet to tame, care for, adventure with & love. Create illustrations of your adventures together.

PersonAE (American English Dialects)
Choose a target persona and see if ChatGPT correctly impersonates the American

The Ikigai Market Selector
This GPT will help you choose a market you could start creating a business in using Ed Dales 30 Day Challenge version of the Japanese Ikigai Process

Mindful Match
A mental health assistant to help choose a therapist based on needs, insurance, and location.