Best AI tools for< Interact With Voice >
20 - AI tool Sites

Spheroid AI Avatars
Spheroid AI Avatars is a platform that allows users to create and customize interactive digital characters that can see, speak, converse, and understand natural language. These avatars can be used for various purposes, such as customer service, entertainment, education, and more. Spheroid AI Avatars can be placed anywhere in the world using augmented reality, allowing users to interact with them in a more immersive and engaging way.

Vapi
Vapi is a Voice AI tool designed specifically for developers. It enables developers to interact with their code using voice commands, making the coding process more efficient and hands-free. With Vapi, developers can perform various tasks such as writing code, debugging, and running tests simply by speaking. The tool is equipped with advanced natural language processing capabilities to accurately interpret and execute voice commands. Vapi aims to revolutionize the way developers work by providing a seamless and intuitive coding experience.

VoiceGPT
VoiceGPT is an Android app that provides a voice-based interface to interact with AI language models like ChatGPT, Bing AI, and Bard. It offers features such as unlimited free messages, voice input and output in 67+ languages, a floating bubble for easy switching between apps, OCR text recognition, code execution, image generation with DALL-E 2, and support for ChatGPT Plus accounts. VoiceGPT is designed to be accessible for users with visual impairments, dyslexia, or other conditions, and it can be set as the default assistant to be activated hands-free with a custom hotword.

WizAI
WizAI is an AI tool that offers ChatGPT for WhatsApp, Instagram, and the web. It provides users with the ability to engage in text and voice chat, image and video recognition, and more. WizAI is powered by OpenAI's ChatGPT, offering advanced AI capabilities for generating smart replies and interacting with users in a human-like manner.

Hume AI - Octave
Hume AI is an AI application that offers the Octave language model for text-to-speech (TTS) capabilities. It provides a voice-based LLM that understands words in context to predict emotions, cadence, and more. Users can create various AI voices with specific prompts and scripts, adjusting emotional delivery and speaking styles on command. The application aims to generate expressive AI voices for podcasts, voiceovers, audiobooks, and more, with total control over the voice output.

CalenAI
CalenAI is an AI-powered scheduling agent that uses human-like voice technology to qualify leads and schedule appointments. It is designed to sound and feel just like a human, making it easy for customers to interact with and schedule appointments. CalenAI also offers personalized onboarding to help businesses set up the agent for their specific needs.

Soundverse AI
Soundverse AI is an AI music generator and music assistant that allows users to create music instantly from text prompts, interact with a voice assistant for music-related help, chat with the assistant for music recommendations, extend existing tracks with new sections, isolate individual audio tracks from a mix, auto-complete songs using initial ideas, craft lyrics with AI assistance, and more. The platform offers a range of AI tools to help users iterate and personalize their music creation process, making it easy to transform ideas into music in seconds.

Quant-Tek.AI
Quant-Tek.AI is a premier provider of conversational artificial intelligence tools, empowering businesses with human-like voice AI solutions. Their mission is to revolutionize the way businesses interact with customers by providing intelligent solutions that automate communication and enhance customer experience. They aim to drive efficiency, improve customer satisfaction, and foster growth through cutting-edge AI technology. Quant-Tek.AI values innovation, excellence, integrity, and collaboration in their pursuit of AI innovation and shaping the future of business communication.

Spoken AI
Spoken AI is an innovative AI tool that enables users to interact with technology through voice commands. It leverages cutting-edge natural language processing and machine learning algorithms to understand and respond to spoken language. With Spoken AI, users can perform various tasks hands-free, such as setting reminders, sending messages, playing music, and getting weather updates. The application aims to enhance user experience by providing a seamless and intuitive way to engage with devices using voice input.

Native AI
Native AI is an innovative AI tool that aims to revolutionize the way users interact with various applications by providing a unified interface for faster and more efficient work. It eliminates the need for context switching, clunky user interfaces, and manual tasks, offering a seamless experience across different apps. Users can interact with AI through voice commands, typing, or clicking, enabling lightning-fast interactions and effortless automations. The tool simplifies complex tasks by providing automation suggestions and intuitive interfaces based on user intent, ultimately enhancing productivity and streamlining workflows.

VoiceLine
VoiceLine is an AI-based field sales revenue intelligence tool designed to enhance the efficiency and productivity of field sales teams. It allows users to capture touchpoints using voice commands, automate administrative tasks, and gain actionable insights directly from the field. With advanced speech recognition capabilities and offline functionality, VoiceLine aims to revolutionize the way salespeople work and interact with customers, ultimately driving more revenue for businesses.

AiCogni
AiCogni is a multi-lingual voice chat bot and writing assistant, powered by ChatGPT. It is designed to be a versatile AI companion that can help users with a wide range of tasks, from learning and communication to creativity and productivity. AiCogni's advanced ChatGPT technology enables it to understand and respond to user queries in a natural and informative way, making it an ideal tool for anyone looking to enhance their communication and learning experiences.

Twinning
Twinning is an AI application that allows users to create a virtual clone of themselves for their followers to interact with on social media platforms. Users can record an audio of themselves speaking, and the AI twin is generated within minutes. The application offers different pricing tiers based on the number of followers an influencer has, with features like professional voice cloning, audio messaging, and analytics. Twinning provides a unique way for influencers to engage with their audience and potentially monetize their AI twin's interactions.

Netwrck
Netwrck is an AI tool that offers AI Chat, AI Characters, and an AI Art Generator. Users can create unique AI characters, engage in voice chats, and generate art using AI technology. The platform provides a wide range of AI storytellers, scholars, artists, poets, diplomats, and more, allowing users to interact with diverse virtual personalities. Netwrck aims to provide an immersive and creative experience through AI-generated content.

PreCallAI
PreCallAI is a revolutionary Generative AI-powered voice bot designed to proactively engage and empathetically interact with clients. It empowers businesses by providing seamless revenue generation on autopilot. The application addresses issues such as timely support for potential customers, providing pertinent details to leads, sustaining continuous interaction, and plugging leaks in low-converting sales pipelines. PreCallAI offers features like elevating sales game, product education & discovery, lead qualification, lead nurturing, appointment scheduling/meetings, and demand generation.

Uncensored AI
Uncensored AI is a cutting-edge AI platform that prides itself on being 100% uncensored and unfiltered. It offers users a unique experience with no restrictions, filters, or guardrails. With a user base of over 25,000 worldwide, Uncensored AI provides a range of features and model capabilities that cater to various needs. Users can interact with the AI through chat, image processing, and more, making it a versatile tool for a wide range of tasks.

Generrate
Generrate is an AI-powered content creation tool that empowers users to generate high-quality content in their brand voice. It offers a wide range of features including AI Writer, AI Article Wizard, AI Chat, PDF Chat, AI Speech To Text, and AI Voiceover. With Generrate, users can automate their content creation process, customize their content to suit their branding, and interact with AI-powered chatbots for real-time data collection. The tool supports 40 languages, provides unlimited result proposals, and offers 4 levels of creativity. Generrate is a transformative tool for businesses and individuals looking to enhance productivity and efficiency through AI features.

GizAI
GizAI is an AI application that offers a unified platform for AI generators, drive, and notes. Users can generate, enjoy, and share various content types such as stories, images, videos, audios, and games using AI technology. The platform also includes features like AI chat, AI story generator, AI image generator, AI audio generator, and AI video generator. GizAI aims to provide a seamless experience for users to create and interact with AI-generated content.

Personal Voice and Vision Assistant
This AI-powered voice and vision assistant offers a range of features to enhance communication, productivity, and learning. Engage in natural voice conversations, get assistance with daily tasks, manage your schedule, and interact with visuals seamlessly. The assistant adapts to your needs, providing personalized support and advice. With its intuitive interface and affordable pricing, it's an ideal companion for individuals of all ages and interests.

Robot Writers AI
Robot Writers AI is an artificial intelligence tool that automates writing tasks. It offers advanced AI engines like ChatGPT-4o, enabling users to interact with AI personalities, generate content, interpret voice, video, and text in real-time, and more. The tool aims to enhance the writing process by providing faster response times, increased reasoning capabilities, and improved user experience. With features like video interaction, voice-to-voice communication, and a desktop app, Robot Writers AI is revolutionizing the writing industry by leveraging cutting-edge AI technology.
20 - Open Source AI Tools

Open-LLM-VTuber
Open-LLM-VTuber is a project in early stages of development that allows users to interact with Large Language Models (LLM) using voice commands and receive responses through a Live2D talking face. The project aims to provide a minimum viable prototype for offline use on macOS, Linux, and Windows, with features like long-term memory using MemGPT, customizable LLM backends, speech recognition, and text-to-speech providers. Users can configure the project to chat with LLMs, choose different backend services, and utilize Live2D models for visual representation. The project supports perpetual chat, offline operation, and GPU acceleration on macOS, addressing limitations of existing solutions on macOS.

assistant
The WhatsApp AI Assistant repository offers a chatbot named Sydney that serves as an AI-powered personal assistant. It utilizes Language Model (LLM) technology to provide various features such as Google/Bing searching, Google Calendar integration, communication capabilities, group chat compatibility, voice message support, basic text reminders, image recognition, and more. Users can interact with Sydney through natural language queries and voice messages. The chatbot can transcribe voice messages using either the Whisper API or a local method. Additionally, Sydney can be used in group chats by mentioning her username or replying to her last message. The repository welcomes contributions in the form of issue reports, pull requests, and requests for new tools. The creators of the project, Veigamann and Luisotee, are open to job opportunities and can be contacted through their GitHub profiles.

whisper_dictation
Whisper Dictation is a fast, offline, privacy-focused tool for voice typing, AI voice chat, voice control, and translation. It allows hands-free operation, launching and controlling apps, and communicating with OpenAI ChatGPT or a local chat server. The tool also offers the option to speak answers out loud and draw pictures. It includes client and server versions, inspired by the Star Trek series, and is designed to keep data off the internet and confidential. The project is optimized for dictation and translation tasks, with voice control capabilities and AI image generation using stable-diffusion API.

Simulator-Controller
Simulator Controller is a modular administration and controller application for Sim Racing, featuring a comprehensive plugin automation framework for external controller hardware. It includes voice chat capable Assistants like Virtual Race Engineer, Race Strategist, Race Spotter, and Driving Coach. The tool offers features for setup, strategy development, monitoring races, and more. Developed in AutoHotkey, it supports various simulation games and integrates with third-party applications for enhanced functionality.

AiTreasureBox
AiTreasureBox is a versatile AI tool that provides a collection of pre-trained models and algorithms for various machine learning tasks. It simplifies the process of implementing AI solutions by offering ready-to-use components that can be easily integrated into projects. With AiTreasureBox, users can quickly prototype and deploy AI applications without the need for extensive knowledge in machine learning or deep learning. The tool covers a wide range of tasks such as image classification, text generation, sentiment analysis, object detection, and more. It is designed to be user-friendly and accessible to both beginners and experienced developers, making AI development more efficient and accessible to a wider audience.

AITreasureBox
AITreasureBox is a comprehensive collection of AI tools and resources designed to simplify and accelerate the development of AI projects. It provides a wide range of pre-trained models, datasets, and utilities that can be easily integrated into various AI applications. With AITreasureBox, developers can quickly prototype, test, and deploy AI solutions without having to build everything from scratch. Whether you are working on computer vision, natural language processing, or reinforcement learning projects, AITreasureBox has something to offer for everyone. The repository is regularly updated with new tools and resources to keep up with the latest advancements in the field of artificial intelligence.

june
june-va is a local voice chatbot that combines Ollama for language model capabilities, Hugging Face Transformers for speech recognition, and the Coqui TTS Toolkit for text-to-speech synthesis. It provides a flexible, privacy-focused solution for voice-assisted interactions on your local machine, ensuring that no data is sent to external servers. The tool supports various interaction modes including text input/output, voice input/text output, text input/audio output, and voice input/audio output. Users can customize the tool's behavior with a JSON configuration file and utilize voice conversion features for voice cloning. The application can be further customized using a configuration file with attributes for language model, speech-to-text model, and text-to-speech model configurations.

rai
RAI is a framework designed to bring general multi-agent system capabilities to robots, enhancing human interactivity, flexibility in problem-solving, and out-of-the-box AI features. It supports multi-modalities, incorporates an advanced database for agent memory, provides ROS 2-oriented tooling, and offers a comprehensive task/mission orchestrator. The framework includes features such as voice interaction, customizable robot identity, camera sensor access, reasoning through ROS logs, and integration with LangChain for AI tools. RAI aims to support various AI vendors, improve human-robot interaction, provide an SDK for developers, and offer a user interface for configuration.

Ai-Hoshino
Ai Hoshino - MD is a WhatsApp bot tool with features like voice and text interaction, group configuration, anti-delete, anti-link, personalized welcome messages, chatbot functionality, sticker creation, sub-bot integration, RPG game, YouTube music and video downloads, and more. The tool is actively maintained by Starlights Team and offers a range of functionalities for WhatsApp users.

awesome-local-llms
The 'awesome-local-llms' repository is a curated list of open-source tools for local Large Language Model (LLM) inference, covering both proprietary and open weights LLMs. The repository categorizes these tools into LLM inference backend engines, LLM front end UIs, and all-in-one desktop applications. It collects GitHub repository metrics as proxies for popularity and active maintenance. Contributions are encouraged, and users can suggest additional open-source repositories through the Issues section or by running a provided script to update the README and make a pull request. The repository aims to provide a comprehensive resource for exploring and utilizing local LLM tools.

Director
Director is a framework to build video agents that can reason through complex video tasks like search, editing, compilation, generation, etc. It enables users to summarize videos, search for specific moments, create clips instantly, integrate GenAI projects and APIs, add overlays, generate thumbnails, and more. Built on VideoDB's 'video-as-data' infrastructure, Director is perfect for developers, creators, and teams looking to simplify media workflows and unlock new possibilities.

py-xiaozhi
py-xiaozhi is a Python-based XiaoZhi voice client designed for learning code and experiencing AI XiaoZhi's voice functions without hardware conditions. It features voice interaction, graphical interface, volume control, session management, encrypted audio transmission, CLI mode, and automatic copying of verification codes and opening browsers for first-time users. The project aims to optimize and add new features to zhh827's py-xiaozhi based on the original hardware project xiaozhi-esp32 and the Python implementation py-xiaozhi.

voice-chat-ai
Voice Chat AI is a project that allows users to interact with different AI characters using speech. Users can choose from various characters with unique personalities and voices, and have conversations or role play with them. The project supports OpenAI, xAI, or Ollama language models for chat, and provides text-to-speech synthesis using XTTS, OpenAI TTS, or ElevenLabs. Users can seamlessly integrate visual context into conversations by having the AI analyze their screen. The project offers easy configuration through environment variables and can be run via WebUI or Terminal. It also includes a huge selection of built-in characters for engaging conversations.

talk-to-chatgpt
Talk-To-ChatGPT is a Google Chrome and Microsoft Edge extension that enables users to interact with the ChatGPT AI using voice commands for speech recognition and text-to-speech responses. The tool enhances the conversational experience by allowing users to speak to the AI and receive spoken responses, making interactions more natural and engaging. It also supports ElevenLabs API integration for creating custom voices for text-to-speech. The extension provides settings for voice, language, and more, and can be installed from the Chrome and Edge web stores or manually. While the project has been discontinued due to upcoming desktop apps from OpenAI, it has been used to assist individuals with disabilities and the elderly in interacting with ChatGPT.

MonikA.I
MonikA.I. submod is a project that enhances Monika After Story mod with various AI features. It utilizes multiple AI models for text generation, text-to-speech, speech-to-text, emotion detection, and NLI classification. Users can interact with Monika through chatbots, voice commands, and game actions. The project is compatible with MAS v0.12.15 and supports Windows, Linux, and MacOS. It offers a user-friendly installation process and detailed usage instructions for different AI functionalities.

mahilo
Mahilo is a flexible framework for creating multi-agent systems that can interact with humans while sharing context internally. It allows developers to set up complex agent networks for various applications, from customer service to emergency response simulations. Agents can communicate with each other and with humans, making the system efficient by handling context from multiple agents and helping humans stay focused on specific problems. The system supports Realtime API for voice interactions, WebSocket-based communication, flexible communication patterns, session management, and easy agent definition.

ichigo
Ichigo is a local real-time voice AI tool that uses an early fusion technique to extend a text-based LLM to have native 'listening' ability. It is an open research experiment with improved multiturn capabilities and the ability to refuse processing inaudible queries. The tool is designed for open data, open weight, on-device Siri-like functionality, inspired by Meta's Chameleon paper. Ichigo offers a web UI demo and Gradio web UI for users to interact with the tool. It has achieved enhanced MMLU scores, stronger context handling, advanced noise management, and improved multi-turn capabilities for a robust user experience.

Linly-Talker
Linly-Talker is an innovative digital human conversation system that integrates the latest artificial intelligence technologies, including Large Language Models (LLM) 🤖, Automatic Speech Recognition (ASR) 🎙️, Text-to-Speech (TTS) 🗣️, and voice cloning technology 🎤. This system offers an interactive web interface through the Gradio platform 🌐, allowing users to upload images 📷 and engage in personalized dialogues with AI 💬.
20 - OpenAI Gpts

MagicUnprotect
This GPT allows to interact with the Unprotect DB to retrieve knowledge about malware evasion techniques

AI Executive Order Explorer
Interact with President Biden's Executive Order on Artificial Intelligence.

midpage caselaw
Interact with US legal cases and statutes: Searches, summarizes, answers, and checks legal statements.

Genki Assistant Alice
Interact with Alice, your embodied, personality-rich, restless assistant! Uses the story (roleplay) format for the most personalized experience.

MyGoogle
Connect and interact with your Google accounts. Organize, retrieve, and manipulate data with A.I

AstrologyGPT
Dive into the significance of your Sun, Moon, and Rising signs, along with the positions of planets and how they interact with each other. Discover the cosmic blueprint that makes you uniquely you, and embark on a journey of self-awareness and growth

Revelations: Detectives, a text adventure game
Justice hangs in the balance between good and evil. Let me entertain you with this interactive true crime mystery game, lovingly illustrated in the style of the angelic and demonic hosts of Renaissance paintings.

Subcreation
An RPG adventure. Unexplored worlds await your character—are you ready to enter?

Your AI Doctor
This prompt is presented as a virtual health assistant that interacts empathically and efficiently with the user, assuming the role of a doctor.