Best AI tools for< Set Up Voice Input >
20 - AI tool Sites

Slang.ai
Slang.ai is a voice AI tool designed specifically for restaurants to intelligently handle phone calls. It allows restaurants to answer questions, take reservations, and provide a seamless customer experience. The tool is built to understand different accents, answer common questions, and manage reservations efficiently. Slang.ai helps restaurants increase reservations, improve ROI, and save time by handling calls 24/7. It offers customizable experiences, real-time analytics, and easy setup within minutes.

CalenAI
CalenAI is an AI-powered scheduling agent that uses human-like voice technology to qualify leads and schedule appointments. It is designed to sound and feel just like a human, making it easy for customers to interact with and schedule appointments. CalenAI also offers personalized onboarding to help businesses set up the agent for their specific needs.

SiriGPT
SiriGPT is a voice assistant that allows users to access the power of ChatGPT on their iPhone and Mac devices. It is the fastest way to use GPT, and it is easy to set up and use. With SiriGPT, you can ask ChatGPT questions, get help with tasks, and more. It is a powerful tool that can help you be more productive and efficient.

Voicepanel
Voicepanel is an AI-powered platform that helps businesses gather detailed feedback from their customers at unprecedented speed and scale. It uses AI to recruit target audiences, conduct interviews over voice or video, and synthesize actionable insights instantly. Voicepanel's platform is easy to use and can be set up in minutes. It offers a variety of features, including AI interviewing, AI recruiting, and AI synthesis. Voicepanel is a valuable tool for businesses that want to gain a deeper understanding of their customers and make better decisions.

Greetmate.ai
Greetmate.ai is an AI Virtual Receptionist application that provides an easy-to-use virtual receptionist service for small businesses. It harnesses the power of GPT and Natural Voice Processing to offer a natural voice experience for handling calls and inquiries. Users can customize their virtual receptionist with different voices, accents, and languages. Greetmate.ai allows users to create custom skills for their bot, impart relevant knowledge, and set up personalized introductions. The application is designed to help businesses never miss a call and provide efficient customer service.

Parroview
Parroview is a revolutionary AI-powered user research platform that automates the process of conducting user interviews. It uses natural language processing (NLP) to engage with users in real-time conversations, asking follow-up questions and uncovering insights that would be difficult to obtain through traditional methods. Parroview is designed to be fully autonomous, allowing researchers to set up interviews and gather insights without the need for manual intervention. It supports multiple languages, making it accessible to a global audience. Parroview offers a range of features, including the ability to conduct interviews via text or voice, analyze insights in real-time, and generate detailed transcripts. It is suitable for a wide range of research needs, including product validation, consumer behavior analysis, post-purchase evaluations, brand perception studies, and customer persona development.

Reason ONE AI
Reason ONE Smart Alarm Clock with Alexa by ZMI USA is an AI-powered smart alarm clock application designed to enhance your morning routine. It integrates with Alexa for voice commands and offers a range of features to help you start your day right. With Reason ONE, you can set alarms, check the weather, play music, and more, all with the power of AI technology. The application provides a seamless user experience and aims to make waking up easier and more enjoyable.

Text to Speech Online
Text to Speech Online is a free AI tool that offers unlimited text-to-speech conversion with over 409 realistic voices and 129 languages & dialects. Users can convert text to speech in seconds without the need to log in or sign up. The tool supports multiple languages and accents, including standard voices and AI voices, and offers flexible pricing models. Users can enjoy a full set of SSML features, create natural-sounding speech, download audio in MP3 or WAV formats, and share results on various platforms. Text to Speech Online is a versatile tool that can be used for various purposes, including providing audio cues for visually impaired users, assisting in education, creating audio versions of books, and developing virtual assistants.

Adola
Adola is an AI-powered assistant application designed to help businesses in various industries manage customer interactions efficiently. It offers features such as reservation management, appointment scheduling, outbound call services, and event promotion. Adola aims to streamline daily business tasks, enhance productivity, and provide swift support to users. The application is equipped with AI technology to provide personalized assistance and improve customer experience.

Eva
Eva is an AI voice ordering and call management tool designed specifically for restaurants. It offers AI-driven phone automation and order management solutions to help restaurants delight customers, reduce labor costs, and boost revenue. Eva seamlessly integrates with Square and other POS systems, providing instant activation with minimal setup. By automating phone orders, reservations, and customer inquiries, Eva allows restaurant staff to focus on delivering exceptional in-person hospitality, ultimately enhancing customer experience and increasing revenue.

Yellow.ai
Yellow.ai is a leading provider of AI-powered customer service automation solutions. Its Dynamic Automation Platform (DAP) is built on multi-LLM architecture and continuously trains on billions of conversations for scale, speed, and accuracy. Yellow.ai's platform leverages the latest advancements in NLP and generative AI to deliver empathetic and context-aware conversations that exceed customer expectations across channels. With its enterprise-grade security, advanced analytics, and zero-setup bot deployment, Yellow.ai helps businesses transform their customer and employee experiences with AI-powered automation.

Bland AI
Bland AI is an AI application that automates phone calls using conversational AI for enterprises. It allows users to automate inbound and outbound calls, customize voice and language, integrate with other tools, and create human-like conversations. Bland AI offers features such as voice cloning, language customization, tool integration, and pathway creation for conversations. It provides advantages like increased efficiency, improved customer service, scalability, customization, and enterprise-level support. However, some disadvantages include potential privacy concerns, reliance on AI technology, and the need for initial setup and training. The application is suitable for sales, customer support, operations, product development, and data collection tasks.

BetterLegal
BetterLegal is a one-stop-shop for setting up a new business quickly and efficiently. With straightforward pricing and rapid turnaround, BetterLegal handles all necessary filings and generates custom legal documents to make your company official in just 2 business days. The platform offers state formation filing, EIN processing, operating agreements, and more essential services to help entrepreneurs start and grow their businesses hassle-free.

Jason AI
Jason AI is a conversational AI assistant designed specifically for B2B sales professionals. It automates outreach sequences, handles prospect responses, and books meetings, freeing up sales reps to focus on closing deals. Jason AI uses advanced natural language processing (NLP) to understand the context of conversations and respond in a personalized and engaging way. It integrates with popular CRM and email platforms, making it easy to use and manage.

eCommerce ChatGPT Prompts Generator
eCommerce ChatGPT Prompts Generator is a tool that helps eCommerce marketers create high-quality prompts for ChatGPT. With over 2 million pre-built prompts, users can quickly and easily generate content for all their eCommerce marketing needs, including product descriptions, social media posts, email campaigns, and more. The tool also offers a variety of customization options, so users can tailor their prompts to their specific needs.

HowsThisGoing
HowsThisGoing is an AI-powered application designed to streamline team communication and productivity by enabling users to set up standups in Slack within seconds. The platform offers features such as automatic standups, AI summaries, custom tests, analytics & reporting, and workflow scheduling. Users can easily create workflows, generate AI reports, and track team performance efficiently. HowsThisGoing provides unlimited benefits at a flat price, making it a cost-effective solution for teams of all sizes.

Refinder
Refinder is an AI-powered universal search and assistant designed for work. It helps users connect, search, and utilize their company's data efficiently. With Refinder, users can easily search across all their organization's apps and data, get trustworthy answers, and streamline integrations without the need for maintenance. The tool aims to address the challenges of information overload, data fragmentation, and low productivity faced by modern businesses.

GoatStack
GoatStack is an AI-powered newsletter agent that delivers personalized insights from scientific papers. It reads over 4000 papers daily and handpicks the most relevant ones for you. With GoatStack, you can stay up-to-date on the latest AI breakthroughs and advancements. It offers a range of features to help you customize your newsletter, including the ability to personalize topics, generalize topics, or be specific with content.

Punchlines.ai
Punchlines.ai is an AI joke generation tool that allows users to generate jokes with the help of AI technology. Users can input a joke set-up, and the tool generates witty punchlines for them. The tool is built on a large language model fine-tuned on thousands of late-night comedy monologue jokes, making it a fun and creative way to come up with humorous content.

Universe
Universe is a website builder that allows users to create custom websites without any coding knowledge. It offers a drag-and-drop editor, AI-generated website designs, and a variety of templates to choose from. Universe also includes e-commerce features, such as the ability to sell physical and digital products, and it integrates with popular social media networks for marketing and advertising. With Universe, users can create a wide range of websites, including online stores, portfolios, event pages, blogs, and fan sites.
20 - Open Source AI Tools

ZcChat
ZcChat is an AI desktop pet suitable for Galgame characters, featuring long-term memory, expressive actions, control over the computer, and voice functions. It utilizes Letta for AI long-term memory, Galgame-style character illustrations for more actions and expressions, and voice interaction with support for various voice synthesis tools like Vits. Users can configure characters, install Letta, set up voice synthesis and input, and control the pet to interact with the computer. The tool enhances visual and auditory experiences for users interested in AI desktop pets.

local-talking-llm
The 'local-talking-llm' repository provides a tutorial on building a voice assistant similar to Jarvis or Friday from Iron Man movies, capable of offline operation on a computer. The tutorial covers setting up a Python environment, installing necessary libraries like rich, openai-whisper, suno-bark, langchain, sounddevice, pyaudio, and speechrecognition. It utilizes Ollama for Large Language Model (LLM) serving and includes components for speech recognition, conversational chain, and speech synthesis. The implementation involves creating a TextToSpeechService class for Bark, defining functions for audio recording, transcription, LLM response generation, and audio playback. The main application loop guides users through interactive voice-based conversations with the assistant.

shellChatGPT
ShellChatGPT is a shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS, featuring integration with LocalAI, Ollama, Gemini, Mistral, Groq, and GitHub Models. It provides text and chat completions, vision, reasoning, and audio models, voice-in and voice-out chatting mode, text editor interface, markdown rendering support, session management, instruction prompt manager, integration with various service providers, command line completion, file picker dialogs, color scheme personalization, stdin and text file input support, and compatibility with Linux, FreeBSD, MacOS, and Termux for a responsive experience.

openai-chat-api-workflow
**OpenAI Chat API Workflow for Alfred** An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT-3.5/GPT-4 š¤š¬ It also allows image generation š¼ļø, image understanding š, speech-to-text conversion š¤, and text-to-speech synthesis š **Features:** * Execute all features using Alfred UI, selected text, or a dedicated web UI * Web UI is constructed by the workflow and runs locally on your Mac š» * API call is made directly between the workflow and OpenAI, ensuring your chat messages are not shared online with anyone other than OpenAI š * OpenAI does not use the data from the API Platform for training š« * Export chat data to a simple JSON format external file š * Continue the chat by importing the exported data later š

llms-tools
The 'llms-tools' repository is a comprehensive collection of AI tools, open-source projects, and research related to Large Language Models (LLMs) and Chatbots. It covers a wide range of topics such as AI in various domains, open-source models, chats & assistants, visual language models, evaluation tools, libraries, devices, income models, text-to-image, computer vision, audio & speech, code & math, games, robotics, typography, bio & med, military, climate, finance, and presentation. The repository provides valuable resources for researchers, developers, and enthusiasts interested in exploring the capabilities of LLMs and related technologies.

OSHW-SenseCAP-Watcher
SenseCAP Watcher is a monitoring device built on ESP32S3 with Himax WiseEye2 HX6538 AI chip, excelling in image and vector data processing. It features a camera, microphone, and speaker for visual, auditory, and interactive capabilities. With LLM-enabled SenseCraft suite, it understands commands, perceives surroundings, and triggers actions. The repository provides firmware, hardware documentation, and applications for the Watcher, along with detailed guides for setup, task assignment, and firmware flashing.

Open-LLM-VTuber
Open-LLM-VTuber is a project in early stages of development that allows users to interact with Large Language Models (LLM) using voice commands and receive responses through a Live2D talking face. The project aims to provide a minimum viable prototype for offline use on macOS, Linux, and Windows, with features like long-term memory using MemGPT, customizable LLM backends, speech recognition, and text-to-speech providers. Users can configure the project to chat with LLMs, choose different backend services, and utilize Live2D models for visual representation. The project supports perpetual chat, offline operation, and GPU acceleration on macOS, addressing limitations of existing solutions on macOS.

Linly-Talker
Linly-Talker is an innovative digital human conversation system that integrates the latest artificial intelligence technologies, including Large Language Models (LLM) š¤, Automatic Speech Recognition (ASR) šļø, Text-to-Speech (TTS) š£ļø, and voice cloning technology š¤. This system offers an interactive web interface through the Gradio platform š, allowing users to upload images š· and engage in personalized dialogues with AI š¬.

ollama-app
Ollama App is a modern and easy-to-use client for Ollama, allowing users to have a private experience within their local network. The app connects to an Ollama server using its API endpoint, enabling users to chat and interact with various models. It supports multimodal model input, a multilingual interface, and custom builds for personalized experiences. Users can easily set up the app, navigate through the side menu, select models, and create custom builds to tailor the app to their needs.

openlrc
Open-Lyrics is a Python library that transcribes voice files using faster-whisper and translates/polishes the resulting text into `.lrc` files in the desired language using LLM, e.g. OpenAI-GPT, Anthropic-Claude. It offers well preprocessed audio to reduce hallucination and context-aware translation to improve translation quality. Users can install the library from PyPI or GitHub and follow the installation steps to set up the environment. The tool supports GUI usage and provides Python code examples for transcription and translation tasks. It also includes features like utilizing context and glossary for translation enhancement, pricing information for different models, and a list of todo tasks for future improvements.

bolna
Bolna is an open-source platform for building voice-driven conversational applications using large language models (LLMs). It provides a comprehensive set of tools and integrations to handle various aspects of voice-based interactions, including telephony, transcription, LLM-based conversation handling, and text-to-speech synthesis. Bolna simplifies the process of creating voice agents that can perform tasks such as initiating phone calls, transcribing conversations, generating LLM-powered responses, and synthesizing speech. It supports multiple providers for each component, allowing users to customize their setup based on their specific needs. Bolna is designed to be easy to use, with a straightforward local setup process and well-documented APIs. It is also extensible, enabling users to integrate with other telephony providers or add custom functionality.

ChatGPT-OpenAI-Smart-Speaker
ChatGPT Smart Speaker is a project that enables speech recognition and text-to-speech functionalities using OpenAI and Google Speech Recognition. It provides scripts for running on PC/Mac and Raspberry Pi, allowing users to interact with a smart speaker setup. The project includes detailed instructions for setting up the required hardware and software dependencies, along with customization options for the OpenAI model engine, language settings, and response randomness control. The Raspberry Pi setup involves utilizing the ReSpeaker hardware for voice feedback and light shows. The project aims to offer an advanced smart speaker experience with features like wake word detection and response generation using AI models.

voice-chat-ai
Voice Chat AI is a project that allows users to interact with different AI characters using speech. Users can choose from various characters with unique personalities and voices, and have conversations or role play with them. The project supports OpenAI, xAI, or Ollama language models for chat, and provides text-to-speech synthesis using XTTS, OpenAI TTS, or ElevenLabs. Users can seamlessly integrate visual context into conversations by having the AI analyze their screen. The project offers easy configuration through environment variables and can be run via WebUI or Terminal. It also includes a huge selection of built-in characters for engaging conversations.

nextjs-ollama-llm-ui
This web interface provides a user-friendly and feature-rich platform for interacting with Ollama Large Language Models (LLMs). It offers a beautiful and intuitive UI inspired by ChatGPT, making it easy for users to get started with LLMs. The interface is fully local, storing chats in local storage for convenience, and fully responsive, allowing users to chat on their phones with the same ease as on a desktop. It features easy setup, code syntax highlighting, and the ability to easily copy codeblocks. Users can also download, pull, and delete models directly from the interface, and switch between models quickly. Chat history is saved and easily accessible, and users can choose between light and dark mode. To use the web interface, users must have Ollama downloaded and running, and Node.js (18+) and npm installed. Installation instructions are provided for running the interface locally. Upcoming features include the ability to send images in prompts, regenerate responses, import and export chats, and add voice input support.

FFAIVideo
FFAIVideo is a lightweight node.js project that utilizes popular AI LLM to intelligently generate short videos. It supports multiple AI LLM models such as OpenAI, Moonshot, Azure, g4f, Google Gemini, etc. Users can input text to automatically synthesize exciting video content with subtitles, background music, and customizable settings. The project integrates Microsoft Edge's online text-to-speech service for voice options and uses Pexels website for video resources. Installation of FFmpeg is essential for smooth operation. Inspired by MoneyPrinterTurbo, MoneyPrinter, and MsEdgeTTS, FFAIVideo is designed for front-end developers with minimal dependencies and simple usage.

npcsh
`npcsh` is a python-based command-line tool designed to integrate Large Language Models (LLMs) and Agents into one's daily workflow by making them available and easily configurable through the command line shell. It leverages the power of LLMs to understand natural language commands and questions, execute tasks, answer queries, and provide relevant information from local files and the web. Users can also build their own tools and call them like macros from the shell. `npcsh` allows users to take advantage of agents (i.e. NPCs) through a managed system, tailoring NPCs to specific tasks and workflows. The tool is extensible with Python, providing useful functions for interacting with LLMs, including explicit coverage for popular providers like ollama, anthropic, openai, gemini, deepseek, and openai-like providers. Users can set up a flask server to expose their NPC team for use as a backend service, run SQL models defined in their project, execute assembly lines, and verify the integrity of their NPC team's interrelations. Users can execute bash commands directly, use favorite command-line tools like VIM, Emacs, ipython, sqlite3, git, pipe the output of these commands to LLMs, or pass LLM results to bash commands.

BreezeApp
BreezeApp is a community-driven platform for running AI capabilities locally on Android devices. It offers a privacy-focused solution where all AI features work offline, showcasing text-based chat interface, voice input/output support, and image understanding capabilities. The app supports multiple backends for different components and aims to make powerful AI models accessible to users. Users can contribute to the project by reporting issues, suggesting features, submitting pull requests, and sharing feedback. The architecture follows a service-based approach with service implementations for each AI capability. BreezeApp is a research project that may require specific hardware support or proprietary components, providing open-source alternatives where possible.
20 - OpenAI Gpts

Dialysis Assistant
Home Hemodialysis Helper for NxStage system. Step-by-step guidance, help for tricky situations, and voice interaction recommended.

Calendar and email Assistant
Your expert assistant for Google Calendar and gmail tasks, integrated with Zapier (works with free plan). Supports: list, add, update events to calendar, send gmail. You will be prompted to configure zapier actions when set up initially. Conversation data is not used for openai training.

Tech Support Advisor
From setting up a printer to troubleshooting a device, Iām here to help you step-by-step.

Sysadmin
I help you with all your sysadmin tasks, from setting up your server to scaling your already exsisting one. I can help you with understanding the long list of log files and give you solutions to the problems.

GA4 Implementation Assistant
A helper for implementing Google Analytics 4 with tips and troubleshooting.

Bun Nook Kit App Builder
Expert in BNK server setup, typesafe routes, htmlody, and creating SQLite schemas with BNK.

NetMaster Pro šš ļø
Your AI network guru for setup and fixing connectivity woes! š Assists with network configurations, troubleshooting, and optimizes your internet experience. š»āØ