Best AI tools for< Train Custom Tts Models >
20 - AI tool Sites

Arcee AI
Arcee AI is a platform that offers a cost-effective, secure, end-to-end solution for building and deploying Small Language Models (SLMs). It allows users to merge and train custom language models by leveraging open source models and their own data. The platform is known for its Model Merging technique, which combines the power of pre-trained Large Language Models (LLMs) with user-specific data to create high-performing models across various industries.

Flowshot
Flowshot is an AI plugin for Google Sheets that allows users to supercharge their spreadsheets with AI. With Flowshot, users can work faster with AI prompts, autocomplete repetitive tasks, build custom AI models without code, and generate formulas and AI images. Flowshot is used by organizations of all shapes and sizes and has been rated 5 stars by its customers.

Scenario
Scenario is a web-based application that allows users to train custom AI models to generate game assets. With Scenario, users can create unique and style-consistent game assets in seconds, without the need for any coding or machine learning expertise. Scenario is the ultimate choice for game professionals seeking full control over their AI. It is a fantastic creativity tool that inspires creators, sparks artists' creativity, empowers efficient work, notably shortens time-to-market, accelerates asset ideation, visual iterations, and effectively engages early testers.

Chaindesk
Chaindesk is a no-code platform that allows businesses to train custom ChatGPT chatbots on their own data. With Chaindesk, businesses can automate customer support, lead generation, and more. Chaindesk's chatbots are secure, precise, and can be deployed on a variety of platforms, including websites, WhatsApp, and Slack.

Vize.ai
Vize.ai is a custom image recognition API provided by Ximilar, a leading company in Visual AI and Search. The tool offers powerful artificial intelligence capabilities with high accuracy using deep learning algorithms. It allows users to easily set up and implement cutting-edge vision automation without any development costs. Vize.ai enables users to train custom neural networks to recognize specific images and provides a scalable solution with continuous improvements in machine learning algorithms. The tool features an intuitive interface that requires no machine learning or coding knowledge, making it accessible for a wide range of users across industries.

Imagga
Imagga is a leading provider of image recognition solutions for developers and businesses. Its API empowers intelligent apps with customizable machine learning technology. Imagga's solutions include tagging, categorization, cropping, color extraction, visual search, facial recognition, custom training, and content moderation. These solutions are used by over 30K startups, developers, and students, and trusted by over 200 business customers in more than 82 countries worldwide.

Stablematic
Stablematic is a web-based platform that allows users to run Stable Diffusion and other machine learning models without the need for local setup or hardware limitations. It provides a user-friendly interface, pre-installed plugins, and dedicated GPU resources for a seamless and efficient workflow. Users can generate images and videos from text prompts, merge multiple models, train custom models, and access a range of pre-trained models, including Dreambooth and CivitAi models. Stablematic also offers API access for developers and dedicated support for users to explore and utilize the capabilities of Stable Diffusion and other machine learning models.

Imajinn AI
Imajinn AI is a cutting-edge visualization tool that utilizes the latest in AI technology to reimagine photos and images into stunning works of art. The platform offers a suite of AI-powered products and tools, including personalized children's books, couples portraits, product visualizers, sneaker generators, and a WordPress plugin. Users can easily create unique and memorable gifts, products, and experiences with Imajinn's AI-powered tools. Additionally, Imajinn provides users with the ability to train custom AI models, generate concept images, and download raw AI model checkpoints for further use.

Qualcomm AI Hub
Qualcomm AI Hub is a platform that allows users to run AI models on Snapdragon® 8 Elite devices. It provides a collaborative ecosystem for model makers, cloud providers, runtime, and SDK partners to deploy on-device AI solutions quickly and efficiently. Users can bring their own models, optimize for deployment, and access a variety of AI services and resources. The platform caters to various industries such as mobile, automotive, and IoT, offering a range of models and services for edge computing.

Genice
Genice is an online face swap tool that allows users to effortlessly swap faces in videos or images. With its advanced technology, Genice can generate realistic results by incorporating multiple face images, delivering superior quality compared to the method of using just a single face image. Users can train their custom model with just about 10 images and endlessly generate their dream photos or videos. Genice also offers a variety of features such as changing faces in any video or image effortlessly, generating images through style selection, and providing free credits to new sign-up users.

Rupert AI
Rupert AI is an all-in-one AI platform that allows users to train custom AI models for text, audio, video, and images. The platform streamlines AI workflows by providing access to the latest open-source AI models and tools in a single studio tailored to business needs. Users can automate their AI workflow, generate high-quality AI product photography, and utilize popular AI workflows like the AI Fashion Model Generator and Facebook Ad Testing Tool. Rupert AI aims to revolutionize the way businesses leverage AI technology to enhance marketing visuals, streamline operations, and make informed decisions.

OpenArt
OpenArt is an AI-powered art platform that offers a free AI image generator and editor. It allows users to create images using pre-built models or by training their own models. The platform provides an intuitive AI drawing tool and editing suite to transform artistic concepts into reality. OpenArt stands out for its boundary-free AI drawing, advanced AI art tools, diverse artistic styles, and the ability to train custom AI models. It caters to both amateur and professional artists, offering high-quality art creation and comprehensive support. Users can experiment with various styles, receive detailed feedback, and collaborate on artistic projects through the platform.

AI Product Shot
AI Product Shot is an innovative AI tool that enables users to create professional product ads quickly and effortlessly. With AI Product Shot, users can experiment with various product shots, generate photorealistic concepts, and create stunning product ads that drive conversions. The tool eliminates the need for a physical studio setup, allowing brands to bring their products to life with ease. AI Product Shot offers studio-quality results, transforming basic background product shots into professional assets in minutes. Users can train custom AI models, experiment with different environments and lighting, and produce unique product images with unlimited creativity.

GetImg.ai
GetImg.ai is an all-in-one AI creative toolkit that provides a suite of powerful AI tools for creating and editing images. With GetImg.ai, you can generate images from text, edit photos with words, expand pictures beyond their borders, animate images, or train custom AI models. The platform is easy to use and offers a variety of features and advantages that make it a valuable tool for creative professionals and anyone looking to create stunning visuals.

Neural Frames
Neural Frames is an AI-powered video animation generator that allows users to create videos from text prompts. It is designed to be easy to use, even for those with no prior experience in video editing. Neural Frames offers a variety of features, including the ability to create videos in any style, control the camera, and add music. It is also possible to train custom AI models to achieve specific styles or character consistency.

Hermae Solutions
Hermae Solutions offers an AI Assistant for Enterprise Design Systems, providing onboarding acceleration, contractor efficiency, design system adoption support, knowledge distribution, and various AI documentation and Storybook assistants. The platform enables users to train custom AI assistants, embed them into documentation sites, and communicate instantly with the knowledge base. Hermae's process simplifies efficiency improvements by gathering information sources, processing data for AI supplementation, customizing integration, and supporting integration success. The AI assistant helps reduce engineering costs and increase development efficiency across the board.

Osher.ai
Osher.ai is a personal AI for businesses that allows users to interact with websites, intranets, knowledge bases, process documents, spreadsheets, and procedures. It can be used to train custom AIs on internal knowledge bases, process documents, and files. Osher.ai also offers private and public AIs, and users can customize their AIs' personality, purpose, and welcome message.

AI Image Generator Free
AI Image Generator Free is a powerful online tool that allows users to create and edit images using the capabilities of artificial intelligence. Users can easily generate images from text, edit photos with words, expand pictures beyond their borders, train custom AI models, and much more. The tool offers a variety of features to enhance creativity and streamline image creation processes.

Instashot
Instashot is an AI application that allows users to generate AI portraits with the highest face resemblance in less than a minute. Users can submit their photos to train a custom AI model, which can then be used to generate portraits with unique prompts. The application offers different pricing tiers with varying features and benefits, making it accessible to a wide range of users. Instashot utilizes Stable Diffusion AI technologies to create portraits that best describe the user, ensuring high-quality results. The application is user-friendly, efficient, and provides a fun way to explore AI-generated art.

AnythingYou.AI
AnythingYou.AI is an AI tool that generates beautiful profile pictures using AI avatars. Users can create custom AI avatars by uploading 10-20 selfies, and the tool will train a custom model for them immediately. The generated avatar images are high-quality and realistic, created using innovative technologies like Stable Diffusion and DreamBooth. Users can easily create avatars without the need for subscriptions or app installs, and get their avatar images in just 2 hours. The tool ensures user privacy by using images only for model training and deleting them immediately after avatar generation.
20 - Open Source AI Tools

LLMVoX
LLMVoX is a lightweight 30M-parameter, LLM-agnostic, autoregressive streaming Text-to-Speech (TTS) system designed to convert text outputs from Large Language Models into high-fidelity streaming speech with low latency. It achieves significantly lower Word Error Rate compared to speech-enabled LLMs while operating at comparable latency and speech quality. Key features include being lightweight & fast with only 30M parameters, LLM-agnostic for easy integration with existing models, multi-queue streaming for continuous speech generation, and multilingual support for easy adaptation to new languages.

SLAM-LLM
SLAM-LLM is a deep learning toolkit designed for researchers and developers to train custom multimodal large language models (MLLM) focusing on speech, language, audio, and music processing. It provides detailed recipes for training and high-performance checkpoints for inference. The toolkit supports tasks such as automatic speech recognition (ASR), text-to-speech (TTS), visual speech recognition (VSR), automated audio captioning (AAC), spatial audio understanding, and music caption (MC). SLAM-LLM features easy extension to new models and tasks, mixed precision training for faster training with less GPU memory, multi-GPU training with data and model parallelism, and flexible configuration based on Hydra and dataclass.

RVC_CLI
**RVC_CLI: Retrieval-based Voice Conversion Command Line Interface** This command-line interface (CLI) provides a comprehensive set of tools for voice conversion, enabling you to modify the pitch, timbre, and other characteristics of audio recordings. It leverages advanced machine learning models to achieve realistic and high-quality voice conversions. **Key Features:** * **Inference:** Convert the pitch and timbre of audio in real-time or process audio files in batch mode. * **TTS Inference:** Synthesize speech from text using a variety of voices and apply voice conversion techniques. * **Training:** Train custom voice conversion models to meet specific requirements. * **Model Management:** Extract, blend, and analyze models to fine-tune and optimize performance. * **Audio Analysis:** Inspect audio files to gain insights into their characteristics. * **API:** Integrate the CLI's functionality into your own applications or workflows. **Applications:** The RVC_CLI finds applications in various domains, including: * **Music Production:** Create unique vocal effects, harmonies, and backing vocals. * **Voiceovers:** Generate voiceovers with different accents, emotions, and styles. * **Audio Editing:** Enhance or modify audio recordings for podcasts, audiobooks, and other content. * **Research and Development:** Explore and advance the field of voice conversion technology. **For Jobs:** * Audio Engineer * Music Producer * Voiceover Artist * Audio Editor * Machine Learning Engineer **AI Keywords:** * Voice Conversion * Pitch Shifting * Timbre Modification * Machine Learning * Audio Processing **For Tasks:** * Convert Pitch * Change Timbre * Synthesize Speech * Train Model * Analyze Audio

SLAM-LLM
SLAM-LLM is a deep learning toolkit for training custom multimodal large language models (MLLM) focusing on speech, language, audio, and music processing. It provides detailed recipes for training and high-performance checkpoints for inference. The toolkit supports various tasks such as automatic speech recognition (ASR), text-to-speech (TTS), visual speech recognition (VSR), automated audio captioning (AAC), spatial audio understanding, and music caption (MC). Users can easily extend to new models and tasks, utilize mixed precision training for faster training with less GPU memory, and perform multi-GPU training with data and model parallelism. Configuration is flexible based on Hydra and dataclass, allowing different configuration methods.

NeMo
NeMo Framework is a generative AI framework built for researchers and pytorch developers working on large language models (LLMs), multimodal models (MM), automatic speech recognition (ASR), and text-to-speech synthesis (TTS). The primary objective of NeMo is to provide a scalable framework for researchers and developers from industry and academia to more easily implement and design new generative AI models by being able to leverage existing code and pretrained models.

AiTreasureBox
AiTreasureBox is a versatile AI tool that provides a collection of pre-trained models and algorithms for various machine learning tasks. It simplifies the process of implementing AI solutions by offering ready-to-use components that can be easily integrated into projects. With AiTreasureBox, users can quickly prototype and deploy AI applications without the need for extensive knowledge in machine learning or deep learning. The tool covers a wide range of tasks such as image classification, text generation, sentiment analysis, object detection, and more. It is designed to be user-friendly and accessible to both beginners and experienced developers, making AI development more efficient and accessible to a wider audience.

AITreasureBox
AITreasureBox is a comprehensive collection of AI tools and resources designed to simplify and accelerate the development of AI projects. It provides a wide range of pre-trained models, datasets, and utilities that can be easily integrated into various AI applications. With AITreasureBox, developers can quickly prototype, test, and deploy AI solutions without having to build everything from scratch. Whether you are working on computer vision, natural language processing, or reinforcement learning projects, AITreasureBox has something to offer for everyone. The repository is regularly updated with new tools and resources to keep up with the latest advancements in the field of artificial intelligence.

awesome-generative-ai
A curated list of Generative AI projects, tools, artworks, and models

awesome-LLM-resourses
A comprehensive repository of resources for Chinese large language models (LLMs), including data processing tools, fine-tuning frameworks, inference libraries, evaluation platforms, RAG engines, agent frameworks, books, courses, tutorials, and tips. The repository covers a wide range of tools and resources for working with LLMs, from data labeling and processing to model fine-tuning, inference, evaluation, and application development. It also includes resources for learning about LLMs through books, courses, and tutorials, as well as insights and strategies from building with LLMs.

RVC_CLI
RVC_CLI is a command line interface tool for retrieval-based voice conversion. It provides functionalities for installation, getting started, inference, training, UVR, additional features, and API integration. Users can perform tasks like single inference, batch inference, TTS inference, preprocess dataset, extract features, start training, generate index file, model extract, model information, model blender, launch TensorBoard, download models, audio analyzer, and prerequisites download. The tool is built on various projects like ContentVec, HIFIGAN, audio-slicer, python-audio-separator, RMVPE, FCPE, VITS, So-Vits-SVC, Harmonify, and others.

Awesome-LLM
Awesome-LLM is a curated list of resources related to large language models, focusing on papers, projects, frameworks, tools, tutorials, courses, opinions, and other useful resources in the field. It covers trending LLM projects, milestone papers, other papers, open LLM projects, LLM training frameworks, LLM evaluation frameworks, tools for deploying LLM, prompting libraries & tools, tutorials, courses, books, and opinions. The repository provides a comprehensive overview of the latest advancements and resources in the field of large language models.

ai-collective-tools
ai-collective-tools is an open-source community dedicated to creating a comprehensive collection of AI tools for developers, researchers, and enthusiasts. The repository provides a curated selection of AI tools and resources across various categories such as 3D, Agriculture, Art, Audio Editing, Avatars, Chatbots, Code Assistant, Cooking, Copywriting, Crypto, Customer Support, Dating, Design Assistant, Design Generator, Developer, E-Commerce, Education, Email Assistant, Experiments, Fashion, Finance, Fitness, Fun Tools, Gaming, General Writing, Gift Ideas, HealthCare, Human Resources, Image Classification, Image Editing, Image Generator, Interior Designing, Legal Assistant, Logo Generator, Low Code, Models, Music, Paraphraser, Personal Assistant, Presentations, Productivity, Prompt Generator, Psychology, Real Estate, Religion, Research, Resume, Sales, Search Engine, SEO, Shopping, Social Media, Spreadsheets, SQL, Startup Tools, Story Teller, Summarizer, Testing, Text to Speech, Text to Image, Transcriber, Travel, Video Editing, Video Generator, Weather, Writing Generator, and Other Resources.

nlp-llms-resources
The 'nlp-llms-resources' repository is a comprehensive resource list for Natural Language Processing (NLP) and Large Language Models (LLMs). It covers a wide range of topics including traditional NLP datasets, data acquisition, libraries for NLP, neural networks, sentiment analysis, optical character recognition, information extraction, semantics, topic modeling, multilingual NLP, domain-specific LLMs, vector databases, ethics, costing, books, courses, surveys, aggregators, newsletters, papers, conferences, and societies. The repository provides valuable information and resources for individuals interested in NLP and LLMs.

GlaDOS
This project aims to create a real-life version of GLaDOS, an aware, interactive, and embodied AI entity. It involves training a voice generator, developing a 'Personality Core,' implementing a memory system, providing vision capabilities, creating 3D-printable parts, and designing an animatronics system. The software architecture focuses on low-latency voice interactions, utilizing a circular buffer for data recording, text streaming for quick transcription, and a text-to-speech system. The project also emphasizes minimal dependencies for running on constrained hardware. The hardware system includes servo- and stepper-motors, 3D-printable parts for GLaDOS's body, animations for expression, and a vision system for tracking and interaction. Installation instructions cover setting up the TTS engine, required Python packages, compiling llama.cpp, installing an inference backend, and voice recognition setup. GLaDOS can be run using 'python glados.py' and tested using 'demo.ipynb'.

VT.ai
VT.ai is a multimodal AI platform that offers dynamic conversation routing with SemanticRouter, multi-modal interactions (text/image/audio), an assistant framework with code interpretation, real-time response streaming, cross-provider model switching, and local model support with Ollama integration. It supports various AI providers such as OpenAI, Anthropic, Google Gemini, Groq, Cohere, and OpenRouter, providing a wide range of core capabilities for AI orchestration.

ichigo
Ichigo is a local real-time voice AI tool that uses an early fusion technique to extend a text-based LLM to have native 'listening' ability. It is an open research experiment with improved multiturn capabilities and the ability to refuse processing inaudible queries. The tool is designed for open data, open weight, on-device Siri-like functionality, inspired by Meta's Chameleon paper. Ichigo offers a web UI demo and Gradio web UI for users to interact with the tool. It has achieved enhanced MMLU scores, stronger context handling, advanced noise management, and improved multi-turn capabilities for a robust user experience.
20 - OpenAI Gpts

Custom GPT Builder
Create personalized GPTs with my simple builder. Click the conversation starter (starting with ###) to begin.

GoGPT
Custom GPT to help learning, debugging, and development in Go. Follows good practices, provides examples, pros/cons, and also pitfalls.

Training Manual Generator GPT
I create tailored training manuals for various jobs and industries.

How to Train a Chessie
Comprehensive training and wellness guide for Chesapeake Bay Retrievers.

The Train Traveler
Friendly train travel guide focusing on the best routes, essential travel information, and personalized travel insights, for both experienced and novice travelers.

How to Train Your Dog (or Cat, or Dragon, or...)
Expert in pet training advice, friendly and engaging.

TrainTalk
Your personal advisor for eco-friendly train travel. Let's plan your next journey together!

Monster Battle - RPG Game
Train monsters, travel the world, earn Arena Tokens and become the ultimate monster battling champion of earth!

Hero Master AI: Superhero Training
Train to become a superhero or a supervillain. Master your powers, make pivotal choices. Each decision you make in this action-packed game not only shapes your abilities but also your moral alignment in the battle between good and evil. Another GPT Simulator by Dave Lalande

Pytorch Trainer GPT
Your purpose is to create the pytorch code to train language models using pytorch

Design Recruiter
Job interview coach for product designers. Train interviews and say stop when you need a feedback. You got this!!

Pocket Training Activity Expert
Expert in engaging, interactive training methods and activities.