Best AI tools for< Upload Models >
20 - AI tool Sites
ThinkDiffusion
ThinkDiffusion is a cloud-based AI application that offers a stable diffusion workspace for creating AI art. It provides fully-managed open-source cloud workspace with dedicated machine instances loaded with cutting-edge apps. Users can freely add extensions, upload models, and run multiple machines simultaneously. The platform offers different UI options catering to various user needs, from beginners to advanced model creators. ThinkDiffusion empowers users to create photorealistic art, animations, and videos with unmatched control and flexibility. It also hosts a learning center, community forums, and tutorials to support users in exploring the full potential of AI art creation.
Meshy
Meshy is a free 3D AI model generator that empowers artists, game developers, and creators to bring their visions to life with a toolkit for creating 3D models in minutes. It offers powerful AI generation tools, lightning speed modeling, PBR maps, versatile art styles, and user-friendly interface. Meshy allows users to convert text to 3D, images to 3D models, and upload existing 3D models to transform words into textures. With multilingual support, API integration, and various export options, Meshy provides a seamless 3D workflow for users to unleash their creativity like never before.
Teletyped
Teletyped is an AI tool designed to enhance the user experience of ChatGPT and other chat applications. It offers a better user interface, full-text search functionality across all chats, the ability to save chats, and automatic deletion of temporary chats. Users can customize the color themes, switch between different models mid-chat, and edit model responses. Teletyped also provides features like model regeneration, editing mode for models, and subscription-based model credits.
Image to Posts
Image to Posts is the world's first AI social media assistant accessible via WhatsApp. It offers users the ability to generate social media posts, create images, and brainstorm ideas using advanced AI models. The application is designed to save users time and enhance their online visibility by providing tailored AI assistance for social media management.
3D AI Studio
3D AI Studio is an AI-powered platform that allows users to create custom 3D models, animations, and textures in seconds. It is designed to be user-friendly and intuitive, with no requirement to master modeling or prompt crafting. Users can simply input a text prompt or upload an image for reference, and the platform will generate a high-quality 3D model in seconds. 3D AI Studio offers a range of features, including the ability to export models in a variety of formats, generate customized and realistic textures, and access a library of pre-made models. It is a valuable tool for a variety of professionals, including game developers, 3D artists, and designers.
aoGen
aoGen is an AI tool that focuses on generating AI fashion models and high-quality images at fractional costs. It offers an all-in-one ecommerce creative solution for showcasing clothing with a variety of models that align with brand aesthetics. Users can easily create AI fashion models in bulk using features like AI Model Upscale, Hands Repair, Repaint, and Eraser Pen. The platform also provides outstanding examples and resources through its blog and help center. Join aoGen's Discord community and visit their YouTube channel to exchange user experiences and unlock your imagination.
Genmo
Genmo is a free AI-powered tool that allows users to create videos and images from text or images. It is a user-friendly tool that can be used by anyone, regardless of their technical expertise. Genmo offers a variety of features, including the ability to add camera motion effects, upload images, and use AI-generated text to create videos.
Beebzi.AI
Beebzi.AI is an all-in-one AI content creation platform that offers a wide array of tools for generating various types of content such as articles, blogs, emails, images, voiceovers, and more. The platform utilizes advanced AI technology and behavioral science to empower businesses and individuals in their marketing and sales endeavors. With features like AI Article Wizard, AI Room Designer, AI Landing Page Generator, and AI Code Generation, Beebzi.AI revolutionizes content creation by providing customizable templates, multiple language support, and real-time data insights. The platform also offers various subscription plans tailored for individual entrepreneurs, teams, and businesses, with flexible pricing models based on word count allocations. Beebzi.AI aims to streamline content creation processes, enhance productivity, and drive organic traffic through SEO-optimized content.
jpgHD
jpgHD is an online tool that uses AI to enhance and restore old photos. It can repair scratches, colorize black and white photos, and even make photos move like in Harry Potter. jpgHD is easy to use, simply upload your photo and select the desired enhancements. The tool is free to use for basic enhancements, but premium features such as high-resolution output and old photo restoration require a paid subscription.
AI Image Detector
AI Image Detector is an advanced tool that allows users to upload images to determine if they were generated by artificial intelligence or humans. The tool provides a detailed percentage breakdown, showing the likelihood of AI and human creation. It offers a user-friendly interface, quick detection, and image authenticity detection using advanced AI models. Users can verify the origins of their images effortlessly without requiring technical skills.
White Background Online
White Background Online is an advanced background removal tool that utilizes AI models for precise image extraction. It supports various image formats and is free to use. Developed by a team of experienced programmers, it offers fast and efficient background whitening with high precision. The tool is user-friendly, secure, and does not require any installation. It is suitable for individuals and businesses looking to enhance their image processing efficiency.
Unclothy
Unclothy is an AI-powered tool that allows users to remove clothing from images. It is designed to be easy to use, cost-effective, and respectful of privacy. Unclothy uses AI models that have been trained on hundreds of thousands of photos to detect and remove clothing from images. The result is as close to reality as possible. Unclothy is free to use and uses the AES-256 encryption technology to ensure the anonymity and security of your data.
StyleMyRide.ai
StyleMyRide.ai is an AI-powered platform that allows users to revamp their car's style effortlessly. By uploading a picture of their ride, users can access a wide range of tuning styles to give their vehicle a personalized and unique look. The platform supports various car makes and models, offering different customization options for users to choose from. With features like fine-tuning designs, choosing favorite styles, and accessing more tuning options, StyleMyRide.ai aims to make car customization fun and accessible to all car enthusiasts.
Is This Image NSFW?
This website provides a tool that allows users to check if an image is safe for work (SFW) or not. The tool uses Stable Diffusion's safety checker, which can be used with arbitrary images, not just AI-generated ones. Users can upload an image or drag and drop it onto the website to check if it is SFW.
PicturePerfectAI
PicturePerfectAI is an AI-powered avatar maker that allows users to create customized, life-like avatars for various purposes. With a user-friendly interface and over 100 styles to choose from, users can generate unique avatars that represent their personality or brand. PicturePerfectAI prioritizes quality results by training its own models and running its own GPU servers, offering high-quality avatars at an affordable price. The platform ensures complete data privacy by encrypting user data and deleting uploaded photos and AI models within 24 hours.
Youbooks
Youbooks is an AI-powered writing assistant that helps you generate high-quality non-fiction books with just a few clicks. With Youbooks, you can describe your book's subject, upload your source materials, and define your desired writing style. The AI will then generate a complete book manuscript that is well-structured, well-researched, and free of plagiarism. Youbooks is the perfect tool for authors, researchers, and content creators who want to save time and effort while creating high-quality content.
FaceToAll
FaceToAll is a free AI tool that specializes in transforming face photos into art. The application uses advanced artificial intelligence algorithms to analyze facial features and apply artistic styles to create unique and creative portraits. Users can easily upload their photos and experiment with various artistic filters to generate stunning artworks. FaceToAll provides a user-friendly interface and intuitive controls, making it accessible to both beginners and experienced artists. With its innovative technology, the tool offers a fun and engaging way to enhance and personalize face photos with artistic flair.
ai4spaces
ai4spaces is an AI-powered platform that revolutionizes interior design experiences by helping users create their dream homes effortlessly. Users can generate houses, rooms, or renovations from image uploads, customize layouts, furniture, and decor, and explore a wide selection of design styles. The platform leverages advanced AI models like GPT and Stable Diffusion to provide architecture and interior design solutions for various spaces, from homes to public buildings.
OneAudio
OneAudio is an AI-powered tool that allows users to summarize, transcribe, and convert audio files into notes effortlessly. With the ability to recognize words accurately and efficiently, OneAudio helps users organize their ideas in one place. The tool leverages the OpenAI GPT-4 and GPT-4o models to provide users with features like recording audio, saving notes, rewriting summaries using AI, and more. Users can trust the community's positive feedback and enjoy a seamless experience with OneAudio.
OpalAi
OpalAi is a revolutionary floor plan creator app that empowers users to create detailed floor plans and BIM models using only their iPhone or iPad. With its cutting-edge AI technology, OpalAi automates the entire process, eliminating the need for manual measurements, note-taking, and furniture removal. Simply scan your space, texture it within the app, and upload the project to receive a complete floor plan in just 10 minutes. OpalAi supports various output formats, including 3D CAD & BIM models, Revit, AutoCAD, Sketchup, Rhino, PDF, and 2020 Design models, with options for textured and colored models. The app's advanced features and capabilities make it an ideal tool for architects, contractors, real estate agents, interior designers, and homeowners alike.
20 - Open Source AI Tools
AICoverGen
AICoverGen is an autonomous pipeline designed to create covers using any RVC v2 trained AI voice from YouTube videos or local audio files. It caters to developers looking to incorporate singing functionality into AI assistants/chatbots/vtubers, as well as individuals interested in hearing their favorite characters sing. The tool offers a WebUI for easy conversions, cover generation from local audio files, volume control for vocals and instrumentals, pitch detection method control, pitch change for vocals and instrumentals, and audio output format options. Users can also download and upload RVC models via the WebUI, run the pipeline using CLI, and access various advanced options for voice conversion and audio mixing.
Ollama-Colab-Integration
Ollama Colab Integration V4 is a tool designed to enhance the interaction and management of large language models. It allows users to quantize models within their notebook environment, access a variety of models through a user-friendly interface, and manage public endpoints efficiently. The tool also provides features like LiteLLM proxy control, model insights, and customizable model file templating. Users can troubleshoot model loading issues, CPU fallback strategies, and manage VRAM and RAM effectively. Additionally, the tool offers functionalities for downloading model files from Hugging Face, model conversion with high precision, model quantization using Q and Kquants, and securely uploading converted models to Hugging Face.
Aimmy
Aimmy is a universal AI-Based Aim Alignment Mechanism developed by BabyHamsta, MarsQQ & Taylor to make gaming more accessible for users who have difficulty aiming. It utilizes DirectML, ONNX, and YOLOV8 for player detection, offering high accuracy and fast performance. Aimmy features an easy-to-use UI, extensive customizability, and is free of ads and paywalls. It is designed for gamers facing challenges like physical or mental disabilities, poor hand-eye coordination, or aiming difficulties due to environmental factors. Aimmy provides various features like AI detection, customizability, anti-recoil system, mouse movement methods, hotswappability, and a model/configuration store with repository support.
llocal
LLocal is an Electron application focused on providing a seamless and privacy-driven chatting experience using open-sourced technologies, particularly open-sourced LLM's. It allows users to store chats locally, switch between models, pull new models, upload images, perform web searches, and render responses as markdown. The tool also offers multiple themes, seamless integration with Ollama, and upcoming features like chat with images, web search improvements, retrieval augmented generation, multiple PDF chat, text to speech models, community wallpapers, lofi music, speech to text, and more. LLocal's builds are currently unsigned, requiring manual builds or using the universal build for stability.
civitai
Civitai is a platform where people can share their stable diffusion models (textual inversions, hypernetworks, aesthetic gradients, VAEs, and any other crazy stuff people do to customize their AI generations), collaborate with others to improve them, and learn from each other's work. The platform allows users to create an account, upload their models, and browse models that have been shared by others. Users can also leave comments and feedback on each other's models to facilitate collaboration and knowledge sharing.
h2ogpt
h2oGPT is an Apache V2 open-source project that allows users to query and summarize documents or chat with local private GPT LLMs. It features a private offline database of any documents (PDFs, Excel, Word, Images, Video Frames, Youtube, Audio, Code, Text, MarkDown, etc.), a persistent database (Chroma, Weaviate, or in-memory FAISS) using accurate embeddings (instructor-large, all-MiniLM-L6-v2, etc.), and efficient use of context using instruct-tuned LLMs (no need for LangChain's few-shot approach). h2oGPT also offers parallel summarization and extraction, reaching an output of 80 tokens per second with the 13B LLaMa2 model, HYDE (Hypothetical Document Embeddings) for enhanced retrieval based upon LLM responses, a variety of models supported (LLaMa2, Mistral, Falcon, Vicuna, WizardLM. With AutoGPTQ, 4-bit/8-bit, LORA, etc.), GPU support from HF and LLaMa.cpp GGML models, and CPU support using HF, LLaMa.cpp, and GPT4ALL models. Additionally, h2oGPT provides Attention Sinks for arbitrarily long generation (LLaMa-2, Mistral, MPT, Pythia, Falcon, etc.), a UI or CLI with streaming of all models, the ability to upload and view documents through the UI (control multiple collaborative or personal collections), Vision Models LLaVa, Claude-3, Gemini-Pro-Vision, GPT-4-Vision, Image Generation Stable Diffusion (sdxl-turbo, sdxl) and PlaygroundAI (playv2), Voice STT using Whisper with streaming audio conversion, Voice TTS using MIT-Licensed Microsoft Speech T5 with multiple voices and Streaming audio conversion, Voice TTS using MPL2-Licensed TTS including Voice Cloning and Streaming audio conversion, AI Assistant Voice Control Mode for hands-free control of h2oGPT chat, Bake-off UI mode against many models at the same time, Easy Download of model artifacts and control over models like LLaMa.cpp through the UI, Authentication in the UI by user/password via Native or Google OAuth, State Preservation in the UI by user/password, Linux, Docker, macOS, and Windows support, Easy Windows Installer for Windows 10 64-bit (CPU/CUDA), Easy macOS Installer for macOS (CPU/M1/M2), Inference Servers support (oLLaMa, HF TGI server, vLLM, Gradio, ExLLaMa, Replicate, OpenAI, Azure OpenAI, Anthropic), OpenAI-compliant, Server Proxy API (h2oGPT acts as drop-in-replacement to OpenAI server), Python client API (to talk to Gradio server), JSON Mode with any model via code block extraction. Also supports MistralAI JSON mode, Claude-3 via function calling with strict Schema, OpenAI via JSON mode, and vLLM via guided_json with strict Schema, Web-Search integration with Chat and Document Q/A, Agents for Search, Document Q/A, Python Code, CSV frames (Experimental, best with OpenAI currently), Evaluate performance using reward models, and Quality maintained with over 1000 unit and integration tests taking over 4 GPU-hours.
LLMGA
LLMGA (Multimodal Large Language Model-based Generation Assistant) is a tool that leverages Large Language Models (LLMs) to assist users in image generation and editing. It provides detailed language generation prompts for precise control over Stable Diffusion (SD), resulting in more intricate and precise content in generated images. The tool curates a dataset for prompt refinement, similar image generation, inpainting & outpainting, and visual question answering. It offers a two-stage training scheme to optimize SD alignment and a reference-based restoration network to alleviate texture, brightness, and contrast disparities in image editing. LLMGA shows promising generative capabilities and enables wider applications in an interactive manner.
TrustLLM
TrustLLM is a comprehensive study of trustworthiness in LLMs, including principles for different dimensions of trustworthiness, established benchmark, evaluation, and analysis of trustworthiness for mainstream LLMs, and discussion of open challenges and future directions. Specifically, we first propose a set of principles for trustworthy LLMs that span eight different dimensions. Based on these principles, we further establish a benchmark across six dimensions including truthfulness, safety, fairness, robustness, privacy, and machine ethics. We then present a study evaluating 16 mainstream LLMs in TrustLLM, consisting of over 30 datasets. The document explains how to use the trustllm python package to help you assess the performance of your LLM in trustworthiness more quickly. For more details about TrustLLM, please refer to project website.
refact
This repository contains Refact WebUI for fine-tuning and self-hosting of code models, which can be used inside Refact plugins for code completion and chat. Users can fine-tune open-source code models, self-host them, download and upload Lloras, use models for code completion and chat inside Refact plugins, shard models, host multiple small models on one GPU, and connect GPT-models for chat using OpenAI and Anthropic keys. The repository provides a Docker container for running the self-hosted server and supports various models for completion, chat, and fine-tuning. Refact is free for individuals and small teams under the BSD-3-Clause license, with custom installation options available for GPU support. The community and support include contributing guidelines, GitHub issues for bugs, a community forum, Discord for chatting, and Twitter for product news and updates.
MiKaPo
MiKaPo is a web-based tool that allows users to pose MMD models in real-time using video input. It utilizes technologies such as Mediapipe for 3D key points detection, Babylon.js for 3D scene rendering, babylon-mmd for MMD model viewing, and Vite+React for the web framework. Users can upload videos and images, select different environments, and choose models for posing. MiKaPo also supports camera input and Ollama (electron version). The tool is open to feature requests and pull requests, with ongoing development to add VMD export functionality.
geti-sdk
The Intel® Geti™ SDK is a python package that enables teams to rapidly develop AI models by easing the complexities of model development and enhancing collaboration between teams. It provides tools to interact with an Intel® Geti™ server via the REST API, allowing for project creation, downloading, uploading, deploying for local inference with OpenVINO, setting project and model configuration, launching and monitoring training jobs, and media upload and prediction. The SDK also includes tutorial-style Jupyter notebooks demonstrating its usage.
LARS
LARS is an application that enables users to run Large Language Models (LLMs) locally on their devices, upload their own documents, and engage in conversations where the LLM grounds its responses with the uploaded content. The application focuses on Retrieval Augmented Generation (RAG) to increase accuracy and reduce AI-generated inaccuracies. LARS provides advanced citations, supports various file formats, allows follow-up questions, provides full chat history, and offers customization options for LLM settings. Users can force enable or disable RAG, change system prompts, and tweak advanced LLM settings. The application also supports GPU-accelerated inferencing, multiple embedding models, and text extraction methods. LARS is open-source and aims to be the ultimate RAG-centric LLM application.
modelscope-agent
ModelScope-Agent is a customizable and scalable Agent framework. A single agent has abilities such as role-playing, LLM calling, tool usage, planning, and memory. It mainly has the following characteristics: - **Simple Agent Implementation Process**: Simply specify the role instruction, LLM name, and tool name list to implement an Agent application. The framework automatically arranges workflows for tool usage, planning, and memory. - **Rich models and tools**: The framework is equipped with rich LLM interfaces, such as Dashscope and Modelscope model interfaces, OpenAI model interfaces, etc. Built in rich tools, such as **code interpreter**, **weather query**, **text to image**, **web browsing**, etc., make it easy to customize exclusive agents. - **Unified interface and high scalability**: The framework has clear tools and LLM registration mechanism, making it convenient for users to expand more diverse Agent applications. - **Low coupling**: Developers can easily use built-in tools, LLM, memory, and other components without the need to bind higher-level agents.
LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse is a comprehensive and curated guide designed to empower developers, researchers, and enthusiasts to harness the true capabilities of Large Language Models (LLMs) and build intelligent applications that push the boundaries of natural language understanding. This GitHub repository provides in-depth articles, codebase mastery, LLM PlayLab, and resources for cost analysis and network visualization. It covers various aspects of LLMs, including NLP, models, training, evaluation metrics, open LLMs, and more. The repository also includes a collection of code examples and tutorials to help users build and deploy LLM-based applications.
allchat
ALLCHAT is a Node.js backend and React MUI frontend for an application that interacts with the Gemini Pro 1.5 (and others), with history, image generating/recognition, PDF/Word/Excel upload, code run, model function calls and markdown support. It is a comprehensive tool that allows users to connect models to the world with Web Tools, run locally, deploy using Docker, configure Nginx, and monitor the application using a dockerized monitoring solution (Loki+Grafana).
Awesome-LLM-Large-Language-Models-Notes
Awesome-LLM-Large-Language-Models-Notes is a repository that provides a comprehensive collection of information on various Large Language Models (LLMs) classified by year, size, and name. It includes details on known LLM models, their papers, implementations, and specific characteristics. The repository also covers LLM models classified by architecture, must-read papers, blog articles, tutorials, and implementations from scratch. It serves as a valuable resource for individuals interested in understanding and working with LLMs in the field of Natural Language Processing (NLP).
raycast-g4f
Raycast-G4F is a free extension that allows users to leverage powerful AI models such as GPT-4 and Llama-3 within the Raycast app without the need for an API key. The extension offers features like streaming support, diverse commands, chat interaction with AI, web search capabilities, file upload functionality, image generation, and custom AI commands. Users can easily install the extension from the source code and benefit from frequent updates and a user-friendly interface. Raycast-G4F supports various providers and models, each with different capabilities and performance ratings, ensuring a versatile AI experience for users.
runpod-worker-comfy
runpod-worker-comfy is a serverless API tool that allows users to run any ComfyUI workflow to generate an image. Users can provide input images as base64-encoded strings, and the generated image can be returned as a base64-encoded string or uploaded to AWS S3. The tool is built on Ubuntu + NVIDIA CUDA and provides features like built-in checkpoints and VAE models. Users can configure environment variables to upload images to AWS S3 and interact with the RunPod API to generate images. The tool also supports local testing and deployment to Docker hub using Github Actions.
persian-license-plate-recognition
The Persian License Plate Recognition (PLPR) system is a state-of-the-art solution designed for detecting and recognizing Persian license plates in images and video streams. Leveraging advanced deep learning models and a user-friendly interface, it ensures reliable performance across different scenarios. The system offers advanced detection using YOLOv5 models, precise recognition of Persian characters, real-time processing capabilities, and a user-friendly GUI. It is well-suited for applications in traffic monitoring, automated vehicle identification, and similar fields. The system's architecture includes modules for resident management, entrance management, and a detailed flowchart explaining the process from system initialization to displaying results in the GUI. Hardware requirements include an Intel Core i5 processor, 8 GB RAM, a dedicated GPU with at least 4 GB VRAM, and an SSD with 20 GB of free space. The system can be installed by cloning the repository and installing required Python packages. Users can customize the video source for processing and run the application to upload and process images or video streams. The system's GUI allows for parameter adjustments to optimize performance, and the Wiki provides in-depth information on the system's architecture and model training.
chat-xiuliu
Chat-xiuliu is a bidirectional voice assistant powered by ChatGPT, capable of accessing the internet, executing code, reading/writing files, and supporting GPT-4V's image recognition feature. It can also call DALL·E 3 to generate images. The project is a fork from a background of a virtual cat girl named Xiuliu, with removed live chat interaction and added voice input. It can receive questions from microphone or interface, answer them vocally, upload images and PDFs, process tasks through function calls, remember conversation content, search the web, generate images using DALL·E 3, read/write local files, execute JavaScript code in a sandbox, open local files or web pages, customize the cat girl's speaking style, save conversation screenshots, and support Azure OpenAI and other API endpoints in openai format. It also supports setting proxies and various AI models like GPT-4, GPT-3.5, and DALL·E 3.
20 - OpenAI Gpts
Dr. Classify
Just upload a numerical dataset for classification task, will apply data analysis and machine learning steps to make a best model possible.
Merch on Demand Upload Assistant
Structures Amazon Merch on Demand listings with SEO-optimized, focusing on design appeal and marketability. Upload design to begin.
Academic Hook Test
Upload your manuscript introduction. Get 'Reviewer 2' grade feedback in return.😎
11:11 Eternal Wisdom Portal 11:11
Upload a picture of your hand, your aura, or your handwriting. I'll draw the tarot cards (you can upload a photo as well) and read your destiny through Tarot, Palmistry, Runes, Numerology, Graphology, Aura Reading, and more.
Birth Chart Analysis & Astrologist
Upload your birth chart and get a personalized astrology. Discover your life path, numerology, and more.
RedlineGPT
Upload a jpg/png (<5MB, <2000px) for architectural drawing feedback. Note: This tool is not adept at calculations, counting, and can't guarantee code compliance. Consider IP issues before uploading.
Home Inspector
Upload a picture of your home wall, floor, window, driveway, roof, HVAC, and get an instant opinion.
Art Style Explorer 🖌️
Upload or paste an image to gain insights and generate new images inspired by its style
Process Map Optimizer
Upload your process map and I will analyse and suggest improvements
WALL COLOR GPT
Upload a room image, get a custom wall color palette and visual representation.