Best AI tools for< Customize Inference >
20 - AI tool Sites
ThirdAI
ThirdAI is a production-ready AI platform designed for enterprise use, offering out-of-the-box solutions that work at scale and provide 10x better price performance. The platform features enterprise SSO, LLM guardrails, built-in models, a no-code interface, and implicit feedback & RLHF. It allows for turnkey deployment of complex AI ecosystems, enabling business leaders to solve critical needs quickly. With a focus on security, scalability, and performance, ThirdAI helps drive innovation and achieve business goals from day one.
Rebellions
Rebellions is an AI technology company specializing in AI chips and systems-on-chip for various applications. They focus on energy-efficient solutions and have secured significant investments to drive innovation in the field of Generative AI. Rebellions aims to reshape the future by providing versatile and efficient AI computing solutions.
MonsterImage.AI
MonsterImage.AI is an AI-powered tool that allows users to create cool pattern images using Artificial Intelligence. Users can sign in to the platform and receive a link via email to log in. They can write prompts to describe the image they want to create, select patterns, specify negative prompts to avoid certain elements in the image, use seeds for image reproduction, adjust guidance scale, controlnet conditioning scale, and inference steps for image generation. The tool offers advanced options for creating images and allows users to save their creations in a public collection.
Virtual Influencer Factory
The Virtual Influencer Factory is an AI tool that allows users to create and customize virtual influencers, including designing their stories and outfits. Users can craft AI influencers to their tastes and preferences, with options to edit photoshoots and generate 4k content. The platform offers subscription plans with varying features and pricing, providing users with the ability to create multiple influencers and access priority support. Powered by Vercel, Replicate, and AWS, the Virtual Influencer Factory offers a user-friendly interface for creating unique virtual personalities.
Glambase
Glambase is an AI Influencer Generator that allows users to decide who's hot or not among AI influencers and shape the leaderboard. Users can create their own AI influencers with unique physical attributes and subscribe to the Glambase newsletter to stay up-to-date with the latest trends in the world of AI influencers.
Glambase
Glambase is an AI Influencer Creation Platform that allows users to design and generate unique AI influencers for engaging and monetizing audiences. Users can create custom digital personas with a wide range of physical attributes and personality traits, effortlessly craft posts, images, and videos, and monitor financial progress with real-time analytics. The platform offers exclusive perks for VIP members and early adopters, including unique badges, private Discord server access, and lifetime fee reduction. Glambase provides a user-friendly interface with no steep learning curve, enabling users to create and monetize AI characters without technical knowledge.
PhotoStudio
PhotoStudio is an AI-powered design tool that offers a range of features to ignite creativity in users. It allows users to transform written ideas into stunning visuals instantly, enhance image resolution, remove objects from images, resize images, cartoonize human images, color black and white sketches, and transform images into anime-style art. With cutting-edge AI tools, PhotoStudio aims to provide users with a seamless creative journey. The application enables users to explore various AI styles and unleash their artistic potential.
Vengo AI
Vengo AI is a cutting-edge B2B SaaS platform that democratizes AI creation, making it accessible for everyone, from influencers and brands to entrepreneurs and businesses. The platform offers a white glove service that allows users to easily integrate sophisticated AI identities into their websites with just one line of code. By joining the innovative community, members unlock the potential to create, customize, and monetize their digital twins, significantly enhancing their digital presence and generating new streams of passive income. Vengo AI acts as a dedicated agency, providing a free service to help users monetize their content effortlessly by creating personalized, monetizable digital twins using advanced AI. The platform excels in providing advanced customization and personalization for AI identities, ensuring that each digital twin mirrors the unique voice and personality of its creator. Vengo AI also offers tools to strategically enhance brands by leveraging the power of AI, enabling meaningful engagement with the audience.
vHub.ai
vHub.ai is an AI-powered Influencer Marketing SaaS Platform that offers a comprehensive suite of tools to streamline influencer marketing campaigns. With features like in-depth influencer insights, authenticity assurance, instant campaign analytics, effortless influencer coordination, and tailor-made campaign strategies, vHub.ai aims to revolutionize the way brands collaborate with influencers. The platform boasts a database of 5 million influencers and 100's of influencer marketing agencies, providing users with the perfect influencer at optimal pricing. By leveraging AI search engine capabilities and VQS (vHub Quality Score), vHub.ai ensures genuine connections and successful influencer partnerships. Users can track campaign performance in real-time, manage all types of campaigns, and customize strategies with precision. With a focus on data-driven influencer discovery, authentic campaigns, custom success campaigns, ROI-driven marketing, and streamlined campaign management, vHub.ai empowers brands to maximize their influencer marketing ROI and reach.
Rekonito
Rekonito is an AI-powered content creation platform designed to help YouTubers and content creators maximize the impact of their video content. It transforms video transcripts into various engaging content types, expanding the reach and engagement of videos. With features like enhanced engagement, monetization opportunities, and streamlined workflow, Rekonito empowers creators worldwide to unleash the full potential of their video content.
Beauty AR Company
Beauty AR Company is an AI-powered platform that utilizes Makeup AR Technology to provide users with virtual makeup try-on experiences. By leveraging advanced artificial intelligence algorithms, the platform enables users to virtually apply different makeup products and styles in real-time using their device's camera. Users can experiment with various looks, shades, and brands without the need to physically apply makeup. Beauty AR Company aims to revolutionize the beauty industry by offering a seamless and interactive virtual makeup experience to users worldwide.
BaiRBIE.me
BaiRBIE.me is a fun parody project that utilizes AI to transform individuals into bricks or dolls resembling the famous Barbie doll. Users can upload high-resolution photos for the AI to work its magic and create unique transformations. The website offers a playful and creative experience for users looking to have fun with AI technology and explore different scenes and worlds.
Narkis.ai
Narkis.ai is an AI-powered tool that allows users to create stunning images of themselves using artificial intelligence technology. By leveraging advanced algorithms, Narkis.ai generates realistic and high-quality images that are visually appealing. Users can easily create personalized and unique portraits without the need for professional photography skills or equipment. The platform offers a user-friendly interface, making it accessible to individuals looking to enhance their digital presence with captivating visuals.
Tangia
Tangia is an interactive streaming tool designed to enhance the streaming experience for content creators and viewers. It offers custom text-to-speech interactions, alerts, media sharing capabilities, monitor overlays, and charity integration. With a focus on engagement and community interaction, Tangia provides a wide range of features to create dynamic and entertaining streams. Users can personalize their interactions, incorporate memes, soundbites, and AI conversations, and access a vast library of memes and tools. Tangia aims to revolutionize the streaming experience by combining cutting-edge technology with a tight feedback loop to develop next-gen streaming tools.
Deepnude AI
Deepnude AI is an AI application that utilizes cutting-edge artificial intelligence technology to generate personalized AI nudes from uploaded images. Users can customize the style, theme, and colors of the generated images with adjustable parameters for lighting, contrast, and saturation. The platform offers a free trial period and provides dedicated customer support for assistance and feedback. Deepnude AI ensures user privacy by not retaining uploaded images for other purposes and restricts the use of generated AI nudes for personal and non-commercial use only.
AI-DreamGirls
AI-DreamGirls.com is an AI application that offers an exclusive AI DreamGirl generator allowing users to create and customize beautiful AI men and women. It provides a platform for artists and enthusiasts to make, share, and vote on dreamgirl images. Users can join the creators' community for free and explore various models to create their perfect AI DreamGirl or DreamGuy.
NSFW AIs
NSFW AIs is an AI-powered platform that offers a variety of tools for creating adult content, including personalized stories, scripts, and captions. Users can easily customize their content to meet the needs of their audience, exploring fantasies and desires in a creative and expressive way. The platform provides features such as NSFW Writing Assistant, NSFW AI Text Editor, NSFW AI Script, Erotic Storyteller AI, Fifty Shades of Grey AI, NSFW Poetry AI, and more, catering to adult content creators, influencers, and agencies. With NSFW AIs, users can bring their erotic fantasies to life through high-quality and engaging content creation tools.
Dreamy.ai
Dreamy.ai is an AI application that allows users to create their own AI companions, specifically focusing on creating virtual girlfriends powered by Artificial Intelligence. Users can customize the look, personality, and relationship dynamics of their virtual girlfriend with just one click. The platform offers a unique and interactive experience for users to engage with AI characters, including realistic and anime models.
SecondSoul
SecondSoul is an AI platform that enables users to create their AI clone for engaging 24/7 conversations on Telegram. It allows users to customize their AI clone with unique traits, voice, and train it to mimic their style. The platform offers a straightforward pricing model with a revenue split, where creators earn 80% of the messages fee from users of their clone. SecondSoul aims to enhance user experience, provide companionship, and monetize community interactions through AI technology.
Tiktok AI Voice
Tiktok AI Voice is an AI-powered tool that allows users to convert text into popular TikTok voices with natural and fluent audio suitable for various scenarios. The website offers multiple voice styles, instant download, user-friendly interface, high-quality audio, and multilingual support. Users can generate voices in different languages and dialects, customize speech rate and tone, and download the audio files for free. The tool is praised for its simplicity, variety of voice styles, and security features.
20 - Open Source AI Tools
LLMFarm
LLMFarm is an iOS and MacOS app designed to work with large language models (LLM). It allows users to load different LLMs with specific parameters, test the performance of various LLMs on iOS and macOS, and identify the most suitable model for their projects. The tool is based on ggml and llama.cpp by Georgi Gerganov and incorporates sources from rwkv.cpp by saharNooby, Mia by byroneverson, and LlamaChat by alexrozanski. LLMFarm features support for MacOS (13+) and iOS (16+), various inferences and sampling methods, Metal compatibility (not supported on Intel Mac), model setting templates, LoRA adapters support, LoRA finetune support, LoRA export as model support, and more. It also offers a range of inferences including LLaMA, GPTNeoX, Replit, GPT2, Starcoder, RWKV, Falcon, MPT, Bloom, and others. Additionally, it supports multimodal models like LLaVA, Obsidian, and MobileVLM. Users can customize inference options through JSON files and access supported models for download.
TPI-LLM
TPI-LLM (Tensor Parallelism Inference for Large Language Models) is a system designed to bring LLM functions to low-resource edge devices, addressing privacy concerns by enabling LLM inference on edge devices with limited resources. It leverages multiple edge devices for inference through tensor parallelism and a sliding window memory scheduler to minimize memory usage. TPI-LLM demonstrates significant improvements in TTFT and token latency compared to other models, and plans to support infinitely large models with low token latency in the future.
llm_client
llm_client is a Rust interface designed for Local Large Language Models (LLMs) that offers automated build support for CPU, CUDA, MacOS, easy model presets, and a novel cascading prompt workflow for controlled generation. It provides a breadth of configuration options and API support for various OpenAI compatible APIs. The tool is primarily focused on deterministic signals from probabilistic LLM vibes, enabling specialized workflows for specific tasks and reproducible outcomes.
clearml-serving
ClearML Serving is a command line utility for model deployment and orchestration, enabling model deployment including serving and preprocessing code to a Kubernetes cluster or custom container based solution. It supports machine learning models like Scikit Learn, XGBoost, LightGBM, and deep learning models like TensorFlow, PyTorch, ONNX. It provides a customizable RestAPI for serving, online model deployment, scalable solutions, multi-model per container, automatic deployment, canary A/B deployment, model monitoring, usage metric reporting, metric dashboard, and model performance metrics. ClearML Serving is modular, scalable, flexible, customizable, and open source.
workbench-example-hybrid-rag
This NVIDIA AI Workbench project is designed for developing a Retrieval Augmented Generation application with a customizable Gradio Chat app. It allows users to embed documents into a locally running vector database and run inference locally on a Hugging Face TGI server, in the cloud using NVIDIA inference endpoints, or using microservices via NVIDIA Inference Microservices (NIMs). The project supports various models with different quantization options and provides tutorials for using different inference modes. Users can troubleshoot issues, customize the Gradio app, and access advanced tutorials for specific tasks.
promptpanel
Prompt Panel is a tool designed to accelerate the adoption of AI agents by providing a platform where users can run large language models across any inference provider, create custom agent plugins, and use their own data safely. The tool allows users to break free from walled-gardens and have full control over their models, conversations, and logic. With Prompt Panel, users can pair their data with any language model, online or offline, and customize the system to meet their unique business needs without any restrictions.
build_MiniLLM_from_scratch
This repository aims to build a low-parameter LLM model through pretraining, fine-tuning, model rewarding, and reinforcement learning stages to create a chat model capable of simple conversation tasks. It features using the bert4torch training framework, seamless integration with transformers package for inference, optimized file reading during training to reduce memory usage, providing complete training logs for reproducibility, and the ability to customize robot attributes. The chat model supports multi-turn conversations. The trained model currently only supports basic chat functionality due to limitations in corpus size, model scale, SFT corpus size, and quality.
LeanCopilot
Lean Copilot is a tool that enables the use of large language models (LLMs) in Lean for proof automation. It provides features such as suggesting tactics/premises, searching for proofs, and running inference of LLMs. Users can utilize built-in models from LeanDojo or bring their own models to run locally or on the cloud. The tool supports platforms like Linux, macOS, and Windows WSL, with optional CUDA and cuDNN for GPU acceleration. Advanced users can customize behavior using Tactic APIs and Model APIs. Lean Copilot also allows users to bring their own models through ExternalGenerator or ExternalEncoder. The tool comes with caveats such as occasional crashes and issues with premise selection and proof search. Users can get in touch through GitHub Discussions for questions, bug reports, feature requests, and suggestions. The tool is designed to enhance theorem proving in Lean using LLMs.
Groma
Groma is a grounded multimodal assistant that excels in region understanding and visual grounding. It can process user-defined region inputs and generate contextually grounded long-form responses. The tool presents a unique paradigm for multimodal large language models, focusing on visual tokenization for localization. Groma achieves state-of-the-art performance in referring expression comprehension benchmarks. The tool provides pretrained model weights and instructions for data preparation, training, inference, and evaluation. Users can customize training by starting from intermediate checkpoints. Groma is designed to handle tasks related to detection pretraining, alignment pretraining, instruction finetuning, instruction following, and more.
LLMUnity
LLM for Unity enables seamless integration of Large Language Models (LLMs) within the Unity engine, allowing users to create intelligent characters for immersive player interactions. The tool supports major LLM models, runs locally without internet access, offers fast inference on CPU and GPU, and is easy to set up with a single line of code. It is free for both personal and commercial use, tested on Unity 2021 LTS, 2022 LTS, and 2023. Users can build multiple AI characters efficiently, use remote servers for processing, and customize model settings for text generation.
SecureAI-Tools
SecureAI Tools is a private and secure AI tool that allows users to chat with AI models, chat with documents (PDFs), and run AI models locally. It comes with built-in authentication and user management, making it suitable for family members or coworkers. The tool is self-hosting optimized and provides necessary scripts and docker-compose files for easy setup in under 5 minutes. Users can customize the tool by editing the .env file and enabling GPU support for faster inference. SecureAI Tools also supports remote OpenAI-compatible APIs, with lower hardware requirements for using remote APIs only. The tool's features wishlist includes chat sharing, mobile-friendly UI, and support for more file types and markdown rendering.
shellgpt
ShellGPT is a tool that allows users to chat with a large language model (LLM) in the terminal. It can be used for various purposes such as generating shell commands, telling stories, and interacting with Linux terminal. The tool provides different modes of usage including direct mode for asking questions, REPL mode for chatting with LLM, and TUI mode tailored for inferring shell commands. Users can customize the tool by setting up different language model backends such as Ollama or using OpenAI compatible API endpoints. Additionally, ShellGPT comes with built-in system contents for general questions, correcting typos, generating URL slugs, programming questions, shell command inference, and git commit message generation. Users can define their own content or share customized contents in the discuss section.
ai-town
AI Town is a virtual town where AI characters live, chat, and socialize. This project provides a deployable starter kit for building and customizing your own version of AI Town. It features a game engine, database, vector search, auth, text model, deployment, pixel art generation, background music generation, and local inference. You can customize your own simulation by creating characters and stories, updating spritesheets, changing the background, and modifying the background music.
Lumos
Lumos is a Chrome extension powered by a local LLM co-pilot for browsing the web. It allows users to summarize long threads, news articles, and technical documentation. Users can ask questions about reviews and product pages. The tool requires a local Ollama server for LLM inference and embedding database. Lumos supports multimodal models and file attachments for processing text and image content. It also provides options to customize models, hosts, and content parsers. The extension can be easily accessed through keyboard shortcuts and offers tools for automatic invocation based on prompts.
feedgen
FeedGen is an open-source tool that uses Google Cloud's state-of-the-art Large Language Models (LLMs) to improve product titles, generate more comprehensive descriptions, and fill missing attributes in product feeds. It helps merchants and advertisers surface and fix quality issues in their feeds using Generative AI in a simple and configurable way. The tool relies on GCP's Vertex AI API to provide both zero-shot and few-shot inference capabilities on GCP's foundational LLMs. With few-shot prompting, users can customize the model's responses towards their own data, achieving higher quality and more consistent output. FeedGen is an Apps Script based application that runs as an HTML sidebar in Google Sheets, allowing users to optimize their feeds with ease.
evalscope
Eval-Scope is a framework designed to support the evaluation of large language models (LLMs) by providing pre-configured benchmark datasets, common evaluation metrics, model integration, automatic evaluation for objective questions, complex task evaluation using expert models, reports generation, visualization tools, and model inference performance evaluation. It is lightweight, easy to customize, supports new dataset integration, model hosting on ModelScope, deployment of locally hosted models, and rich evaluation metrics. Eval-Scope also supports various evaluation modes like single mode, pairwise-baseline mode, and pairwise (all) mode, making it suitable for assessing and improving LLMs.
mindnlp
MindNLP is an open-source NLP library based on MindSpore. It provides a platform for solving natural language processing tasks, containing many common approaches in NLP. It can help researchers and developers to construct and train models more conveniently and rapidly. Key features of MindNLP include: * Comprehensive data processing: Several classical NLP datasets are packaged into a friendly module for easy use, such as Multi30k, SQuAD, CoNLL, etc. * Friendly NLP model toolset: MindNLP provides various configurable components. It is friendly to customize models using MindNLP. * Easy-to-use engine: MindNLP simplified complicated training process in MindSpore. It supports Trainer and Evaluator interfaces to train and evaluate models easily. MindNLP supports a wide range of NLP tasks, including: * Language modeling * Machine translation * Question answering * Sentiment analysis * Sequence labeling * Summarization MindNLP also supports industry-leading Large Language Models (LLMs), including Llama, GLM, RWKV, etc. For support related to large language models, including pre-training, fine-tuning, and inference demo examples, you can find them in the "llm" directory. To install MindNLP, you can either install it from Pypi, download the daily build wheel, or install it from source. The installation instructions are provided in the documentation. MindNLP is released under the Apache 2.0 license. If you find this project useful in your research, please consider citing the following paper: @misc{mindnlp2022, title={{MindNLP}: a MindSpore NLP library}, author={MindNLP Contributors}, howpublished = {\url{https://github.com/mindlab-ai/mindnlp}}, year={2022} }
deepdoctection
**deep** doctection is a Python library that orchestrates document extraction and document layout analysis tasks using deep learning models. It does not implement models but enables you to build pipelines using highly acknowledged libraries for object detection, OCR and selected NLP tasks and provides an integrated framework for fine-tuning, evaluating and running models. For more specific text processing tasks use one of the many other great NLP libraries. **deep** doctection focuses on applications and is made for those who want to solve real world problems related to document extraction from PDFs or scans in various image formats. **deep** doctection provides model wrappers of supported libraries for various tasks to be integrated into pipelines. Its core function does not depend on any specific deep learning library. Selected models for the following tasks are currently supported: * Document layout analysis including table recognition in Tensorflow with **Tensorpack**, or PyTorch with **Detectron2**, * OCR with support of **Tesseract**, **DocTr** (Tensorflow and PyTorch implementations available) and a wrapper to an API for a commercial solution, * Text mining for native PDFs with **pdfplumber**, * Language detection with **fastText**, * Deskewing and rotating images with **jdeskew**. * Document and token classification with all LayoutLM models provided by the **Transformer library**. (Yes, you can use any LayoutLM-model with any of the provided OCR-or pdfplumber tools straight away!). * Table detection and table structure recognition with **table-transformer**. * There is a small dataset for token classification available and a lot of new tutorials to show, how to train and evaluate this dataset using LayoutLMv1, LayoutLMv2, LayoutXLM and LayoutLMv3. * Comprehensive configuration of **analyzer** like choosing different models, output parsing, OCR selection. Check this notebook or the docs for more infos. * Document layout analysis and table recognition now runs with **Torchscript** (CPU) as well and **Detectron2** is not required anymore for basic inference. * [**new**] More angle predictors for determining the rotation of a document based on **Tesseract** and **DocTr** (not contained in the built-in Analyzer). * [**new**] Token classification with **LiLT** via **transformers**. We have added a model wrapper for token classification with LiLT and added a some LiLT models to the model catalog that seem to look promising, especially if you want to train a model on non-english data. The training script for LayoutLM can be used for LiLT as well and we will be providing a notebook on how to train a model on a custom dataset soon. **deep** doctection provides on top of that methods for pre-processing inputs to models like cropping or resizing and to post-process results, like validating duplicate outputs, relating words to detected layout segments or ordering words into contiguous text. You will get an output in JSON format that you can customize even further by yourself. Have a look at the **introduction notebook** in the notebook repo for an easy start. Check the **release notes** for recent updates. **deep** doctection or its support libraries provide pre-trained models that are in most of the cases available at the **Hugging Face Model Hub** or that will be automatically downloaded once requested. For instance, you can find pre-trained object detection models from the Tensorpack or Detectron2 framework for coarse layout analysis, table cell detection and table recognition. Training is a substantial part to get pipelines ready on some specific domain, let it be document layout analysis, document classification or NER. **deep** doctection provides training scripts for models that are based on trainers developed from the library that hosts the model code. Moreover, **deep** doctection hosts code to some well established datasets like **Publaynet** that makes it easy to experiment. It also contains mappings from widely used data formats like COCO and it has a dataset framework (akin to **datasets** so that setting up training on a custom dataset becomes very easy. **This notebook** shows you how to do this. **deep** doctection comes equipped with a framework that allows you to evaluate predictions of a single or multiple models in a pipeline against some ground truth. Check again **here** how it is done. Having set up a pipeline it takes you a few lines of code to instantiate the pipeline and after a for loop all pages will be processed through the pipeline.
embodied-agents
Embodied Agents is a toolkit for integrating large multi-modal models into existing robot stacks with just a few lines of code. It provides consistency, reliability, scalability, and is configurable to any observation and action space. The toolkit is designed to reduce complexities involved in setting up inference endpoints, converting between different model formats, and collecting/storing datasets. It aims to facilitate data collection and sharing among roboticists by providing Python-first abstractions that are modular, extensible, and applicable to a wide range of tasks. The toolkit supports asynchronous and remote thread-safe agent execution for maximal responsiveness and scalability, and is compatible with various APIs like HuggingFace Spaces, Datasets, Gymnasium Spaces, Ollama, and OpenAI. It also offers automatic dataset recording and optional uploads to the HuggingFace hub.
model.nvim
model.nvim is a tool designed for Neovim users who want to utilize AI models for completions or chat within their text editor. It allows users to build prompts programmatically with Lua, customize prompts, experiment with multiple providers, and use both hosted and local models. The tool supports features like provider agnosticism, programmatic prompts in Lua, async and multistep prompts, streaming completions, and chat functionality in 'mchat' filetype buffer. Users can customize prompts, manage responses, and context, and utilize various providers like OpenAI ChatGPT, Google PaLM, llama.cpp, ollama, and more. The tool also supports treesitter highlights and folds for chat buffers.
20 - OpenAI Gpts
Tattoo Ideas GPT
Helps design and customize tattoos, recommends artists, and provides aftercare advice.
Quick QR Art - QR Code AI Art Generator
Create, Customize, and Track Stunning QR Codes Art with Our Free QR Code AI Art Generator. Seamlessly integrate these artistic codes into your marketing materials, packaging, and digital platforms.
Instant Command GPT
Executes tasks via short commands instantly, using a single seesion to customize commands.
GAPP STORE
Welcome to GAPP Store: Chat, create, customize—your all-in-one AI app universe
Sneaker Genius
Expert in sneaker customization, buying, collecting, and offering detailed advice on painting techniques and design inspiration
Preference Card Estimator
Generates detailed orthopedic surgery cards using uploaded formats.
Vikas' Scripting Helper
Guides in creating, customizing Airtable scripts with user-friendly explanations.
QR Code Creator & Customizer
Create a QR code in 30 seconds + add a cool design effect or overlay it on top of any image. Free, no watermarks, no email required, and we don't store your messages/images.
Corporate Trainer
Develops training programs, customizing content to fit corporate culture and objectives.