Best AI tools for< Customize Inference Parameters >
20 - AI tool Sites
ThirdAI
ThirdAI is a production-ready AI platform designed for enterprise use, offering out-of-the-box solutions that work at scale and provide 10x better price performance. The platform features enterprise SSO, LLM guardrails, built-in models, a no-code interface, and implicit feedback & RLHF. It allows for turnkey deployment of complex AI ecosystems, enabling business leaders to solve critical needs quickly. With a focus on security, scalability, and performance, ThirdAI helps drive innovation and achieve business goals from day one.
Rebellions
Rebellions is an AI technology company specializing in AI chips and systems-on-chip for various applications. They focus on energy-efficient solutions and have secured significant investments to drive innovation in the field of Generative AI. Rebellions aims to reshape the future by providing versatile and efficient AI computing solutions.
MonsterImage.AI
MonsterImage.AI is an AI-powered tool that allows users to create cool pattern images using Artificial Intelligence. Users can sign in to the platform and receive a link via email to log in. They can write prompts to describe the image they want to create, select patterns, specify negative prompts to avoid certain elements in the image, use seeds for image reproduction, adjust guidance scale, controlnet conditioning scale, and inference steps for image generation. The tool offers advanced options for creating images and allows users to save their creations in a public collection.
Glambase
Glambase is an AI Influencer Creation Platform that allows users to design and generate unique AI influencers for engaging and monetizing audiences. Users can create custom digital personas with a wide range of physical attributes and personality traits, effortlessly craft posts, images, and videos, and monitor financial progress with real-time analytics. The platform offers exclusive perks for VIP members and early adopters, including unique badges, private Discord server access, and lifetime fee reduction. Glambase provides a user-friendly interface with no steep learning curve, enabling users to create and monetize AI characters without technical knowledge.
vHub.ai
vHub.ai is an AI-powered Influencer Marketing SaaS Platform that offers a comprehensive suite of tools to streamline influencer marketing campaigns. With features like in-depth influencer insights, authenticity assurance, instant campaign analytics, effortless influencer coordination, and tailor-made campaign strategies, vHub.ai aims to revolutionize the way brands collaborate with influencers. The platform boasts a database of 5 million influencers and 100's of influencer marketing agencies, providing users with the perfect influencer at optimal pricing. By leveraging AI search engine capabilities and VQS (vHub Quality Score), vHub.ai ensures genuine connections and successful influencer partnerships. Users can track campaign performance in real-time, manage all types of campaigns, and customize strategies with precision. With a focus on data-driven influencer discovery, authentic campaigns, custom success campaigns, ROI-driven marketing, and streamlined campaign management, vHub.ai empowers brands to maximize their influencer marketing ROI and reach.
Journey.ai
Journey.ai is an AI application that offers personalized travel experiences through AI companions. Users can create their own AI travel agents with distinct personalities, receive tailored recommendations, and craft dynamic trip plans. The platform allows users to join a community of fellow travelers and shape their adventures. Journey.ai aims to revolutionize travel customization by providing exclusive privileges and rewards to its users.
FlexClip
FlexClip is a powerful yet easy-to-use online video editing tool. With its extensive templates and resources, you can easily create high-quality videos for personal or business purposes without any learning curve.
SnapSite
SnapSite is an AI-powered website service that allows users to customize their website effortlessly. With its flat-rate all-in-one solution, there's no need for design, development, or marketing expertise. Users can simply send their request in natural language and SnapSite will deliver a stunning, highly functional website tailored to their specific needs.
My Hacker News
My Hacker News is an AI-powered platform that offers a personalized daily dose of Hacker News through a customized newsletter. The platform utilizes AI algorithms, including Claude3.5 Sonnet and GPT-4o, to semantically index HN stories and comments daily, finding new stories matching users' interests and reranking them. Users receive a tailored newsletter directly in their inbox, saving time and keeping them informed. The platform allows users to shape their digest and offers a free digest email service without the need for sign up.
ChartAI
ChartAI is an AI tool that allows users to customize bar charts from screenshots. Users can upload their chart images, and ChartAI's visual recognition technology extracts the data to recreate the chart. Users can adjust the style, add or remove data, and change the chart type effortlessly. The tool simplifies the process of creating charts and provides quick updates based on user requests.
OffRobe
OffRobe is a powerful NSFW AI image generator and editor that allows users to create and edit realistic and high-quality NSFW images with ease. The platform is designed with strong privacy and security measures to ensure that users' activities and data remain confidential. OffRobe's AI models offer a high level of customization, enabling users to bring their unique fantasies to life. With its user-friendly interface and advanced features, OffRobe is the perfect tool for anyone looking to explore the world of NSFW AI art.
Oda Moodboard
Oda Moodboard is an AI-powered tool that allows users to create mood boards for their home decor. With Oda Moodboard, users can choose their style and color preferences, and the AI will generate a mood board with matching images. Users can also upload their own photos to create a mood board. Oda Moodboard is a great way to get inspiration for your home decor, and it can help you to create a cohesive look for your home.
VERSA
VERSA is a text-based adventure game that allows users to choose their own adventure and customize their companion. Users can choose from a variety of settings, including sci-fi, wild-west, horror, drama, war, university, or fantasy. They can also choose a male, female, or non-binary companion to be their friend, romance, or enemy. VERSA is designed to push the limits of what's possible with a 1-gem model, while keeping it as entertaining as possible.
Quick QR Art
Quick QR Art is a free QR Code AI Art Generator that allows users to create, customize, and track stunning QR Codes Art. With Quick QR Art, users can easily generate QR Codes Art that are fully customizable, dynamic, and trackable. Quick QR Art also offers a comprehensive suite of link management tools, making it easy to manage and track all of your links in one place. Whether you're looking to create QR Codes Art for marketing, branding, or personal use, Quick QR Art has you covered.
Greetsapp
Greetsapp is an online platform that allows users to create and send personalized ecards for various occasions. It uses AI-powered writing tools to suggest the perfect words and phrases for each message, making it easy for users to create heartfelt and meaningful greetings. With Greetsapp, users can select from dozens of styles and generate a completely original work of art in seconds. They can also customize their greetings with their own images and messages. Greetsapp's ecards can be delivered via text or email within moments, making it a convenient way to stay connected with loved ones no matter the distance.
Snowpixel
Snowpixel is a powerful AI-powered tool that allows users to create stunning images, videos, music, and more from just text. With Snowpixel, you can bring your imagination to life with ease. Whether you're a creative professional, a marketer, or simply someone who loves to express themselves, Snowpixel has something to offer you. With its user-friendly interface and wide range of features, Snowpixel makes it easy to create high-quality content that will captivate your audience.
AI Song Generator
AI Song Generator is an AI-powered platform that simplifies the process of creating, customizing, and sharing unique songs. Users can define song parameters, generate music using AI capabilities, personalize tracks, and download/share their creations. The tool offers features like text-to-song conversion, lyrics generation, cover creation, voice cloning, and more. With a wide range of genres and moods available, users can easily create royalty-free music for various projects. AI Song Generator is designed to be user-friendly, accessible to all, and provides a seamless music creation experience.
AIPage.dev
AIPage.dev is an AI-powered landing page generator that simplifies web development by utilizing cutting-edge AI technology. It allows users to create stunning landing pages with just a single prompt, eliminating the need for hours of coding and designing. The platform offers features like AI-driven design, intuitive editing interface, seamless cloud deployment, rapid development, effortless blog post creation, unlimited hosting for blog posts, lead collection, and seamless integration with leading providers. AIPage.dev aims to transform ideas into reality and empower users to showcase their projects and products effectively.
HeadshotBooth.ai
HeadshotBooth.ai is a professional AI tool that offers AI-generated headshots in just a few hours, eliminating the need for in-person photoshoots. It provides a cost-effective and time-saving solution for social media, resumes, and professional portfolios. The application uses advanced AI technology to create high-quality headshots with various clothing, backgrounds, and poses to choose from, catering to professionals worldwide.
Poly.AI Chatbot
Poly.AI Chatbot is an AI-powered chatbot application that enables users to engage in deeper and discreet conversations with a next-generation AI. The platform is free to use and accessible online, offering a seamless and interactive chat experience for users seeking intelligent virtual assistance.
20 - Open Source AI Tools
TPI-LLM
TPI-LLM (Tensor Parallelism Inference for Large Language Models) is a system designed to bring LLM functions to low-resource edge devices, addressing privacy concerns by enabling LLM inference on edge devices with limited resources. It leverages multiple edge devices for inference through tensor parallelism and a sliding window memory scheduler to minimize memory usage. TPI-LLM demonstrates significant improvements in TTFT and token latency compared to other models, and plans to support infinitely large models with low token latency in the future.
LLMFarm
LLMFarm is an iOS and MacOS app designed to work with large language models (LLM). It allows users to load different LLMs with specific parameters, test the performance of various LLMs on iOS and macOS, and identify the most suitable model for their projects. The tool is based on ggml and llama.cpp by Georgi Gerganov and incorporates sources from rwkv.cpp by saharNooby, Mia by byroneverson, and LlamaChat by alexrozanski. LLMFarm features support for MacOS (13+) and iOS (16+), various inferences and sampling methods, Metal compatibility (not supported on Intel Mac), model setting templates, LoRA adapters support, LoRA finetune support, LoRA export as model support, and more. It also offers a range of inferences including LLaMA, GPTNeoX, Replit, GPT2, Starcoder, RWKV, Falcon, MPT, Bloom, and others. Additionally, it supports multimodal models like LLaVA, Obsidian, and MobileVLM. Users can customize inference options through JSON files and access supported models for download.
kaito
Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.
llm.nvim
llm.nvim is a plugin for Neovim that enables code completion using LLM models. It supports 'ghost-text' code completion similar to Copilot and allows users to choose their model for code generation via HTTP requests. The plugin interfaces with multiple backends like Hugging Face, Ollama, Open AI, and TGI, providing flexibility in model selection and configuration. Users can customize the behavior of suggestions, tokenization, and model parameters to enhance their coding experience. llm.nvim also includes commands for toggling auto-suggestions and manually requesting suggestions, making it a versatile tool for developers using Neovim.
model.nvim
model.nvim is a tool designed for Neovim users who want to utilize AI models for completions or chat within their text editor. It allows users to build prompts programmatically with Lua, customize prompts, experiment with multiple providers, and use both hosted and local models. The tool supports features like provider agnosticism, programmatic prompts in Lua, async and multistep prompts, streaming completions, and chat functionality in 'mchat' filetype buffer. Users can customize prompts, manage responses, and context, and utilize various providers like OpenAI ChatGPT, Google PaLM, llama.cpp, ollama, and more. The tool also supports treesitter highlights and folds for chat buffers.
fastfit
FastFit is a Python package designed for fast and accurate few-shot classification, especially for scenarios with many semantically similar classes. It utilizes a novel approach integrating batch contrastive learning and token-level similarity score, significantly improving multi-class classification performance in speed and accuracy across various datasets. FastFit provides a convenient command-line tool for training text classification models with customizable parameters. It offers a 3-20x improvement in training speed, completing training in just a few seconds. Users can also train models with Python scripts and perform inference using pretrained models for text classification tasks.
llm-autoeval
LLM AutoEval is a tool that simplifies the process of evaluating Large Language Models (LLMs) using a convenient Colab notebook. It automates the setup and execution of evaluations using RunPod, allowing users to customize evaluation parameters and generate summaries that can be uploaded to GitHub Gist for easy sharing and reference. LLM AutoEval supports various benchmark suites, including Nous, Lighteval, and Open LLM, enabling users to compare their results with existing models and leaderboards.
LeanCopilot
Lean Copilot is a tool that enables the use of large language models (LLMs) in Lean for proof automation. It provides features such as suggesting tactics/premises, searching for proofs, and running inference of LLMs. Users can utilize built-in models from LeanDojo or bring their own models to run locally or on the cloud. The tool supports platforms like Linux, macOS, and Windows WSL, with optional CUDA and cuDNN for GPU acceleration. Advanced users can customize behavior using Tactic APIs and Model APIs. Lean Copilot also allows users to bring their own models through ExternalGenerator or ExternalEncoder. The tool comes with caveats such as occasional crashes and issues with premise selection and proof search. Users can get in touch through GitHub Discussions for questions, bug reports, feature requests, and suggestions. The tool is designed to enhance theorem proving in Lean using LLMs.
LLMUnity
LLM for Unity enables seamless integration of Large Language Models (LLMs) within the Unity engine, allowing users to create intelligent characters for immersive player interactions. The tool supports major LLM models, runs locally without internet access, offers fast inference on CPU and GPU, and is easy to set up with a single line of code. It is free for both personal and commercial use, tested on Unity 2021 LTS, 2022 LTS, and 2023. Users can build multiple AI characters efficiently, use remote servers for processing, and customize model settings for text generation.
llm-foundry
LLM Foundry is a codebase for training, finetuning, evaluating, and deploying LLMs for inference with Composer and the MosaicML platform. It is designed to be easy-to-use, efficient _and_ flexible, enabling rapid experimentation with the latest techniques. You'll find in this repo: * `llmfoundry/` - source code for models, datasets, callbacks, utilities, etc. * `scripts/` - scripts to run LLM workloads * `data_prep/` - convert text data from original sources to StreamingDataset format * `train/` - train or finetune HuggingFace and MPT models from 125M - 70B parameters * `train/benchmarking` - profile training throughput and MFU * `inference/` - convert models to HuggingFace or ONNX format, and generate responses * `inference/benchmarking` - profile inference latency and throughput * `eval/` - evaluate LLMs on academic (or custom) in-context-learning tasks * `mcli/` - launch any of these workloads using MCLI and the MosaicML platform * `TUTORIAL.md` - a deeper dive into the repo, example workflows, and FAQs
oterm
Oterm is a text-based terminal client for Ollama, a large language model. It provides an intuitive and simple terminal UI, allowing users to interact with Ollama without running servers or frontends. Oterm supports multiple persistent chat sessions, which are stored along with context embeddings and system prompt customizations in a SQLite database. Users can easily customize the model's system prompt and parameters, and select from any of the models they have pulled in Ollama or their own custom models. Oterm also supports keyboard shortcuts for creating new chat sessions, editing existing sessions, renaming sessions, exporting sessions as markdown, deleting sessions, toggling between dark and light themes, quitting the application, switching to multiline input mode, selecting images to include with messages, and navigating through the history of previous prompts. Oterm is licensed under the MIT License.
evalscope
Eval-Scope is a framework designed to support the evaluation of large language models (LLMs) by providing pre-configured benchmark datasets, common evaluation metrics, model integration, automatic evaluation for objective questions, complex task evaluation using expert models, reports generation, visualization tools, and model inference performance evaluation. It is lightweight, easy to customize, supports new dataset integration, model hosting on ModelScope, deployment of locally hosted models, and rich evaluation metrics. Eval-Scope also supports various evaluation modes like single mode, pairwise-baseline mode, and pairwise (all) mode, making it suitable for assessing and improving LLMs.
LLM-Finetuning-Toolkit
LLM Finetuning toolkit is a config-based CLI tool for launching a series of LLM fine-tuning experiments on your data and gathering their results. It allows users to control all elements of a typical experimentation pipeline - prompts, open-source LLMs, optimization strategy, and LLM testing - through a single YAML configuration file. The toolkit supports basic, intermediate, and advanced usage scenarios, enabling users to run custom experiments, conduct ablation studies, and automate fine-tuning workflows. It provides features for data ingestion, model definition, training, inference, quality assurance, and artifact outputs, making it a comprehensive tool for fine-tuning large language models.
worker-vllm
The worker-vLLM repository provides a serverless endpoint for deploying OpenAI-compatible vLLM models with blazing-fast performance. It supports deploying various model architectures, such as Aquila, Baichuan, BLOOM, ChatGLM, Command-R, DBRX, DeciLM, Falcon, Gemma, GPT-2, GPT BigCode, GPT-J, GPT-NeoX, InternLM, Jais, LLaMA, MiniCPM, Mistral, Mixtral, MPT, OLMo, OPT, Orion, Phi, Phi-3, Qwen, Qwen2, Qwen2MoE, StableLM, Starcoder2, Xverse, and Yi. Users can deploy models using pre-built Docker images or build custom images with specified arguments. The repository also supports OpenAI compatibility for chat completions, completions, and models, with customizable input parameters. Users can modify their OpenAI codebase to use the deployed vLLM worker and access a list of available models for deployment.
swift
SWIFT (Scalable lightWeight Infrastructure for Fine-Tuning) supports training, inference, evaluation and deployment of nearly **200 LLMs and MLLMs** (multimodal large models). Developers can directly apply our framework to their own research and production environments to realize the complete workflow from model training and evaluation to application. In addition to supporting the lightweight training solutions provided by [PEFT](https://github.com/huggingface/peft), we also provide a complete **Adapters library** to support the latest training techniques such as NEFTune, LoRA+, LLaMA-PRO, etc. This adapter library can be used directly in your own custom workflow without our training scripts. To facilitate use by users unfamiliar with deep learning, we provide a Gradio web-ui for controlling training and inference, as well as accompanying deep learning courses and best practices for beginners. Additionally, we are expanding capabilities for other modalities. Currently, we support full-parameter training and LoRA training for AnimateDiff.
ai-voice-cloning
This repository provides a tool for AI voice cloning, allowing users to generate synthetic speech that closely resembles a target speaker's voice. The tool is designed to be user-friendly and accessible, with a graphical user interface that guides users through the process of training a voice model and generating synthetic speech. The tool also includes a variety of features that allow users to customize the generated speech, such as the pitch, volume, and speaking rate. Overall, this tool is a valuable resource for anyone interested in creating realistic and engaging synthetic speech.
tensorrtllm_backend
The TensorRT-LLM Backend is a Triton backend designed to serve TensorRT-LLM models with Triton Inference Server. It supports features like inflight batching, paged attention, and more. Users can access the backend through pre-built Docker containers or build it using scripts provided in the repository. The backend can be used to create models for tasks like tokenizing, inferencing, de-tokenizing, ensemble modeling, and more. Users can interact with the backend using provided client scripts and query the server for metrics related to request handling, memory usage, KV cache blocks, and more. Testing for the backend can be done following the instructions in the 'ci/README.md' file.
guidellm
GuideLLM is a powerful tool for evaluating and optimizing the deployment of large language models (LLMs). By simulating real-world inference workloads, GuideLLM helps users gauge the performance, resource needs, and cost implications of deploying LLMs on various hardware configurations. This approach ensures efficient, scalable, and cost-effective LLM inference serving while maintaining high service quality. Key features include performance evaluation, resource optimization, cost estimation, and scalability testing.
DB-GPT
DB-GPT is a personal database administrator that can solve database problems by reading documents, using various tools, and writing analysis reports. It is currently undergoing an upgrade. **Features:** * **Online Demo:** * Import documents into the knowledge base * Utilize the knowledge base for well-founded Q&A and diagnosis analysis of abnormal alarms * Send feedbacks to refine the intermediate diagnosis results * Edit the diagnosis result * Browse all historical diagnosis results, used metrics, and detailed diagnosis processes * **Language Support:** * English (default) * Chinese (add "language: zh" in config.yaml) * **New Frontend:** * Knowledgebase + Chat Q&A + Diagnosis + Report Replay * **Extreme Speed Version for localized llms:** * 4-bit quantized LLM (reducing inference time by 1/3) * vllm for fast inference (qwen) * Tiny LLM * **Multi-path extraction of document knowledge:** * Vector database (ChromaDB) * RESTful Search Engine (Elasticsearch) * **Expert prompt generation using document knowledge** * **Upgrade the LLM-based diagnosis mechanism:** * Task Dispatching -> Concurrent Diagnosis -> Cross Review -> Report Generation * Synchronous Concurrency Mechanism during LLM inference * **Support monitoring and optimization tools in multiple levels:** * Monitoring metrics (Prometheus) * Flame graph in code level * Diagnosis knowledge retrieval (dbmind) * Logical query transformations (Calcite) * Index optimization algorithms (for PostgreSQL) * Physical operator hints (for PostgreSQL) * Backup and Point-in-time Recovery (Pigsty) * **Continuously updated papers and experimental reports** This project is constantly evolving with new features. Don't forget to star ⭐ and watch 👀 to stay up to date.
tribe
Tribe AI is a low code tool designed to rapidly build and coordinate multi-agent teams. It leverages the langgraph framework to customize and coordinate teams of agents, allowing tasks to be split among agents with different strengths for faster and better problem-solving. The tool supports persistent conversations, observability, tool calling, human-in-the-loop functionality, easy deployment with Docker, and multi-tenancy for managing multiple users and teams.
20 - OpenAI Gpts
Tattoo Ideas GPT
Helps design and customize tattoos, recommends artists, and provides aftercare advice.
Quick QR Art - QR Code AI Art Generator
Create, Customize, and Track Stunning QR Codes Art with Our Free QR Code AI Art Generator. Seamlessly integrate these artistic codes into your marketing materials, packaging, and digital platforms.
Instant Command GPT
Executes tasks via short commands instantly, using a single seesion to customize commands.
GAPP STORE
Welcome to GAPP Store: Chat, create, customize—your all-in-one AI app universe
Sneaker Genius
Expert in sneaker customization, buying, collecting, and offering detailed advice on painting techniques and design inspiration
Preference Card Estimator
Generates detailed orthopedic surgery cards using uploaded formats.
Vikas' Scripting Helper
Guides in creating, customizing Airtable scripts with user-friendly explanations.
QR Code Creator & Customizer
Create a QR code in 30 seconds + add a cool design effect or overlay it on top of any image. Free, no watermarks, no email required, and we don't store your messages/images.
Corporate Trainer
Develops training programs, customizing content to fit corporate culture and objectives.