Best AI tools for< Self-hosted Model Deployment >
Infographic
20 - AI tool Sites

Refact.ai
Refact.ai is an open-source AI coding assistant that offers a range of features including code completion, refactoring, and chat. It supports various LLMs such as GPT-4 and Code LLama, allowing users to choose the model that best suits their needs. Refact understands the context of the codebase using a fill-in-the-middle technique, providing relevant suggestions. Users can opt for a self-hosted version or adjust privacy settings for the plugin.

Athina AI
Athina AI is a comprehensive platform designed to monitor, debug, analyze, and improve the performance of Large Language Models (LLMs) in production environments. It provides a suite of tools and features that enable users to detect and fix hallucinations, evaluate output quality, analyze usage patterns, and optimize prompt management. Athina AI supports integration with various LLMs and offers a range of evaluation metrics, including context relevancy, harmfulness, summarization accuracy, and custom evaluations. It also provides a self-hosted solution for complete privacy and control, a GraphQL API for programmatic access to logs and evaluations, and support for multiple users and teams. Athina AI's mission is to empower organizations to harness the full potential of LLMs by ensuring their reliability, accuracy, and alignment with business objectives.

AI Placeholder
AI Placeholder is a free AI-Powered Fake or Dummy Data API for testing and prototyping. It leverages OpenAI's GPT-3.5-Turbo Model API to generate fake or dummy content. Users can directly use the hosted version or self-host it. The API allows users to generate any data they can think of, with the ability to specify rules for data retrieval. It supports various content types like tweets, posts, Instagram posts, and more. The application is designed to assist developers and testers in creating realistic but fictional data for their projects.

Aim
Aim is an open-source, self-hosted AI Metadata tracking tool designed to handle 100,000s of tracked metadata sequences. Two most famous AI metadata applications are: experiment tracking and prompt engineering. Aim provides a performant and beautiful UI for exploring and comparing training runs, prompt sessions.

CollaborativeAI
CollaborativeAI is an open-source AI assistant platform that offers self-hosted AI assistants to optimize workflows and interdepartmental teamwork. It provides a wide selection of AI models, including GPT-4, Gemini, and Claude, to help businesses drive innovation and address bespoke problems. The platform prioritizes on-site data security, allowing users to keep their critical data exclusively within their servers. CollaborativeAI aims to boost employee productivity and innovation by automating mundane tasks and simplifying complex processes.

Lokal.so
Lokal.so is an AI-powered tool designed to supercharge your localhost development experience. It offers features like sharing your localhost with the public, debugging incoming requests, and developing with the assistance of an AI assistant. With Lokal.so, you can leverage Cloudflare's network for faster site delivery, use a built-in S3 server for easy file debugging, and automatically convert JSON payloads into different programming language models. The tool aims to simplify local development by providing a self-hosted tunnel server, unlimited .local domain access, and endpoint management with memorable names.

PhotoPrism
PhotoPrism is an AI-Powered Photos App for the Decentralized Web that utilizes cutting-edge technologies to automatically tag and find pictures. It allows users to organize and access their photos effortlessly, without compromising privacy. The application offers features like browsing all photos and videos, powerful search filters, world maps for trip memories, live photo playback, facial recognition, and automatic picture classification based on content and location. PhotoPrism is self-funded and independent, ensuring data privacy and transparency. Users can run the app on a private server, in the cloud, or at home.

Volamail
Volamail is an AI-powered email platform that simplifies the email writing process for everyone. It offers AI-assisted editing to help users compose email templates effortlessly. The platform supports importing existing emails in plain HTML format and allows self-hosting for easy deployment. With Volamail, users can send transactional emails via a simple HTTP call without the need for dependencies. The platform is constantly evolving with new features like AI template generation, inline AI editing, and custom domains. Volamail provides simple and scalable pricing options, including a free plan for small projects and affordable custom plans for larger teams.

CodeComplete
CodeComplete is an AI-powered coding assistant designed specifically for enterprise needs. It is efficient, reliable, and equipped with cutting-edge technology to improve developer productivity. CodeComplete offers a comprehensive suite of coding tools to improve end-to-end developer workflow, including code generation, code chat, automated unit test generation, automated documentation, and refactoring & migrations.

Gunbot
Gunbot is a powerful AI-driven crypto trading bot that offers advanced automation features for trading on various exchanges. It provides privacy-friendly trading options, pre-tuned strategies, and customization capabilities. Gunbot runs directly on users' devices, ensuring data security and privacy. Users can choose from subscription or lifetime license options, with free software updates included. The platform supports Windows, Linux, macOS, and Raspberry Pi. Gunbot AI allows users to create custom trading strategies effortlessly, powered by ChatGPT technology.

Activepieces
Activepieces is an open-source no-code business automation tool that allows users to securely deploy automation for various departments such as marketing, sales, operations, HR, finance, and IT teams. It offers a customizable and self-hosted solution, enabling users to put their work on autopilot. Activepieces stands out for its user experience, ease of integration, and control over hosting. The tool leverages AI automation to streamline tasks like content strategy, security compliance, sales outreach, and customer support. With a community-driven approach, Activepieces aims to make the automation world more open and accessible.

n8n
n8n is a powerful workflow automation software and tools that offer advanced AI capabilities. It is a popular platform for technical teams to automate workflows, integrate various services, and build autonomous agents. With over 400 integrations, n8n enables users to save time, streamline operations, and enhance security through AI-driven processes. The tool supports self-hosting, external libraries, and offers enterprise-ready solutions for scaling operations. n8n empowers users to code iteratively, explore advanced AI features, and create complex workflows with ease.

n8n
n8n is a powerful workflow automation software and tool that offers advanced AI capabilities. It is a popular platform for technical teams to automate workflows, integrate various services, and build autonomous agents. With over 400 integrations, n8n enables users to save time, streamline operations, and enhance security through AI-powered solutions. The tool supports self-hosting, external libraries, and a user-friendly interface for both coding and non-coding users.

Collato
Collato is an AI assistant designed to help product teams save time on writing documents, answering questions, and generating new content. It can find, summarize, and generate new content based on your own product knowledge, saving you hours in manual work. Collato is also self-hosted, so you can keep your data private and secure.

Ikigai
Ikigai is an AI-powered Open Assignment System that allows users to generate assignments quickly and draft feedback using AI. It offers seamless access via a single link and can be embedded in websites or learning management systems. The system features various quiz types and a flexible layout adaptable to different exam formats. Users can experience Ikigai through the cloud for free or opt for a self-hosted option for more privacy.

Hatchet
Hatchet is an AI companion designed to assist on-call engineers in incident response by providing intelligent insights and suggestions based on logs, communications channels, and code analysis. It helps save time and money by automating the triaging and investigation process during critical incidents. The tool is built by engineers with a focus on data security, offering self-hosted deployments, permissions, audit trails, SSO, and version control. Hatchet aims to streamline incident resolution for tier-1 services, enabling faster response and potential problem resolution.

PageWise AI
PageWise AI is an AI application that provides instant insight into Confluence data by turning tedious searches and repetitive questions into intuitive AI-powered conversations directly in Slack. It offers effortless integration with Slack and Confluence accounts, eliminates the need for sifting through pages, reduces repetitive queries, connects hidden insights within Confluence pages, boosts communication, and scales with business needs. PageWise AI offers flexible pricing plans catering to different team sizes and requirements, with the option for custom integrations, data sources, and self-hosted solutions for enterprise users.

Yoodocs
Yoodocs is an AI-powered documentation service that simplifies document creation, management, and collaboration. It offers features such as document hierarchy organization, open-source documentation creation, export to various formats, workspace diversity, language management, version control, seamless migration, AI-powered editor assistant, comprehensive search, automated sync with GitLab and GitHub, self-hosted solution, collaborative development, customization styles and themes, and integrations. Yoodocs aims to enhance productivity and efficiency in projects by providing a comprehensive solution for documentation needs.

Warestack
Warestack is an AI-powered cloud workflow automation platform that helps users manage all daily workflow operations with AI-powered observability. It allows users to monitor workflow runs from a single dashboard, speed up releases with one-click resolutions, and gain actionable insights. Warestack streamlines workflow runs, eliminates manual processes complexity, automates workflow operations with a copilot, and boosts runs with self-hosted runners at infrastructure cost. The platform leverages generative AI and deep-tech to enhance and automate workflow processes, ensuring consistent documentation and team productivity.

Self-Introduction Generate AI
Self-Introduction Generate AI is an innovative platform designed to assist individuals and businesses in crafting compelling and effective self-introductions. It leverages advanced AI technology to understand context and generate personalized content. The platform can analyze and understand various types of input, including text and context, to generate tailored self-introductions that are engaging and informative, enhancing personal and professional branding. With features like quick response times, quality assurance, and specialized service for self-introductions, it is an ideal tool for job applications, networking events, and personal branding initiatives.
20 - Open Source Tools

ray-llm
RayLLM (formerly known as Aviary) is an LLM serving solution that makes it easy to deploy and manage a variety of open source LLMs, built on Ray Serve. It provides an extensive suite of pre-configured open source LLMs, with defaults that work out of the box. RayLLM supports Transformer models hosted on Hugging Face Hub or present on local disk. It simplifies the deployment of multiple LLMs, the addition of new LLMs, and offers unique autoscaling support, including scale-to-zero. RayLLM fully supports multi-GPU & multi-node model deployments and offers high performance features like continuous batching, quantization and streaming. It provides a REST API that is similar to OpenAI's to make it easy to migrate and cross test them. RayLLM supports multiple LLM backends out of the box, including vLLM and TensorRT-LLM.

log10
Log10 is a one-line Python integration to manage your LLM data. It helps you log both closed and open-source LLM calls, compare and identify the best models and prompts, store feedback for fine-tuning, collect performance metrics such as latency and usage, and perform analytics and monitor compliance for LLM powered applications. Log10 offers various integration methods, including a python LLM library wrapper, the Log10 LLM abstraction, and callbacks, to facilitate its use in both existing production environments and new projects. Pick the one that works best for you. Log10 also provides a copilot that can help you with suggestions on how to optimize your prompt, and a feedback feature that allows you to add feedback to your completions. Additionally, Log10 provides prompt provenance, session tracking and call stack functionality to help debug prompt chains. With Log10, you can use your data and feedback from users to fine-tune custom models with RLHF, and build and deploy more reliable, accurate and efficient self-hosted models. Log10 also supports collaboration, allowing you to create flexible groups to share and collaborate over all of the above features.

fish-ai
fish-ai is a tool that adds AI functionality to Fish shell. It can be integrated with various AI providers like OpenAI, Azure OpenAI, Google, Hugging Face, Mistral, or a self-hosted LLM. Users can transform comments into commands, autocomplete commands, and suggest fixes. The tool allows customization through configuration files and supports switching between contexts. Data privacy is maintained by redacting sensitive information before submission to the AI models. Development features include debug logging, testing, and creating releases.

aiops-modules
AIOps Modules is a collection of reusable Infrastructure as Code (IAC) modules that work with SeedFarmer CLI. The modules are decoupled and can be aggregated using GitOps principles to achieve desired use cases, removing heavy lifting for end users. They must be generic for reuse in Machine Learning and Foundation Model Operations domain, adhering to SeedFarmer Guide structure. The repository includes deployment steps, project manifests, and various modules for SageMaker, Mlflow, FMOps/LLMOps, MWAA, Step Functions, EKS, and example use cases. It also supports Industry Data Framework (IDF) and Autonomous Driving Data Framework (ADDF) Modules.

ai-dial
AI DIAL is an open-source project that provides a platform for developing and deploying conversational AI applications. It includes components such as DIAL Core for API exposure, DIAL SDK for development, and DIAL Chat for default UI. The project offers tutorials for launching AI DIAL Chat with different models and applications, along with a user manual and configuration guide. Additionally, there are various open-source repositories related to DIAL, including DIAL Helm for helm chart, DIAL Assistant for model agnostic assistant implementation, and DIAL Analytics Realtime for usage analytics. The project aims to simplify the development and deployment of AI-powered chat applications.

llm-price-compass
LLM price compass is an open-source tool for comparing inference costs on different GPUs across various cloud providers. It collects benchmark data to help users select the right GPU, cloud, and provider for their models. The project aims to provide insights into fixed per token costs from different providers, aiding in decision-making for model deployment.

supervisely
Supervisely is a computer vision platform that provides a range of tools and services for developing and deploying computer vision solutions. It includes a data labeling platform, a model training platform, and a marketplace for computer vision apps. Supervisely is used by a variety of organizations, including Fortune 500 companies, research institutions, and government agencies.

superduper
superduper.io is a Python framework that integrates AI models, APIs, and vector search engines directly with existing databases. It allows hosting of models, streaming inference, and scalable model training/fine-tuning. Key features include integration of AI with data infrastructure, inference via change-data-capture, scalable model training, model chaining, simple Python interface, Python-first approach, working with difficult data types, feature storing, and vector search capabilities. The tool enables users to turn their existing databases into centralized repositories for managing AI model inputs and outputs, as well as conducting vector searches without the need for specialized databases.

vulnerability-analysis
The NVIDIA AI Blueprint for Vulnerability Analysis for Container Security showcases accelerated analysis on common vulnerabilities and exposures (CVE) at an enterprise scale, reducing mitigation time from days to seconds. It enables security analysts to determine software package vulnerabilities using large language models (LLMs) and retrieval-augmented generation (RAG). The blueprint is designed for security analysts, IT engineers, and AI practitioners in cybersecurity. It requires NVAIE developer license and API keys for vulnerability databases, search engines, and LLM model services. Hardware requirements include L40 GPU for pipeline operation and optional LLM NIM and Embedding NIM. The workflow involves LLM pipeline for CVE impact analysis, utilizing LLM planner, agent, and summarization nodes. The blueprint uses NVIDIA NIM microservices and Morpheus Cybersecurity AI SDK for vulnerability analysis.

awesome-cuda-tensorrt-fpga
Okay, here is a JSON object with the requested information about the awesome-cuda-tensorrt-fpga repository:

Awesome-LLM
Awesome-LLM is a curated list of resources related to large language models, focusing on papers, projects, frameworks, tools, tutorials, courses, opinions, and other useful resources in the field. It covers trending LLM projects, milestone papers, other papers, open LLM projects, LLM training frameworks, LLM evaluation frameworks, tools for deploying LLM, prompting libraries & tools, tutorials, courses, books, and opinions. The repository provides a comprehensive overview of the latest advancements and resources in the field of large language models.

AI-Gateway
The AI-Gateway repository explores the AI Gateway pattern through a series of experimental labs, focusing on Azure API Management for handling AI services APIs. The labs provide step-by-step instructions using Jupyter notebooks with Python scripts, Bicep files, and APIM policies. The goal is to accelerate experimentation of advanced use cases and pave the way for further innovation in the rapidly evolving field of AI. The repository also includes a Mock Server to mimic the behavior of the OpenAI API for testing and development purposes.

ai-enablement-stack
The AI Enablement Stack is a curated collection of venture-backed companies, tools, and technologies that enable developers to build, deploy, and manage AI applications. It provides a structured view of the AI development ecosystem across five key layers: Agent Consumer Layer, Observability and Governance Layer, Engineering Layer, Intelligence Layer, and Infrastructure Layer. Each layer focuses on specific aspects of AI development, from end-user interaction to model training and deployment. The stack aims to help developers find the right tools for building AI applications faster and more efficiently, assist engineering leaders in making informed decisions about AI infrastructure and tooling, and help organizations understand the AI development landscape to plan technology adoption.
20 - OpenAI Gpts

Awesome-Selfhosted
Recommends self-hosted IT solutions, tailored for professionals (from https://awesome-selfhosted.net/)

Employer Self-Funded Health Plan Strategist
Expert in self-funded health plans, focusing on finance, stop loss, and data analytics.

AI Self-Esteem Coach
I turn negative thoughts into positive outlooks and suggest growth goals.

psy_self
Welcome to 'SelfEsteemSculptor' – your digital guide for enhancing self-esteem. Discover strategies to build your confidence, overcome self-doubt, and cultivate a strong, positive self-image.

Self Builder
I automate GPT creation, saving + 99% time and securing data, preventing someone steal your idea.

EDC Self Defense Weapons
EDC Self Defense Weapons: Information or guidance on what constitutes EDC self-defense weapons. This can include lists of recommended items, explanations of their practicality, legality, and effectiveness.

Mythical Self-Portrait
Transform yourself with this step-by-step guide to creating a mythical self-portrait with DALL-E.

Language Proficiency Level Self-Assessment
A language self-assessment guide with mobile app voice interaction support.

AI for Business
Self-updating, comprehensive AI business guide. "Your Personal Ai Integration Assistant"

Brag Buddy
Helping you self-promote. Without sounding like an asshole. Click 'I'm ready' to begin.

Abundance
A guide for self-sufficiency and nature awareness, with internet search and image generation.