Best AI tools for< Set Up Llama.cpp Server >
20 - AI tool Sites
AnythingLLM
AnythingLLM is an all-in-one AI application designed for everyone. It offers a suite of tools for working with LLM (Large Language Models), documents, and agents in a fully private environment. Users can install AnythingLLM on their desktop for Windows, MacOS, and Linux, enabling flexible one-click installation and secure, fully private operation without internet connectivity. The application supports custom models, including enterprise models like GPT-4, custom fine-tuned models, and open-source models like Llama and Mistral. AnythingLLM allows users to work with various document formats, such as PDFs and word documents, providing tailored solutions with locally running defaults for privacy.
OneDollarAI.lol
OneDollarAI.lol is an AI application that offers the best AI language model for just one dollar a month. It features LLaMa 3, which is known for being the fastest and most powerful language model. Users can enjoy unlimited usage with no limits, at an affordable price of only $1 per month. The application provides instant responses and requires no setup. It is designed to be user-friendly and accessible to all, making it a convenient tool for various language-related tasks.
BetterLegal
BetterLegal is a one-stop-shop for setting up a new business quickly and efficiently. With no confusing options, the platform handles all necessary filings and generates custom legal documents, making your company official in just 2 business days. It offers state formation filing, EIN (Tax ID) processing, operating agreement or bylaws creation, express processing, registered agent service, compliance service, permit & license searches, and more. BetterLegal is designed to simplify the business formation process and provide essential services for entrepreneurs and small business owners.
Jason AI
Jason AI is a conversational AI assistant designed specifically for B2B sales professionals. It automates outreach sequences, handles prospect responses, and books meetings, freeing up sales reps to focus on closing deals. Jason AI uses advanced natural language processing (NLP) to understand the context of conversations and respond in a personalized and engaging way. It integrates with popular CRM and email platforms, making it easy to use and manage.
eCommerce ChatGPT Prompts Generator
eCommerce ChatGPT Prompts Generator is a tool that helps eCommerce marketers create high-quality prompts for ChatGPT. With over 2 million pre-built prompts, users can quickly and easily generate content for all their eCommerce marketing needs, including product descriptions, social media posts, email campaigns, and more. The tool also offers a variety of customization options, so users can tailor their prompts to their specific needs.
HowsThisGoing
HowsThisGoing is an AI-powered application designed to streamline team communication and productivity by enabling users to set up standups in Slack within seconds. The platform offers features such as automatic standups, AI summaries, custom tests, analytics & reporting, and workflow scheduling. Users can easily create workflows, generate AI reports, and track team performance efficiently. HowsThisGoing provides unlimited benefits at a flat price, making it a cost-effective solution for teams of all sizes.
GoatStack
GoatStack is an AI-powered newsletter agent that delivers personalized insights from scientific papers. It reads over 4000 papers daily and handpicks the most relevant ones for you. With GoatStack, you can stay up-to-date on the latest AI breakthroughs and advancements. It offers a range of features to help you customize your newsletter, including the ability to personalize topics, generalize topics, or be specific with content.
Punchlines.ai
Punchlines.ai is an AI joke generation tool that allows users to generate jokes with the help of AI technology. Users can input a joke set-up, and the tool generates witty punchlines for them. The tool is built on a large language model fine-tuned on thousands of late-night comedy monologue jokes, making it a fun and creative way to come up with humorous content.
Universe
Universe is a website builder that allows users to create custom websites without any coding knowledge. It offers a drag-and-drop editor, AI-generated website designs, and a variety of templates to choose from. Universe also includes e-commerce features, such as the ability to sell physical and digital products, and it integrates with popular social media networks for marketing and advertising. With Universe, users can create a wide range of websites, including online stores, portfolios, event pages, blogs, and fan sites.
Slang.ai
Slang.ai is a voice AI tool designed specifically for restaurants to intelligently handle phone calls. It allows restaurants to answer questions, take reservations, and provide a seamless customer experience. The tool is built to understand different accents, answer common questions, and manage reservations efficiently. Slang.ai helps restaurants increase reservations, improve ROI, and save time by handling calls 24/7. It offers customizable experiences, real-time analytics, and easy setup within minutes.
GlobalSeo
GlobalSeo is an AI-powered website translation tool that enables users to easily translate their websites into 93 languages, reach a global audience, and improve SEO rankings. The tool offers automatic translation, website localization, and integration with popular search engines. Users can sign up for free, get a demo, and start translating their websites in under 5 minutes. GlobalSeo simplifies the translation process, saves time, and helps businesses expand their online presence globally.
Tolgee
Tolgee is a smart platform for painless localization, offering fast integration and dev tools for translation assistance, autonomous translation, and collaboration. It supports in-context translating, translation memory, machine translations, auto translation, and activity log features. Tolgee is used by 8500+ users worldwide and provides integrations for all major JS frameworks.
EBI.AI
EBI.AI is a customer service AI assistant that can help businesses with a variety of tasks, such as answering customer questions, resolving issues, and providing support. It is a self-serve platform that allows businesses to create and launch their own AI assistant in minutes. EBI.AI also offers a range of features, such as natural language processing, human-in-the-loop support, and integrations with other business systems. With EBI.AI, businesses can improve customer satisfaction, reduce costs, and increase efficiency.
Instabot
Instabot is a chatbot software that helps businesses generate leads, qualify leads, book appointments, and engage with customers. It is an easy-to-use platform that can be launched on a website in 20 minutes or less. Instabot offers a variety of features, including templates, a drag-and-drop interface, and analytics. It also integrates with a variety of platforms, including Facebook Messenger, Google Calendar, and Office 365.
Juji
Juji is a no-code AI chatbot builder that allows users to create and manage custom cognitive AI assistants without any coding required. Juji's cognitive AI assistants are powered with advanced human soft skills, such as active listening and reading between the lines, which enables them to automate complex human interactions empathetically and responsibly. Juji is accessible to every organization who needs it, and it enables non-IT professionals to create and manage custom cognitive AI assistants with no code, similar to using Powerpoint or Excel.
Prolific
Prolific is a platform that helps users quickly find research participants they can trust. It offers free representative samples, a participant pool of domain experts, the ability to bring your own participants, and an API for integration. Prolific ensures data quality by verifying participants with bank-grade ID checks, ongoing checks to identify bots, and no AI participants. The platform allows users to easily set up accounts, access rich and comprehensive responses, and scale research projects efficiently.
Glencoco
Glencoco is a tech-enabled sales marketplace that empowers businesses to become fractional sales representatives. The platform offers AI-enabled SDRs on a pay-for-performance basis, helping businesses grow their pipeline by finding the right prospects and maximizing ROI. Glencoco provides insights on prospect responses, integrates dialing and email solutions, and allows users to set up campaigns, select sales development reps, and optimize results. The platform combines human contractors with AI workflows to deliver successful outbound sales motions effortlessly.
Candor
Candor is an AI-powered team feedback platform that helps businesses improve team culture and performance. It offers a range of features including team retrospectives, check-ins, anonymous feedback, 1:1s, and 360 surveys. Candor's AI-driven insights help businesses identify and address issues within their teams, and its user-friendly interface makes it easy to set up and use. Candor is a valuable tool for any business looking to improve team communication, collaboration, and productivity.
UseCredits
UseCredits is a hassle-free credit-based billing integration that allows you to easily add credit-based billing to your Stripe or Paddle accounts. With UseCredits, you can set credits for your products, get automatic account top-ups, transparent account statements, analytics, and much more. UseCredits is flexible and un-opinionated, making it suitable for a variety of use cases, including generative AI SaaS, email or SMS API, and games and entertainment.
ColdIQ
ColdIQ is an AI-powered sales prospecting tool that helps B2B companies with revenue above $100k/month to build outbound systems that sell for them. The tool offers end-to-end cold outreach campaign setup and management, email infrastructure setup and warmup, audience research and targeting, data scraping and enrichment, campaigns optimization, sending automation, sales systems implementation, training on tools best practices, sales tools recommendations, free gap analysis, sales consulting, and copywriting frameworks. ColdIQ leverages AI to tailor messaging to each prospect, automate outreach, and flood calendars with opportunities.
20 - Open Source AI Tools
ai-cli-lib
The ai-cli-lib is a library designed to enhance interactive command-line editing programs by integrating with GPT large language model servers. It allows users to obtain AI help from servers like Anthropic's or OpenAI's, or a llama.cpp server. The library acts as a command line copilot, providing natural language prompts and responses to enhance user experience and productivity. It supports various platforms such as Debian GNU/Linux, macOS, and Cygwin, and requires specific packages for installation and operation. Users can configure the library to activate during shell startup and interact with command-line programs like bash, mysql, psql, gdb, sqlite3, and bc. Additionally, the library provides options for configuring API keys, setting up llama.cpp servers, and ensuring data privacy by managing context settings.
OpenAI-sublime-text
The OpenAI Completion plugin for Sublime Text provides first-class code assistant support within the editor. It utilizes LLM models to manipulate code, engage in chat mode, and perform various tasks. The plugin supports OpenAI, llama.cpp, and ollama models, allowing users to customize their AI assistant experience. It offers separated chat histories and assistant settings for different projects, enabling context-specific interactions. Additionally, the plugin supports Markdown syntax with code language syntax highlighting, server-side streaming for faster response times, and proxy support for secure connections. Users can configure the plugin's settings to set their OpenAI API key, adjust assistant modes, and manage chat history. Overall, the OpenAI Completion plugin enhances the Sublime Text editor with powerful AI capabilities, streamlining coding workflows and fostering collaboration with AI assistants.
TinyLLM
TinyLLM is a project that helps build a small locally hosted language model with a web interface using consumer-grade hardware. It supports multiple language models, builds a local OpenAI API web service, and serves a Chatbot web interface with customizable prompts. The project requires specific hardware and software configurations for optimal performance. Users can run a local language model using inference servers like vLLM, llama-cpp-python, and Ollama. The Chatbot feature allows users to interact with the language model through a web-based interface, supporting features like summarizing websites, displaying news headlines, stock prices, weather conditions, and using vector databases for queries.
chat-ui
A chat interface using open source models, eg OpenAssistant or Llama. It is a SvelteKit app and it powers the HuggingChat app on hf.co/chat.
whisper_dictation
Whisper Dictation is a fast, offline, privacy-focused tool for voice typing, AI voice chat, voice control, and translation. It allows hands-free operation, launching and controlling apps, and communicating with OpenAI ChatGPT or a local chat server. The tool also offers the option to speak answers out loud and draw pictures. It includes client and server versions, inspired by the Star Trek series, and is designed to keep data off the internet and confidential. The project is optimized for dictation and translation tasks, with voice control capabilities and AI image generation using stable-diffusion API.
erag
ERAG is an advanced system that combines lexical, semantic, text, and knowledge graph searches with conversation context to provide accurate and contextually relevant responses. This tool processes various document types, creates embeddings, builds knowledge graphs, and uses this information to answer user queries intelligently. It includes modules for interacting with web content, GitHub repositories, and performing exploratory data analysis using various language models.
LARS
LARS is an application that enables users to run Large Language Models (LLMs) locally on their devices, upload their own documents, and engage in conversations where the LLM grounds its responses with the uploaded content. The application focuses on Retrieval Augmented Generation (RAG) to increase accuracy and reduce AI-generated inaccuracies. LARS provides advanced citations, supports various file formats, allows follow-up questions, provides full chat history, and offers customization options for LLM settings. Users can force enable or disable RAG, change system prompts, and tweak advanced LLM settings. The application also supports GPU-accelerated inferencing, multiple embedding models, and text extraction methods. LARS is open-source and aims to be the ultimate RAG-centric LLM application.
paper-qa
PaperQA is a minimal package for question and answering from PDFs or text files, providing very good answers with in-text citations. It uses OpenAI Embeddings to embed and search documents, and follows a process of embedding docs and queries, searching for top passages, creating summaries, scoring and selecting relevant summaries, putting summaries into prompt, and generating answers. Users can customize prompts and use various models for embeddings and LLMs. The tool can be used asynchronously and supports adding documents from paths, files, or URLs.
anything-llm
AnythingLLM is a full-stack application that enables you to turn any document, resource, or piece of content into context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions.
llama-on-lambda
This project provides a proof of concept for deploying a scalable, serverless LLM Generative AI inference engine on AWS Lambda. It leverages the llama.cpp project to enable the usage of more accessible CPU and RAM configurations instead of limited and expensive GPU capabilities. By deploying a container with the llama.cpp converted models onto AWS Lambda, this project offers the advantages of scale, minimizing cost, and maximizing compute availability. The project includes AWS CDK code to create and deploy a Lambda function leveraging your model of choice, with a FastAPI frontend accessible from a Lambda URL. It is important to note that you will need ggml quantized versions of your model and model sizes under 6GB, as your inference RAM requirements cannot exceed 9GB or your Lambda function will fail.
LLMUnity
LLM for Unity enables seamless integration of Large Language Models (LLMs) within the Unity engine, allowing users to create intelligent characters for immersive player interactions. The tool supports major LLM models, runs locally without internet access, offers fast inference on CPU and GPU, and is easy to set up with a single line of code. It is free for both personal and commercial use, tested on Unity 2021 LTS, 2022 LTS, and 2023. Users can build multiple AI characters efficiently, use remote servers for processing, and customize model settings for text generation.
kantv
KanTV is an open-source project that focuses on studying and practicing state-of-the-art AI technology in real applications and scenarios, such as online TV playback, transcription, translation, and video/audio recording. It is derived from the original ijkplayer project and includes many enhancements and new features, including: * Watching online TV and local media using a customized FFmpeg 6.1. * Recording online TV to automatically generate videos. * Studying ASR (Automatic Speech Recognition) using whisper.cpp. * Studying LLM (Large Language Model) using llama.cpp. * Studying SD (Text to Image by Stable Diffusion) using stablediffusion.cpp. * Generating real-time English subtitles for English online TV using whisper.cpp. * Running/experiencing LLM on Xiaomi 14 using llama.cpp. * Setting up a customized playlist and using the software to watch the content for R&D activity. * Refactoring the UI to be closer to a real commercial Android application (currently only supports English). Some goals of this project are: * To provide a well-maintained "workbench" for ASR researchers interested in practicing state-of-the-art AI technology in real scenarios on mobile devices (currently focusing on Android). * To provide a well-maintained "workbench" for LLM researchers interested in practicing state-of-the-art AI technology in real scenarios on mobile devices (currently focusing on Android). * To create an Android "turn-key project" for AI experts/researchers (who may not be familiar with regular Android software development) to focus on device-side AI R&D activity, where part of the AI R&D activity (algorithm improvement, model training, model generation, algorithm validation, model validation, performance benchmark, etc.) can be done very easily using Android Studio IDE and a powerful Android phone.
resonance
Resonance is a framework designed to facilitate interoperability and messaging between services in your infrastructure and beyond. It provides AI capabilities and takes full advantage of asynchronous PHP, built on top of Swoole. With Resonance, you can: * Chat with Open-Source LLMs: Create prompt controllers to directly answer user's prompts. LLM takes care of determining user's intention, so you can focus on taking appropriate action. * Asynchronous Where it Matters: Respond asynchronously to incoming RPC or WebSocket messages (or both combined) with little overhead. You can set up all the asynchronous features using attributes. No elaborate configuration is needed. * Simple Things Remain Simple: Writing HTTP controllers is similar to how it's done in the synchronous code. Controllers have new exciting features that take advantage of the asynchronous environment. * Consistency is Key: You can keep the same approach to writing software no matter the size of your project. There are no growing central configuration files or service dependencies registries. Every relation between code modules is local to those modules. * Promises in PHP: Resonance provides a partial implementation of Promise/A+ spec to handle various asynchronous tasks. * GraphQL Out of the Box: You can build elaborate GraphQL schemas by using just the PHP attributes. Resonance takes care of reusing SQL queries and optimizing the resources' usage. All fields can be resolved asynchronously.
AMD-AI
AMD-AI is a repository containing detailed instructions for installing, setting up, and configuring ROCm on Ubuntu systems with AMD GPUs. The repository includes information on installing various tools like Stable Diffusion, ComfyUI, and Oobabooga for tasks like text generation and performance tuning. It provides guidance on adding AMD GPU package sources, installing ROCm-related packages, updating system packages, and finding graphics devices. The instructions are aimed at users with AMD hardware looking to set up their Linux systems for AI-related tasks.
llm
The 'llm' package for Emacs provides an interface for interacting with Large Language Models (LLMs). It abstracts functionality to a higher level, concealing API variations and ensuring compatibility with various LLMs. Users can set up providers like OpenAI, Gemini, Vertex, Claude, Ollama, GPT4All, and a fake client for testing. The package allows for chat interactions, embeddings, token counting, and function calling. It also offers advanced prompt creation and logging capabilities. Users can handle conversations, create prompts with placeholders, and contribute by creating providers.
ezlocalai
ezlocalai is an artificial intelligence server that simplifies running multimodal AI models locally. It handles model downloading and server configuration based on hardware specs. It offers OpenAI Style endpoints for integration, voice cloning, text-to-speech, voice-to-text, and offline image generation. Users can modify environment variables for customization. Supports NVIDIA GPU and CPU setups. Provides demo UI and workflow visualization for easy usage.
local-talking-llm
The 'local-talking-llm' repository provides a tutorial on building a voice assistant similar to Jarvis or Friday from Iron Man movies, capable of offline operation on a computer. The tutorial covers setting up a Python environment, installing necessary libraries like rich, openai-whisper, suno-bark, langchain, sounddevice, pyaudio, and speechrecognition. It utilizes Ollama for Large Language Model (LLM) serving and includes components for speech recognition, conversational chain, and speech synthesis. The implementation involves creating a TextToSpeechService class for Bark, defining functions for audio recording, transcription, LLM response generation, and audio playback. The main application loop guides users through interactive voice-based conversations with the assistant.
llmware
LLMWare is a framework for quickly developing LLM-based applications including Retrieval Augmented Generation (RAG) and Multi-Step Orchestration of Agent Workflows. This project provides a comprehensive set of tools that anyone can use - from a beginner to the most sophisticated AI developer - to rapidly build industrial-grade, knowledge-based enterprise LLM applications. Our specific focus is on making it easy to integrate open source small specialized models and connecting enterprise knowledge safely and securely.
sql-eval
This repository contains the code that Defog uses for the evaluation of generated SQL. It's based off the schema from the Spider, but with a new set of hand-selected questions and queries grouped by query category. The testing procedure involves generating a SQL query, running both the 'gold' query and the generated query on their respective database to obtain dataframes with the results, comparing the dataframes using an 'exact' and a 'subset' match, logging these alongside other metrics of interest, and aggregating the results for reporting. The repository provides comprehensive instructions for installing dependencies, starting a Postgres instance, importing data into Postgres, importing data into Snowflake, using private data, implementing a query generator, and running the test with different runners.
20 - OpenAI Gpts
Calendar and email Assistant
Your expert assistant for Google Calendar and gmail tasks, integrated with Zapier (works with free plan). Supports: list, add, update events to calendar, send gmail. You will be prompted to configure zapier actions when set up initially. Conversation data is not used for openai training.
Tech Support Advisor
From setting up a printer to troubleshooting a device, Iām here to help you step-by-step.
Sysadmin
I help you with all your sysadmin tasks, from setting up your server to scaling your already exsisting one. I can help you with understanding the long list of log files and give you solutions to the problems.
GA4 Implementation Assistant
A helper for implementing Google Analytics 4 with tips and troubleshooting.
Bun Nook Kit App Builder
Expert in BNK server setup, typesafe routes, htmlody, and creating SQLite schemas with BNK.
NetMaster Pro šš ļø
Your AI network guru for setup and fixing connectivity woes! š Assists with network configurations, troubleshooting, and optimizes your internet experience. š»āØ
Dialysis Assistant
Home Hemodialysis Helper for NxStage system. Step-by-step guidance, help for tricky situations, and voice interaction recommended.