Best AI tools for< Improve Rag Performance >
20 - AI tool Sites
Langtrace AI
Langtrace AI is an open-source observability tool powered by Scale3 Labs that helps monitor, evaluate, and improve LLM (Large Language Model) applications. It collects and analyzes traces and metrics to provide insights into the ML pipeline, ensuring security through SOC 2 Type II certification. Langtrace supports popular LLMs, frameworks, and vector databases, offering end-to-end observability and the ability to build and deploy AI applications with confidence.
Cloudflare
Cloudflare is a platform that offers a range of products and services to help improve website performance, security, and reliability. It provides solutions such as web analytics, troubleshooting errors, domain registration, and content delivery network services. Cloudflare also offers developer products like Workers and AI products like RAG Workers, AI Vectorize, and AI Gateway. The platform aims to simplify website management and enhance user experience by leveraging cloud-based technologies.
N/A
The website seems to be experiencing technical difficulties as indicated by the error message '502 Bad Gateway'. This error typically occurs when a server acting as a gateway or proxy receives an invalid response from an upstream server. The message 'openresty' suggests that the server may be using the OpenResty web platform. Users encountering a 502 Bad Gateway error may need to wait for the issue to be resolved by the website's administrators or try accessing the site at a later time.
Vellum AI
Vellum AI is an AI platform that supports using Microsoft Azure hosted OpenAI models. It offers tools for prompt engineering, semantic search, prompt chaining, evaluations, and monitoring. Vellum enables users to build AI systems with features like workflow automation, document analysis, fine-tuning, Q&A over documents, intent classification, summarization, vector search, chatbots, blog generation, sentiment analysis, and more. The platform is backed by top VCs and founders of well-known companies, providing a complete solution for building LLM-powered applications.
Pongo
Pongo is an AI-powered tool that helps reduce hallucinations in Large Language Models (LLMs) by up to 80%. It utilizes multiple state-of-the-art semantic similarity models and a proprietary ranking algorithm to ensure accurate and relevant search results. Pongo integrates seamlessly with existing pipelines, whether using a vector database or Elasticsearch, and processes top search results to deliver refined and reliable information. Its distributed architecture ensures consistent latency, handling a wide range of requests without compromising speed. Pongo prioritizes data security, operating at runtime with zero data retention and no data leaving its secure AWS VPC.
Glean
Glean is an AI-powered work assistant and enterprise search platform that enables teams to harness generative AI to make better decisions faster. It connects all company data, provides advanced personalization, and ensures retrieval of the most relevant information. Glean offers responsible AI solutions that scale to businesses, respecting permissions and providing secure, private, and fully referenceable answers. With turnkey deployment and a variety of platform tools, Glean helps teams move faster and be more productive.
Trieve
Trieve is an AI-first infrastructure API that offers a modern solution for search, recommendations, and RAG (Retrieve and Generate) tasks. It combines language models with tools for fine-tuning ranking and relevance, providing production-ready capabilities for building search, discovery, and RAG experiences. Trieve supports semantic vector search, full-text search using BM25 & SPLADE models, custom embedding models, hybrid search, and sub-sentence highlighting. With features like merchandising, relevance tuning, and self-hostable options, Trieve empowers companies to enhance their search capabilities and user experiences.
Simbo AI
Simbo AI is a Gen AI platform designed for healthcare enterprises, offering autonomous applications for medical practice automation. It combines LLM and symbolic knowledge bases to provide hallucination-free responses. The platform is fully controllable, consistent, secure, and responsible, ensuring accurate and reliable AI interactions. Simbo AI utilizes Symbolic RAG technology with Lossless NLU for exact search and fact-checking capabilities. It aims to automate medical practices, reduce costs, and improve patient care, ultimately enhancing the lives of doctors and patients.
Kore.ai
Kore.ai is an AI application that offers a comprehensive platform for generating value from AI across various business use cases. It provides a range of solutions such as generative AI applications, RAG/LLM-based search, and experience optimization. The platform enables users to automate processes, enhance productivity, and improve business operations using advanced AI technologies. Kore.ai caters to industries like financial services, healthcare, retail, and manufacturing, delivering human-like interactions through virtual assistants and AI models.
Clarifai
Clarifai is an AI Workflow Orchestration Platform that helps businesses establish an AI Operating Model and transition from prototype to production efficiently. It offers end-to-end solutions for operationalizing AI, including Retrieval Augmented Generation (RAG), Generative AI, Digital Asset Management, Visual Inspection, Automated Data Labeling, and Content Moderation. Clarifai's platform enables users to build and deploy AI faster, reduce development costs, ensure oversight and security, and unlock AI capabilities across the organization. The platform simplifies data labeling, content moderation, intelligence & surveillance, generative AI, content organization & personalization, and visual inspection. Trusted by top enterprises, Clarifai helps companies overcome challenges in hiring AI talent and misuse of data, ultimately leading to AI success at scale.
QOVES
QOVES is a website that provides tools and advice to help people improve their looks. The website offers a variety of services, including facial analysis, hairline design, style advice, and Photoshop retouching. QOVES also has a blog with articles on a variety of topics related to beauty and aesthetics.
Zevi
Zevi is an AI-powered site search and discovery platform that helps businesses improve their website search and chat experience. It offers a range of features including neural search, chat assistant, merchandising, and analytics. Zevi's AI-driven technology helps businesses understand their customers' queries and provide them with the most relevant results. It also helps businesses create a more personalized and conversational shopping experience for their customers.
FCK.School
FCK.School is an online platform that provides AI-powered writing tools to help students with various aspects of their academic work, including creating outlines, generating essay conclusions, crafting thesis statements, and much more. It offers a range of tools such as paraphrasing, text generation, summarizing, grammar and punctuation correction, title generation, thesis statement generation, outline generation, intro generation, paragraph generation, and conclusion generation. FCK.School aims to improve writing skills, save time and effort, and enhance the quality of written content.
Cyberday.ai
Cyberday.ai is an AI-powered platform designed to help organizations improve and certify their cybersecurity. The platform offers a comprehensive set of tools and resources to guide users in implementing security tasks, creating policies, and generating compliance reports. With a focus on automation and efficiency, Cyberday.ai streamlines the process of managing information security, from risk assessment to employee training. By leveraging AI technology, Cyberday.ai aims to simplify the complex task of cybersecurity management for organizations of all sizes.
Neurala
Neurala is a company that provides visual quality inspection software powered by AI. Their software is designed to help manufacturers improve their inspection process by reducing product defects, increasing inspection rates, and preventing production downtime. Neurala's software is flexible and can be easily retrofitted into existing production line infrastructure, without the need for AI experts or expensive capital expenditures. The company's software is used by a variety of manufacturers, including Sony, AITRIOS, and CB Insights.
Textbuddy
Textbuddy is an online writing editor that uses AI to analyze and improve your writing. It highlights potential flaws in your text, such as long sentences, complex words, formatting errors, passive voice, adverbs, filler and vague words. The tool also provides synonyms, CEFR language levels, its own readability score, text statistics, and a keyword list for your text. Textbuddy's AI writing assistant can help you with grammar, spelling, and punctuation, and can even (re-)write text in plain language. You can use the AI assistant to overcome writer's block, create headlines and summaries, paraphrase text, analyze text sentiment, or just let it write for you as a copywriting AI.
kOS
Helper Systems has developed technology that restores the trust between students who want to use AI tools for research and faculty who need to ensure academic integrity. With kOS (pronounced chaos), students can easily provide proof of work using a platform that significantly simplifies and enhances the research process in ways never before possible. Add PDF files from your desktop, shared drives or the web. Annotate them if you desire. Use AI responsibly, knowing when information is generated from your research vs. the web. Instantly create a presentation of all your resources. Share and prove your work. Try other cool features that offer a unique way to find, organize, discover, archive, and present information.
DesignRoasts
DesignRoasts is a web-based tool that provides personalized AI insights to help you optimize your website or app. Simply upload a screenshot of your product and select your goal (e.g., increase conversions, improve onboarding, etc.), and DesignRoasts will generate a list of actionable feedback tailored to your specific needs. The feedback focuses on improving the user experience, visual design, copywriting, and more.
PEAR Health Labs
PEAR Health Labs is an AI-powered adaptive digital coaching software that offers hyperpersonalized coaching solutions to help individuals move smarter, get healthier, and live happier. The platform utilizes AI technology and physiological science to create personalized fitness experiences based on wearables and biometric feedback. PEAR Training Intelligence aims to change the landscape of preventative care by identifying health issues before they become problems. The software is cost-effective, scalable, and provides guidance on physical activity to improve overall health outcomes.
LangWatch
LangWatch is a monitoring and analytics tool for Generative AI (GenAI) solutions. It provides detailed evaluations of the faithfulness and relevancy of GenAI responses, coupled with user feedback insights. LangWatch is designed for both technical and non-technical users to collaborate and comment on improvements. With LangWatch, you can understand your users, detect issues, and improve your GenAI products.
20 - Open Source AI Tools
LLMLingua
LLMLingua is a tool that utilizes a compact, well-trained language model to identify and remove non-essential tokens in prompts. This approach enables efficient inference with large language models, achieving up to 20x compression with minimal performance loss. The tool includes LLMLingua, LongLLMLingua, and LLMLingua-2, each offering different levels of prompt compression and performance improvements for tasks involving large language models.
tonic_validate
Tonic Validate is a framework for the evaluation of LLM outputs, such as Retrieval Augmented Generation (RAG) pipelines. Validate makes it easy to evaluate, track, and monitor your LLM and RAG applications. Validate allows you to evaluate your LLM outputs through the use of our provided metrics which measure everything from answer correctness to LLM hallucination. Additionally, Validate has an optional UI to visualize your evaluation results for easy tracking and monitoring.
LARS
LARS is an application that enables users to run Large Language Models (LLMs) locally on their devices, upload their own documents, and engage in conversations where the LLM grounds its responses with the uploaded content. The application focuses on Retrieval Augmented Generation (RAG) to increase accuracy and reduce AI-generated inaccuracies. LARS provides advanced citations, supports various file formats, allows follow-up questions, provides full chat history, and offers customization options for LLM settings. Users can force enable or disable RAG, change system prompts, and tweak advanced LLM settings. The application also supports GPU-accelerated inferencing, multiple embedding models, and text extraction methods. LARS is open-source and aims to be the ultimate RAG-centric LLM application.
awesome-deliberative-prompting
The 'awesome-deliberative-prompting' repository focuses on how to ask Large Language Models (LLMs) to produce reliable reasoning and make reason-responsive decisions through deliberative prompting. It includes success stories, prompting patterns and strategies, multi-agent deliberation, reflection and meta-cognition, text generation techniques, self-correction methods, reasoning analytics, limitations, failures, puzzles, datasets, tools, and other resources related to deliberative prompting. The repository provides a comprehensive overview of research, techniques, and tools for enhancing reasoning capabilities of LLMs.
vectordb-recipes
This repository contains examples, applications, starter code, & tutorials to help you kickstart your GenAI projects. * These are built using LanceDB, a free, open-source, serverless vectorDB that **requires no setup**. * It **integrates into python data ecosystem** so you can simply start using these in your existing data pipelines in pandas, arrow, pydantic etc. * LanceDB has **native Typescript SDK** using which you can **run vector search** in serverless functions! This repository is divided into 3 sections: - Examples - Get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes! - Applications - Ready to use Python and web apps using applied LLMs, VectorDB and GenAI tools - Tutorials - A curated list of tutorials, blogs, Colabs and courses to get you started with GenAI in greater depth.
langroid
Langroid is a Python framework that makes it easy to build LLM-powered applications. It uses a multi-agent paradigm inspired by the Actor Framework, where you set up Agents, equip them with optional components (LLM, vector-store and tools/functions), assign them tasks, and have them collaboratively solve a problem by exchanging messages. Langroid is a fresh take on LLM app-development, where considerable thought has gone into simplifying the developer experience; it does not use Langchain.
deepeval
DeepEval is a simple-to-use, open-source LLM evaluation framework specialized for unit testing LLM outputs. It incorporates various metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., and runs locally on your machine for evaluation. It provides a wide range of ready-to-use evaluation metrics, allows for creating custom metrics, integrates with any CI/CD environment, and enables benchmarking LLMs on popular benchmarks. DeepEval is designed for evaluating RAG and fine-tuning applications, helping users optimize hyperparameters, prevent prompt drifting, and transition from OpenAI to hosting their own Llama2 with confidence.
UFO
UFO is a UI-focused dual-agent framework to fulfill user requests on Windows OS by seamlessly navigating and operating within individual or spanning multiple applications.
RAG-FiT
RAG-FiT is a library designed to improve Language Models' ability to use external information by fine-tuning models on specially created RAG-augmented datasets. The library assists in creating training data, training models using parameter-efficient finetuning (PEFT), and evaluating performance using RAG-specific metrics. It is modular, customizable via configuration files, and facilitates fast prototyping and experimentation with various RAG settings and configurations.
RAGFoundry
RAG Foundry is a library designed to enhance Large Language Models (LLMs) by fine-tuning models on RAG-augmented datasets. It helps create training data, train models using parameter-efficient finetuning (PEFT), and measure performance using RAG-specific metrics. The library is modular, customizable using configuration files, and facilitates prototyping with various RAG settings and configurations for tasks like data processing, retrieval, training, inference, and evaluation.
RAGHub
RAGHub is a community-driven project focused on cataloging new and emerging frameworks, projects, and resources in the Retrieval-Augmented Generation (RAG) ecosystem. It aims to help users stay ahead of changes in the field by providing a platform for the latest innovations in RAG. The repository includes information on RAG frameworks, evaluation frameworks, optimization frameworks, citation frameworks, engines, search reranker frameworks, projects, resources, and real-world use cases across industries and professions.
llmware
LLMWare is a framework for quickly developing LLM-based applications including Retrieval Augmented Generation (RAG) and Multi-Step Orchestration of Agent Workflows. This project provides a comprehensive set of tools that anyone can use - from a beginner to the most sophisticated AI developer - to rapidly build industrial-grade, knowledge-based enterprise LLM applications. Our specific focus is on making it easy to integrate open source small specialized models and connecting enterprise knowledge safely and securely.
rageval
Rageval is an evaluation tool for Retrieval-augmented Generation (RAG) methods. It helps evaluate RAG systems by performing tasks such as query rewriting, document ranking, information compression, evidence verification, answer generation, and result validation. The tool provides metrics for answer correctness and answer groundedness, along with benchmark results for ASQA and ALCE datasets. Users can install and use Rageval to assess the performance of RAG models in question-answering tasks.
ai-notes
Notes on AI state of the art, with a focus on generative and large language models. These are the "raw materials" for the https://lspace.swyx.io/ newsletter. This repo used to be called https://github.com/sw-yx/prompt-eng, but was renamed because Prompt Engineering is Overhyped. This is now an AI Engineering notes repo.
athina-evals
Athina is an open-source library designed to help engineers improve the reliability and performance of Large Language Models (LLMs) through eval-driven development. It offers plug-and-play preset evals for catching and preventing bad outputs, measuring model performance, running experiments, A/B testing models, detecting regressions, and monitoring production data. Athina provides a solution to the flaws in current LLM developer workflows by offering rapid experimentation, customizable evaluators, integrated dashboard, consistent metrics, historical record tracking, and easy setup. It includes preset evaluators for RAG applications and summarization accuracy, as well as the ability to write custom evals. Athina's evals can run on both development and production environments, providing consistent metrics and removing the need for manual infrastructure setup.
rag-experiment-accelerator
The RAG Experiment Accelerator is a versatile tool that helps you conduct experiments and evaluations using Azure AI Search and RAG pattern. It offers a rich set of features, including experiment setup, integration with Azure AI Search, Azure Machine Learning, MLFlow, and Azure OpenAI, multiple document chunking strategies, query generation, multiple search types, sub-querying, re-ranking, metrics and evaluation, report generation, and multi-lingual support. The tool is designed to make it easier and faster to run experiments and evaluations of search queries and quality of response from OpenAI, and is useful for researchers, data scientists, and developers who want to test the performance of different search and OpenAI related hyperparameters, compare the effectiveness of various search strategies, fine-tune and optimize parameters, find the best combination of hyperparameters, and generate detailed reports and visualizations from experiment results.
ragna
Ragna is a RAG orchestration framework designed for managing workflows and orchestrating tasks. It provides a comprehensive set of features for users to streamline their processes and automate repetitive tasks. With Ragna, users can easily create, schedule, and monitor workflows, making it an ideal tool for teams and individuals looking to improve their productivity and efficiency. The framework offers extensive documentation, community support, and a user-friendly interface, making it accessible to users of all skill levels. Whether you are a developer, data scientist, or project manager, Ragna can help you simplify your workflow management and boost your overall performance.
LLaMa2lang
LLaMa2lang is a repository containing convenience scripts to finetune LLaMa3-8B (or any other foundation model) for chat towards any language that isn't English. The repository aims to improve the performance of LLaMa3 for non-English languages by combining fine-tuning with RAG. Users can translate datasets, extract threads, turn threads into prompts, and finetune models using QLoRA and PEFT. Additionally, the repository supports translation models like OPUS, M2M, MADLAD, and base datasets like OASST1 and OASST2. The process involves loading datasets, translating them, combining checkpoints, and running inference using the newly trained model. The repository also provides benchmarking scripts to choose the right translation model for a target language.
20 - OpenAI Gpts
UX & UI
Gives you tips and suggestions on how you can improve your application for your users.
Memory Enhancer
Offers exercises and techniques to improve memory retention and cognitive functions.
English Conversation Role Play Creator
Generates conversation examples and chunks for specified situations. Improve your instantaneous conversational skills through repetitive practice!
Customer Retention Consultant
Analyzes customer churn and provides strategies to improve loyalty and retention.
Agile Coach Expert
Agile expert providing practical, step-by-step advice with the agile way of working of your team and organisation. Whether you're looking to improve your Agile skills or find solutions to specific problems. Including Scrum, Kanban and SAFe knowledge.
Kemi - Research & Creative Assistant
I improve marketing effectiveness by designing stunning research-led assets in a flash!
Quickest Feedback for Language Learner
Helps improve language skills through interactive scenarios and feedback.
Le VPN - Your Secure Internet Proxy
Bypass Internet censorship & improve your security online
実践スキルが身につく営業ロールプレイング:【エキスパートクラス】
実践スキル向上のための対話型学習アシスタント (Interactive learning assistant to improve practical skills)
Your personal GRC & Security Tutor
A training tool for infosec professionals to improve their skills in GRC & security and help obtain related certifications.
Anna, the Ethical Essay Guide
Guides in structuring essays to improve writing skills, adapting to skill levels.
MetaGPT : Meta Ads AI Marketing Co-Pilot
Expert in Meta advertising that can improve your ROI. Official Meta GPT built by dicer.ai