Best AI tools for< Track Quality Performance >
20 - AI tool Sites

Navina AI
Navina AI is a clinician-first AI platform designed to streamline patient care by providing key insights and actionable recommendations to healthcare providers, ACOs, MSOs, and health plans. The platform leverages proprietary AI technology to improve clinical decision-making, reduce administrative burden, and enhance quality management and risk adjustment processes. Navina AI offers efficient chart review, accurate risk adjustment, streamlined quality management, robust analytics, and a user-friendly interface that integrates seamlessly into the clinical workflow.

MacWhisper
MacWhisper is a native macOS application that utilizes OpenAI's Whisper technology for transcribing audio files into text. It offers a user-friendly interface for recording, transcribing, and editing audio, making it suitable for various use cases such as transcribing meetings, lectures, interviews, and podcasts. The application is designed to protect user privacy by performing all transcriptions locally on the device, ensuring that no data leaves the user's machine.

Perspect
Perspect is an AI-powered platform designed for high-performance software teams. It offers real-time insights into team contributions and impact, optimizing developer experience, and rewarding high-performers. With 50+ integrations, Perspect enables visualization of impact, benchmarking performance, and uses machine learning models to identify and eliminate blockers. The platform is deeply integrated with web3 wallets and offers built-in reward mechanisms. Managers can align resources around crucial KPIs, identify top talent, and prevent burnout. Perspect aims to enhance team productivity and employee retention through AI and ML technologies.

ReachifyMe
ReachifyMe is an AI-powered LinkedIn post generator that helps users create high-quality LinkedIn posts in under 7 minutes. The tool provides users with ready-made post ideas, viral post templates, and a content editor to help them craft posts that stand out in the crowded LinkedIn landscape. ReachifyMe also offers a content planner that allows users to save and schedule their posts, as well as track their performance.

Bidmatic.io
Bidmatic.io is a publisher-centric monetization platform that helps publishers maximize their revenue through programmatic advertising. It offers a range of features including header bidding, programmatic direct sales, and access to premium demand partners. Bidmatic.io's AI-powered optimization technology helps publishers select the best set of partners for each auction in real-time, maximizing yield and ad quality. The platform also provides detailed and transparent reporting, allowing publishers to track their performance and identify optimization opportunities.

Bugasura
Bugasura is an AI-enabled bug management tool designed for fast-moving, modern technology teams. It offers features like issue tracking, bug reporting, performance monitoring, integrations, and API documentation. With intelligent features powered by AI, Bugasura streamlines the bug tracking and resolution process, empowering teams to handle complex challenges efficiently. The tool provides custom workflows, automatic issue assignment, seamless exports and imports, visual bug reporters, and in-app bug reporting widgets. Bugasura also offers performance monitoring, advanced filtering options, cloud-based and on-premise deployment choices, pocket-friendly pricing, and robust privacy and security measures.

aqua
aqua is a comprehensive Quality Assurance (QA) management tool designed to streamline testing processes and enhance testing efficiency. It offers a wide range of features such as AI Copilot, bug reporting, test management, requirements management, user acceptance testing, and automation management. aqua caters to various industries including banking, insurance, manufacturing, government, tech companies, and medical sectors, helping organizations improve testing productivity, software quality, and defect detection ratios. The tool integrates with popular platforms like Jira, Jenkins, JMeter, and offers both Cloud and On-Premise deployment options. With AI-enhanced capabilities, aqua aims to make testing faster, more efficient, and error-free.

Presspool.ai
Presspool.ai is an AI-powered platform that offers high-intent cybersecurity leads through endorsements from top industry voices. It provides a network of cybersecurity influencers trusted by CTOs, CISOs, and decision-makers at Fortune 500 companies. The platform helps brands and publishers create campaigns, match with ideal influencers, and optimize marketing strategies using real-time analytics.

AnalyStock.ai
AnalyStock.ai is a financial application leveraging AI to provide users with a next-generation investment toolbox. It helps users better understand businesses, risks, and make informed investment decisions. The platform offers direct access to the stock market, powerful data-driven tools to build top-ranking portfolios, and insights into company valuations and growth prospects. AnalyStock.ai aims to optimize the investment process, offering a reliable strategy with factors like A-Score, factor investing scores for value, growth, quality, volatility, momentum, and yield. Users can discover hidden gems, fine-tune filters, access company scorecards, perform activity analysis, understand industry dynamics, evaluate capital structure, profitability, and peers' valuation. The application also provides adjustable DCF valuation, portfolio management tools, net asset value computation, monthly commentary, and an AI assistant for personalized insights and assistance.

Anecdote
Anecdote is a customer feedback analytics hub that leverages automated AI tagging and precision NLP clustering to help businesses uncover product insights, detect bugs, analyze competitor feedback, and provide real-time feedback alerts. The platform offers semantic search, survey analysis, and integrates with over 65 sources to deliver accurate clusters from customer feedback. Anecdote is used by top customer-centric companies to save time, improve customer experiences, and track feedback in multiple languages securely.

Pitch Patterns
Pitch Patterns is an AI-powered Quality Control platform designed for teams analyzing sales and customer service calls to enhance close rates and CSAT scores. The platform offers innovative features such as Social Skill Markers, Conversation Analysis, and AI Tracking to provide valuable insights and improve agent performance. With integration capabilities with popular CRM systems like Salesforce and Pipedrive, Pitch Patterns aims to revolutionize call center excellence through AI analytics.

Coval
Coval is an AI tool designed to help users ship reliable AI agents faster by providing simulation and evaluations for voice and chat agents. It allows users to simulate thousands of scenarios from a few test cases, create prompts for testing, and evaluate agent interactions comprehensively. Coval offers AI-powered simulations, voice AI compatibility, performance tracking, workflow metrics, and customizable evaluation metrics to optimize AI agents efficiently.

Gitlights
Gitlights is a powerful Git analytics tool that leverages AI and NLP algorithms to provide enriched insights on commits, pull requests, and developer skills. It empowers teams with advanced analytics and insights, revolutionizing the development process. Gitlights offers features such as insightful commits and pull requests dashboard, advanced developer skills analysis, strategic investment balance monitoring, collaborative developers map, and benchmarking comparison with other teams. With Gitlights, users can stay ahead with comparative data, receive smart notifications, and make informed decisions based on precise and detailed data. The tool aims to provide a holistic view of a development team's activity, driving strategic decision-making, continuous improvement, and excellence in collaboration.

Forethought
Forethought is a customer support AI platform that uses generative AI to automate tasks and improve efficiency. It offers a range of features including automatic ticket resolution, sentiment analysis, and agent assist. Forethought's platform is designed to help businesses save costs, improve customer satisfaction, and increase agent productivity.

Univw
Univw is an AI-powered sales CRM designed for start-ups and small businesses. It offers custom dashboards, reports, and analytics, along with features like automated call quality assurance, external automations, agent coaching using AI, and intelligent summarization of notes. Univw aims to enhance business efficiency by providing powerful features that streamline sales processes, improve data access control, integrate cloud telephony, bring data from external sources, offer a flexible workflow engine, and enable data visualization for better decision-making.

AI Clearing
AI Clearing is an AI-powered progress tracking tool with quality control reporting specifically designed for construction projects. It allows users to monitor progress, track production, and align with schedules seamlessly. The tool is fully driven by site and project data, providing actionable insights to boost teams' performance, save resources, and cut administrative and supervision time. AI Clearing is trusted by industry professionals and offers a comprehensive solution to overcome challenges in the construction industry.

MgrWorkbench.ai
MgrWorkbench.ai is an AI-powered business writing tool that helps users create employee performance reviews, development plans, and business letters. It uses natural language processing and machine learning to generate first drafts of these documents, allowing users to focus on feedback and insights rather than writing. MgrWorkbench.ai is designed to save managers time and improve the quality of their written communication.

AIby.email
AIby.email is an AI-powered email assistant that helps you write better emails, faster. It uses natural language processing to understand your intent and generate personalized email responses. AIby.email also offers a variety of other features, such as email scheduling, tracking, and analytics.

Bearkd Clicktracker
Bearkd Clicktracker is an adblock-proof AI tracking and attribution software designed to help e-commerce stores maximize profitability. It offers fully automated, customizable click fraud prevention and reporting, allowing users to identify and exclude invalid and wasteful traffic, including click fraud and bot attacks. Bearkd's AI-powered system analyzes every click against a wide range of markers to determine its legitimacy, ensuring accurate and reliable data for campaign optimization. With Bearkd, businesses can optimize for intent, enhance traffic quality, increase conversion rates, and protect their PPC budget from fraudulent activities.

Squirrly SEO
Squirrly SEO is an AI-powered, all-in-one SEO suite designed to help users improve their search engine optimization efforts. With over 650 features, Squirrly covers every key ranking factor from content quality to user experience. The tool provides expert AI guidance, advanced keyword research and analysis, content optimization suggestions, performance tracking, and easy integration with WordPress and other platforms. Squirrly aims to simplify SEO management and offers a fail-proof approach to optimizing websites for better search rankings.
20 - Open Source AI Tools

zenml
ZenML is an extensible, open-source MLOps framework for creating portable, production-ready machine learning pipelines. By decoupling infrastructure from code, ZenML enables developers across your organization to collaborate more effectively as they develop to production.

skyvern
Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows, replacing brittle or unreliable automation solutions. Traditional approaches to browser automations required writing custom scripts for websites, often relying on DOM parsing and XPath-based interactions which would break whenever the website layouts changed. Instead of only relying on code-defined XPath interactions, Skyvern adds computer vision and LLMs to the mix to parse items in the viewport in real-time, create a plan for interaction and interact with them. This approach gives us a few advantages: 1. Skyvern can operate on websites it’s never seen before, as it’s able to map visual elements to actions necessary to complete a workflow, without any customized code 2. Skyvern is resistant to website layout changes, as there are no pre-determined XPaths or other selectors our system is looking for while trying to navigate 3. Skyvern leverages LLMs to reason through interactions to ensure we can cover complex situations. Examples include: 1. If you wanted to get an auto insurance quote from Geico, the answer to a common question “Were you eligible to drive at 18?” could be inferred from the driver receiving their license at age 16 2. If you were doing competitor analysis, it’s understanding that an Arnold Palmer 22 oz can at 7/11 is almost definitely the same product as a 23 oz can at Gopuff (even though the sizes are slightly different, which could be a rounding error!) Want to see examples of Skyvern in action? Jump to #real-world-examples-of- skyvern

langfuse
Langfuse is a powerful tool that helps you develop, monitor, and test your LLM applications. With Langfuse, you can: * **Develop:** Instrument your app and start ingesting traces to Langfuse, inspect and debug complex logs, and manage, version, and deploy prompts from within Langfuse. * **Monitor:** Track metrics (cost, latency, quality) and gain insights from dashboards & data exports, collect and calculate scores for your LLM completions, run model-based evaluations, collect user feedback, and manually score observations in Langfuse. * **Test:** Track and test app behaviour before deploying a new version, test expected in and output pairs and benchmark performance before deploying, and track versions and releases in your application. Langfuse is easy to get started with and offers a generous free tier. You can sign up for Langfuse Cloud or deploy Langfuse locally or on your own infrastructure. Langfuse also offers a variety of integrations to make it easy to connect to your LLM applications.

bugbug
Bugbug is a tool developed by Mozilla that leverages machine learning techniques to assist with bug and quality management, as well as other software engineering tasks like test selection and defect prediction. It provides various classifiers to suggest assignees, detect patches likely to be backed-out, classify bugs, assign product/components, distinguish between bugs and feature requests, detect bugs needing documentation, identify invalid issues, verify bugs needing QA, detect regressions, select relevant tests, track bugs, and more. Bugbug can be trained and tested using Python scripts, and it offers the ability to run model training tasks on Taskcluster. The project structure includes modules for data mining, bug/commit feature extraction, model implementations, NLP utilities, label handling, bug history playback, and GitHub issue retrieval.

PromptFuzz
**Description:** PromptFuzz is an automated tool that generates high-quality fuzz drivers for libraries via a fuzz loop constructed on mutating LLMs' prompts. The fuzz loop of PromptFuzz aims to guide the mutation of LLMs' prompts to generate programs that cover more reachable code and explore complex API interrelationships, which are effective for fuzzing. **Features:** * **Multiply LLM support** : Supports the general LLMs: Codex, Inocder, ChatGPT, and GPT4 (Currently tested on ChatGPT). * **Context-based Prompt** : Construct LLM prompts with the automatically extracted library context. * **Powerful Sanitization** : The program's syntax, semantics, behavior, and coverage are thoroughly analyzed to sanitize the problematic programs. * **Prioritized Mutation** : Prioritizes mutating the library API combinations within LLM's prompts to explore complex interrelationships, guided by code coverage. * **Fuzz Driver Exploitation** : Infers API constraints using statistics and extends fixed API arguments to receive random bytes from fuzzers. * **Fuzz engine integration** : Integrates with grey-box fuzz engine: LibFuzzer. **Benefits:** * **High branch coverage:** The fuzz drivers generated by PromptFuzz achieved a branch coverage of 40.12% on the tested libraries, which is 1.61x greater than _OSS-Fuzz_ and 1.67x greater than _Hopper_. * **Bug detection:** PromptFuzz detected 33 valid security bugs from 49 unique crashes. * **Wide range of bugs:** The fuzz drivers generated by PromptFuzz can detect a wide range of bugs, most of which are security bugs. * **Unique bugs:** PromptFuzz detects uniquely interesting bugs that other fuzzers may miss. **Usage:** 1. Build the library using the provided build scripts. 2. Export the LLM API KEY if using ChatGPT or GPT4. 3. Generate fuzz drivers using the `fuzzer` command. 4. Run the fuzz drivers using the `harness` command. 5. Deduplicate and analyze the reported crashes. **Future Works:** * **Custom LLMs suport:** Support custom LLMs. * **Close-source libraries:** Apply PromptFuzz to close-source libraries by fine tuning LLMs on private code corpus. * **Performance** : Reduce the huge time cost required in erroneous program elimination.

pytest-evals
pytest-evals is a minimalistic pytest plugin designed to help evaluate the performance of Language Model (LLM) outputs against test cases. It allows users to test and evaluate LLM prompts against multiple cases, track metrics, and integrate easily with pytest, Jupyter notebooks, and CI/CD pipelines. Users can scale up by running tests in parallel with pytest-xdist and asynchronously with pytest-asyncio. The tool focuses on simplifying evaluation processes without the need for complex frameworks, keeping tests and evaluations together, and emphasizing logic over infrastructure.

tonic_validate
Tonic Validate is a framework for the evaluation of LLM outputs, such as Retrieval Augmented Generation (RAG) pipelines. Validate makes it easy to evaluate, track, and monitor your LLM and RAG applications. Validate allows you to evaluate your LLM outputs through the use of our provided metrics which measure everything from answer correctness to LLM hallucination. Additionally, Validate has an optional UI to visualize your evaluation results for easy tracking and monitoring.

langwatch
LangWatch is a monitoring and analytics platform designed to track, visualize, and analyze interactions with Large Language Models (LLMs). It offers real-time telemetry to optimize LLM cost and latency, a user-friendly interface for deep insights into LLM behavior, user analytics for engagement metrics, detailed debugging capabilities, and guardrails to monitor LLM outputs for issues like PII leaks and toxic language. The platform supports OpenAI and LangChain integrations, simplifying the process of tracing LLM calls and generating API keys for usage. LangWatch also provides documentation for easy integration and self-hosting options for interested users.

ai-game-development-tools
Here we will keep track of the AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥 * Tool (AI LLM) * Game (Agent) * Code * Framework * Writer * Image * Texture * Shader * 3D Model * Avatar * Animation * Video * Audio * Music * Singing Voice * Speech * Analytics * Video Tool

LLMEvaluation
The LLMEvaluation repository is a comprehensive compendium of evaluation methods for Large Language Models (LLMs) and LLM-based systems. It aims to assist academics and industry professionals in creating effective evaluation suites tailored to their specific needs by reviewing industry practices for assessing LLMs and their applications. The repository covers a wide range of evaluation techniques, benchmarks, and studies related to LLMs, including areas such as embeddings, question answering, multi-turn dialogues, reasoning, multi-lingual tasks, ethical AI, biases, safe AI, code generation, summarization, software performance, agent LLM architectures, long text generation, graph understanding, and various unclassified tasks. It also includes evaluations for LLM systems in conversational systems, copilots, search and recommendation engines, task utility, and verticals like healthcare, law, science, financial, and others. The repository provides a wealth of resources for evaluating and understanding the capabilities of LLMs in different domains.

next-money
Next Money Stripe Starter is a SaaS Starter project that empowers your next project with a stack of Next.js, Prisma, Supabase, Clerk Auth, Resend, React Email, Shadcn/ui, and Stripe. It seamlessly integrates these technologies to accelerate your development and SaaS journey. The project includes frameworks, platforms, UI components, hooks and utilities, code quality tools, and miscellaneous features to enhance the development experience. Created by @koyaguo in 2023 and released under the MIT license.

AgentNeo
AgentNeo is an advanced, open-source Agentic AI Application Observability, Monitoring, and Evaluation Framework designed to provide deep insights into AI agents, Large Language Model (LLM) calls, and tool interactions. It offers robust logging, visualization, and evaluation capabilities to help debug and optimize AI applications with ease. With features like tracing LLM calls, monitoring agents and tools, tracking interactions, detailed metrics collection, flexible data storage, simple instrumentation, interactive dashboard, project management, execution graph visualization, and evaluation tools, AgentNeo empowers users to build efficient, cost-effective, and high-quality AI-driven solutions.

data-juicer
Data-Juicer is a one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs. It is a systematic & reusable library of 80+ core OPs, 20+ reusable config recipes, and 20+ feature-rich dedicated toolkits, designed to function independently of specific LLM datasets and processing pipelines. Data-Juicer allows detailed data analyses with an automated report generation feature for a deeper understanding of your dataset. Coupled with multi-dimension automatic evaluation capabilities, it supports a timely feedback loop at multiple stages in the LLM development process. Data-Juicer offers tens of pre-built data processing recipes for pre-training, fine-tuning, en, zh, and more scenarios. It provides a speedy data processing pipeline requiring less memory and CPU usage, optimized for maximum productivity. Data-Juicer is flexible & extensible, accommodating most types of data formats and allowing flexible combinations of OPs. It is designed for simplicity, with comprehensive documentation, easy start guides and demo configs, and intuitive configuration with simple adding/removing OPs from existing configs.

council
Council is an open-source platform designed for the rapid development and deployment of customized generative AI applications using teams of agents. It extends the LLM tool ecosystem by providing advanced control flow and scalable oversight for AI agents. Users can create sophisticated agents with predictable behavior by leveraging Council's powerful approach to control flow using Controllers, Filters, Evaluators, and Budgets. The framework allows for automated routing between agents, comparing, evaluating, and selecting the best results for a task. Council aims to facilitate packaging and deploying agents at scale on multiple platforms while enabling enterprise-grade monitoring and quality control.

CogVideo
CogVideo is an open-source repository that provides pretrained text-to-video models for generating videos based on input text. It includes models like CogVideoX-2B and CogVideo, offering powerful video generation capabilities. The repository offers tools for inference, fine-tuning, and model conversion, along with demos showcasing the model's capabilities through CLI, web UI, and online experiences. CogVideo aims to facilitate the creation of high-quality videos from textual descriptions, catering to a wide range of applications.

Awesome-Knowledge-Distillation-of-LLMs
A collection of papers related to knowledge distillation of large language models (LLMs). The repository focuses on techniques to transfer advanced capabilities from proprietary LLMs to smaller models, compress open-source LLMs, and refine their performance. It covers various aspects of knowledge distillation, including algorithms, skill distillation, verticalization distillation in fields like law, medical & healthcare, finance, science, and miscellaneous domains. The repository provides a comprehensive overview of the research in the area of knowledge distillation of LLMs.

SurveyX
SurveyX is an advanced academic survey automation system that leverages Large Language Models (LLMs) to generate high-quality, domain-specific academic papers and surveys. Users can request comprehensive academic papers or surveys tailored to specific topics by providing a paper title and keywords for literature retrieval. The system streamlines academic research by automating paper creation, saving users time and effort in compiling research content.

upscayl
Upscayl is a free and open-source AI image upscaler that uses advanced AI algorithms to enlarge and enhance low-resolution images without losing quality. It is a cross-platform application built with the Linux-first philosophy, available on all major desktop operating systems. Upscayl utilizes Real-ESRGAN and Vulkan architecture for image enhancement, and its backend is fully open-source under the AGPLv3 license. It is important to note that a Vulkan compatible GPU is required for Upscayl to function effectively.

awesome-langchain
LangChain is an amazing framework to get LLM projects done in a matter of no time, and the ecosystem is growing fast. Here is an attempt to keep track of the initiatives around LangChain. Subscribe to the newsletter to stay informed about the Awesome LangChain. We send a couple of emails per month about the articles, videos, projects, and tools that grabbed our attention Contributions welcome. Add links through pull requests or create an issue to start a discussion. Please read the contribution guidelines before contributing.
20 - OpenAI Gpts

Project Performance Monitoring Advisor
Guides project success through comprehensive performance monitoring.

High-Quality Review Analyzer
Analyses and gives actionable feedback on web Review type content using Google's Reviews System guidelines and Google's Quality Rater Guidelines

AquaAirAI
AquaAirAI is a specialized assistant that compares air and water quality across cities and regions, providing insightful reports and recommendations based on comprehensive environmental data analysis from Excel files.

MeepMouse
MeepMouse, the advanced computer mouse for developers, displays logs of edits made in a virtual IDE, simulating direct code manipulation.

FDA Advisor
Approachable expert on FDA medical device regulation. Offering direct download links for related regulation and guidance documents from FDA sites.

Certifications Master
Your go-to assistant for certification exam schedules and requirements management! 資格認定の試験スケジュールや要件管理等、あなたのお役に立つアシスタントです!

Feature Ticket Generator
This GPT writes tickets for software features. It uses Gherkin to specify scenarios. @cxmacedo

Project Documentation Advisor
Guides the organization in creating comprehensive project closure documents.

BizFix Agent
I'm BizFix, your guide to business optimization using BPI, 5s methods and AI powered Automations.

Prince2 Expert
Guides through Prince2 questions and answers, ensuring accuracy and engagement.