Best AI tools for< Find Data Issues >
20 - AI tool Sites

Ocular
Ocular is an AI-powered search platform that allows users to search, visualize, and take action on their work and engineering tools and data on one unified platform. It is designed to help engineers work more efficiently and effectively by providing them with a single, central location to access all of their relevant information.

Elastic
Elastic is a Search AI Company that offers a platform for building tailored experiences, search and analytics, data ingestion, visualization, and generative AI solutions. The company provides services like Elastic Cloud for real-time insights, Elastic AI Assistant for retrieval and generation, and Search AI Lake for faster integration with LLMs. Elastic aims to help businesses scale with low-latency search AI and accelerate problem resolution with observability powered by advanced ML and analytics.

HypeAuditor
HypeAuditor is a 100% AI-Powered Influencer Marketing Platform that offers a comprehensive suite of tools for influencer marketing campaigns. With a database of over 168.7 million influencers, HypeAuditor provides features such as influencer discovery, campaign management, market analysis, and influencer analytics. The platform helps brands and agencies grow their business by increasing efficiency and control over influencer marketing strategies. HypeAuditor's AI technology enables users to analyze influencer audiences, performance, and detect fraud issues, providing valuable insights for successful influencer collaborations.

Atlassian Intelligence
Atlassian Intelligence is an AI-powered tool that accelerates productivity on the Atlassian platform by transforming teamwork through AI-human collaboration. It provides insights from team knowledge, turns data into actionable insights, helps find issues in Jira using natural language, offers development insights, accelerates incident detection, and assists in work acceleration and project management. The tool also enables quick responses to customer requests, drives faster decision-making, delivers faster service through virtual agents, streamlines setup, and automates tedious tasks.

Mastertech.ai
Mastertech.ai is an AI tool designed to assist with manufacturer procedures and diagnose common issues for any vehicle. By providing personalized assistance based on the user's level of experience, it offers instant answers for questions related to torque specs, fluid capacity, component locations, and more. The tool cross-references symptoms against Technical Service Bulletins to identify common known issues and provides trustworthy vehicle data. With upcoming features like integration with shop management platforms and an interactive diagnostic AI assistant, Mastertech.ai aims to enhance accuracy and efficiency in automotive repair.

August
August is a free-to-use health AI available on WhatsApp. It provides direct answers to health questions, helps with mental health issues, creates personalized nutrition and fitness plans, and offers proactive support. August is designed to be a comprehensive health companion, available 24/7.

BrowseGPT
BrowseGPT is a free Chrome extension that uses artificial intelligence to automate your browser. You can give BrowseGPT instructions like "Find a place to stay in Seattle on February 22nd" or "buy a children's book on Amazon", and it will use OpenAI's GPT-3 model to process web pages and issue commands like CLICK, ENTER_TEXT, or NAVIGATE to complete the task for you.

Landbase
Landbase is an AI-powered platform that offers competitive Go-To-Market (GTM) insights for businesses. Users can search for any company and receive AI-powered overviews within seconds. The platform provides tools, guides, and more to help businesses prepare for their market strategies. Landbase offers features such as sentiment analysis, digital trust scoring, omnichannel lead generation, and automation of sales pipelines. With Landbase, users can unlock key data points and receive AI-powered analysis for their organizations, enabling them to focus on generating high-ticket leads and delivering exceptional service to customers.

Keylight AI
Keylight AI is an AI-powered solution designed to help users efficiently find information within their documents. It offers lightning-fast searches, precision accuracy, a user-friendly interface, customizable prompts, and ensures secure and confidential document handling. Ideal for professionals across various industries, Keylight AI revolutionizes document search by providing quick and efficient navigation. Users can boost their productivity and save time with this innovative tool.

Deepfind
Deepfind is a privacy-first AI search engine that prioritizes user data protection. It allows users to conduct searches without the use of cookies, tracking, or storing personal information. Deepfind aims to provide a secure and efficient search experience while maintaining user privacy and data security.

Phind AI
Phind AI is a cost-effective alternative to other AI search engines, making AI search accessible to everyone, regardless of location. It offers a comprehensive search experience with a user-friendly interface and advanced features.

Qatalog
Qatalog is a business search engine that provides real-time access to data across various company systems and applications. It uses natural language processing and machine learning to understand user queries and deliver relevant results from multiple data sources. Qatalog eliminates the need to search through multiple systems and applications, saving employees time and improving productivity.

Tremello
Tremello is a market research platform that uses AI to deliver off-market data. It combines a leading AI engine with human experts to provide bespoke intelligence delivered directly to the user's inbox. Tremello's AI analyzes relationships, identifies patterns, and considers the broader context, delivering meaningful and actionable insights on top of a base human layer. It leverages a diverse range of data sources, including public and private databases, industry reports, social media archives, company websites, and government filings, ensuring a complete and comprehensive picture of the research subject.

Explore AI
Explore AI is a semantic search engine that utilizes artificial intelligence technology to enhance search capabilities. It provides users with a more intuitive and efficient way to discover information by understanding the context and meaning behind the search queries. With its advanced algorithms, Explore AI offers a personalized search experience tailored to individual preferences and needs. The platform aims to revolutionize the way people interact with search engines, making information retrieval faster, more accurate, and user-friendly.

Robin AI
Robin AI is a legal AI application that offers a platform for accelerating contract review and analysis. It provides services such as generating contract reports 50 times faster, reviewing contracts 80% faster, and finding contract data in less than 3 seconds. The application combines LLMs, proprietary machine learning models, and legal experts to transform contract review for businesses worldwide. With features like precision edits, secure repository, fast turnaround times, and customizable report templates, Robin AI aims to simplify contract processes for legal teams. The platform also offers resources like blog insights, webinars, and legal dictionary definitions to empower users in the legal industry.

Kira Systems
Kira Systems is a machine learning contract search, review, and analysis software that helps businesses identify, extract, and analyze content in their contracts and documents. It uses patented machine learning technology to extract concepts and data points with high efficiency and accuracy. Kira also has built-in intelligence that streamlines the contract review process with out-of-the-box smart fields. Businesses can also create their own smart fields to find specific data points using Kira's no-code machine learning tool. Kira's adaptive workflows allow businesses to organize, track, and export results. Kira has a partner ecosystem that allows businesses to transform how teams work with their contracts.

Shieldbase
Shieldbase is an AI-powered enterprise search tool designed to provide secure and efficient search capabilities for businesses. It utilizes advanced artificial intelligence algorithms to index and retrieve information from various data sources within an organization, ensuring quick and accurate search results. With a focus on security, Shieldbase offers encryption and access control features to protect sensitive data. The platform is user-friendly and customizable, making it easy for businesses to implement and integrate into their existing systems. Shieldbase enhances productivity by enabling employees to quickly find the information they need, ultimately improving decision-making processes and overall operational efficiency.

Vanga AI
Vanga AI is an AI-powered upselling and cross-selling tool for Shopify stores. It helps businesses increase their revenue by automatically generating and displaying upsells and cross-sells on their post-purchase and thank you pages. Vanga AI uses data to find the products that customers are most likely to buy together, and it creates custom upsell funnels for each product. The tool is easy to use and requires no setup or maintenance. Vanga AI offers a 14-day free trial and two paid plans, starting at $9/month.

Jobs-Scout
Jobs-Scout is an AI-powered job search engine that helps you find your dream job. With Jobs-Scout, you can search for jobs by keyword, location, and industry. You can also filter your search results by salary, experience, and education level. Jobs-Scout also provides personalized job recommendations based on your skills and interests.

Picarta AI
Picarta AI is an image geolocalization solution that uses artificial intelligence to find where a photo has been taken in the world. By uploading a photo, users can get the GPS location, latitude, longitude, time stamp, and camera details of the image. Picarta AI also offers a map view of the image location and allows users to download the map. The company's vision is to empower individuals and businesses with the most accurate and reliable image geolocalization solution, unlocking new possibilities for exploration, research, and decision-making.
20 - Open Source AI Tools

cleanlab
Cleanlab helps you **clean** data and **lab** els by automatically detecting issues in a ML dataset. To facilitate **machine learning with messy, real-world data** , this data-centric AI package uses your _existing_ models to estimate dataset problems that can be fixed to train even _better_ models.

fiftyone
FiftyOne is an open-source tool designed for building high-quality datasets and computer vision models. It supercharges machine learning workflows by enabling users to visualize datasets, interpret models faster, and improve efficiency. With FiftyOne, users can explore scenarios, identify failure modes, visualize complex labels, evaluate models, find annotation mistakes, and much more. The tool aims to streamline the process of improving machine learning models by providing a comprehensive set of features for data analysis and model interpretation.

data-juicer
Data-Juicer is a one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs. It is a systematic & reusable library of 80+ core OPs, 20+ reusable config recipes, and 20+ feature-rich dedicated toolkits, designed to function independently of specific LLM datasets and processing pipelines. Data-Juicer allows detailed data analyses with an automated report generation feature for a deeper understanding of your dataset. Coupled with multi-dimension automatic evaluation capabilities, it supports a timely feedback loop at multiple stages in the LLM development process. Data-Juicer offers tens of pre-built data processing recipes for pre-training, fine-tuning, en, zh, and more scenarios. It provides a speedy data processing pipeline requiring less memory and CPU usage, optimized for maximum productivity. Data-Juicer is flexible & extensible, accommodating most types of data formats and allowing flexible combinations of OPs. It is designed for simplicity, with comprehensive documentation, easy start guides and demo configs, and intuitive configuration with simple adding/removing OPs from existing configs.

txtai
Txtai is an all-in-one embeddings database for semantic search, LLM orchestration, and language model workflows. It combines vector indexes, graph networks, and relational databases to enable vector search with SQL, topic modeling, retrieval augmented generation, and more. Txtai can stand alone or serve as a knowledge source for large language models (LLMs). Key features include vector search with SQL, object storage, topic modeling, graph analysis, multimodal indexing, embedding creation for various data types, pipelines powered by language models, workflows to connect pipelines, and support for Python, JavaScript, Java, Rust, and Go. Txtai is open-source under the Apache 2.0 license.

SQLAgent
DataAgent is a multi-agent system for data analysis, capable of understanding data development and data analysis requirements, understanding data, and generating SQL and Python code for tasks such as data query, data visualization, and machine learning.

airda
airda(Air Data Agent) is a multi-agent system for data analysis, which can understand data development and data analysis requirements, understand data, and generate SQL and Python code for data query, data visualization, machine learning and other tasks.

morph
Morph is a python-centric full-stack framework for building and deploying data apps. It is fast to start, deploy and operate, requires no HTML/CSS knowledge, and is customizable with Python and SQL for advanced data workflows. With Markdown-based syntax and pre-made components, users can create visually appealing designs without writing HTML or CSS.

qlib
Qlib is an open-source, AI-oriented quantitative investment platform that supports diverse machine learning modeling paradigms, including supervised learning, market dynamics modeling, and reinforcement learning. It covers the entire chain of quantitative investment, from alpha seeking to order execution. The platform empowers researchers to explore ideas and implement productions using AI technologies in quantitative investment. Qlib collaboratively solves key challenges in quantitative investment by releasing state-of-the-art research works in various paradigms. It provides a full ML pipeline for data processing, model training, and back-testing, enabling users to perform tasks such as forecasting market patterns, adapting to market dynamics, and modeling continuous investment decisions.

ai-enablement-stack
The AI Enablement Stack is a curated collection of venture-backed companies, tools, and technologies that enable developers to build, deploy, and manage AI applications. It provides a structured view of the AI development ecosystem across five key layers: Agent Consumer Layer, Observability and Governance Layer, Engineering Layer, Intelligence Layer, and Infrastructure Layer. Each layer focuses on specific aspects of AI development, from end-user interaction to model training and deployment. The stack aims to help developers find the right tools for building AI applications faster and more efficiently, assist engineering leaders in making informed decisions about AI infrastructure and tooling, and help organizations understand the AI development landscape to plan technology adoption.

ModelCache
Codefuse-ModelCache is a semantic cache for large language models (LLMs) that aims to optimize services by introducing a caching mechanism. It helps reduce the cost of inference deployment, improve model performance and efficiency, and provide scalable services for large models. The project facilitates sharing and exchanging technologies related to large model semantic cache through open-source collaboration.

MiniCPM-V
MiniCPM-V is a series of end-side multimodal LLMs designed for vision-language understanding. The models take image and text inputs to provide high-quality text outputs. The series includes models like MiniCPM-Llama3-V 2.5 with 8B parameters surpassing proprietary models, and MiniCPM-V 2.0, a lighter model with 2B parameters. The models support over 30 languages, efficient deployment on end-side devices, and have strong OCR capabilities. They achieve state-of-the-art performance on various benchmarks and prevent hallucinations in text generation. The models can process high-resolution images efficiently and support multilingual capabilities.

VITA
VITA is an open-source interactive omni multimodal Large Language Model (LLM) capable of processing video, image, text, and audio inputs simultaneously. It stands out with features like Omni Multimodal Understanding, Non-awakening Interaction, and Audio Interrupt Interaction. VITA can respond to user queries without a wake-up word, track and filter external queries in real-time, and handle various query inputs effectively. The model utilizes state tokens and a duplex scheme to enhance the multimodal interactive experience.

crewAI
CrewAI is a cutting-edge framework designed to orchestrate role-playing autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks. It enables AI agents to assume roles, share goals, and operate in a cohesive unit, much like a well-oiled crew. Whether you're building a smart assistant platform, an automated customer service ensemble, or a multi-agent research team, CrewAI provides the backbone for sophisticated multi-agent interactions. With features like role-based agent design, autonomous inter-agent delegation, flexible task management, and support for various LLMs, CrewAI offers a dynamic and adaptable solution for both development and production workflows.

LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse is a comprehensive and curated guide designed to empower developers, researchers, and enthusiasts to harness the true capabilities of Large Language Models (LLMs) and build intelligent applications that push the boundaries of natural language understanding. This GitHub repository provides in-depth articles, codebase mastery, LLM PlayLab, and resources for cost analysis and network visualization. It covers various aspects of LLMs, including NLP, models, training, evaluation metrics, open LLMs, and more. The repository also includes a collection of code examples and tutorials to help users build and deploy LLM-based applications.

invariant
Invariant Analyzer is an open-source scanner designed for LLM-based AI agents to find bugs, vulnerabilities, and security threats. It scans agent execution traces to identify issues like looping behavior, data leaks, prompt injections, and unsafe code execution. The tool offers a library of built-in checkers, an expressive policy language, data flow analysis, real-time monitoring, and extensible architecture for custom checkers. It helps developers debug AI agents, scan for security violations, and prevent security issues and data breaches during runtime. The analyzer leverages deep contextual understanding and a purpose-built rule matching engine for security policy enforcement.

x-crawl
x-crawl is a flexible Node.js AI-assisted crawler library that offers powerful AI assistance functions to make crawler work more efficient, intelligent, and convenient. It consists of a crawler API and various functions that can work normally even without relying on AI. The AI component is currently based on a large AI model provided by OpenAI, simplifying many tedious operations. The library supports crawling dynamic pages, static pages, interface data, and file data, with features like control page operations, device fingerprinting, asynchronous sync, interval crawling, failed retry handling, rotation proxy, priority queue, crawl information control, and TypeScript support.

langfuse
Langfuse is a powerful tool that helps you develop, monitor, and test your LLM applications. With Langfuse, you can: * **Develop:** Instrument your app and start ingesting traces to Langfuse, inspect and debug complex logs, and manage, version, and deploy prompts from within Langfuse. * **Monitor:** Track metrics (cost, latency, quality) and gain insights from dashboards & data exports, collect and calculate scores for your LLM completions, run model-based evaluations, collect user feedback, and manually score observations in Langfuse. * **Test:** Track and test app behaviour before deploying a new version, test expected in and output pairs and benchmark performance before deploying, and track versions and releases in your application. Langfuse is easy to get started with and offers a generous free tier. You can sign up for Langfuse Cloud or deploy Langfuse locally or on your own infrastructure. Langfuse also offers a variety of integrations to make it easy to connect to your LLM applications.

ragas
Ragas is a framework that helps you evaluate your Retrieval Augmented Generation (RAG) pipelines. RAG denotes a class of LLM applications that use external data to augment the LLM’s context. There are existing tools and frameworks that help you build these pipelines but evaluating it and quantifying your pipeline performance can be hard. This is where Ragas (RAG Assessment) comes in. Ragas provides you with the tools based on the latest research for evaluating LLM-generated text to give you insights about your RAG pipeline. Ragas can be integrated with your CI/CD to provide continuous checks to ensure performance.

eureka-ml-insights
The Eureka ML Insights Framework is a repository containing code designed to help researchers and practitioners run reproducible evaluations of generative models efficiently. Users can define custom pipelines for data processing, inference, and evaluation, as well as utilize pre-defined evaluation pipelines for key benchmarks. The framework provides a structured approach to conducting experiments and analyzing model performance across various tasks and modalities.
20 - OpenAI Gpts

GovChat - Government API Guide
Friendly, technical API expert offering clear guidance on government APIs.

OpenData Explorer
I'll help you access and understand open data published by central government, local authorities and public bodies. You can ask me in your native language.

Chronic Disease Indicators Expert
This chatbot answers questions about the CDC’s Chronic Disease Indicators dataset

Sommelier de dados
Opa! Cole o texto da sua reportagem ou trecho para que eu possa analisá-la com base em manuais de uso de dados em textos jornalísticos.

PPT Expert
PPT Assistant for creating detailed outlines in Markdown, using Chinese by default.

AI OSINT
Your AI OSINT assistant. Our tool helps you find the data needle in the internet haystack.
Open Data Italia bot
Fornisce informazioni sulla normativa italiana in materia di open data, con un tono professionale e divulgativo. In modo che sia più facile chiederne e/o pretenderne la pubblicazione.

BCorpGPT
Query BCorp company data. All data is publicly available. United Kingdom only (for now).

Ordinals API
Knows the docs and can query official ordinal endpoints—Sat Numbers, Inscription IDs, and more.

Graphene Explorer AI
Leading AI in graphene research, offering innovative insights and solutions, powered by OpenAI.