Best AI tools for< Correct Data >
20 - AI tool Sites
Bifrost AI
Bifrost AI is a data generation engine designed for AI and robotics applications. It enables users to train and validate AI models faster by generating physically accurate synthetic datasets in 3D simulations, eliminating the need for real-world data. The platform offers pixel-perfect labels, scenario metadata, and a simulated 3D world to enhance AI understanding. Bifrost AI empowers users to create new scenarios and datasets rapidly, stress test AI perception, and improve model performance. It is built for teams at every stage of AI development, offering features like automated labeling, class imbalance correction, and performance enhancement.
Football Predictions AI
Football Predictions AI is a website that provides users with accurate and reliable football match predictions. Users can access a variety of prediction types, including 1x2, BTTS, Over Under, and Correct Score predictions for matches across different leagues. The site also offers live scores, articles, and settings to customize the user experience. With a focus on user privacy and data protection, Football Predictions AI aims to enhance the football betting experience for enthusiasts and fans.
GrabText
GrabText is an online OCR tool that allows users to convert handwritten or printed text from photos, graphics, or documents into editable text. It uses ChatGPT to automatically correct spelling, grammar, and other illegal writings. The tool also supports math equations and offers flexible output options such as txt, latex, doc, and pdf.
AIJ
AIJ is an AI tool designed to streamline the job application process by automating tasks such as job search, application submission, and answering frequently asked questions. Users can save time and energy by letting AIJ handle these tasks efficiently. The tool also allows users to correct AI mistakes and save common questions for AI to answer. With pricing plans available, AIJ aims to simplify the job application process for users.
HARPA AI
HARPA AI is a Google Chrome extension that brings AI to your browser. It can summarize and reply to emails, rewrite, rephrase, correct and expand text, read articles, translate and scan web pages for data. HARPA has a hybrid AI engine and works with OpenAI GPT-3 & GPT-4 API, ChatGPT, Claude2 and Google Gemini.
AI Seed Phrase Finder & BTC balance checker tool for Windows PC
The AI Seed Phrase Finder & BTC balance checker tool for Windows PC is an innovative application designed to prevent the loss of access to Bitcoin wallets. Leveraging advanced algorithms and artificial intelligence techniques, this program efficiently analyzes vast amounts of data to pre-train AI models. Consequently, it generates and searches for mnemonic phrases that grant access to abandoned Bitcoin wallets holding nonzero balances. With the “AI Seed Finder tool for Windows PC”, locating a complete 12-word seed phrase for a specific Bitcoin wallet becomes effortless. Even if you possess only partial knowledge of the mnemonic phrase or individual words comprising it, this tool can swiftly identify the entire seed phrase. Furthermore, by providing the address of a specific Bitcoin wallet you wish to regain access to, the program narrows down the search area. This targeted approach significantly enhances the program’s efficiency and reduces the time required to ascertain the correct mnemonic phrase.
Tekst
Tekst is an AI-powered platform that helps businesses automate their email, ticket, and other communication workflows. By leveraging text intelligence, Tekst enables enterprises to gain insights into their communication data, optimize critical workflows, and deliver better customer service. With its advanced AI capabilities, Tekst can analyze messages, enrich them with tags, reroute them to the correct inbox, and even automate responses, significantly reducing the time wasted on manual tasks.
Legal Data
Legal Data is a comprehensive legal research platform developed by lawyers for lawyers. It offers a powerful search feature that covers various legal areas from commercial to criminal law. The platform recognizes synonyms, legalese, and abbreviations, corrects typos, and provides suggestions as you type. Additionally, Legal Data includes an AI-assistant called FlyBot, trained on carefully selected laws and cases, to provide accurate legal answers without fabricating information.
Kodezi
Kodezi is an AI-powered development tool that helps developers write better code. It offers a range of features to help developers with tasks such as code autocorrect, code review, and debugging. Kodezi is available as a web-based IDE, a VS Code extension, and an enterprise solution.
Resumecheck.net
Resumecheck.net is an AI-powered resume improvement platform that helps users create error-free, professional resumes that stand out to recruiters. The platform uses GPT4 technology to provide personalized feedback and suggestions, including grammar corrections, formatting adjustments, and industry-specific keyword optimization. Additionally, Resumecheck.net offers an AI Cover Letter Writer that generates tailored cover letters based on the user's resume and the specific job position they are applying for.
AiText
AiText is an AI-powered writing assistant that enhances the quality of texts, corrects spelling and grammar errors, and generates new texts effortlessly. It offers a range of features including vocabulary enhancement, sentence structure optimization, grammar and spell checking, and text generation. The application is versatile and can be used for various purposes such as creating social media posts, emails, articles, reports, and professional documents. It is designed to improve communication clarity, accelerate writing tasks, and enhance writing skills for academic and professional pursuits.
RewriteBar
RewriteBar is an AI assistant tool designed for macOS users to improve their writing in any application. It offers features such as correcting grammar mistakes, changing writing styles, translating text, simplifying language, adjusting writing tone, summarizing paragraphs, and enhancing ChatGPT output. RewriteBar is a cost-effective solution that works seamlessly across various applications, making it a valuable tool for non-native English speakers, developers, and content creators.
Texthelper
Texthelper is an AI-powered text correction tool designed to assist users in identifying and correcting errors in their written content. Users can input text, which will be analyzed by the tool's AI algorithms to detect and fix mistakes. The tool aims to enhance the overall quality and accuracy of written communication by providing quick and efficient error detection and correction. Texthelper is user-friendly and suitable for individuals, students, professionals, and anyone looking to improve the correctness of their written text.
SafeSpelling
SafeSpelling is an AI-powered writing tool that helps users write without mistakes. It offers accurate corrections using the GPT-4o model, ensuring error-free content. Users can create an account to access the tool and benefit from its powerful features.
slAItor
slAItor is an AI translation assistant powered by GPT technology. It offers advanced translation features and customization options to enhance the translation experience. Users can benefit from step-by-step translations, multiple translation alternatives, and unique translation styles. The tool supports 28 language pairs and combines recent AI advancements with traditional translation techniques to deliver accurate and efficient translations. slAItor also provides post-processing and evaluation steps to ensure translation quality and offers a user-friendly interface for seamless translation management.
NeuroSpell
NeuroSpell is a universal auto-corrector powered by deep learning. It can be used to correct spelling, grammar, and style errors in text. NeuroSpell can be deployed on-premise or in the cloud, and it can be trained on domain-specific vocabulary and sentence structures. NeuroSpell is used by businesses and individuals to improve the quality of their written communication.
Spruce Autocorrect
Spruce is an AI tool designed to automatically correct typos in your Slack messages. It edits your messages in real-time to ensure accurate communication. Users can easily undo corrections by adding a specific reaction to the message. Spruce is a helpful tool for enhancing the quality of written communication in Slack channels. It is created with care and attention to detail by the developers @Hema_Thanki and @s16h.
LanguageTool
LanguageTool is an AI-based spelling, style, and grammar checker that helps correct or paraphrase texts across languages. It offers a range of features including grammar checking, paraphrasing, punctuation correction, style improvement, and more. LanguageTool is available as a browser extension, desktop app, and mobile app, and it supports over 30 languages. It is used by over 2000 organizations, including BMW Group, European Union, Spiegel Magazine, and Deutsche Presse-Agentur (dpa).
Trinka
Trinka is an AI-powered English grammar checker and language enhancement writing assistant designed for academic and technical writing. It corrects contextual spelling mistakes and advanced grammar errors by providing writing suggestions in real-time. Trinka helps professionals and academics ensure formal, concise, and engaging writing.
Gramara
Gramara is an AI-powered grammar checker that helps you write error-free text. It can detect and correct a wide range of grammar, spelling, and punctuation errors, and it can also provide suggestions for improving your writing style.
20 - Open Source AI Tools
lfai-landscape
LF AI & Data Landscape is a map to explore open source projects in the AI & Data domains, highlighting companies that are members of LF AI & Data. It showcases members of the Foundation and is modelled after the Cloud Native Computing Foundation landscape. The landscape includes current version, interactive version, new entries, logos, proper SVGs, corrections, external data, best practices badge, non-updated items, license, formats, installation, vulnerability reporting, and adjusting the landscape view.
strictjson
Strict JSON is a framework designed to handle JSON outputs with complex structures, fixing issues that standard json.loads() cannot resolve. It provides functionalities for parsing LLM outputs into dictionaries, supporting various data types, type forcing, and error correction. The tool allows easy integration with OpenAI JSON Mode and offers community support through tutorials and discussions. Users can download the package via pip, set up API keys, and import functions for usage. The tool works by extracting JSON values using regex, matching output values to literals, and ensuring all JSON fields are output by LLM with optional type checking. It also supports LLM-based checks for type enforcement and error correction loops.
llm-leaderboard
Nejumi Leaderboard 3 is a comprehensive evaluation platform for large language models, assessing general language capabilities and alignment aspects. The evaluation framework includes metrics for language processing, translation, summarization, information extraction, reasoning, mathematical reasoning, entity extraction, knowledge/question answering, English, semantic analysis, syntactic analysis, alignment, ethics/moral, toxicity, bias, truthfulness, and robustness. The repository provides an implementation guide for environment setup, dataset preparation, configuration, model configurations, and chat template creation. Users can run evaluation processes using specified configuration files and log results to the Weights & Biases project.
HuggingFists
HuggingFists is a low-code data flow tool that enables convenient use of LLM and HuggingFace models. It provides functionalities similar to Langchain, allowing users to design, debug, and manage data processing workflows, create and schedule workflow jobs, manage resources environment, and handle various data artifact resources. The tool also offers account management for users, allowing centralized management of data source accounts and API accounts. Users can access Hugging Face models through the Inference API or locally deployed models, as well as datasets on Hugging Face. HuggingFists supports breakpoint debugging, branch selection, function calls, workflow variables, and more to assist users in developing complex data processing workflows.
ivy
Ivy is an open-source machine learning framework that enables you to: * 🔄 **Convert code into any framework** : Use and build on top of any model, library, or device by converting any code from one framework to another using `ivy.transpile`. * ⚒️ **Write framework-agnostic code** : Write your code once in `ivy` and then choose the most appropriate ML framework as the backend to leverage all the benefits and tools. Join our growing community 🌍 to connect with people using Ivy. **Let's** unify.ai **together 🦾**
ivy
Ivy is an open-source machine learning framework that enables users to convert code between different ML frameworks and write framework-agnostic code. It allows users to transpile code from one framework to another, making it easy to use building blocks from different frameworks in a single project. Ivy also serves as a flexible framework that breaks free from framework limitations, allowing users to publish code that is interoperable with various frameworks and future frameworks. Users can define trainable modules and layers using Ivy's stateful API, making it easy to build and train models across different backends.
ai-audio-datasets
AI Audio Datasets List (AI-ADL) is a comprehensive collection of datasets consisting of speech, music, and sound effects, used for Generative AI, AIGC, AI model training, and audio applications. It includes datasets for speech recognition, speech synthesis, music information retrieval, music generation, audio processing, sound synthesis, and more. The repository provides a curated list of diverse datasets suitable for various AI audio tasks.
rag
RAG with txtai is a Retrieval Augmented Generation (RAG) Streamlit application that helps generate factually correct content by limiting the context in which a Large Language Model (LLM) can generate answers. It supports two categories of RAG: Vector RAG, where context is supplied via a vector search query, and Graph RAG, where context is supplied via a graph path traversal query. The application allows users to run queries, add data to the index, and configure various parameters to control its behavior.
llm-datasets
LLM Datasets is a repository containing high-quality datasets, tools, and concepts for LLM fine-tuning. It provides datasets with characteristics like accuracy, diversity, and complexity to train large language models for various tasks. The repository includes datasets for general-purpose, math & logic, code, conversation & role-play, and agent & function calling domains. It also offers guidance on creating high-quality datasets through data deduplication, data quality assessment, data exploration, and data generation techniques.
BIG-Bench-Mistake
BIG-Bench Mistake is a dataset of chain-of-thought (CoT) outputs annotated with the location of the first logical mistake. It was released as part of a research paper focusing on benchmarking LLMs in terms of their mistake-finding ability. The dataset includes CoT traces for tasks like Word Sorting, Tracking Shuffled Objects, Logical Deduction, Multistep Arithmetic, and Dyck Languages. Human annotators were recruited to identify mistake steps in these tasks, with automated annotation for Dyck Languages. Each JSONL file contains input questions, steps in the chain of thoughts, model's answer, correct answer, and the index of the first logical mistake.
dwata
Dwata is a desktop application that allows users to chat with any AI model and gain insights from their data. Chats are organized into threads, similar to Discord, with each thread connecting to a different AI model. Dwata can connect to databases, APIs (such as Stripe), or CSV files and send structured data as prompts when needed. The AI's response will often include SQL or Python code, which can be used to extract the desired insights. Dwata can validate AI-generated SQL to ensure that the tables and columns referenced are correct and can execute queries against the database from within the application. Python code (typically using Pandas) can also be executed from within Dwata, although this feature is still in development. Dwata supports a range of AI models, including OpenAI's GPT-4, GPT-4 Turbo, and GPT-3.5 Turbo; Groq's LLaMA2-70b and Mixtral-8x7b; Phind's Phind-34B and Phind-70B; Anthropic's Claude; and Ollama's Llama 2, Mistral, and Phi-2 Gemma. Dwata can compare chats from different models, allowing users to see the responses of multiple models to the same prompts. Dwata can connect to various data sources, including databases (PostgreSQL, MySQL, MongoDB), SaaS products (Stripe, Shopify), CSV files/folders, and email (IMAP). The desktop application does not collect any private or business data without the user's explicit consent.
amber-data-prep
This repository contains the code to prepare the data for the Amber 7B language model. The final training data comes from three sources: RedPajama V1, RefinedWeb, and StarCoderData. The data preparation involves downloading untokenized data, tokenizing the data using the Huggingface tokenizer, concatenating tokens into 2048 token sequences, merging datasets, and splitting the merged dataset into 360 chunks. Each tokenized data chunk is a jsonl file containing samples with 2049 tokens. The repository provides scripts for downloading datasets, tokenizing and concatenating sequences, validating data, and merging subsets into chunks.
DataEngineeringPilipinas
DataEngineeringPilipinas is a repository dedicated to data engineering resources in the Philippines. It serves as a platform for data engineering professionals to contribute and access high-quality content related to data engineering. The repository provides guidelines for contributing, including forking the repository, making changes, and submitting contributions. It emphasizes the importance of quality, relevance, and respect in the contributions made to the project. By following the guidelines and contributing to the repository, users can help build a valuable resource for the data engineering community in the Philippines and beyond.
TurtleBenchmark
Turtle Benchmark is a novel and cheat-proof benchmark test used to evaluate large language models (LLMs). It is based on the Turtle Soup game, focusing on logical reasoning and context understanding abilities. The benchmark does not require background knowledge or model memory, providing all necessary information for judgment from stories under 200 words. The results are objective and unbiased, quantifiable as correct/incorrect/unknown, and impossible to cheat due to using real user-generated questions and dynamic data generation during online gameplay.
Open_Data_QnA
Open Data QnA is a Python library that allows users to interact with their PostgreSQL or BigQuery databases in a conversational manner, without needing to write SQL queries. The library leverages Large Language Models (LLMs) to bridge the gap between human language and database queries, enabling users to ask questions in natural language and receive informative responses. It offers features such as conversational querying with multiturn support, table grouping, multi schema/dataset support, SQL generation, query refinement, natural language responses, visualizations, and extensibility. The library is built on a modular design and supports various components like Database Connectors, Vector Stores, and Agents for SQL generation, validation, debugging, descriptions, embeddings, responses, and visualizations.
WeatherGFT
WeatherGFT is a physics-AI hybrid model designed to generalize weather forecasts to finer-grained temporal scales beyond the training dataset. It incorporates physical partial differential equations (PDEs) into neural networks to simulate fine-grained physical evolution and correct biases. The model achieves state-of-the-art performance in forecasting tasks at different time scales, from nowcasting to medium-range forecasts, by utilizing a lead time-aware training framework and a carefully designed PDE kernel. WeatherGFT bridges the gap between nowcast and medium-range forecast by extending forecasting abilities to predict accurately at a 30-minute time scale.
monitors4codegen
This repository hosts the official code and data artifact for the paper 'Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context'. It introduces Monitor-Guided Decoding (MGD) for code generation using Language Models, where a monitor uses static analysis to guide the decoding. The repository contains datasets, evaluation scripts, inference results, a language server client 'multilspy' for static analyses, and implementation of various monitors monitoring for different properties in 3 programming languages. The monitors guide Language Models to adhere to properties like valid identifier dereferences, correct number of arguments to method calls, typestate validity of method call sequences, and more.
Woodpecker
Woodpecker is a tool designed to correct hallucinations in Multimodal Large Language Models (MLLMs) by introducing a training-free method that picks out and corrects inconsistencies between generated text and image content. It consists of five stages: key concept extraction, question formulation, visual knowledge validation, visual claim generation, and hallucination correction. Woodpecker can be easily integrated with different MLLMs and provides interpretable results by accessing intermediate outputs of the stages. The tool has shown significant improvements in accuracy over baseline models like MiniGPT-4 and mPLUG-Owl.
ethereum-etl-airflow
This repository contains Airflow DAGs for extracting, transforming, and loading (ETL) data from the Ethereum blockchain into BigQuery. The DAGs use the Google Cloud Platform (GCP) services, including BigQuery, Cloud Storage, and Cloud Composer, to automate the ETL process. The repository also includes scripts for setting up the GCP environment and running the DAGs locally.
AirCasting
AirCasting is a platform for gathering, visualizing, and sharing environmental data. It aims to provide a central hub for environmental data, making it easier for people to access and use this information to make informed decisions about their environment.
20 - OpenAI Gpts
Hallucinate
Highly accurate and reliable, ensures information is 100% correct, never hallucinates.
Word Problem Solver
Expert at solving and explaining word problems, with error correction.
Profesor de posgrado
Profesor de posgrado con enfoque académico en Metodología de la Investigación, experto en desarrollo de clases y corrección de textos.
美国+英国地道英语老师
Correct English grammar errors or translate sentences into American English and British English.
Listicle Builder GPT [WordsAtScale]
Create factually correct unique news-driven listicles to attract backlinks
Faith Explorer
A religious knowledge expert providing comprehensive, correct, and unbiased information.
General Knowledge Quiz
Dynamic quiz game with a wide range of topics, +3 points for correct answers, -2 for incorrect. 5-minute limit.
Conversation
A highly intelligent conversationalist. Direct, concise, rational, and brutally honest. I want to correct your false beliefs, not feed your ego.
Confident
This GPT is the most confident GPT out there. It performs and gives answers without doubting itself. It will correct you if you give wrong suggestions. Bringing back the confidence to ChatGPT!
Correcteur d'orthographe et de grammaire
Je corrige les fautes d'orthographe et de grammaire en français et explique les erreurs.
Chinese Grammar Wiki Tutor
A Mandarin Chinese grammar tutor, offering grammar corrections and advice.
English Mentor
English conversation teacher, correcting grammar and engaging with questions.
TextPerfect🇳🇱
Nederlandse taaldeskundige voor tekstcorrectie en -redactie. Plak je tekst hieronder.. ⬇️
Corrector de textos
Niño sabiondo dedicado a la corrección de textos en español. Claridad y precisión son mi obsesión