Best AI tools for< Add Evaluation Datasets >
20 - AI tool Sites

AILYZE
AILYZE is an AI tool designed for qualitative data collection and analysis. Users can upload various document formats in any language to generate codes, conduct thematic, frequency, content, and cross-group analysis, extract top quotes, and more. The tool also allows users to create surveys, utilize an AI voice interviewer, and recruit participants globally. AILYZE offers different plans with varying features and data security measures, including options for advanced analysis and AI interviewer add-ons. Additionally, users can tap into data scientists for detailed and customized analyses on a wide range of documents.

usefulAI
usefulAI is a platform that allows users to easily add AI features to their products in minutes. Users can find AI features that best meet their needs, test them using the platform's playground, and integrate them into their products through a single API. The platform offers a user-friendly playground to test and compare AI solutions, provides pricing and metrics for evaluation, and allows integration within applications using a single API. usefulAI aims to provide practical AI engines in one place, without hype, for users to leverage in their products.

SDXL Turbo
SDXL Turbo is a cutting-edge text-to-image generation model that leverages Adversarial Diffusion Distillation (ADD) technology for high-quality, real-time image synthesis. Developed by Stability AI, SDXL Turbo is a distilled version of the SDXL 1.0 model, specifically trained for real-time synthesis. It excels in generating photorealistic images from text prompts in a single network evaluation, making it ideal for applications demanding speed and efficiency, such as video games, virtual reality, and instant content creation. SDXL Turbo is accessible to both professionals and hobbyists alike, with simple setup requirements and an intuitive interface. It presents unparalleled opportunities for research and development in advanced AI and image synthesis.

Clarity AI
Clarity AI is an AI-powered technology platform that offers a Sustainability Tech Kit for sustainable investing, shopping, reporting, and benchmarking. The platform provides built-in sustainability technology with customizable solutions for various needs related to data, methodologies, and tools. It seamlessly integrates into workflows, offering scalable and flexible end-to-end SaaS tools to address sustainability use cases. Clarity AI leverages powerful AI and machine learning to analyze vast amounts of data points, ensuring reliable and transparent data coverage. The platform is designed to empower users to assess, analyze, and report on sustainability aspects efficiently and confidently.

Feelsy
Feelsy is a social media sentiment analysis tool that helps businesses understand how their audience feels about their content. With Feelsy, businesses can track the sentiment of their Instagram comments in real-time, identify the content that resonates most with their audience, and measure the effectiveness of their social media campaigns.

Supertranslate
Supertranslate is an AI-powered tool that allows users to automatically add English subtitles to videos in any language. It is powered by OpenAI's Whisper, the world's most accurate speech-to-text engine. The tool offers a fast and efficient way to generate subtitles, with features such as intuitive subtitle editing capabilities. Users can upload videos, generate subtitles, and download .srt/.vtt files with ease. Supertranslate ensures high-quality subtitle generation using OpenAI-Whisper technology.

EnhanceAI
EnhanceAI is a tool that allows you to add AI-powered autocomplete to any website in just 2 minutes. It integrates with all major no-code tools and UI frameworks, and it's easy to use. Just add a few lines of code to your website, and you're good to go. EnhanceAI uses OpenAI's GPT models, so you can be sure that you're getting the most accurate and up-to-date AI technology. With EnhanceAI, you can create AI-powered forms, surveys, text inputs, and more. You can also use EnhanceAI to generate creative content, such as art prompts and sales copy. EnhanceAI is the perfect tool for anyone who wants to add AI to their website or product.

Double Subtitles
Double Subtitles is an AI-powered tool that helps you add subtitles to your videos quickly and easily. With 90% of mobile videos being watched on mute, it's more important than ever to make sure your viewers can understand your content. Double Subtitles uses advanced AI algorithms to generate accurate, precise, and fast subtitles. It's 90% more accurate, 20x faster, and 3x lower cost than the competition. Plus, you can customize the style of your subtitles to match your brand. With Double Subtitles, you can be sure that your viewers will understand your content, no matter how they're watching it.

AI Captions for Videos: VidCap
AI Captions for Videos: VidCap is an application available on the App Store that allows users to add accurate automatic subtitles to their videos in seconds. The app offers ultra-accurate automatic subtitles in over 100 languages, custom text font, style, and animations, AI features like Eye Contact and Video Teleprompter, background noise removal, and the ability to add custom logos or watermarks. Users can preview their designs on Instagram and TikTok before exporting, save videos in 4K, and export transcriptions in various formats.

DocGPT.ai
DocGPT.ai is an AI-powered tool designed to enhance productivity and efficiency in various tasks such as email writing, document generation, content creation, SEO optimization, data enrichment, and more. It seamlessly integrates with Google Workspace applications to provide users with advanced AI capabilities for content generation and management. With support for multiple AI models and a wide range of features, DocGPT.ai is a comprehensive solution for individuals and businesses looking to streamline their workflows and improve their content creation processes.

AI Mail Assistant
AI Mail Assistant is a Gmail add-on powered by OpenAI's ChatGPT that helps users streamline their email communication. It offers a range of features including generating quick and accurate email responses, summarizing emails, translating messages, and improving the grammar and wording of drafts. The tool is designed to save time, avoid language barriers and mistakes, and ensure clear and professional communication.

VEED.IO
VEED.IO is an online video editor that uses AI to help users create professional-quality videos quickly and easily. With VEED.IO, users can add subtitles, remove background noise, and more. VEED.IO is also a great tool for creating videos for social media, marketing, and education.

Nova AI
Nova AI is a cloud-based video editing platform that provides users with a wide range of tools and features to create and edit videos. The platform is designed to be easy to use, even for beginners, and it offers a variety of features that are typically only found in professional video editing software. With Nova AI, users can trim, cut, merge, and resize videos, as well as add text, images, and music. The platform also includes a number of AI-powered features, such as automatic subtitle generation, text-to-speech, and video translation. Nova AI is a powerful and versatile video editing platform that is suitable for a wide range of users, from beginners to professionals.

Editby
Editby is an AI-powered content creation tool that helps users create SEO-optimized content that ranks on Google and social media. It offers a range of features to help users create high-quality content, including AI-powered recommendations, trending content suggestions, and plagiarism detection. Editby also integrates with a variety of platforms, making it easy to publish content anywhere you need it.

Zeemo AI
Zeemo AI is a powerful caption generator tool that enables users to add subtitles to videos, transcribe video and audio to text, and generate captions using AI technology. It supports multiple languages and provides dynamic visual effects for captions. The tool is designed for content creators, educators, and product sellers to enhance their videos and reach a wider audience across various platforms.

SubTitles.Love
SubTitles.Love is an AI-powered online subtitles editor that helps users easily add subtitles to their videos. The tool offers features such as auto speech recognition, support for 10+ languages, and simple editing capabilities. Users can upload any video format, tune subtitles with high accuracy, and customize the appearance before downloading the subtitled video. SubTitles.Love aims to save time and enhance audience engagement by providing automatic subtitles, resizing for social media, and affordable pricing. The platform is trusted by bloggers, podcast makers, and content producers for its quality service and community-driven approach.

Bytecap
Bytecap is an AI application that allows users to immerse their videos with custom AI captions. It offers features such as auto creation of 99% accurate captions using advanced speech recognition, customization of captions with fonts, colors, emojis, effects, music, and highlights, and AI-generated hook titles and descriptions for boosting engagement. Bytecap supports over 99 languages, provides complete caption control, and offers trendy sounds and background music options. The application caters to video editors, content creators, podcasters, and streamers, enabling them to save time, expand reach, and increase brand awareness. Bytecap ensures privacy and security, offers free trial options, and allows users to edit captions after creation.

Alva Solutions
Alva Solutions is an AI-powered browser extension application that aims to simplify browsing experience by providing a range of AI browser extensions. The application offers diverse browser extensions such as Alva AI, Alva Network, and Snap AI, each designed to enhance productivity and streamline tasks. Users can benefit from features like AI-powered assistance, network insights, and voice recording capabilities. Alva Solutions prioritizes user privacy and data security, offering a safe environment with premium protection features. With a user-friendly interface and intuitive dashboard, users can easily manage and control their extensions. The application also fosters a community environment through various social media platforms, providing users with updates, tutorials, and engaging discussions.

ChatBuild AI
ChatBuild AI is a website that allows users to create custom trained AI chatbots for their website in minutes. No coding experience is needed. Users can upload files to train their chatbot, and ChatBuild AI will generate a custom chatbot that is trained on the user's own data. ChatBuild AI also offers global support, so users can use their chatbot in any language.

Doclingo
Doclingo is an AI-powered document translation tool that supports translating documents in various formats such as PDF, Word, Excel, PowerPoint, SRT subtitles, ePub ebooks, AR&ZIP packages, and more. It utilizes large language models to provide accurate and professional translations, preserving the original layout of the documents. Users can enjoy a limited-time free trial upon registration, with the option to subscribe for more features. Doclingo aims to offer high-quality translation services through continuous algorithm improvements.
20 - Open Source AI Tools

llm-jp-eval
LLM-jp-eval is a tool designed to automatically evaluate Japanese large language models across multiple datasets. It provides functionalities such as converting existing Japanese evaluation data to text generation task evaluation datasets, executing evaluations of large language models across multiple datasets, and generating instruction data (jaster) in the format of evaluation data prompts. Users can manage the evaluation settings through a config file and use Hydra to load them. The tool supports saving evaluation results and logs using wandb. Users can add new evaluation datasets by following specific steps and guidelines provided in the tool's documentation. It is important to note that using jaster for instruction tuning can lead to artificially high evaluation scores, so caution is advised when interpreting the results.

evalscope
Eval-Scope is a framework designed to support the evaluation of large language models (LLMs) by providing pre-configured benchmark datasets, common evaluation metrics, model integration, automatic evaluation for objective questions, complex task evaluation using expert models, reports generation, visualization tools, and model inference performance evaluation. It is lightweight, easy to customize, supports new dataset integration, model hosting on ModelScope, deployment of locally hosted models, and rich evaluation metrics. Eval-Scope also supports various evaluation modes like single mode, pairwise-baseline mode, and pairwise (all) mode, making it suitable for assessing and improving LLMs.

ai-audio-datasets
AI Audio Datasets List (AI-ADL) is a comprehensive collection of datasets consisting of speech, music, and sound effects, used for Generative AI, AIGC, AI model training, and audio applications. It includes datasets for speech recognition, speech synthesis, music information retrieval, music generation, audio processing, sound synthesis, and more. The repository provides a curated list of diverse datasets suitable for various AI audio tasks.

AutoRAG
AutoRAG is an AutoML tool designed to automatically find the optimal RAG pipeline for your data. It simplifies the process of evaluating various RAG modules to identify the best pipeline for your specific use-case. The tool supports easy evaluation of different module combinations, making it efficient to find the most suitable RAG pipeline for your needs. AutoRAG also offers a cloud beta version to assist users in running and optimizing the tool, along with building RAG evaluation datasets for a starting price of $9.99 per optimization.

prometheus-eval
Prometheus-Eval is a repository dedicated to evaluating large language models (LLMs) in generation tasks. It provides state-of-the-art language models like Prometheus 2 (7B & 8x7B) for assessing in pairwise ranking formats and achieving high correlation scores with benchmarks. The repository includes tools for training, evaluating, and using these models, along with scripts for fine-tuning on custom datasets. Prometheus aims to address issues like fairness, controllability, and affordability in evaluations by simulating human judgments and proprietary LM-based assessments.

deepeval
DeepEval is a simple-to-use, open-source LLM evaluation framework specialized for unit testing LLM outputs. It incorporates various metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., and runs locally on your machine for evaluation. It provides a wide range of ready-to-use evaluation metrics, allows for creating custom metrics, integrates with any CI/CD environment, and enables benchmarking LLMs on popular benchmarks. DeepEval is designed for evaluating RAG and fine-tuning applications, helping users optimize hyperparameters, prevent prompt drifting, and transition from OpenAI to hosting their own Llama2 with confidence.

PIXIU
PIXIU is a project designed to support the development, fine-tuning, and evaluation of Large Language Models (LLMs) in the financial domain. It includes components like FinBen, a Financial Language Understanding and Prediction Evaluation Benchmark, FIT, a Financial Instruction Dataset, and FinMA, a Financial Large Language Model. The project provides open resources, multi-task and multi-modal financial data, and diverse financial tasks for training and evaluation. It aims to encourage open research and transparency in the financial NLP field.

giskard
Giskard is an open-source Python library that automatically detects performance, bias & security issues in AI applications. The library covers LLM-based applications such as RAG agents, all the way to traditional ML models for tabular data.

awesome-generative-ai-guide
This repository serves as a comprehensive hub for updates on generative AI research, interview materials, notebooks, and more. It includes monthly best GenAI papers list, interview resources, free courses, and code repositories/notebooks for developing generative AI applications. The repository is regularly updated with the latest additions to keep users informed and engaged in the field of generative AI.

zshot
Zshot is a highly customizable framework for performing Zero and Few shot named entity and relationships recognition. It can be used for mentions extraction, wikification, zero and few shot named entity recognition, zero and few shot named relationship recognition, and visualization of zero-shot NER and RE extraction. The framework consists of two main components: the mentions extractor and the linker. There are multiple mentions extractors and linkers available, each serving a specific purpose. Zshot also includes a relations extractor and a knowledge extractor for extracting relations among entities and performing entity classification. The tool requires Python 3.6+ and dependencies like spacy, torch, transformers, evaluate, and datasets for evaluation over datasets like OntoNotes. Optional dependencies include flair and blink for additional functionalities. Zshot provides examples, tutorials, and evaluation methods to assess the performance of the components.

Awesome-LLM-Eval
Awesome-LLM-Eval: a curated list of tools, benchmarks, demos, papers for Large Language Models (like ChatGPT, LLaMA, GLM, Baichuan, etc) Evaluation on Language capabilities, Knowledge, Reasoning, Fairness and Safety.

RagaAI-Catalyst
RagaAI Catalyst is a comprehensive platform designed to enhance the management and optimization of LLM projects. It offers features such as project management, dataset management, evaluation management, trace management, prompt management, synthetic data generation, and guardrail management. These functionalities enable efficient evaluation and safeguarding of LLM applications.

Awesome-LLM-Constrained-Decoding
Awesome-LLM-Constrained-Decoding is a curated list of papers, code, and resources related to constrained decoding of Large Language Models (LLMs). The repository aims to facilitate reliable, controllable, and efficient generation with LLMs by providing a comprehensive collection of materials in this domain.

Reflection_Tuning
Reflection-Tuning is a project focused on improving the quality of instruction-tuning data through a reflection-based method. It introduces Selective Reflection-Tuning, where the student model can decide whether to accept the improvements made by the teacher model. The project aims to generate high-quality instruction-response pairs by defining specific criteria for the oracle model to follow and respond to. It also evaluates the efficacy and relevance of instruction-response pairs using the r-IFD metric. The project provides code for reflection and selection processes, along with data and model weights for both V1 and V2 methods.

RobustVLM
This repository contains code for the paper 'Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models'. It focuses on fine-tuning CLIP in an unsupervised manner to enhance its robustness against visual adversarial attacks. By replacing the vision encoder of large vision-language models with the fine-tuned CLIP models, it achieves state-of-the-art adversarial robustness on various vision-language tasks. The repository provides adversarially fine-tuned ViT-L/14 CLIP models and offers insights into zero-shot classification settings and clean accuracy improvements.

llm-leaderboard
Nejumi Leaderboard 3 is a comprehensive evaluation platform for large language models, assessing general language capabilities and alignment aspects. The evaluation framework includes metrics for language processing, translation, summarization, information extraction, reasoning, mathematical reasoning, entity extraction, knowledge/question answering, English, semantic analysis, syntactic analysis, alignment, ethics/moral, toxicity, bias, truthfulness, and robustness. The repository provides an implementation guide for environment setup, dataset preparation, configuration, model configurations, and chat template creation. Users can run evaluation processes using specified configuration files and log results to the Weights & Biases project.

opencompass
OpenCompass is a one-stop platform for large model evaluation, aiming to provide a fair, open, and reproducible benchmark for large model evaluation. Its main features include: * Comprehensive support for models and datasets: Pre-support for 20+ HuggingFace and API models, a model evaluation scheme of 70+ datasets with about 400,000 questions, comprehensively evaluating the capabilities of the models in five dimensions. * Efficient distributed evaluation: One line command to implement task division and distributed evaluation, completing the full evaluation of billion-scale models in just a few hours. * Diversified evaluation paradigms: Support for zero-shot, few-shot, and chain-of-thought evaluations, combined with standard or dialogue-type prompt templates, to easily stimulate the maximum performance of various models. * Modular design with high extensibility: Want to add new models or datasets, customize an advanced task division strategy, or even support a new cluster management system? Everything about OpenCompass can be easily expanded! * Experiment management and reporting mechanism: Use config files to fully record each experiment, and support real-time reporting of results.

open-unlearning
OpenUnlearning is an easily extensible framework that unifies LLM unlearning evaluation benchmarks. It provides efficient implementations of TOFU and MUSE unlearning benchmarks, supporting 5 unlearning methods, 3+ datasets, 6+ evaluation metrics, and 7+ LLMs. Users can easily extend the framework to incorporate more variants, collaborate by adding new benchmarks, unlearning methods, datasets, and evaluation metrics, and drive progress in the field.
20 - OpenAI Gpts

AIProductGPT: Add AI to your Product and get a PRD
With simple prompts, AIProductGPT instantly crafts detailed AI-powered requirements (PRD) and mocks so that you team can hit the ground running

GroceriesGPT
I manage your grocery lists to help you stay organized. *1/ Tell me what to add to a list. 2/ Ask me to add all ingredients for a receipe. 3/ Upload a receipt to remove items from your lists 4/ Add an item by simply uploading a picture. 5/ Ask me what items I would recommend you add to your lists.*

SpintaxGPT
I add spintax to emails for Instantly.ai. For more cold email tips, follow me on Twitter/𝕏 at @kenautoup

Meal Planner + Home Delivery
Find your next favorite recipe and instantly add fresh, affordable ingredients to your Walmart cart. Enjoy the convenience of home delivery or pickup. Delicious, healthy, and budget-friendly.

QR Code Creator & Customizer
Create a QR code in 30 seconds + add a cool design effect or overlay it on top of any image. Free, no watermarks, no email required, and we don't store your messages/images.

WP coding assistant
Friendly WordPress expert that will help you write custom plugins, functions, add custom fields and enhance your WordPress website.

AI Tools Guru
Find the best AI tools. Want to add your tool? Fill the form: https://forms.gle/uqMaC2EFZzh3Y4yT6

Awesome BFCM Deals Finder 2023
Get Suggestion on best BFMC deals. Add your deal ➡️ https://bit.ly/3sqY7DV

Fashion Sentinel
Expert GPT for fashion authenticity. Add photos and ask if it's real or fake. By neuralvault.

Turnitin Rate Killer
Help your essay get 0% rate! Will not add strange expression to you essay! Will not change the professional terminology you used in the essay! Reducing Turnitin similarity scores. 论文润色、论文降重、Ai率0%

Homescreen Analyzer
Get recommendations based on your phone's Homescreen screenshot! Just add the screenshot in here for analysis 📱🧐