Best AI tools for< Filter Datasets >
20 - AI tool Sites
Zelma
Zelma is an AI-powered research assistant that enables users to find, graph, and understand U.S. school testing data using plain English queries. It allows users to search student test data by school district, demographics, grade, and more, and presents the results with graphs, tables, and descriptions. Zelma aims to make education data accessible and understandable for everyone.
FoodAI
FoodAI.app is an AI-powered application that helps users generate cooking recipes based on the ingredients they have. Users can select the ingredients they want to use, and the AI will provide them with recipes using those ingredients. The application offers options to filter results based on dietary preferences, regions, and additional ingredients. With a user-friendly interface, FoodAI.app aims to simplify the cooking process and inspire creativity in the kitchen.
Binary Vulnerability Analysis
The website offers an AI-powered binary vulnerability scanner that allows users to upload a binary file for analysis. The tool decompiles the executable, removes filler, formats the code, and checks for vulnerabilities by comparing against a database of historical vulnerabilities. It utilizes a finetuned CodeT5+ Embedding model to generate function-wise embeddings and checks for similarities against the DiverseVul Dataset. The tool also uses SemGrep to identify vulnerabilities in the code.
PS2 AI Filter
PS2 AI Filter is an AI-powered tool that allows users to create PS2-style images from their photos. It is easy to use and can be done in seconds. The tool is available as a web app and as an iOS app.
PS2 Filter AI
PS2 Filter AI is an online tool that allows users to transform their photos and videos into a PS2-style aesthetic. The tool uses advanced algorithms to replicate the graphic style of PlayStation 2 games, giving users the ability to add a nostalgic, retro gaming look to their content. PS2 Filter AI is easy to use, with a user-friendly interface that makes it simple for anyone to apply the filter and transform their content. The tool is compatible with various file formats and devices, ensuring that users can enjoy the retro gaming aesthetics on any platform.
PS2 Filter AI
PS2 Filter AI is an artificial intelligence tool that allows users to transform themselves into PS2 Playstation video game characters with just one click. Users can create PS2 game covers, customize artwork, and relive the nostalgia of the PS2 era through this user-friendly and intuitive AI application. With a range of parameters and advanced features, users can easily craft unique and visually appealing images without the need for artistic or coding skills.
AI Filter
AI Filter is an online tool that allows users to transform their photos using various AI filters such as anime, clay, 3D, pixel, emoji, PS2, sticker, and more. It caters to both beginners and professionals, offering a simple and user-friendly experience to enhance and stylize images. Users can easily upload their photos and apply different filters to give them a unique and artistic touch.
PS2 Filter AI
PS2 Filter AI is an online platform that utilizes artificial intelligence to transform photos and images into the distinctive visual style of PlayStation 2 games. By leveraging advanced AI image generation techniques, users can create retro-gaming inspired artwork from their own pictures. The platform offers a user-friendly experience with customizable style options and text prompts to enhance creative freedom. PS2 Filter AI goes beyond traditional filters by faithfully recreating the iconic visuals of the PS2 era, including low polygons, pixelated textures, and unique lighting techniques. With lightning-fast processing, users can generate PS2-style artwork in seconds, making it a fun and creative tool for nostalgic gaming enthusiasts and art enthusiasts alike.
PS2 Filter AI Tool
PS2 Filter AI Tool is an online application that allows users to easily generate PS2 style images. By uploading an image, the AI quickly transforms it into a retro gaming visual experience reminiscent of the PlayStation 2 era. Users can download the generated images for free and share them on social media platforms like Twitter or Facebook. The tool provides a fun and nostalgic way to create unique visuals with a vintage gaming vibe.
Fuk.ai
Fuk.ai is a hate speech and profanity detection tool that utilizes Transformer-based neural network architectures with advanced natural language processing capabilities to filter out hate, bigotry, and profanity from online content. It offers a free software pricing model and allows users to analyze up to 1,000 characters for free. By creating an account, users can analyze up to 10,000 characters per month. Fuk.ai can be integrated into user-generated apps and websites to maintain a positive online environment.
CrushOn.AI
CrushOn.AI is a NSFW character AI chat where you can create and chat with your own custom AI characters. With our advanced AI technology, you can create characters that are truly unique and lifelike. You can choose their appearance, personality, and even their sexual preferences. Once you've created your character, you can chat with them about anything you want. They'll respond in a realistic and engaging way, and they'll even learn from your conversations. CrushOn.AI is the perfect way to explore your fantasies and have some fun with AI.
NSFW Character AI
NSFW Character AI is a free and unfiltered AI chatbot that allows users to create and interact with their own custom AI characters. With NSFW Character AI, you can create characters of any gender, race, or sexual orientation, and explore a wide range of topics, including sex, relationships, and other adult themes. NSFW Character AI is a great way to explore your sexuality and fantasies in a safe and private environment.
AI Girlfriend
AI Girlfriend is an AI application that offers users the experience of engaging in no-filter chats and interactions with virtual AI characters tailored to their preferences. Users can create memorable moments with over 1000 AI characters, enjoy responsive conversations, and explore new perspectives through personalized dialogues. The application aims to provide companionship and enhance mental health through immersive interactions with AI chatbots.
PS2filter.me
PS2filter.me is an AI-powered tool that allows users to transform their photos into images that resemble PS2 video game graphics. It is powered by the same AI technology as Replicate's Face to Many filter, but with some improvements to make it more fun and engaging. Users can simply select a photo from their camera roll or snap a new one, and the AI will apply the PS2 effect to their image seamlessly. They can then share their new look on social media or with their friends to show off their retro style.
Rubii
Rubii is an AI character platform that allows users to explore and create unique AI characters without any filters. Users can bring their characters to life, capturing memorable moments and memories. The platform offers a seamless login/register process for users to start their creative journey. With Rubii, users can unleash their creativity and imagination by designing and customizing AI characters in a personalized way, making each character truly unique and special.
Photo to Anime
Photo to Anime is a free, privacy-centric AI anime filter that allows users to transform their photos and text descriptions into captivating anime-style art. With its user-friendly interface and powerful AI capabilities, Photo to Anime empowers users to unleash their creativity and explore the world of anime art without any artistic or coding skills. The platform offers two main features: Photo-to-Anime, which converts personal photos into unique anime artwork, and Text-to-Anime, which turns written prompts into custom anime-style images. Photo to Anime ensures user privacy by processing images on-device, eliminating the need for cloud uploads. The platform is free to use, with no login or credit card required, making it accessible to all.
Image to Clay Style Online
Image to Clay Style Online is a free AI tool that allows users to generate custom clay-style images from uploaded images or text prompts. The tool uses AI technology to transform regular images into unique clay-style artworks. Users can explore various clay images, customize their creations, and download the final results. With a user-friendly interface, Image to Clay Style Online provides a fun and creative way to generate artistic clay images effortlessly.
AI Disturbance Overlay
AI Disturbance Overlay is an innovative tool designed to protect digital artwork from unauthorized copying and imitation by leveraging AI technology. The tool introduces subtle adjustments to images that are imperceptible to humans but significantly disrupt AI models, ensuring the security and integrity of artists' original creations. With features like Blind Spot Protection, Resistance to Image Processing Attacks, and Anti-Interference Protection, AI Disturbance Overlay offers comprehensive defense mechanisms against AI style theft. The tool is user-friendly, affordable, and provides different protection levels to cater to artists' diverse needs.
AI Girlfriend WTF
AI Girlfriend WTF is an ultimate AI roleplay chat application that allows users to interact with virtual AI girlfriends in various fantasy scenarios, from casual to extreme. Users can create their ideal girl, chat, and enjoy spicy images with personalized AI characters. The platform prioritizes user safety and privacy, offering a premium sexting AI chat and NSFW art experience. With cutting-edge artificial intelligence technology, AI Girlfriend WTF provides a realistic and engaging virtual companionship experience through AI chatting capabilities and an AI image generator.
NSFWCharacter AI
NSFWCharacter AI is an innovative AI application that allows users to create their own AI sex characters and engage in intimate conversations through chatbot interactions. The platform offers a unique experience by providing AI companionship with no filters, supporting NSFW content, and enabling users to generate AI hentai characters. With a global audience in mind, NSFWCharacter AI breaks traditional boundaries by offering unrestricted content and accepting inputs in multiple languages. Users can personalize characters easily by providing a short description and are encouraged to actively participate in character development. The application aims to turn virtual fantasies into actual experiences, providing a safe space for personal and tempting communication.
20 - Open Source AI Tools
DeepDanbooru
DeepDanbooru is an anime-style girl image tag estimation system written in Python. It allows users to estimate images using a live demo site. The tool requires specific packages to be installed and provides a structured dataset for training projects. Users can create training projects, download tags, filter datasets, and start training to estimate tags for images. The tool uses a specific dataset structure and project structure to facilitate the training process.
llm-datasets
LLM Datasets is a repository containing high-quality datasets, tools, and concepts for LLM fine-tuning. It provides datasets with characteristics like accuracy, diversity, and complexity to train large language models for various tasks. The repository includes datasets for general-purpose, math & logic, code, conversation & role-play, and agent & function calling domains. It also offers guidance on creating high-quality datasets through data deduplication, data quality assessment, data exploration, and data generation techniques.
HuggingFaceModelDownloader
The HuggingFace Model Downloader is a utility tool for downloading models and datasets from the HuggingFace website. It offers multithreaded downloading for LFS files and ensures the integrity of downloaded models with SHA256 checksum verification. The tool provides features such as nested file downloading, filter downloads for specific LFS model files, support for HuggingFace Access Token, and configuration file support. It can be used as a library or a single binary for easy model downloading and inference in projects.
UglyFeed
UglyFeed is a simple Python application designed to retrieve, aggregate, filter, rewrite, evaluate, and serve content (RSS feeds) written by a large language model. It provides features such as retrieving RSS feeds, aggregating feed items by similarity, rewriting content using various APIs, saving rewritten feeds to JSON files, converting JSON to valid RSS feed, serving XML feed via an HTTP server, deploying XML feed to GitHub or GitLab, and evaluating generated content. The tool can be used for smart content curation, dynamic blog generation, interactive educational tools, personalized reading experiences, brand monitoring, multilingual content delivery, enhanced RSS feeds, creative writing assistance, content repurposing, and fake news detection datasets. It is modular, extensible, and aims to empower users in content manipulation and delivery.
dolma
Dolma is a dataset and toolkit for curating large datasets for (pre)-training ML models. The dataset consists of 3 trillion tokens from a diverse mix of web content, academic publications, code, books, and encyclopedic materials. The toolkit provides high-performance, portable, and extensible tools for processing, tagging, and deduplicating documents. Key features of the toolkit include built-in taggers, fast deduplication, and cloud support.
awesome-mobile-robotics
The 'awesome-mobile-robotics' repository is a curated list of important content related to Mobile Robotics and AI. It includes resources such as courses, books, datasets, software and libraries, podcasts, conferences, journals, companies and jobs, laboratories and research groups, and miscellaneous resources. The repository covers a wide range of topics in the field of Mobile Robotics and AI, providing valuable information for enthusiasts, researchers, and professionals in the domain.
kernel-memory
Kernel Memory (KM) is a multi-modal AI Service specialized in the efficient indexing of datasets through custom continuous data hybrid pipelines, with support for Retrieval Augmented Generation (RAG), synthetic memory, prompt engineering, and custom semantic memory processing. KM is available as a Web Service, as a Docker container, a Plugin for ChatGPT/Copilot/Semantic Kernel, and as a .NET library for embedded applications. Utilizing advanced embeddings and LLMs, the system enables Natural Language querying for obtaining answers from the indexed data, complete with citations and links to the original sources. Designed for seamless integration as a Plugin with Semantic Kernel, Microsoft Copilot and ChatGPT, Kernel Memory enhances data-driven features in applications built for most popular AI platforms.
LLMeBench
LLMeBench is a flexible framework designed for accelerating benchmarking of Large Language Models (LLMs) in the field of Natural Language Processing (NLP). It supports evaluation of various NLP tasks using model providers like OpenAI, HuggingFace Inference API, and Petals. The framework is customizable for different NLP tasks, LLM models, and datasets across multiple languages. It features extensive caching capabilities, supports zero- and few-shot learning paradigms, and allows on-the-fly dataset download and caching. LLMeBench is open-source and continuously expanding to support new models accessible through APIs.
qb
QANTA is a system and dataset for question answering tasks. It provides a script to download datasets, preprocesses questions, and matches them with Wikipedia pages. The system includes various datasets, training, dev, and test data in JSON and SQLite formats. Dependencies include Python 3.6, `click`, and NLTK models. Elastic Search 5.6 is needed for the Guesser component. Configuration is managed through environment variables and YAML files. QANTA supports multiple guesser implementations that can be enabled/disabled. Running QANTA involves using `cli.py` and Luigi pipelines. The system accesses raw Wikipedia dumps for data processing. The QANTA ID numbering scheme categorizes datasets based on events and competitions.
datachain
DataChain is an open-source Python library for processing and curating unstructured data at scale. It supports AI-driven data curation using local ML models and LLM APIs, handles large datasets, and is Python-friendly with Pydantic objects. It excels at optimizing batch operations and is designed for offline data processing, curation, and ETL. Typical use cases include Computer Vision data curation, LLM analytics, and validation.
FedLLM-Bench
FedLLM-Bench is a realistic benchmark for the Federated Learning of Large Language Models community. It includes datasets for federated instruction tuning and preference alignment tasks, exhibiting diversities in language, quality, quantity, instruction, sequence length, embedding, and preference. The repository provides training scripts and code for open-ended evaluation, aiming to facilitate research and development in federated learning of large language models.
Cherry_LLM
Cherry Data Selection project introduces a self-guided methodology for LLMs to autonomously discern and select cherry samples from open-source datasets, minimizing manual curation and cost for instruction tuning. The project focuses on selecting impactful training samples ('cherry data') to enhance LLM instruction tuning by estimating instruction-following difficulty. The method involves phases like 'Learning from Brief Experience', 'Evaluating Based on Experience', and 'Retraining from Self-Guided Experience' to improve LLM performance.
premsql
PremSQL is an open-source library designed to help developers create secure, fully local Text-to-SQL solutions using small language models. It provides essential tools for building and deploying end-to-end Text-to-SQL pipelines with customizable components, ideal for secure, autonomous AI-powered data analysis. The library offers features like Local-First approach, Customizable Datasets, Robust Executors and Evaluators, Advanced Generators, Error Handling and Self-Correction, Fine-Tuning Support, and End-to-End Pipelines. Users can fine-tune models, generate SQL queries from natural language inputs, handle errors, and evaluate model performance against predefined metrics. PremSQL is extendible for customization and private data usage.
awesome-generative-information-retrieval
This repository contains a curated list of resources on generative information retrieval, including research papers, datasets, tools, and applications. Generative information retrieval is a subfield of information retrieval that uses generative models to generate new documents or passages of text that are relevant to a given query. This can be useful for a variety of tasks, such as question answering, summarization, and document generation. The resources in this repository are intended to help researchers and practitioners stay up-to-date on the latest advances in generative information retrieval.
SheetCopilot
SheetCopilot is an assistant agent that manipulates spreadsheets by following user commands. It leverages Large Language Models (LLMs) to interact with spreadsheets like a human expert, enabling non-expert users to complete tasks on complex software such as Google Sheets and Excel via a language interface. The tool observes spreadsheet states, polishes generated solutions based on external action documents and error feedback, and aims to improve success rate and efficiency. SheetCopilot offers a dataset with diverse task categories and operations, supporting operations like entry & manipulation, management, formatting, charts, and pivot tables. Users can interact with SheetCopilot in Excel or Google Sheets, executing tasks like calculating revenue, creating pivot tables, and plotting charts. The tool's evaluation includes performance comparisons with leading LLMs and VBA-based methods on specific datasets, showcasing its capabilities in controlling various aspects of a spreadsheet.
data-juicer
Data-Juicer is a one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs. It is a systematic & reusable library of 80+ core OPs, 20+ reusable config recipes, and 20+ feature-rich dedicated toolkits, designed to function independently of specific LLM datasets and processing pipelines. Data-Juicer allows detailed data analyses with an automated report generation feature for a deeper understanding of your dataset. Coupled with multi-dimension automatic evaluation capabilities, it supports a timely feedback loop at multiple stages in the LLM development process. Data-Juicer offers tens of pre-built data processing recipes for pre-training, fine-tuning, en, zh, and more scenarios. It provides a speedy data processing pipeline requiring less memory and CPU usage, optimized for maximum productivity. Data-Juicer is flexible & extensible, accommodating most types of data formats and allowing flexible combinations of OPs. It is designed for simplicity, with comprehensive documentation, easy start guides and demo configs, and intuitive configuration with simple adding/removing OPs from existing configs.
DataFrame
DataFrame is a C++ analytical library designed for data analysis similar to libraries in Python and R. It allows you to slice, join, merge, group-by, and perform various statistical, summarization, financial, and ML algorithms on your data. DataFrame also includes a large collection of analytical algorithms in form of visitors, ranging from basic stats to more involved analysis. You can easily add your own algorithms as well. DataFrame employs extensive multithreading in almost all its APIs, making it suitable for analyzing large datasets. Key principles followed in the library include supporting any type without needing new code, avoiding pointer chasing, having all column data in contiguous memory space, minimizing space usage, avoiding data copying, using multi-threading judiciously, and not protecting the user against garbage in, garbage out.
MMOS
MMOS (Mix of Minimal Optimal Sets) is a dataset designed for math reasoning tasks, offering higher performance and lower construction costs. It includes various models and data subsets for tasks like arithmetic reasoning and math word problem solving. The dataset is used to identify minimal optimal sets through reasoning paths and statistical analysis, with a focus on QA-pairs generated from open-source datasets. MMOS also provides an auto problem generator for testing model robustness and scripts for training and inference.
cambrian
Cambrian-1 is a fully open project focused on exploring multimodal Large Language Models (LLMs) with a vision-centric approach. It offers competitive performance across various benchmarks with models at different parameter levels. The project includes training configurations, model weights, instruction tuning data, and evaluation details. Users can interact with Cambrian-1 through a Gradio web interface for inference. The project is inspired by LLaVA and incorporates contributions from Vicuna, LLaMA, and Yi. Cambrian-1 is licensed under Apache 2.0 and utilizes datasets and checkpoints subject to their respective original licenses.
20 - OpenAI Gpts
ChromaSpectra Filter Creator
Merge a holographic shimmer with RGB splitting for a surreal, digital-art look.
Signal Processing Advisor
Provides expert guidance on signal processing in engineering projects.
Air Purifier Servicer Assistant
Hello I'm Air Purifier Servicer Assistant! What would you like help with today?
Prompt Injection Detector
GPT used to classify prompts as valid inputs or injection attempts. Json output.
Photo Mentor
Upload photo! I will provide clear, concise photo analysis and improvement advice.
South Parkify
Transform any photo into a visually stunning South Park moment with just a few clicks.
Photo Multiverse
Upload your photo to create an AI persona, then change 🏞️ background, convert to ✏️ cartoon, or edit character styles. Try with selfies, items or pet images!