Best AI tools for< Submit To Leaderboard >
20 - AI tool Sites

Quicklisting
Quicklisting is an AI-powered tool that helps startups submit their information to over 200 directories and 500 newsletters. It automates the submission process, saving startups time and effort. Quicklisting also provides startups with access to a database of directories and newsletters, making it easy for them to find the right ones to submit to. With Quicklisting, startups can increase their online visibility, reach a wider audience, and get more backlinks to their website.

Unprompted
Unprompted is an AI image guessing game where players guess the words used to create AI-generated images. Players type words into the text box and submit to see if their guesses are correct. The game offers three new images to try every day, and players can check the answers from the previous day under the 'Yesterday' tab. Unprompted provides a fun and interactive way to engage with AI technology and test your creativity and imagination.

Afroverse
Afroverse is an AI-powered music investment platform designed specifically for the Afrobeat industry. It provides tools for demo submissions, cross-border collaborations, music distribution, investment opportunities, community engagement, and more.

SubmitAI
SubmitAI is an AI tool that offers a service to submit AI tools to over 100 directories, aiming to enhance visibility and impact for AI products. The platform handpicks influential directories to optimize traffic and backlinks, providing a seamless submission process. Users can choose from different submission plans to save time and effort, with detailed reports and data insights included. SubmitAI also offers community engagement opportunities within the AI industry, fostering collaboration and networking. The tool prioritizes user satisfaction and data security, ensuring encrypted information for directory submissions.

Fake Hacker News
The website is a platform where users can submit fake hacker news for testing purposes. Users can log in to submit their titles and test their submissions. The platform allows users to see how readers may respond to their posts. The website was built by Justin and Michael.

SoraPrompting
SoraPrompting is a website that provides a collection of prompts to help users get started with Sora prompting and create high-quality video content. The website also includes a form for users to submit their own prompts, which can then be reviewed and added to the collection for the community to explore and create videos from. Sora is OpenAI's revolutionary text-to-video model, designed to understand and simulate the physical world in motion. It aims to assist in solving real-world problems through dynamic interaction. Sora stands out by generating high-quality videos up to a minute long while maintaining visual excellence and adhering to user prompts. Its unique capabilities make it a game-changer in the AI landscape.

SciSummary
SciSummary is an AI tool designed to summarize scientific articles and research papers quickly and efficiently. It utilizes advanced AI technology, specifically GPT-3.5 and GPT-4 models, to provide accurate and concise summaries for busy scientists, students, and enthusiasts. The platform allows users to submit documents via email, upload articles to the dashboard, or attach PDFs for summarization. With features like unlimited summaries, figure and table analysis, and chat messages, SciSummary is a valuable resource for researchers looking to stay updated with the latest trends in research.

Free AI Apps Directory
The Free AI Apps Directory is a curated list of all free AI applications available for immediate use. Users can easily find and explore various AI apps through this platform. The website provides information on app launch status, device compatibility, and allows users to submit new apps. It is a valuable resource for individuals interested in leveraging AI technology for different purposes.

Arya by Leoforce
Arya by Leoforce is an AI recruitment software powered by ML technology. It offers advanced AI capabilities to source, screen, rank, and engage high-quality candidates faster. Arya empowers recruiters to focus on building meaningful candidate connections by going beyond simple keyword matching. It provides comprehensive talent sourcing and engagement capabilities, driving measurable improvements to candidate quality and reducing time to submit. Arya also enhances engagement and the candidate experience by providing access to new and updated candidate information daily, a 24/7 AI recruiting assistant, and a centralized multi-channel communication dashboard.

Navi AI Tools Directory
The website is a comprehensive AI directory platform that showcases a wide range of AI tools and applications. Users can explore and discover various AI-powered tools for different purposes, such as writing, marketing, paraphrasing, SEO, study, generating content, research, art, music, video, coding, photo editing, and more. The platform offers a free listing service for AI tool developers and is regularly updated with new tools. Users can easily navigate through the directory to find and access their favorite AI tools. Additionally, the platform provides information on how to submit AI tools, the categories supported, and the frequency of updates. The content is generated by GPT-4o from OpenAI, ensuring high-quality descriptions and details about the listed AI tools.

Huntr
Huntr is the world's first bug bounty platform for AI/ML. It provides a single place for security researchers to submit vulnerabilities, ensuring the security and stability of AI/ML applications, including those powered by Open Source Software (OSS).

Beauty.AI
Beauty.AI is an AI application that hosts an international beauty contest judged by artificial intelligence. The app allows humans to submit selfies for evaluation by AI algorithms that assess criteria linked to human beauty and health. The platform aims to challenge biases in perception and promote healthy aging through the use of deep learning and semantic analysis. Beauty.AI offers a unique opportunity for individuals to participate in a groundbreaking competition that combines technology and beauty standards.

Ask a Philosopher
Ask a Philosopher is a website where users can submit questions to be answered by a philosopher. The platform allows individuals to seek philosophical insights and perspectives on various topics. It serves as a space for intellectual discourse and exploration of ideas, offering a unique opportunity to engage with philosophical thinking in a practical and accessible manner.

WorkSync.AI
WorkSync.AI is an AI-powered job application extension designed to significantly speed up the job application process. It helps users apply to multiple job positions effortlessly by autofilling job applications with AI technology. The tool aims to improve the quality of job applications by providing prompts for cover letters and responses, making it easier for users to submit high-effort applications. Additionally, WorkSync.AI offers ApplyBot, a feature that automates the application process for LinkedIn EasyApply job applications with just one click. The tool also provides a dashboard with CV formats, candidate tips, and analytics to help users become better candidates and track their job application progress.

Imagica
Imagica is an innovative platform that allows users to build AI applications without any coding knowledge. Users can create AI functions, chat interfaces, and generate images using plain language descriptions. The platform offers real-time data integration, category templates, and multimodal input/output options. Imagica also provides monetization features and the ability to submit apps to Natural OS for wider distribution. With a focus on simplicity and creativity, Imagica empowers users to bring their ideas to life and create functional AI apps at the speed of thought.

Vivid
Vivid is a tool that syncs Figma designs with your codebase by generating and updating UI code. It allows users to submit designs directly in Figma, make edits to generated divs, and sync changes with design updates. Vivid isolates design styles, making it easier for developers to focus on functionality. The tool minimizes style clutter and provides auto-updating code that tracks changes in Figma.

AutoRepurpose
AutoRepurpose is a platform that allows users to repurpose YouTube videos into Twitter threads and LinkedIn posts effortlessly. With AutoRepurpose, users can grow their social media presence 10x faster by converting their video content into text for various platforms like Twitter, LinkedIn, newsletters, and more. The tool simplifies the process by enabling users to submit a YouTube video URL and receive the generated Twitter thread and LinkedIn post within minutes. AutoRepurpose offers a pay-as-you-go model, eliminating the need for subscriptions and allowing users to purchase credits only when needed.

Kindred Tales
Kindred Tales is an AI-assisted memoir writing service that helps users capture and preserve their life stories in a beautiful keepsake book. With the help of AI, Kindred Tales makes authoring your life story simple and enjoyable, offering various ways to write, including a classic composer, email, biographer, and transcription. The service provides over 100 meaningful questions to inspire writing, and users can also create their own topics or invite family to submit topics for a truly customized experience. Kindred Tales is perfect for preserving family legacy and sharing memories with future generations.

MeddiPop
MeddiPop is an AI-powered platform that seamlessly connects patients with medical practices in various industries such as plastic surgery, dermatology, cosmetic dentistry, and ophthalmology. The application streamlines the process by allowing patients to submit applications for services, which are then matched with the most suitable practice using AI algorithms. MeddiPop aims to revolutionize healthcare by simplifying patient-practice connections and optimizing appointment scheduling.

KhojGPT
KhojGPT is an AI tool that serves as a store and curation platform for GPTs (Generative Pre-trained Transformers). It allows users to submit their GPTs and sign in with Google for easy access. The platform aims to provide a curated collection of GPTs for various purposes, enhancing user experience and productivity in AI-related tasks.
20 - Open Source AI Tools

raid
RAID is the largest and most comprehensive dataset for evaluating AI-generated text detectors. It contains over 10 million documents spanning 11 LLMs, 11 genres, 4 decoding strategies, and 12 adversarial attacks. RAID is designed to be the go-to location for trustworthy third-party evaluation of popular detectors. The dataset covers diverse models, domains, sampling strategies, and attacks, making it a valuable resource for training detectors, evaluating generalization, protecting against adversaries, and comparing to state-of-the-art models from academia and industry.

Q-Bench
Q-Bench is a benchmark for general-purpose foundation models on low-level vision, focusing on multi-modality LLMs performance. It includes three realms for low-level vision: perception, description, and assessment. The benchmark datasets LLVisionQA and LLDescribe are collected for perception and description tasks, with open submission-based evaluation. An abstract evaluation code is provided for assessment using public datasets. The tool can be used with the datasets API for single images and image pairs, allowing for automatic download and usage. Various tasks and evaluations are available for testing MLLMs on low-level vision tasks.

CJA_Comprehensive_Jailbreak_Assessment
This public repository contains the paper 'Comprehensive Assessment of Jailbreak Attacks Against LLMs'. It provides a labeling method to label results using Python and offers the opportunity to submit evaluation results to the leaderboard. Full codes will be released after the paper is accepted.

VoiceBench
VoiceBench is a repository containing code and data for benchmarking LLM-Based Voice Assistants. It includes a leaderboard with rankings of various voice assistant models based on different evaluation metrics. The repository provides setup instructions, datasets, evaluation procedures, and a curated list of awesome voice assistants. Users can submit new voice assistant results through the issue tracker for updates on the ranking list.

OlympicArena
OlympicArena is a comprehensive benchmark designed to evaluate advanced AI capabilities across various disciplines. It aims to push AI towards superintelligence by tackling complex challenges in science and beyond. The repository provides detailed data for different disciplines, allows users to run inference and evaluation locally, and offers a submission platform for testing models on the test set. Additionally, it includes an annotation interface and encourages users to cite their paper if they find the code or dataset helpful.

TempCompass
TempCompass is a benchmark designed to evaluate the temporal perception ability of Video LLMs. It encompasses a diverse set of temporal aspects and task formats to comprehensively assess the capability of Video LLMs in understanding videos. The benchmark includes conflicting videos to prevent models from relying on single-frame bias and language priors. Users can clone the repository, install required packages, prepare data, run inference using examples like Video-LLaVA and Gemini, and evaluate the performance of their models across different tasks such as Multi-Choice QA, Yes/No QA, Caption Matching, and Caption Generation.

appworld
AppWorld is a high-fidelity execution environment of 9 day-to-day apps, operable via 457 APIs, populated with digital activities of ~100 people living in a simulated world. It provides a benchmark of natural, diverse, and challenging autonomous agent tasks requiring rich and interactive coding. The repository includes implementations of AppWorld apps and APIs, along with tests. It also introduces safety features for code execution and provides guides for building agents and extending the benchmark.

eval-scope
Eval-Scope is a framework for evaluating and improving large language models (LLMs). It provides a set of commonly used test datasets, metrics, and a unified model interface for generating and evaluating LLM responses. Eval-Scope also includes an automatic evaluator that can score objective questions and use expert models to evaluate complex tasks. Additionally, it offers a visual report generator, an arena mode for comparing multiple models, and a variety of other features to support LLM evaluation and development.

AgentLab
AgentLab is an open, easy-to-use, and extensible framework designed to accelerate web agent research. It provides features for developing and evaluating agents on various benchmarks supported by BrowserGym. The framework allows for large-scale parallel agent experiments using ray, building blocks for creating agents over BrowserGym, and a unified LLM API for OpenRouter, OpenAI, Azure, or self-hosted using TGI. AgentLab also offers reproducibility features, a unified LeaderBoard, and supports multiple benchmarks like WebArena, WorkArena, WebLinx, VisualWebArena, AssistantBench, GAIA, Mind2Web-live, and MiniWoB.

WildBench
WildBench is a tool designed for benchmarking Large Language Models (LLMs) with challenging tasks sourced from real users in the wild. It provides a platform for evaluating the performance of various models on a range of tasks. Users can easily add new models to the benchmark by following the provided guidelines. The tool supports models from Hugging Face and other APIs, allowing for comprehensive evaluation and comparison. WildBench facilitates running inference and evaluation scripts, enabling users to contribute to the benchmark and collaborate on improving model performance.

MMStar
MMStar is an elite vision-indispensable multi-modal benchmark comprising 1,500 challenge samples meticulously selected by humans. It addresses two key issues in current LLM evaluation: the unnecessary use of visual content in many samples and the existence of unintentional data leakage in LLM and LVLM training. MMStar evaluates 6 core capabilities across 18 detailed axes, ensuring a balanced distribution of samples across all dimensions.

Korean-SAT-LLM-Leaderboard
The Korean SAT LLM Leaderboard is a benchmarking project that allows users to test their fine-tuned Korean language models on a 10-year dataset of the Korean College Scholastic Ability Test (CSAT). The project provides a platform to compare human academic ability with the performance of large language models (LLMs) on various question types to assess reading comprehension, critical thinking, and sentence interpretation skills. It aims to share benchmark data, utilize a reliable evaluation dataset curated by the Korea Institute for Curriculum and Evaluation, provide annual updates to prevent data leakage, and promote open-source LLM advancement for achieving top-tier performance on the Korean CSAT.

LLMs-Planning
This repository contains code for three papers related to evaluating large language models on planning and reasoning about change. It includes benchmarking tools and analysis for assessing the planning abilities of large language models. The latest addition evaluates and enhances the planning and scheduling capabilities of a specific language reasoning model. The repository provides a static test set leaderboard showcasing model performance on various tasks with natural language and planning domain prompts.

babilong
BABILong is a generative benchmark designed to evaluate the performance of NLP models in processing long documents with distributed facts. It consists of 20 tasks that simulate interactions between characters and objects in various locations, requiring models to distinguish important information from irrelevant details. The tasks vary in complexity and reasoning aspects, with test samples potentially containing millions of tokens. The benchmark aims to challenge and assess the capabilities of Large Language Models (LLMs) in handling complex, long-context information.

rlhf_trojan_competition
This competition is organized by Javier Rando and Florian Tramèr from the ETH AI Center and SPY Lab at ETH Zurich. The goal of the competition is to create a method that can detect universal backdoors in aligned language models. A universal backdoor is a secret suffix that, when appended to any prompt, enables the model to answer harmful instructions. The competition provides a set of poisoned generation models, a reward model that measures how safe a completion is, and a dataset with prompts to run experiments. Participants are encouraged to use novel methods for red-teaming, automated approaches with low human oversight, and interpretability tools to find the trojans. The best submissions will be offered the chance to present their work at an event during the SaTML 2024 conference and may be invited to co-author a publication summarizing the competition results.

magpie
This is the official repository for 'Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing'. Magpie is a tool designed to synthesize high-quality instruction data at scale by extracting it directly from an aligned Large Language Models (LLMs). It aims to democratize AI by generating large-scale alignment data and enhancing the transparency of model alignment processes. Magpie has been tested on various model families and can be used to fine-tune models for improved performance on alignment benchmarks such as AlpacaEval, ArenaHard, and WildBench.

jobs
The 'jobs' repository by comma.ai focuses on solving self-driving cars by building a robotics stack that includes state-of-the-art machine learning models, operating system design, hardware development, and manufacturing. The company aims to deliver constant incremental progress in self-driving technology to users, with a focus on practical solutions rather than hype. Job opportunities at comma.ai include technical challenges, phone screenings, and paid micro-internships, with perks such as chef-prepared meals, on-site gym access, and health insurance. The teams at comma.ai are organized into web, systems, infrastructure, product, design, and electrical engineering, with specific challenges for each team. The repository also offers opportunities for non-job seekers to participate in challenges and win prizes.

nexa-sdk
Nexa SDK is a comprehensive toolkit supporting ONNX and GGML models for text generation, image generation, vision-language models (VLM), and text-to-speech (TTS) capabilities. It offers an OpenAI-compatible API server with JSON schema mode and streaming support, along with a user-friendly Streamlit UI. Users can run Nexa SDK on any device with Python environment, with GPU acceleration supported. The toolkit provides model support, conversion engine, inference engine for various tasks, and differentiating features from other tools.

LLM4SE
The collection is actively updated with the help of an internal literature search engine.

nlp-llms-resources
The 'nlp-llms-resources' repository is a comprehensive resource list for Natural Language Processing (NLP) and Large Language Models (LLMs). It covers a wide range of topics including traditional NLP datasets, data acquisition, libraries for NLP, neural networks, sentiment analysis, optical character recognition, information extraction, semantics, topic modeling, multilingual NLP, domain-specific LLMs, vector databases, ethics, costing, books, courses, surveys, aggregators, newsletters, papers, conferences, and societies. The repository provides valuable information and resources for individuals interested in NLP and LLMs.
20 - OpenAI Gpts

Better GPT Builder
Guides users in creating GPTs with a structured approach. Experimental! See https://github.com/allisonmorrell/gptbuilder for background, full prompts and files, and to submit ideas and issues.

EE-GPT
A search engine and troubleshooter for electrical engineers to promote an open-source community. Submit your questions, corrections and feedback to [email protected]

Metaverse Radio GPT
* Submit Your Music * Get Acquainted * Music * News * Talk * Broadcasting EVERYWHERE 24/7 * Metaverse Radio WMVR-db Chicago (www.Metaverse.Radio) * Ideal for music lovers and creators, it offers album art creation, music submission guidance, and a splash of humor.

Borrower's Defense Assistant
Assistance in understanding and filling out the Borrower's Defense to Repayment Form provided by the United States Department of Education.

(Unofficial) Bullhorn Support Agent
I am not affiliated with Bullhorn, nor do I have rights to this software. For this, please visit Bullhorn.com as they are the owner. The rights holders may ask me to remove this test bot.

Project Deliverable Submission Advisor
Guides project teams towards successful deliverable submissions.

Pawtrait Creator
Creates cartoon pet portraits. Upload a photo of your pet, type its name, submit it, and watch the magic happen.

Winternet - (Project Proposals)
Assists with Information Technology related project proposal creation and submission.

Hur bra är remissvaret?
Få feed-back på hur väl ett remissvar svarar mot Regeringskansliets önskemål om hur remissvar bör utformas.

孙溢高级护理职称申报材料准备助手
帮助你准备高级护理职称申报所需的各种材料的助手。可以根据你的申报职称级别、申报专业方向、申报单位等信息,为你生成一份符合格式要求和内容要求的申报材料清单,包括申报表、考核表、临床成果等资料。它还可以提供一些参考文献和范文,帮助你完善和优化你的申报材料。