Best AI tools for< Extend Benchmark >
20 - AI tool Sites

ExtendImageAI
ExtendImageAI is an AI-powered tool that allows you to extend your images using generative AI models like Dalle, Stable Diffusion, and Midjourney. With ExtendImageAI, you can create variations of your images while preserving the depth and context. This tool is perfect for designers, artists, and anyone who wants to explore the possibilities of generative AI.

Rerun
Rerun is an SDK, time-series database, and visualizer for temporal and multimodal data. It is used in fields like robotics, spatial computing, 2D/3D simulation, and finance to verify, debug, and explain data. Rerun allows users to log data like tensors, point clouds, and text to create streams, visualize and interact with live and recorded streams, build layouts, customize visualizations, and extend data and UI functionalities. The application provides a composable data model, dynamic schemas, and custom views for enhanced data visualization and analysis.

Staccato
Staccato is an AI-powered music creation and songwriting tool that helps musicians, songwriters, and producers overcome writer's block, generate unique melodies and lyrics, and learn music theory. With its intuitive interface and powerful AI algorithms, Staccato provides a range of features to enhance the creative process, including AI Instrument™ for generating MIDI music, AI Lyrics for creating song lyrics, and educational tools for understanding music theory and songwriting techniques.

QuickData Cloud
QuickData Cloud is an innovative platform designed to simplify collaboration on online notes and text data storage. It empowers users to store, manage, and retrieve text data effortlessly through a single API endpoint, providing real-time access to information. QuickData Cloud is the simplest and fastest method to collaborate and maintain continuity in data handling, ensuring data is accessible, secure, and easy to manage. With a focus on no-code developers, it offers storage of text, comments, JSON, and databases, along with upcoming AI features for data analysis.

OAK
OAK is an open-source platform for building and deploying custom AI agents quickly and easily. It offers a modular design, powerful plugins, and seamless integration with various AI models. OAK is scalable, flexible, and developer-friendly, allowing users to create AI agents in minutes without hassle.

Soundverse AI
Soundverse AI is an AI music generator and music assistant that allows users to create music instantly from text prompts, interact with a voice assistant for music-related help, chat with the assistant for music recommendations, extend existing tracks with new sections, isolate individual audio tracks from a mix, auto-complete songs using initial ideas, craft lyrics with AI assistance, and more. The platform offers a range of AI tools to help users iterate and personalize their music creation process, making it easy to transform ideas into music in seconds.

Sora AI Tech
Sora AI Tech is an advanced diffusion model capable of generating videos. It starts with a video that looks like static noise and gradually transforms it by removing the noise over many steps to produce a clear video. Sora can generate entire videos at once or extend the length of videos, catering to a wide range of video production needs.

Writekit
Writekit is an AI assistant designed to help users create high-quality content efficiently. It learns from user data to provide tailored suggestions, streamlining the content creation process. With features like adaptive learning, brand authenticity, and real-time collaboration, Writekit aims to enhance productivity and creativity for writers and content creators.

AI Video API
AI Video API is an all-in-one API hub for AI-generated video, offering a cost-effective, user-friendly, and robust solution for creating videos in various styles. The platform allows users to transform their ideas into stunning videos with just a few words, enabling text-to-video generation, image to animated video conversion, extended video length, dual output formats, and real-time alerts. With seamless integration into popular frameworks and support for multiple programming languages, AI Video API empowers users to innovate effortlessly, stay ahead of the curve, and scale their projects limitlessly.

123RF
123RF is a stock photo website that offers a variety of AI tools for photo editing. These tools include AI Image Generator, AI Image Upscaler, AI Generative Fill, AI Background Remix, AI Image Extender, and AI Writer. 123RF also offers a variety of other features, such as a photo editor, a video editor, and a music editor. 123RF's AI tools are designed to make photo editing easier and faster. With AI Image Generator, users can create unique visuals from scratch. AI Image Upscaler can be used to improve the quality of low-resolution images. AI Generative Fill can be used to remove or replace objects in images. AI Background Remix can be used to create professional backgrounds for products. AI Image Extender can be used to extend images to different ratios. AI Writer can be used to generate text for websites, social media, and other marketing materials. 123RF's AI tools are available to both free and paid users. Free users have access to a limited number of AI tools, while paid users have access to all of the AI tools. 123RF's AI tools are a valuable resource for anyone who needs to edit photos. These tools are easy to use and can save users a lot of time and effort.

PicSo
PicSo is an AI art generator that allows you to create artworks from text prompts. With PicSo, you can create images in any art style, from realistic to abstract. You can also edit existing images, extend images, and create AI portraits. PicSo is available as a web app and a mobile app.

OpenAI Sora
OpenAI Sora is a text-to-video model that can generate realistic and imaginative video scenes from text instructions. It's designed to simulate the physical world in motion, generating videos up to a minute long while maintaining visual quality and adhering to the user's prompt.

Aimerce
Aimerce is an AI tool designed to help Shopify brands unlock additional revenue in the cookieless world. It provides a solution to capture, use, and monetize high-quality first-party data, enabling marketers to overcome challenges posed by evolving data privacy regulations and maximize marketing performance.

BabySleepBot™
BabySleepBot™ is an AI-powered online DIY program designed to help parents teach their babies to sleep through the night and take longer day naps. The program offers personalized training tailored to different parenting styles and babies' individual needs. It includes audio clips, personalized training, companion guide, education on decoding baby's tired cues, custom routines, and access to results within three weeks. The program is led by Jennifer, Australia's leading baby sleep consultant with 22+ years of experience and a proven track record of helping thousands of families achieve successful sleep outcomes.

OpenResty
The website appears to be displaying a '403 Forbidden' error message, which indicates that the server is refusing to respond to the request. This error is often caused by incorrect permissions on the server or a misconfiguration in the server settings. The message 'openresty' suggests that the server may be running the OpenResty web platform. OpenResty is a web platform based on NGINX and Lua that is commonly used to build high-performance web applications. It provides a powerful and flexible way to extend NGINX with Lua scripts, allowing for advanced web server functionality.

QuillWord
QuillWord is an AI-powered text editor designed to enhance academic and research writing. It offers a range of AI-powered tools, including an email writer, text summarizer, outline generator, essay rewriter, essay extender, essay shortener, essay introduction generator, essay conclusion generator, essay topic generator, research title generator, abstract generator, essay checker, and AI-powered autocompletion. QuillWord also provides citation support in various styles, a reference library, and an AI assistant called Copilot. It is suitable for students, teachers, researchers, and writers who want to improve their writing efficiency and quality.

IntelliPlugin
IntelliPlugin is an AI-powered WordPress plugin development tool that allows users to generate custom-made WordPress plugins without the need to write any code. By leveraging artificial intelligence, IntelliPlugin creates precise plugins tailored to the user's requirements. Users can easily edit and customize the functionality of the generated plugins. The tool seamlessly integrates with WordPress, BuddyPress, WooCommerce, and other plugins to extend website functionality. IntelliPlugin provides full control to review the generated plugin before activation, ensuring a seamless user experience.

Astra
Astra is a universal API for LLM function calling that supercharges LLMs with integrations using a single line of code. It allows users to conveniently leverage function calling in LLMs with over 2,200 integrations, manage authentication profiles, import tools easily, and enable function calling with any LLM. Astra replaces JSON with a type-safe UI, making integration management simpler. The application extends the capabilities of LLMs without altering their core structure, offering a seamless layer of integrations and function execution.

Heal.dev
Heal.dev is an AI-powered platform that offers an easy way to write stable end-to-end tests by automating regression testing, end-to-end tests, and production smoke tests in minutes. It provides tools for defining stable tests in plain English, automating complex checks with AI-powered assertions, composing tests with blocks, extending functionality with JavaScript code, and detecting bugs smartly. Heal.dev aims to speed up development cycles, eliminate flaky tests, and allow teams to focus on shipping great software.

SPUN
SPUN is a platform that helps foreigners relocate to or extend their stay in Indonesia. It provides a range of services, including visa and permit assistance, travel insurance, and accommodation options. SPUN is powered by a network of virtual assistants and AI, which helps to guide users through the relocation process and answer their questions.
20 - Open Source AI Tools

appworld
AppWorld is a high-fidelity execution environment of 9 day-to-day apps, operable via 457 APIs, populated with digital activities of ~100 people living in a simulated world. It provides a benchmark of natural, diverse, and challenging autonomous agent tasks requiring rich and interactive coding. The repository includes implementations of AppWorld apps and APIs, along with tests. It also introduces safety features for code execution and provides guides for building agents and extending the benchmark.

PDEBench
PDEBench provides a diverse and comprehensive set of benchmarks for scientific machine learning, including challenging and realistic physical problems. The repository consists of code for generating datasets, uploading and downloading datasets, training and evaluating machine learning models as baselines. It features a wide range of PDEs, realistic and difficult problems, ready-to-use datasets with various conditions and parameters. PDEBench aims for extensibility and invites participation from the SciML community to improve and extend the benchmark.

eval-dev-quality
DevQualityEval is an evaluation benchmark and framework designed to compare and improve the quality of code generation of Language Model Models (LLMs). It provides developers with a standardized benchmark to enhance real-world usage in software development and offers users metrics and comparisons to assess the usefulness of LLMs for their tasks. The tool evaluates LLMs' performance in solving software development tasks and measures the quality of their results through a point-based system. Users can run specific tasks, such as test generation, across different programming languages to evaluate LLMs' language understanding and code generation capabilities.

Tutel
Tutel MoE is an optimized Mixture-of-Experts implementation that offers a parallel solution with 'No-penalty Parallism/Sparsity/Capacity/Switching' for modern training and inference. It supports Pytorch framework (version >= 1.10) and various GPUs including CUDA and ROCm. The tool enables Full Precision Inference of MoE-based Deepseek R1 671B on AMD MI300. Tutel provides features like all-to-all benchmarking, tensorcore option, NCCL timeout settings, Megablocks solution, and dynamic switchable configurations. Users can run Tutel in distributed mode across multiple GPUs and machines. The tool allows for custom MoE implementations and offers detailed usage examples and reference documentation.

open-unlearning
OpenUnlearning is an easily extensible framework that unifies LLM unlearning evaluation benchmarks. It provides efficient implementations of TOFU and MUSE unlearning benchmarks, supporting 5 unlearning methods, 3+ datasets, 6+ evaluation metrics, and 7+ LLMs. Users can easily extend the framework to incorporate more variants, collaborate by adding new benchmarks, unlearning methods, datasets, and evaluation metrics, and drive progress in the field.

RouteLLM
RouteLLM is a framework for serving and evaluating LLM routers. It allows users to launch an OpenAI-compatible API that routes requests to the best model based on cost thresholds. Trained routers are provided to reduce costs while maintaining performance. Users can easily extend the framework, compare router performance, and calibrate cost thresholds. RouteLLM supports multiple routing strategies and benchmarks, offering a lightweight server and evaluation framework. It enables users to evaluate routers on benchmarks, calibrate thresholds, and modify model pairs. Contributions for adding new routers and benchmarks are welcome.

ck
Collective Mind (CM) is a collection of portable, extensible, technology-agnostic and ready-to-use automation recipes with a human-friendly interface (aka CM scripts) to unify and automate all the manual steps required to compose, run, benchmark and optimize complex ML/AI applications on any platform with any software and hardware: see online catalog and source code. CM scripts require Python 3.7+ with minimal dependencies and are continuously extended by the community and MLCommons members to run natively on Ubuntu, MacOS, Windows, RHEL, Debian, Amazon Linux and any other operating system, in a cloud or inside automatically generated containers while keeping backward compatibility - please don't hesitate to report encountered issues here and contact us via public Discord Server to help this collaborative engineering effort! CM scripts were originally developed based on the following requirements from the MLCommons members to help them automatically compose and optimize complex MLPerf benchmarks, applications and systems across diverse and continuously changing models, data sets, software and hardware from Nvidia, Intel, AMD, Google, Qualcomm, Amazon and other vendors: * must work out of the box with the default options and without the need to edit some paths, environment variables and configuration files; * must be non-intrusive, easy to debug and must reuse existing user scripts and automation tools (such as cmake, make, ML workflows, python poetry and containers) rather than substituting them; * must have a very simple and human-friendly command line with a Python API and minimal dependencies; * must require minimal or zero learning curve by using plain Python, native scripts, environment variables and simple JSON/YAML descriptions instead of inventing new workflow languages; * must have the same interface to run all automations natively, in a cloud or inside containers. CM scripts were successfully validated by MLCommons to modularize MLPerf inference benchmarks and help the community automate more than 95% of all performance and power submissions in the v3.1 round across more than 120 system configurations (models, frameworks, hardware) while reducing development and maintenance costs.

LongRoPE
LongRoPE is a method to extend the context window of large language models (LLMs) beyond 2 million tokens. It identifies and exploits non-uniformities in positional embeddings to enable 8x context extension without fine-tuning. The method utilizes a progressive extension strategy with 256k fine-tuning to reach a 2048k context. It adjusts embeddings for shorter contexts to maintain performance within the original window size. LongRoPE has been shown to be effective in maintaining performance across various tasks from 4k to 2048k context lengths.

RAGLAB
RAGLAB is a modular, research-oriented open-source framework for Retrieval-Augmented Generation (RAG) algorithms. It offers reproductions of 6 existing RAG algorithms and a comprehensive evaluation system with 10 benchmark datasets, enabling fair comparisons between RAG algorithms and easy expansion for efficient development of new algorithms, datasets, and evaluation metrics. The framework supports the entire RAG pipeline, provides advanced algorithm implementations, fair comparison platform, efficient retriever client, versatile generator support, and flexible instruction lab. It also includes features like Interact Mode for quick understanding of algorithms and Evaluation Mode for reproducing paper results and scientific research.

aiida-core
AiiDA (www.aiida.net) is a workflow manager for computational science with a strong focus on provenance, performance and extensibility. **Features** * **Workflows:** Write complex, auto-documenting workflows in python, linked to arbitrary executables on local and remote computers. The event-based workflow engine supports tens of thousands of processes per hour with full checkpointing. * **Data provenance:** Automatically track inputs, outputs & metadata of all calculations in a provenance graph for full reproducibility. Perform fast queries on graphs containing millions of nodes. * **HPC interface:** Move your calculations to a different computer by changing one line of code. AiiDA is compatible with schedulers like SLURM, PBS Pro, torque, SGE or LSF out of the box. * **Plugin interface:** Extend AiiDA with plugins for new simulation codes (input generation & parsing), data types, schedulers, transport modes and more. * **Open Science:** Export subsets of your provenance graph and share them with peers or make them available online for everyone on the Materials Cloud. * **Open source:** AiiDA is released under the MIT open source license

InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) based on InternLM2-7B excelling in free-form text-image composition and comprehension. It boasts several amazing capabilities and applications: * **Free-form Interleaved Text-Image Composition** : InternLM-XComposer2 can effortlessly generate coherent and contextual articles with interleaved images following diverse inputs like outlines, detailed text requirements and reference images, enabling highly customizable content creation. * **Accurate Vision-language Problem-solving** : InternLM-XComposer2 accurately handles diverse and challenging vision-language Q&A tasks based on free-form instructions, excelling in recognition, perception, detailed captioning, visual reasoning, and more. * **Awesome performance** : InternLM-XComposer2 based on InternLM2-7B not only significantly outperforms existing open-source multimodal models in 13 benchmarks but also **matches or even surpasses GPT-4V and Gemini Pro in 6 benchmarks** We release InternLM-XComposer2 series in three versions: * **InternLM-XComposer2-4KHD-7B** 🤗: The high-resolution multi-task trained VLLM model with InternLM-7B as the initialization of the LLM for _High-resolution understanding_ , _VL benchmarks_ and _AI assistant_. * **InternLM-XComposer2-VL-7B** 🤗 : The multi-task trained VLLM model with InternLM-7B as the initialization of the LLM for _VL benchmarks_ and _AI assistant_. **It ranks as the most powerful vision-language model based on 7B-parameter level LLMs, leading across 13 benchmarks.** * **InternLM-XComposer2-VL-1.8B** 🤗 : A lightweight version of InternLM-XComposer2-VL based on InternLM-1.8B. * **InternLM-XComposer2-7B** 🤗: The further instruction tuned VLLM for _Interleaved Text-Image Composition_ with free-form inputs. Please refer to Technical Report and 4KHD Technical Reportfor more details.

ShapeLLM
ShapeLLM is the first 3D Multimodal Large Language Model designed for embodied interaction, exploring a universal 3D object understanding with 3D point clouds and languages. It supports single-view colored point cloud input and introduces a robust 3D QA benchmark, 3D MM-Vet, encompassing various variants. The model extends the powerful point encoder architecture, ReCon++, achieving state-of-the-art performance across a range of representation learning tasks. ShapeLLM can be used for tasks such as training, zero-shot understanding, visual grounding, few-shot learning, and zero-shot learning on 3D MM-Vet.

LLaMa2lang
LLaMa2lang is a repository containing convenience scripts to finetune LLaMa3-8B (or any other foundation model) for chat towards any language that isn't English. The repository aims to improve the performance of LLaMa3 for non-English languages by combining fine-tuning with RAG. Users can translate datasets, extract threads, turn threads into prompts, and finetune models using QLoRA and PEFT. Additionally, the repository supports translation models like OPUS, M2M, MADLAD, and base datasets like OASST1 and OASST2. The process involves loading datasets, translating them, combining checkpoints, and running inference using the newly trained model. The repository also provides benchmarking scripts to choose the right translation model for a target language.

khoj
Khoj is an open-source, personal AI assistant that extends your capabilities by creating always-available AI agents. You can share your notes and documents to extend your digital brain, and your AI agents have access to the internet, allowing you to incorporate real-time information. Khoj is accessible on Desktop, Emacs, Obsidian, Web, and Whatsapp, and you can share PDF, markdown, org-mode, notion files, and GitHub repositories. You'll get fast, accurate semantic search on top of your docs, and your agents can create deeply personal images and understand your speech. Khoj is self-hostable and always will be.

EasyEdit
EasyEdit is a Python package for edit Large Language Models (LLM) like `GPT-J`, `Llama`, `GPT-NEO`, `GPT2`, `T5`(support models from **1B** to **65B**), the objective of which is to alter the behavior of LLMs efficiently within a specific domain without negatively impacting performance across other inputs. It is designed to be easy to use and easy to extend.

Qwen
Qwen is a series of large language models developed by Alibaba DAMO Academy. It outperforms the baseline models of similar model sizes on a series of benchmark datasets, e.g., MMLU, C-Eval, GSM8K, MATH, HumanEval, MBPP, BBH, etc., which evaluate the models’ capabilities on natural language understanding, mathematic problem solving, coding, etc. Qwen models outperform the baseline models of similar model sizes on a series of benchmark datasets, e.g., MMLU, C-Eval, GSM8K, MATH, HumanEval, MBPP, BBH, etc., which evaluate the models’ capabilities on natural language understanding, mathematic problem solving, coding, etc. Qwen-72B achieves better performance than LLaMA2-70B on all tasks and outperforms GPT-3.5 on 7 out of 10 tasks.

MM-RLHF
MM-RLHF is a comprehensive project for aligning Multimodal Large Language Models (MLLMs) with human preferences. It includes a high-quality MLLM alignment dataset, a Critique-Based MLLM reward model, a novel alignment algorithm MM-DPO, and benchmarks for reward models and multimodal safety. The dataset covers image understanding, video understanding, and safety-related tasks with model-generated responses and human-annotated scores. The reward model generates critiques of candidate texts before assigning scores for enhanced interpretability. MM-DPO is an alignment algorithm that achieves performance gains with simple adjustments to the DPO framework. The project enables consistent performance improvements across 10 dimensions and 27 benchmarks for open-source MLLMs.

FlagEmbedding
FlagEmbedding focuses on retrieval-augmented LLMs, consisting of the following projects currently: * **Long-Context LLM** : Activation Beacon * **Fine-tuning of LM** : LM-Cocktail * **Embedding Model** : Visualized-BGE, BGE-M3, LLM Embedder, BGE Embedding * **Reranker Model** : llm rerankers, BGE Reranker * **Benchmark** : C-MTEB

Awesome-LLMs-for-Video-Understanding
Awesome-LLMs-for-Video-Understanding is a repository dedicated to exploring Video Understanding with Large Language Models. It provides a comprehensive survey of the field, covering models, pretraining, instruction tuning, and hybrid methods. The repository also includes information on tasks, datasets, and benchmarks related to video understanding. Contributors are encouraged to add new papers, projects, and materials to enhance the repository.
8 - OpenAI Gpts

Prompt Muse
Extend the utility of readymade prompt libraries with your SMB's personalized prompt prefix.

Smartphone Repair Manual
A virtual smartphone repair manual offering detailed fixing instructions.

Fragrance Creator and Connoisseur GPT
I am a GPT specialized in providing bespoke recommendations for colognes and perfumes. My expertise extends to crafting unique fragrance creations, tailored to align with your individual preferences.
Scraping GPT Proxy and Web Scraping Tips
Scraping ChatGPT helps you with web scraping and proxy management. It provides advanced tips and strategies for efficiently handling CAPTCHAs, and managing IP rotations. Its expertise extends to ethical scraping practices, and optimizing proxy usage for seamless data retrieval