Best AI tools for< Experiment With Tests >
20 - AI tool Sites

Heatseeker
Heatseeker is an AI-powered market experimentation tool that helps businesses predict customer preferences, conduct feature tests, and generate value propositions. It enables users to answer critical growth questions about market, audience, and product features through AI-powered experiments. Heatseeker provides insights into market trends, competitor analysis, and helps in making data-driven decisions. The platform offers curated recommendations, competitive intelligence, and continuous testing for refining strategies. It automates ad campaign generation, data collection, and provides recommendations for launching new products. Heatseeker is designed to help businesses optimize their marketing efforts and improve their product offerings.

Permar
Permar is an AI-powered website optimization tool that helps businesses increase their conversion rates. It uses reinforcement learning techniques to dynamically adapt website optimization, resulting in an average uplift in conversion rates of 10-12% compared to static A/B tests. Permar also offers a complete toolkit of features to help businesses create high-converting landing pages, including dynamic A/B testing, real-time optimization, and growth experiment ideas.

AI Test Kitchen
AI Test Kitchen is a website that provides a variety of AI-powered tools for creative professionals. These tools can be used to generate images, music, and text, as well as to explore different creative concepts. The website is designed to be a place where users can experiment with AI and learn how to use it to enhance their creative process.

Freeplay
Freeplay is a tool that helps product teams experiment, test, monitor, and optimize AI features for customers. It provides a single pane of glass for the entire team, lightweight developer SDKs for Python, Node, and Java, and deployment options to meet compliance needs. Freeplay also offers best practices for the entire AI development lifecycle.

Spacely AI
Spacely AI is an AI rendering solution that specializes in interior, room, and home design. It utilizes advanced artificial intelligence algorithms to create realistic and immersive visualizations of interior spaces. With Spacely AI, users can easily experiment with different design elements, layouts, and color schemes to bring their ideas to life. Whether you are an interior designer, architect, or homeowner, Spacely AI provides a powerful tool to visualize and plan your design projects with precision and creativity.

Release.ai
Release.ai is an AI-centric platform that allows developers, operations, and leadership teams to easily deploy and manage AI applications. It offers pre-configured templates for popular open-source technologies, private AI environments for secure development, and access to GPU resources. With Release.ai, users can build, test, and scale AI solutions quickly and efficiently within their own boundaries.

Sophi
Mather Economics is a global leader in providing technology, analytics, and advisory services to enable publishers to monetize content and grow subscriber revenue. Our Sophi platform uses industry-leading AI technology and automation to monetize content more effectively and efficiently, driving both revenue growth and cost savings. Our approach to dynamic pricing has proven to drive significant revenue increases over the past 20 years and continues to lead the industry. Overall, our value proposition enables fast, economical, and verifiable return on investment through our efficient application of predictive analytics, technology, A/B testing, and data science.

RagaAI Catalyst
RagaAI Catalyst is a sophisticated AI observability, monitoring, and evaluation platform designed to help users observe, evaluate, and debug AI agents at all stages of Agentic AI workflows. It offers features like visualizing trace data, instrumenting and monitoring tools and agents, enhancing AI performance, agentic testing, comprehensive trace logging, evaluation for each step of the agent, enterprise-grade experiment management, secure and reliable LLM outputs, finetuning with human feedback integration, defining custom evaluation logic, generating synthetic data, and optimizing LLM testing with speed and precision. The platform is trusted by AI leaders globally and provides a comprehensive suite of tools for AI developers and enterprises.

Kameleoon
Kameleoon is an AI-driven A/B testing platform and personalization tool designed to optimize web experiences through experimentation and feature management. It offers a single platform with AI-powered conversion capabilities, strong security features, and powerful integrations. Kameleoon caters to a wide range of industries, including E-commerce, Retail, Travel, Automotive, Financial Services, Media, Healthcare, and B2B SaaS. The platform enables users to run experiments, personalize content, manage features, and analyze real-time data to enhance user experiences and drive growth.

UpTrain
UpTrain is a full-stack LLMOps platform designed to help users confidently scale AI by providing a comprehensive solution for all production needs, from evaluation to experimentation to improvement. It offers diverse evaluations, automated regression testing, enriched datasets, and innovative techniques to generate high-quality scores. UpTrain is built for developers, compliant to data governance needs, cost-efficient, remarkably reliable, and open-source. It provides precision metrics, task understanding, safeguard systems, and covers a wide range of language features and quality aspects. The platform is suitable for developers, product managers, and business leaders looking to enhance their LLM applications.

Teste.ai
Teste.ai is an AI-powered platform that allows users to create software testing scenarios and test cases using top-notch artificial intelligence technology. The platform offers a variety of tools based on AI to accelerate the software quality testing journey, helping testers cover a wide range of requirements with a vast array of test scenarios efficiently. Teste.ai's intelligent features enable users to save time and enhance efficiency in creating, executing, and managing software tests. With advanced AI integration, the platform provides automatic generation of test cases based on software documentation or specific requirements, ensuring comprehensive test coverage and precise responses to testing queries.

More Views AI
More Views AI is an AI tool designed to help YouTube content creators increase their view count by testing different video settings. The tool uses artificial intelligence to analyze video titles, suggest variations, and run A/B tests to determine the best-performing title. It offers features such as automatic A/B toggling, performance tracking, views optimization, AI-generated title suggestions, and thumbnail testing. Users can connect their YouTube account, set up experiments in seconds, and let the algorithm optimize their video titles to attract more views.

Human or Not
Human or Not is a social Turing game where you chat with someone for two minutes and try to figure out if it was a fellow human or an AI bot. The experiment has ended, but you can read more about the research here.

AI Website & Landing Pages
The AI Website & Landing Pages tool allows users to create AI-designed websites and landing pages in just 10 seconds. It offers a streamlined experience with features such as AI design and copy, free and custom domains, analytics and insights, A/B testing, AI sales and support chatbot, SEO optimization, free image and video library, custom forms, webhook integration, auto page translation, high-speed streaming, adaptive design, curated playlists, and more. Users can optimize their results with AB testing, AI-generated versions, quick experiments, and in-depth reports. The tool also enables users to run ads globally, create multilingual landing pages, and reach a global audience with fast campaigns. It provides effortless editing with 1-click edit and publish functionality, instant previews, and seamless publishing. The tool is user-friendly, requiring no code or drag-and-drop actions, making website and landing page creation quick and easy.

Qlik AutoML
Qlik AutoML is an AI tool that offers automated machine learning for analytics teams. It allows users to create machine learning experiments, identify key drivers in data, train models, and make predictions. With a focus on no-code machine learning, Qlik AutoML simplifies the process of generating predictive models and understanding outcomes. The tool enables users to explore predictive data, test what-if scenarios, and leverage AI-powered connectors for seamless integration with other AI and machine learning tools.

QuarkIQL
QuarkIQL is a generative testing tool for computer vision APIs. It allows users to create custom test images and requests with just a few clicks. QuarkIQL also provides a log of your queries so you can run more experiments without starting from square one.

Byterat
Byterat is a cloud-based platform that provides battery data management, visualization, and analytics. It offers an end-to-end data pipeline that automatically synchronizes, processes, and visualizes materials, manufacturing, and test data from all labs. Byterat also provides 24/7 access to experiments from anywhere in the world and integrates seamlessly with current workflows. It is customizable to specific cell chemistries and allows users to build custom visualizations, dashboards, and analyses. Byterat's AI-powered battery research has been published in leading journals, and its team has pioneered a new class of models that extract tell-tale signals of battery health from electrical signals to forecast future performance.

Evolv AI
Evolv AI is an AI-led experience optimization platform that drives measurable business growth by continuously learning, optimizing, and accelerating UX experimentation to deliver results. It uses generative AI to evaluate digital experiences, identify conversion issues, and provide performance-boosting UX recommendations. Users can train the AI with specific business information, simplify prototyping, and implement with support. Evolv AI focuses on active learning through experimentation, leveraging AI and machine learning to create personalized experiences across multiple touchpoints. The platform integrates well with existing technology stacks, enabling continuous optimization and impactful business growth.

Google Labs
Google Labs is a website that showcases experimental AI tools and technology developed by Google. These tools are designed to help users explore the potential of AI in various fields, including creativity, productivity, and education. Some of the featured tools include: - **LABS.GOOGLE**: A platform for experimenting with the future of AI, including tools for creating images from text, generating music, and writing scripts for home automation. - **NotebookLM**: A personalized AI collaborator designed to help users with their thinking and writing. - **Say What You See**: A tool that helps users learn the art of prompting and improving their image-reading skills. - **Help Me Script**: A tool that turns text into home automation scripts for Google Home. - **ImageFX**: A tool that transforms text into images, allowing users to explore endless possibilities. - **Gen AI in Chrome**: A tool that creates themes with AI, organizes tabs, and helps users write more confidently on the web. - **MusicFX**: A tool that describes a musical idea and brings it to life. - **Duet AI**: A tool that helps users create, write, visualize, and organize in new ways with collaborative AI tools in Google Workspace. - **TextFX**: A tool that supercharges the writing process with AI-powered language tools.

GPT4Free
GPT4Free is a free playground for experimenting with ChatGPT without the hassle of APIs, logins, or restrictions. It offers a wide range of features, including a prompt library, organized chats, import/export functionality, customizable model parameters, and multiple language support. GPT4Free also provides access to GPT Reverse Proxy, allowing users to interact with GPT4 & GPT3.5 and GPT4 Vision from anywhere in the world.
20 - Open Source AI Tools

LLM-Learn-PK
LLM-Learn-PK is a repository for testing various LLM and RAG tests. It serves as a learning platform where the creator experiments with different tests and learns in the process.

rag-experiment-accelerator
The RAG Experiment Accelerator is a versatile tool that helps you conduct experiments and evaluations using Azure AI Search and RAG pattern. It offers a rich set of features, including experiment setup, integration with Azure AI Search, Azure Machine Learning, MLFlow, and Azure OpenAI, multiple document chunking strategies, query generation, multiple search types, sub-querying, re-ranking, metrics and evaluation, report generation, and multi-lingual support. The tool is designed to make it easier and faster to run experiments and evaluations of search queries and quality of response from OpenAI, and is useful for researchers, data scientists, and developers who want to test the performance of different search and OpenAI related hyperparameters, compare the effectiveness of various search strategies, fine-tune and optimize parameters, find the best combination of hyperparameters, and generate detailed reports and visualizations from experiment results.

generative-ai-application-builder-on-aws
The Generative AI Application Builder on AWS (GAAB) is a solution that provides a web-based management dashboard for deploying customizable Generative AI (Gen AI) use cases. Users can experiment with and compare different combinations of Large Language Model (LLM) use cases, configure and optimize their use cases, and integrate them into their applications for production. The solution is targeted at novice to experienced users who want to experiment and productionize different Gen AI use cases. It uses LangChain open-source software to configure connections to Large Language Models (LLMs) for various use cases, with the ability to deploy chat use cases that allow querying over users' enterprise data in a chatbot-style User Interface (UI) and support custom end-user implementations through an API.

llm-analysis
llm-analysis is a tool designed for Latency and Memory Analysis of Transformer Models for Training and Inference. It automates the calculation of training or inference latency and memory usage for Large Language Models (LLMs) or Transformers based on specified model, GPU, data type, and parallelism configurations. The tool helps users to experiment with different setups theoretically, understand system performance, and optimize training/inference scenarios. It supports various parallelism schemes, communication methods, activation recomputation options, data types, and fine-tuning strategies. Users can integrate llm-analysis in their code using the `LLMAnalysis` class or use the provided entry point functions for command line interface. The tool provides lower-bound estimations of memory usage and latency, and aims to assist in achieving feasible and optimal setups for training or inference.

plandex
Plandex is an open source, terminal-based AI coding engine designed for complex tasks. It uses long-running agents to break up large tasks into smaller subtasks, helping users work through backlogs, navigate unfamiliar technologies, and save time on repetitive tasks. Plandex supports various AI models, including OpenAI, Anthropic Claude, Google Gemini, and more. It allows users to manage context efficiently in the terminal, experiment with different approaches using branches, and review changes before applying them. The tool is platform-independent and runs from a single binary with no dependencies.

model.nvim
model.nvim is a tool designed for Neovim users who want to utilize AI models for completions or chat within their text editor. It allows users to build prompts programmatically with Lua, customize prompts, experiment with multiple providers, and use both hosted and local models. The tool supports features like provider agnosticism, programmatic prompts in Lua, async and multistep prompts, streaming completions, and chat functionality in 'mchat' filetype buffer. Users can customize prompts, manage responses, and context, and utilize various providers like OpenAI ChatGPT, Google PaLM, llama.cpp, ollama, and more. The tool also supports treesitter highlights and folds for chat buffers.

superpipe
Superpipe is a lightweight framework designed for building, evaluating, and optimizing data transformation and data extraction pipelines using LLMs. It allows users to easily combine their favorite LLM libraries with Superpipe's building blocks to create pipelines tailored to their unique data and use cases. The tool facilitates rapid prototyping, evaluation, and optimization of end-to-end pipelines for tasks such as classification and evaluation of job departments based on work history. Superpipe also provides functionalities for evaluating pipeline performance, optimizing parameters for cost, accuracy, and speed, and conducting grid searches to experiment with different models and prompts.

appworld
AppWorld is a high-fidelity execution environment of 9 day-to-day apps, operable via 457 APIs, populated with digital activities of ~100 people living in a simulated world. It provides a benchmark of natural, diverse, and challenging autonomous agent tasks requiring rich and interactive coding. The repository includes implementations of AppWorld apps and APIs, along with tests. It also introduces safety features for code execution and provides guides for building agents and extending the benchmark.

genkit
Firebase Genkit (beta) is a framework with powerful tooling to help app developers build, test, deploy, and monitor AI-powered features with confidence. Genkit is cloud optimized and code-centric, integrating with many services that have free tiers to get started. It provides unified API for generation, context-aware AI features, evaluation of AI workflow, extensibility with plugins, easy deployment to Firebase or Google Cloud, observability and monitoring with OpenTelemetry, and a developer UI for prototyping and testing AI features locally. Genkit works seamlessly with Firebase or Google Cloud projects through official plugins and templates.

llmops-promptflow-template
LLMOps with Prompt flow is a template and guidance for building LLM-infused apps using Prompt flow. It provides centralized code hosting, lifecycle management, variant and hyperparameter experimentation, A/B deployment, many-to-many dataset/flow relationships, multiple deployment targets, comprehensive reporting, BYOF capabilities, configuration-based development, local prompt experimentation and evaluation, endpoint testing, and optional Human-in-loop validation. The tool is customizable to suit various application needs.

moatless-tools
Moatless Tools is a hobby project focused on experimenting with using Large Language Models (LLMs) to edit code in large existing codebases. The project aims to build tools that insert the right context into prompts and handle responses effectively. It utilizes an agentic loop functioning as a finite state machine to transition between states like Search, Identify, PlanToCode, ClarifyChange, and EditCode for code editing tasks.

csghub
CSGHub is an open source platform for managing large model assets, including datasets, model files, and codes. It offers functionalities similar to a privatized Huggingface, managing assets in a manner akin to how OpenStack Glance manages virtual machine images. Users can perform operations such as uploading, downloading, storing, verifying, and distributing assets through various interfaces. The platform provides microservice submodules and standardized OpenAPIs for easy integration with users' systems. CSGHub is designed for large models and can be deployed On-Premise for offline operation.

ai-dev-2024-ml-workshop
The 'ai-dev-2024-ml-workshop' repository contains materials for the Deploy and Monitor ML Pipelines workshop at the AI_dev 2024 conference in Paris, focusing on deployment designs of machine learning pipelines using open-source applications and free-tier tools. It demonstrates automating data refresh and forecasting using GitHub Actions and Docker, monitoring with MLflow and YData Profiling, and setting up a monitoring dashboard with Quarto doc on GitHub Pages.

gorilla
Gorilla is a tool that enables LLMs to use tools by invoking APIs. Given a natural language query, Gorilla comes up with the semantically- and syntactically- correct API to invoke. With Gorilla, you can use LLMs to invoke 1,600+ (and growing) API calls accurately while reducing hallucination. Gorilla also releases APIBench, the largest collection of APIs, curated and easy to be trained on!

gptme
GPTMe is a tool that allows users to interact with an LLM assistant directly in their terminal in a chat-style interface. The tool provides features for the assistant to run shell commands, execute code, read/write files, and more, making it suitable for various development and terminal-based tasks. It serves as a local alternative to ChatGPT's 'Code Interpreter,' offering flexibility and privacy when using a local model. GPTMe supports code execution, file manipulation, context passing, self-correction, and works with various AI models like GPT-4. It also includes a GitHub Bot for requesting changes and operates entirely in GitHub Actions. In progress features include handling long contexts intelligently, a web UI and API for conversations, web and desktop vision, and a tree-based conversation structure.

HuggingFaceGuidedTourForMac
HuggingFaceGuidedTourForMac is a guided tour on how to install optimized pytorch and optionally Apple's new MLX, JAX, and TensorFlow on Apple Silicon Macs. The repository provides steps to install homebrew, pytorch with MPS support, MLX, JAX, TensorFlow, and Jupyter lab. It also includes instructions on running large language models using HuggingFace transformers. The repository aims to help users set up their Macs for deep learning experiments with optimized performance.

kwaak
Kwaak is a tool that allows users to run a team of autonomous AI agents locally from their own machine. It enables users to write code, improve test coverage, update documentation, and enhance code quality while focusing on building innovative projects. Kwaak is designed to run multiple agents in parallel, interact with codebases, answer questions about code, find examples, write and execute code, create pull requests, and more. It is free and open-source, allowing users to bring their own API keys or models via Ollama. Kwaak is part of the bosun.ai project, aiming to be a platform for autonomous code improvement.

AceCoder
AceCoder is a tool that introduces a fully automated pipeline for synthesizing large-scale reliable tests used for reward model training and reinforcement learning in the coding scenario. It curates datasets, trains reward models, and performs RL training to improve coding abilities of language models. The tool aims to unlock the potential of RL training for code generation models and push the boundaries of LLM's coding abilities.

ludwig
Ludwig is a declarative deep learning framework designed for scale and efficiency. It is a low-code framework that allows users to build custom AI models like LLMs and other deep neural networks with ease. Ludwig offers features such as optimized scale and efficiency, expert level control, modularity, and extensibility. It is engineered for production with prebuilt Docker containers, support for running with Ray on Kubernetes, and the ability to export models to Torchscript and Triton. Ludwig is hosted by the Linux Foundation AI & Data.
20 - OpenAI Gpts

Digital Experiment Analyst
Demystifying Experimentation and Causal Inference with 1-Sided Tests Focus

BioChomps
Assume the role of a MAD SCIENTIST bent on creating the most powerful animal in this turn based creature creator AI battler!

Shoes Design Image Generator | Discover Creativity
Explore a wide range of shoe styles and learn about design with AI-generated images.

Case Digests on Demand (a Jurisage experiment)
Upload a court judgment and get back a collection of topical case digests based on the case. Oh - don't trust the "Topic 2210" or similar number, it's random. Also, probably best you not fully trust the output either. We're just playing with the GPT maker. More about us at Jurisage.com.

Ask Cris about File Maker
An experiment in personal FileMaker guidance from the collective works of lifetime award-winning FileMaker trainer, Cris Ippolite. Not just links to resources, but direct access to 20+ years of custom training curriculum combined with expert AI instruction without the noise of external web links.

NYC Dog Data Guide
EXPERIMENT - Friendly expert on NYC dog license data from 2015-2016, with info on names, breeds and boroughs

E&L and Pharmaceutical Regulatory Compliance AI
This GPT chat AI is specialized in understanding Extractables and Leachables studies, aligning with pharmaceutical guidelines, and aiding in the design and interpretation of relevant experiments.

Product Coach
Guiding your product journey with expert insights. Made by the team behind kraftful.com

Genetic Explorer
Innovative GPT in genetic engineering, compliant with OpenAI's usage policies

Glass Blower, Laboratory Apparatus Assistant
Hello I'm Glass Blower, Laboratory Apparatus Assistant! What would you like help with today?

Hypothesis Generator
Generates research hypotheses in various fields, ensuring scientific plausibility.