Best AI tools for< Instruction Evaluation >
Infographic
20 - AI tool Sites
Applicant AI
Applicant AI is an applicant tracking and recruiting software powered by AI (ATS). It helps companies review job applicants 10x faster by using AI to screen thousands of applicants and identify the right candidates in seconds. The tool transforms the traditional applicant selection process, saving users 80% of the time spent on screening. With features like AI-generated summaries, ratings, and custom instructions evaluation, Applicant AI streamlines the hiring process and ensures only high-quality applicants are considered. The platform is compliant with EU AI regulation, prioritizes human decision-making, and aims to minimize risks of unfair or biased outcomes in employment.
MASCAA
MASCAA is a comprehensive human confidence analysis platform that focuses on evaluating the confidence of users through video and audio during various tasks. It integrates advanced facial expression and voice analysis technologies to provide valuable feedback for students, instructors, individuals, businesses, and teams. MASCAA offers quick and easy test creation, evaluation, and confidence assessment for educational settings, personal use, startups, small organizations, universities, and large organizations. The platform aims to unlock long-term value and enhance customer experience by helping users assess and improve their confidence levels.
IELTSWritingPro
IELTSWritingPro is an AI-powered platform designed to help users improve their IELTS writing skills. It offers detailed feedback, band estimation, practice questions, and correction services. The platform utilizes advanced AI technology to analyze grammar, coherence, task response, and lexical resource. Users can receive personalized improvement suggestions and insights to enhance their writing abilities. IELTSWritingPro aims to assist individuals in preparing effectively for the IELTS exam by providing comprehensive evaluation and valuable feedback.
LingoLeap
LingoLeap is an AI-powered tool and platform designed for TOEFL and IELTS preparation. It leverages artificial intelligence to provide personalized feedback and guidance tailored to individual learning needs. With features such as instant feedback, practice tests, high-score answer generation, and vocabulary boost, LingoLeap aims to help users improve their English skills efficiently. The tool offers subscription plans with varying credits for speaking and writing evaluations, along with a free trial option. LingoLeap's innovative approach enhances language learning by analyzing users' language expression, grammar accuracy, and vocabulary application, similar to the official TOEFL test standards.
Eduaide.Ai
Eduaide.Ai is an AI-driven platform designed to assist educators in creating lesson plans, teaching resources, and assessments. It offers features such as AI-assisted lesson planning, teaching resources generation, feedback bot, personalization tools, and assessment builder. The platform aims to streamline administrative tasks, provide personalized learning experiences, and enhance teaching efficiency through AI technology.
MailMaestro
MailMaestro is an AI email assistant that helps users write better emails faster, allowing them to be more productive and focus on important tasks. The company behind MailMaestro, Maestro Labs, acquired Flowrite, the AI writing division of Flow AI, in September 2024. The team behind Flowrite is now focusing on Flow AI, an advanced platform for AI teams to evaluate and improve LLM systems effortlessly. MailMaestro integrates Flowrite's capabilities to enhance email writing experience for users.
Canopy Directory
Canopy Directory is an AI tool designed specifically for educators. It provides a comprehensive directory of AI tools that can be used in educational settings. The platform aims to streamline the process of finding and utilizing AI tools for teaching and learning purposes. Educators can explore a wide range of tools categorized based on their functionalities and applications, making it easier to integrate AI technology into their teaching practices. Canopy Directory serves as a valuable resource for educators looking to enhance their teaching methods through the use of AI tools.
Google Gemma
Google Gemma is a lightweight, state-of-the-art open language model (LLM) developed by Google. It is part of the same research used in the creation of Google's Gemini models. Gemma models come in two sizes, the 2B and 7B parameter versions, where each has a base (pre-trained) and instruction-tuned modifications. Gemma models are designed to be cross-device compatible and optimized for Google Cloud and NVIDIA GPUs. They are also accessible through Kaggle, Hugging Face, Google Cloud with Vertex AI or GKE. Gemma models can be used for a variety of applications, including text generation, summarization, RAG, and both commercial and research use.
DoubleO AIPURE AI DOUBLE .O.
DoubleO AIPURE AI DOUBLE .O. is an AI automation tool designed for non-developers to easily create powerful AI automations. The tool allows users to give simple instructions, connect tools, and let a team of highly-trained DoubleO AI agents automate complex tasks. It offers pre-built and custom workflows for various teams, such as Sales, Marketing, Product, and Operations. The tool integrates with popular tools like Intercom, Slack, Salesforce, and more, ensuring data security and privacy with end-to-end encryption and compliance with data security standards. Users can benefit from features like automating pre-call prep, analyzing customer feedback, creating launch plans, and maintaining roadmaps.
OdiaGenAI
OdiaGenAI is a collaborative initiative focused on conducting research on Generative AI and Large Language Models (LLM) for the Odia Language. The project aims to leverage AI technology to develop Generative AI and LLM-based solutions for the overall development of Odisha and the Odia language through collaboration among Odia technologists. The initiative offers pre-trained models, codes, and datasets for non-commercial and research purposes, with a focus on building language models for Indic languages like Odia and Bengali.
Diffit
Diffit is an AI-powered educational tool designed to provide learning resources for teachers and students. It helps teachers create customized, grade-level content by generating standards-aligned resources from scratch. With features like text re-leveling, vocabulary customization, and question addition, Diffit aims to make instructional materials accessible to all students. The application offers a library of high-quality, student-ready exports to facilitate the teaching process. Testimonials from educators highlight the tool's effectiveness in differentiating instruction and engaging students across various subjects.
Grow with Google
Grow with Google is an AI tool designed to provide training and resources to help individuals boost their productivity and skills in various fields such as cybersecurity, data analytics, digital marketing, IT support, project management, UX design, and AI essentials. The platform offers online courses, tools, and professional certificates to help users develop ideas, make informed decisions, and enhance their daily work tasks using generative AI tools. With a focus on career growth and business development, Grow with Google aims to empower individuals with essential AI skills to succeed in today's competitive job market.
LessonPlans.ai
LessonPlans.ai is an AI-powered lesson plan generator that helps teachers create high-quality lesson plans in seconds. With this tool, teachers can easily generate detailed, personalized lesson plans that are tailored to their students' needs. LessonPlans.ai also includes a step-by-step guide for each lesson, making it easy for teachers to follow and implement the plan in their classrooms.
Cognii
Cognii is an AI-based educational technology provider that offers solutions for K-12, higher education, and corporate training markets. Their award-winning EdTech product enables personalized learning, intelligent tutoring, open response assessments, and rich analytics. Cognii's Virtual Learning Assistant engages students in chatbot-style conversations, providing instant feedback, personalized hints, and guiding towards mastery. The platform aims to deliver 21st-century online education with superior learning outcomes and cost efficiency.
Breakout Learning
Breakout Learning is an AI-powered educational platform that transforms traditional case studies into engaging, multifaceted experiences. It empowers professors with AI insights into small-group discussions, enabling them to customize lectures and foster deeper student comprehension. Students benefit from rich content, peer-led discussions, and AI assessment that provides personalized feedback and tracks their progress.
Undress App
Undress App is an AI tool that allows users to nudify any person in a photo using AI technology. The application provides a 3-step instruction on how to achieve this, emphasizing the importance of photo quality and proper positioning of the person in the image. It discusses the ethical implications of using AI to undress individuals and highlights the creative and potentially harmful uses of such technology. Undress App aims to offer a platform for creative self-expression and entertainment while acknowledging the potential misuse of the tool.
Cerebras API
The Cerebras API is a high-speed inferencing solution for AI model inference powered by Cerebras Wafer-Scale Engines and CS-3 systems. It offers developers access to two models: Meta’s Llama 3.1 8B and 70B models, which are instruction-tuned and suitable for conversational applications. The API provides low-latency solutions and invites developers to explore new possibilities in AI development.
Class Companion
Class Companion is an AI teaching assistant tool designed to provide instant coaching and AI feedback to students on their assignments. It helps improve student engagement and outcomes by offering multiple attempts, targeted help, and personalized feedback. The tool supports various subjects and grades, allowing teachers to save time on manual feedback and focus on lesson planning and individual instruction. With features like AI-generated content, in-depth reporting, and customizable rubrics, Class Companion aims to enhance student learning and comprehension.
DapperGPT
DapperGPT is a user interface (UI) for ChatGPT that provides a better user experience and additional features. It offers an intuitive interface, AI-powered notes, a Chrome extension, smart search, the ability to pin favorites, image generation, character instruction prompts, and code generation. DapperGPT is free to use, but requires a valid OpenAI API key. Premium features are also available for purchase, which include additional customization options and cloud sync.
Promptmatic
Promptmatic is a free Google Chrome extension that helps you bookmark, save, and organize your best ChatGPT prompt templates and GPTs all in one place. It also includes a smart prompt editor with built-in variable editor and 200+ role, instruction, style, and tone presets. With Promptmatic, you can easily create reusable prompt templates, save and bookmark GPTs and templates in folders, and access them whenever you need with a single click right inside your ChatGPT dashboard.
20 - Open Source Tools
EasyInstruct
EasyInstruct is a Python package proposed as an easy-to-use instruction processing framework for Large Language Models (LLMs) like GPT-4, LLaMA, ChatGLM in your research experiments. EasyInstruct modularizes instruction generation, selection, and prompting, while also considering their combination and interaction.
KULLM
KULLM (구름) is a Korean Large Language Model developed by Korea University NLP & AI Lab and HIAI Research Institute. It is based on the upstage/SOLAR-10.7B-v1.0 model and has been fine-tuned for instruction. The model has been trained on 8×A100 GPUs and is capable of generating responses in Korean language. KULLM exhibits hallucination and repetition phenomena due to its decoding strategy. Users should be cautious as the model may produce inaccurate or harmful results. Performance may vary in benchmarks without a fixed system prompt.
LLMEvaluation
The LLMEvaluation repository is a comprehensive compendium of evaluation methods for Large Language Models (LLMs) and LLM-based systems. It aims to assist academics and industry professionals in creating effective evaluation suites tailored to their specific needs by reviewing industry practices for assessing LLMs and their applications. The repository covers a wide range of evaluation techniques, benchmarks, and studies related to LLMs, including areas such as embeddings, question answering, multi-turn dialogues, reasoning, multi-lingual tasks, ethical AI, biases, safe AI, code generation, summarization, software performance, agent LLM architectures, long text generation, graph understanding, and various unclassified tasks. It also includes evaluations for LLM systems in conversational systems, copilots, search and recommendation engines, task utility, and verticals like healthcare, law, science, financial, and others. The repository provides a wealth of resources for evaluating and understanding the capabilities of LLMs in different domains.
rag-experiment-accelerator
The RAG Experiment Accelerator is a versatile tool that helps you conduct experiments and evaluations using Azure AI Search and RAG pattern. It offers a rich set of features, including experiment setup, integration with Azure AI Search, Azure Machine Learning, MLFlow, and Azure OpenAI, multiple document chunking strategies, query generation, multiple search types, sub-querying, re-ranking, metrics and evaluation, report generation, and multi-lingual support. The tool is designed to make it easier and faster to run experiments and evaluations of search queries and quality of response from OpenAI, and is useful for researchers, data scientists, and developers who want to test the performance of different search and OpenAI related hyperparameters, compare the effectiveness of various search strategies, fine-tune and optimize parameters, find the best combination of hyperparameters, and generate detailed reports and visualizations from experiment results.
PIXIU
PIXIU is a project designed to support the development, fine-tuning, and evaluation of Large Language Models (LLMs) in the financial domain. It includes components like FinBen, a Financial Language Understanding and Prediction Evaluation Benchmark, FIT, a Financial Instruction Dataset, and FinMA, a Financial Large Language Model. The project provides open resources, multi-task and multi-modal financial data, and diverse financial tasks for training and evaluation. It aims to encourage open research and transparency in the financial NLP field.
Awesome-LLM-Eval
Awesome-LLM-Eval: a curated list of tools, benchmarks, demos, papers for Large Language Models (like ChatGPT, LLaMA, GLM, Baichuan, etc) Evaluation on Language capabilities, Knowledge, Reasoning, Fairness and Safety.
DecryptPrompt
This repository does not provide a tool, but rather a collection of resources and strategies for academics in the field of artificial intelligence who are feeling depressed or overwhelmed by the rapid advancements in the field. The resources include articles, blog posts, and other materials that offer advice on how to cope with the challenges of working in a fast-paced and competitive environment.
arena-hard-auto
Arena-Hard-Auto-v0.1 is an automatic evaluation tool for instruction-tuned LLMs. It contains 500 challenging user queries. The tool prompts GPT-4-Turbo as a judge to compare models' responses against a baseline model (default: GPT-4-0314). Arena-Hard-Auto employs an automatic judge as a cheaper and faster approximator to human preference. It has the highest correlation and separability to Chatbot Arena among popular open-ended LLM benchmarks. Users can evaluate their models' performance on Chatbot Arena by using Arena-Hard-Auto.
llm-jp-eval
LLM-jp-eval is a tool designed to automatically evaluate Japanese large language models across multiple datasets. It provides functionalities such as converting existing Japanese evaluation data to text generation task evaluation datasets, executing evaluations of large language models across multiple datasets, and generating instruction data (jaster) in the format of evaluation data prompts. Users can manage the evaluation settings through a config file and use Hydra to load them. The tool supports saving evaluation results and logs using wandb. Users can add new evaluation datasets by following specific steps and guidelines provided in the tool's documentation. It is important to note that using jaster for instruction tuning can lead to artificially high evaluation scores, so caution is advised when interpreting the results.
gritlm
The 'gritlm' repository provides all materials for the paper Generative Representational Instruction Tuning. It includes code for inference, training, evaluation, and known issues related to the GritLM model. The repository also offers models for embedding and generation tasks, along with instructions on how to train and evaluate the models. Additionally, it contains visualizations, acknowledgements, and a citation for referencing the work.
LESS
This repository contains the code for the paper 'LESS: Selecting Influential Data for Targeted Instruction Tuning'. The work proposes a data selection method to choose influential data for inducing a target capability. It includes steps for warmup training, building the gradient datastore, selecting data for a task, and training with the selected data. The repository provides tools for data preparation, data selection pipeline, and evaluation of the model trained on the selected data.
chinese-llm-benchmark
The Chinese LLM Benchmark is a continuous evaluation list of large models in CLiB, covering a wide range of commercial and open-source models from various companies and research institutions. It supports multidimensional evaluation of capabilities including classification, information extraction, reading comprehension, data analysis, Chinese encoding efficiency, and Chinese instruction compliance. The benchmark not only provides capability score rankings but also offers the original output results of all models for interested individuals to score and rank themselves.
LangBridge
LangBridge is a tool that bridges mT5 encoder and the target LM together using only English data. It enables models to effectively solve multilingual reasoning tasks without the need for multilingual supervision. The tool provides pretrained models like Orca 2, MetaMath, Code Llama, Llemma, and Llama 2 for various instruction-tuned and not instruction-tuned scenarios. Users can install the tool to replicate evaluations from the paper and utilize the models for multilingual reasoning tasks. LangBridge is particularly useful for low-resource languages and may lower performance in languages where the language model is already proficient.
IG-LLM
IG-LLM is a framework for solving inverse-graphics problems by instruction-tuning a Large Language Model (LLM) to decode visual embeddings into graphics code. The framework demonstrates natural generalization across distribution shifts without special inductive biases. It provides training and evaluation data for various scenarios like CLEVR, 2D, SO(3), 6-DoF, and ShapeNet. The environment setup can be done using conda/micromamba or Dockerfile. Training can be initiated for each scenario with specific commands, and inference can be performed using the provided script.
AGiXT
AGiXT is a dynamic Artificial Intelligence Automation Platform engineered to orchestrate efficient AI instruction management and task execution across a multitude of providers. Our solution infuses adaptive memory handling with a broad spectrum of commands to enhance AI's understanding and responsiveness, leading to improved task completion. The platform's smart features, like Smart Instruct and Smart Chat, seamlessly integrate web search, planning strategies, and conversation continuity, transforming the interaction between users and AI. By leveraging a powerful plugin system that includes web browsing and command execution, AGiXT stands as a versatile bridge between AI models and users. With an expanding roster of AI providers, code evaluation capabilities, comprehensive chain management, and platform interoperability, AGiXT is consistently evolving to drive a multitude of applications, affirming its place at the forefront of AI technology.
LongLLaVA
LongLLaVA is a tool for scaling multi-modal LLMs to 1000 images efficiently via hybrid architecture. It includes stages for single-image alignment, instruction-tuning, and multi-image instruction-tuning, with evaluation through a command line interface and model inference. The tool aims to achieve GPT-4V level capabilities and beyond, providing reproducibility of results and benchmarks for efficiency and performance.
Awesome-LLM-RAG
This repository, Awesome-LLM-RAG, aims to record advanced papers on Retrieval Augmented Generation (RAG) in Large Language Models (LLMs). It serves as a resource hub for researchers interested in promoting their work related to LLM RAG by updating paper information through pull requests. The repository covers various topics such as workshops, tutorials, papers, surveys, benchmarks, retrieval-enhanced LLMs, RAG instruction tuning, RAG in-context learning, RAG embeddings, RAG simulators, RAG search, RAG long-text and memory, RAG evaluation, RAG optimization, and RAG applications.
20 - OpenAI Gpts
EduCheck
Automatically evaluates uploaded lesson plans against educational standards. Upload text or a PDF.
Bloom's Reading Comprehension
Create comprehension questions based on a shared text. These questions will be designed to assess understanding at different levels of Bloom's taxonomy, from basic recall to more complex analytical and evaluative thinking skills.
Concept Tutor
Assistant focused on teaching concepts, evaluating comprehension, and recommending subsequent topics. USE WITH VOICE.
Rúbricas de evaluación - ProfesTV
GPT especializado en generar rúbricas de evaluación educativas
Instruction Assistant Operating Director
Full step by step guidance and copy & paste text for developing assistants with specific use cases.
GPT Instruction Builder
Write your GPT instructions, context, persona, constraints. The more detailed the better.
Custom Instruction Creator
Write your role and get your tailored persona for a tailored ChatGPT instructions.
Origami Instruction Companion
Teaches origami with step-by-step visual instructions and provides templates for various skill levels.
invideoAI instruction support bot
Send keywords and an overview of the video you want to make, and this bot will create invideoAI (AI Video Creator) instructions for you!
LDS Church Instruction
A GPT of the General Handbook of Instructions for the Church of Jesus Christ of Latter-day Saints.
EL Advisor
Differentiation advice for English Learners / Developing Bilinguals. For K-12 Teachers. EL, ESL, ELL, Bilingual, Dual Language instruction. Click a prompt below to begin:
Rosenshine GPT
Give me a lesson and I can give you feedback based on Rosenshine's "Principles of Instruction"
Korean for Beginners
I'm a Language Tutor Bot for beginner Korean learners, offering personalized, engaging instruction.
Ask Cris about File Maker
An experiment in personal FileMaker guidance from the collective works of lifetime award-winning FileMaker trainer, Cris Ippolite. Not just links to resources, but direct access to 20+ years of custom training curriculum combined with expert AI instruction without the noise of external web links.