Best AI tools for< Generate Instruction Data >
20 - AI tool Sites
OdiaGenAI
OdiaGenAI is a collaborative initiative focused on conducting research on Generative AI and Large Language Models (LLM) for the Odia Language. The project aims to leverage AI technology to develop Generative AI and LLM-based solutions for the overall development of Odisha and the Odia language through collaboration among Odia technologists. The initiative offers pre-trained models, codes, and datasets for non-commercial and research purposes, with a focus on building language models for Indic languages like Odia and Bengali.
Leny.ai
Leny.ai is an AI-powered medical assistant designed to provide instant support to medical professionals and patients. It offers features such as differential diagnosis, treatment plan drafting, discharge instructions, referral letters, and lab value analysis. Leny.ai aims to streamline healthcare processes, save time, and provide reliable and accurate medical information. The platform is still in beta mode and continuously improving to offer more accurate responses. It is focused on data security and privacy, although not currently HIPAA compliant. Leny.ai is free of charge at present and plans to transition to a subscription-based model in the future.
Excel Formula Bot
Excel Formula Bot is an AI-powered tool that helps users generate Excel and Google Sheet formulas using simple text instructions. It provides a user-friendly interface that allows users to write formulas in natural language, eliminating the need for complex syntax and manual calculations. With Excel Formula Bot, users can save time and effort while ensuring accuracy and consistency in their spreadsheet tasks.
Text2SQL.AI
Text2SQL.AI is an AI-powered SQL query builder that helps users generate optimized SQL queries effortlessly. It supports various AI-powered services, including SQL query building from textual instructions, SQL query explanation to plain English, SQL query error fixation, adding custom database schemas, SQL dialects for various database types, Microsoft Excel and Google Sheets formula generation and explanation, and Regex expression generation and explanation. The tool is designed to improve SQL skills, save time, and assist beginners, data analysts, data scientists, data engineers, and software developers in their work.
VBA Code Generator
VBA Code Generator is an AI-powered tool that allows users to generate VBA code quickly and efficiently. By inputting requirements, users can instantly generate complex VBA code using simple text instructions with the help of AI. The tool is designed for both beginners and experienced users, offering a versatile application that can handle various VBA tasks, from Excel automation to Access database management. With a focus on saving time and streamlining workflows, VBA Code Generator simplifies the coding process and provides accurate formulas for users' specific needs.
Formula Dog
Formula Dog is an AI-powered tool that helps users generate Excel formulas and solve spreadsheet problems quickly and easily. With Formula Dog, users can simply enter their problem or question into the AI assistant, and the tool will generate a step-by-step solution with the appropriate formula. Formula Dog also offers a variety of other features, such as a formula library, a help center, and a community forum, to help users learn more about Excel and solve their spreadsheet problems.
Formularizer
Formularizer is an AI-powered assistant that helps users create formulas in Excel, Google Sheets, and Notion. It supports a variety of formula types, including Excel, Google Apps Script, and regular expressions. Formularizer can generate formulas from natural language instructions, explain how formulas work, and even help users debug their formulas. It is designed to be user-friendly and accessible to everyone, regardless of their level of expertise.
Devika AI
Devika AI is an open-source AI software engineer that can understand high-level human instructions, break them down into steps, research relevant information, and generate code for particular tasks. It uses Claude 3, GPT-4, GPT-3.5, and Local LLMs via Ollama.
16x Prompt
16x Prompt is a desktop application that helps developers compose prompts for coding tasks in ChatGPT. It simplifies prompt creation by adding context, source code, and formatting instructions. The app supports all major programming languages and frameworks, and it can be used to generate prompts for a variety of coding tasks, including coding from scratch, debugging, refactoring, and more. 16x Prompt is free to download and use, and it can be used with both ChatGPT and GPT-4.
Bash Senpai
Bash Senpai is a terminal assistant powered by ChatGPT that transforms instructions into ready-to-use commands. It provides convenience by allowing users to get answers without leaving the terminal and offers better answers by providing context with questions. The tool also incorporates self-reflection to improve the quality of its responses.
Fly AI
Fly AI is a powerful and user-friendly AI application designed for Mac users. It leverages the capabilities of OpenAI's ChatGPT to provide fast responses, unlimited custom instructions, and quick access. With a beautiful design and context-aware features, Fly AI allows users to work faster and smarter by creating custom AI mini-apps. The application prioritizes user privacy by not recording data and offering a standalone subscription model. Fly AI is a native Mac app, ensuring a seamless user experience without the need to open a browser.
TailorTask
TailorTask is an AI-powered automation tool designed to help users automate repetitive tasks efficiently. It offers a user-friendly interface that allows non-technical individuals to easily interact with the AI. The tool integrates with various platforms and tools, enabling seamless automation processes. TailorTask prioritizes data privacy and security, ensuring that users have full control over the AI's actions. With features like custom workflows, task scheduling, and detailed instructions, TailorTask aims to save users time and effort by automating tasks across different domains.
SearchGPT
SearchGPT is an AI tool that offers a Spreadsheet Generator feature. Users can enter a prompt and generate a spreadsheet based on the given instructions. The tool can handle complicated spreadsheets, although they may take some time to generate. SearchGPT is designed to assist users in creating spreadsheets efficiently and accurately.
Examize
Examize is an AI Quiz Generator that revolutionizes the process of exam building by seamlessly integrating with Google Workspace. It allows users to generate quizzes from their files and easily convert them into Google Forms and Google Docs. With dynamic and creative question options, Examize is tailored for educators looking to enhance their classroom experience. The application offers flexible pricing plans to cater to various needs and ensures user data privacy by not storing any file data.
Humy.ai
Humy.ai is an AI-powered educational tool that provides personalized tutoring, assignments, and study materials for history and social studies classes. It leverages AI to create engaging and interactive learning experiences for students, allowing them to have life-like conversations with historical figures, receive tailored feedback on assignments, and access a vast library of educational resources.
QWiser
QWiser is an AI-powered learning platform that transforms entire courses into interactive study materials. It uses AI to instantly convert complex information into easy-to-understand visuals, organizes study materials into topics and subtopics, generates personalized quizzes and exams, and enhances learning efficiency. QWiser aims to revolutionize the education sector by providing a smarter learning journey for students and educators worldwide.
Enlighten AI
Enlighten AI is an AI grading tool designed to assist teachers in providing personalized feedback to students efficiently. The tool optimizes the grading process by leveraging AI technology while preserving the human touch in feedback delivery. It offers features such as optimized grading in three steps, personalized feedback suggestions, and data-driven insights to enhance student outcomes. Enlighten AI aims to save educators time, improve feedback quality, and maximize teaching impact.
Unschooler
Unschooler is an AI-powered platform offering video courses for educators, universities, and schools. It enables users to generate educational videos for any question, create AI courses in minutes, and convert articles or websites into step-by-step courses. The platform provides personalized curriculum, interactive quizzes, and insights on student skills and interests. Unschooler also offers career matching based on student performance in tests. It emphasizes data privacy, collaboration, and time-saving features for educators.
ibl.ai
ibl.ai is a generative AI platform that focuses on education, providing cutting-edge solutions for institutions to create AI mentors, tutoring apps, and content creation tools. The platform empowers educators by giving them full control over their code, data, and models. With advanced features and support for both web and native mobile platforms, ibl.ai seamlessly integrates with existing infrastructure, making it easy to deploy across organizations. The platform is designed to enhance learning experiences, foster critical thinking, and engage students deeply in educational content.
Supafit
Supafit is an AI-powered personal trainer application that provides exclusive access to personalized fitness programs and location-specific workouts. The app features a 24/7 chat with a virtual coach, integration with third-party apps, centralization of fitness data, calorie tracking, and progress reports. Users can generate custom routines and receive tailored workout plans that evolve with their progress. Supafit aims to revolutionize the fitness industry by offering a comprehensive and adaptive training experience.
20 - Open Source AI Tools
magpie
This is the official repository for 'Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing'. Magpie is a tool designed to synthesize high-quality instruction data at scale by extracting it directly from an aligned Large Language Models (LLMs). It aims to democratize AI by generating large-scale alignment data and enhancing the transparency of model alignment processes. Magpie has been tested on various model families and can be used to fine-tune models for improved performance on alignment benchmarks such as AlpacaEval, ArenaHard, and WildBench.
LLM-Alchemy-Chamber
LLM Alchemy Chamber is a repository dedicated to exploring the world of Language Models (LLMs) through various experiments and projects. It contains scripts, notebooks, and experiments focused on tasks such as fine-tuning different LLM models, quantization for performance optimization, dataset generation for instruction/QA tasks, and more. The repository offers a collection of resources for beginners and enthusiasts interested in delving into the mystical realm of LLMs.
llm-jp-eval
LLM-jp-eval is a tool designed to automatically evaluate Japanese large language models across multiple datasets. It provides functionalities such as converting existing Japanese evaluation data to text generation task evaluation datasets, executing evaluations of large language models across multiple datasets, and generating instruction data (jaster) in the format of evaluation data prompts. Users can manage the evaluation settings through a config file and use Hydra to load them. The tool supports saving evaluation results and logs using wandb. Users can add new evaluation datasets by following specific steps and guidelines provided in the tool's documentation. It is important to note that using jaster for instruction tuning can lead to artificially high evaluation scores, so caution is advised when interpreting the results.
bonito
Bonito is an open-source model for conditional task generation, converting unannotated text into task-specific training datasets for instruction tuning. It is a lightweight library built on top of Hugging Face `transformers` and `vllm` libraries. The tool supports various task types such as question answering, paraphrase generation, sentiment analysis, summarization, and more. Users can easily generate synthetic instruction tuning datasets using Bonito for zero-shot task adaptation.
Reflection_Tuning
Reflection-Tuning is a project focused on improving the quality of instruction-tuning data through a reflection-based method. It introduces Selective Reflection-Tuning, where the student model can decide whether to accept the improvements made by the teacher model. The project aims to generate high-quality instruction-response pairs by defining specific criteria for the oracle model to follow and respond to. It also evaluates the efficacy and relevance of instruction-response pairs using the r-IFD metric. The project provides code for reflection and selection processes, along with data and model weights for both V1 and V2 methods.
NExT-GPT
NExT-GPT is an end-to-end multimodal large language model that can process input and generate output in various combinations of text, image, video, and audio. It leverages existing pre-trained models and diffusion models with end-to-end instruction tuning. The repository contains code, data, and model weights for NExT-GPT, allowing users to work with different modalities and perform tasks like encoding, understanding, reasoning, and generating multimodal content.
Cherry_LLM
Cherry Data Selection project introduces a self-guided methodology for LLMs to autonomously discern and select cherry samples from open-source datasets, minimizing manual curation and cost for instruction tuning. The project focuses on selecting impactful training samples ('cherry data') to enhance LLM instruction tuning by estimating instruction-following difficulty. The method involves phases like 'Learning from Brief Experience', 'Evaluating Based on Experience', and 'Retraining from Self-Guided Experience' to improve LLM performance.
distilabel
Distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency. It helps you synthesize data and provide AI feedback to improve the quality of your AI models. With Distilabel, you can: * **Synthesize data:** Generate synthetic data to train your AI models. This can help you to overcome the challenges of data scarcity and bias. * **Provide AI feedback:** Get feedback from AI models on your data. This can help you to identify errors and improve the quality of your data. * **Improve your AI output quality:** By using Distilabel to synthesize data and provide AI feedback, you can improve the quality of your AI models and get better results.
IvyGPT
IvyGPT is a medical large language model that aims to generate the most realistic doctor consultation effects. It has been fine-tuned on high-quality medical Q&A data and trained using human feedback reinforcement learning. The project features full-process training on medical Q&A LLM, multiple fine-tuning methods support, efficient dataset creation tools, and a dataset of over 300,000 high-quality doctor-patient dialogues for training.
Taiyi-LLM
Taiyi (太一) is a bilingual large language model fine-tuned for diverse biomedical tasks. It aims to facilitate communication between healthcare professionals and patients, provide medical information, and assist in diagnosis, biomedical knowledge discovery, drug development, and personalized healthcare solutions. The model is based on the Qwen-7B-base model and has been fine-tuned using rich bilingual instruction data. It covers tasks such as question answering, biomedical dialogue, medical report generation, biomedical information extraction, machine translation, title generation, text classification, and text semantic similarity. The project also provides standardized data formats, model training details, model inference guidelines, and overall performance metrics across various BioNLP tasks.
chatgpt-universe
ChatGPT is a large language model that can generate human-like text, translate languages, write different kinds of creative content, and answer your questions in a conversational way. It is trained on a massive amount of text data, and it is able to understand and respond to a wide range of natural language prompts. Here are 5 jobs suitable for this tool, in lowercase letters: 1. content writer 2. chatbot assistant 3. language translator 4. creative writer 5. researcher
SeaLLMs
SeaLLMs are a family of language models optimized for Southeast Asian (SEA) languages. They were pre-trained from Llama-2, on a tailored publicly-available dataset, which comprises texts in Vietnamese 🇻🇳, Indonesian 🇮🇩, Thai 🇹🇭, Malay 🇲🇾, Khmer🇰🇭, Lao🇱🇦, Tagalog🇵🇭 and Burmese🇲🇲. The SeaLLM-chat underwent supervised finetuning (SFT) and specialized self-preferencing DPO using a mix of public instruction data and a small number of queries used by SEA language native speakers in natural settings, which **adapt to the local cultural norms, customs, styles and laws in these areas**. SeaLLM-13b models exhibit superior performance across a wide spectrum of linguistic tasks and assistant-style instruction-following capabilities relative to comparable open-source models. Moreover, they outperform **ChatGPT-3.5** in non-Latin languages, such as Thai, Khmer, Lao, and Burmese.
EasyInstruct
EasyInstruct is a Python package proposed as an easy-to-use instruction processing framework for Large Language Models (LLMs) like GPT-4, LLaMA, ChatGLM in your research experiments. EasyInstruct modularizes instruction generation, selection, and prompting, while also considering their combination and interaction.
AnyGPT
AnyGPT is a unified multimodal language model that utilizes discrete representations for processing various modalities like speech, text, images, and music. It aligns the modalities for intermodal conversions and text processing. AnyInstruct dataset is constructed for generative models. The model proposes a generative training scheme using Next Token Prediction task for training on a Large Language Model (LLM). It aims to compress vast multimodal data on the internet into a single model for emerging capabilities. The tool supports tasks like text-to-image, image captioning, ASR, TTS, text-to-music, and music captioning.
AGI-Papers
This repository contains a collection of papers and resources related to Large Language Models (LLMs), including their applications in various domains such as text generation, translation, question answering, and dialogue systems. The repository also includes discussions on the ethical and societal implications of LLMs. **Description** This repository is a collection of papers and resources related to Large Language Models (LLMs). LLMs are a type of artificial intelligence (AI) that can understand and generate human-like text. They have a wide range of applications, including text generation, translation, question answering, and dialogue systems. **For Jobs** - **Content Writer** - **Copywriter** - **Editor** - **Journalist** - **Marketer** **AI Keywords** - **Large Language Models** - **Natural Language Processing** - **Machine Learning** - **Artificial Intelligence** - **Deep Learning** **For Tasks** - **Generate text** - **Translate text** - **Answer questions** - **Engage in dialogue** - **Summarize text**
nlp-llms-resources
The 'nlp-llms-resources' repository is a comprehensive resource list for Natural Language Processing (NLP) and Large Language Models (LLMs). It covers a wide range of topics including traditional NLP datasets, data acquisition, libraries for NLP, neural networks, sentiment analysis, optical character recognition, information extraction, semantics, topic modeling, multilingual NLP, domain-specific LLMs, vector databases, ethics, costing, books, courses, surveys, aggregators, newsletters, papers, conferences, and societies. The repository provides valuable information and resources for individuals interested in NLP and LLMs.
awesome-llms-fine-tuning
This repository is a curated collection of resources for fine-tuning Large Language Models (LLMs) like GPT, BERT, RoBERTa, and their variants. It includes tutorials, papers, tools, frameworks, and best practices to aid researchers, data scientists, and machine learning practitioners in adapting pre-trained models to specific tasks and domains. The resources cover a wide range of topics related to fine-tuning LLMs, providing valuable insights and guidelines to streamline the process and enhance model performance.
Efficient-LLMs-Survey
This repository provides a systematic and comprehensive review of efficient LLMs research. We organize the literature in a taxonomy consisting of three main categories, covering distinct yet interconnected efficient LLMs topics from **model-centric** , **data-centric** , and **framework-centric** perspective, respectively. We hope our survey and this GitHub repository can serve as valuable resources to help researchers and practitioners gain a systematic understanding of the research developments in efficient LLMs and inspire them to contribute to this important and exciting field.
20 - OpenAI Gpts
AutoChatGPT
Have a large task to accomplish? AutoChatGPT will continually review and give itself new instructions to complete a task using expert agents.
Custom Instruction Creator
Write your role and get your tailored persona for a tailored ChatGPT instructions.
IronswornGPT RPG Oracle
Your Ironsworn Oracle for Solo & Co-op Roleplaying! Type /welcome to begin, /help for instructions or learn more about IronswornGPT with /oracle. Or, just start playing! • YouTube.com/@IversusAIGaming •
Research Paper GPT
Drafts detailed research papers with web-sourced citations, following user-specific instructions.
Multimedia Content Creator
Generates diverse media content; doesn't repeat or clarify instructions.
GPT-Builders' Assistant
Effortless GPT Creation : Your Go-To Assistant for Tailoring Perfect Descriptions, Instructions, and Behaviors for Custom GPTs