Best AI tools for< Construct Sft Datasets >
20 - AI tool Sites
NovelAI
NovelAI is an AI-powered storytelling platform that offers a monthly subscription service for AI-assisted image generation and storytelling. Users can create unique stories, illustrate thrilling tales, and write seductive romances with the help of AI technology. The platform provides a creative sandbox for imagination without censorship or guidelines, allowing users to freely express their creativity. NovelAI features advanced image generation, customizable editor, AI output control, secure writing storage, memory expansion, and module-powered tools to enhance storytelling. Users can engage in text adventures, push writing limits with enhanced detail, and give personalized instructions to guide their stories.
Lumina
Lumina is a research tool that uses artificial intelligence to help researchers find and analyze information more quickly and easily. It can be used to search for articles, books, and other resources, and it can also be used to analyze data and create visualizations. Lumina is designed to make research more efficient and productive.
Nichely
Nichely is an AI-powered SEO tool that helps users dominate their niche by utilizing cutting-edge AI technology to navigate and correlate millions of topics in various niches. It assists in building detailed topical maps, constructing comprehensive topic clusters, and discovering untapped content opportunities. With features like topic discovery, topic research, and keyword research, Nichely empowers users to find relevant long-tail keywords, analyze SERPs, and enhance their topical authority. The tool is suitable for content/niche website owners, entrepreneurs, bloggers, and individuals looking to improve their SEO strategies and content creation.
TestFit
TestFit is a real estate feasibility platform that uses AI to help developers, architects, contractors, and brokers evaluate deals and make better decisions. It provides real-time insights into design, cost, and constructability, and integrates with a variety of other software tools. TestFit can help users save time and money, and make more informed decisions about their real estate projects.
Animant
Animant is an interactive AR tool that allows users to create engaging 3D scenes, conduct 3D scanning, and capture rooms. It leverages AI to enable users to build interactive 3D scenes using natural language, without the need for 3D animation knowledge. Animant is designed for AR experiences, enabling users to visualize 3D models in their real-world environment. The tool offers features like Object Capture, Room Capture, SharePlay for collaboration, and innovative 3D path construction. It prioritizes user privacy by not collecting personally identifiable information and supports offline rendering for creative flexibility.
Siml.ai
Siml.ai is a software platform designed for fast AI-driven physics simulations. It combines state-of-the-art machine learning with physics simulation to provide interactive visualization. The platform allows users to work with high-performance AI-based numerical simulators without the need for installation, offering painless scalability and one-click access to high-performance computing resources. Siml.ai aims to democratize scientific-grade simulation tools by simplifying the development and deployment of physics-based simulations for engineers and researchers.
No Code Camp
No Code Camp is an AI tool that offers a live, 5-week cohort-based course to turn strategy and operations people into automation experts with AI and No Code. The platform enables non-technical individuals to build applications, automate workflows, and develop web platforms using graphical interfaces, AI, and tool configuration instead of writing code. No Code Camp democratizes software development, making it accessible to a broader audience, speeding up the development process, and reducing the reliance on specialized software development skills. The course covers essential topics such as Data Architecture, Interface Design, AI Scaling, and No Code Automation, equipping participants with the skills needed to automate business processes and build internal tools.
LangChain
LangChain is an AI tool that offers a suite of products supporting developers in the LLM application lifecycle. It provides a framework to construct LLM-powered apps easily, visibility into app performance, and a turnkey solution for serving APIs. LangChain enables developers to build context-aware, reasoning applications and future-proof their applications by incorporating vendor optionality. LangSmith, a part of LangChain, helps teams improve accuracy and performance, iterate faster, and ship new AI features efficiently. The tool is designed to drive operational efficiency, increase discovery & personalization, and deliver premium products that generate revenue.
Magic AI Avatars
Magic AI Avatars is an AI-powered tool that allows users to create custom profile pictures using artificial intelligence. The app analyzes uploaded photos, recognizes facial features and expressions, and then uses a deep learning algorithm to construct a realistic digital photo that closely resembles the person in the picture. Magic AI Avatars is free to use and offers a variety of different themes and styles to choose from. The app is also committed to maintaining user privacy and data security.
WebsiteColorsAI
WebsiteColorsAI is an AI tool that effortlessly captures colors from any website by analyzing the HTML and CSS files to identify all HEX color codes. Users can construct and evaluate diverse color schemes and palettes, transforming the aesthetic of their websites. The tool provides an easy and time-saving way to explore and use colors for design projects.
Pixable
Pixable is a technology company that specializes in transforming organizations through the intelligent implementation of technology. They create beautiful websites and apps, automate systems, and implement artificial intelligence to revolutionize the way organizations operate and drive their growth. Pixable offers end-to-end technology services, including web development, connected solutions, artificial intelligence, and technology consulting. They help organizations navigate the complex web development landscape and realize their technological goals by embedding AI into the digital core of organizations. Pixable constructs elegant solutions that solve complex technological challenges, adding value for clients worldwide.
Email To Contract
Email To Contract is an AI tool that transforms emails into contracts seamlessly. It simplifies the process of creating tailored contracts by analyzing email conversations and generating contracts based on predefined templates. The tool is designed to work with various types of contracts such as NDAs, influencer agreements, and freelancer contracts. Users can forward email threads to the designated email address and receive a customized contract in return. Email To Contract offers affordable pricing plans, unlimited credits, and modulable access to different contract types. The application is user-friendly, fast, and eliminates the hassle of manual contract creation.
ContractWorks
ContractWorks is a contract management software that helps businesses organize, track, and manage their contracts. It offers a centralized repository for storing contracts, automated alerts and notifications, custom reporting, and electronic signature capabilities. ContractWorks also uses AI to power its search and review機能, allowing users to quickly find any contract, clause, or key term. With ContractWorks, businesses can improve contract visibility, reduce risk, and save time and money.
SpotDraft
SpotDraft is an AI-powered contract management system that helps businesses of all sizes simplify, automate, and accelerate their legal processes. With SpotDraft, you can create contracts in minutes, close deals faster, and gain better control over your business's cash flow. SpotDraft is trusted by general counsel and legal teams at cutting-edge organizations such as Beamery, Chargebee, Zai, and PostScript.
Evisort
Evisort is an AI-powered contract management software that simplifies contract management at every stage. It offers a complete, AI-native platform for end-to-end contract lifecycle management, including the first large language model built specifically for contracts. Evisort's AI capabilities enable users to ask questions about their contracts in simple, natural language and get clear, reasoned answers. It can also track terms of interest across all contracts and related documents, and generate data points that matter for sales, procurement, risk, and finance teams. Additionally, Evisort's AI-powered workflows automate tasks such as redlining, clause generation, and contract approvals, saving time and reducing risk.
SpeedLegal
SpeedLegal is a technological startup that uses Machine Learning technology (specifically Deep Learning, LLMs and genAI) to highlight the terms and the key risks of any contract. We analyze your documents and send you a simplified report so you can make a more informed decision before signing your name on the dotted line.
Diligen
Diligen is a machine learning powered contract analysis tool that helps teams streamline their contract review process. It can identify key provisions, generate contract summaries, and help teams manage review with machine learning powered analysis. Diligen is used by law firms, legal service providers, and corporations around the world to make high quality contract review faster, more efficient, and more cost effective.
Pincites
Pincites is an AI contract review tool designed for busy legal teams. It offers AI-generated redlines and comments within Microsoft Word, helping in-house legal teams to review contracts faster and more consistently. Pincites allows users to scan agreements for potential issues, apply AI-generated redlines, and interactively chat with documents. The tool also provides playbook management, enabling users to control redlines suggested by the AI based on their preferences and existing guidance.
Zefort
Zefort is an AI-powered contract management solution that offers a zero-effort approach to managing contracts. It allows users to create, sign, and store contracts with ease, providing features like eSignatures, automated reminders, and secure storage. Zefort is designed to streamline contract processes for legal teams, procurement, HR teams, sales teams, and company administration. The platform integrates advanced AI technology to enhance contract management efficiency and accuracy, catering to organizations of all sizes. With bank-level security measures and a user-friendly interface, Zefort ensures a seamless contract management experience.
Exante
Exante is an AI-powered contract intelligence platform that offers a single source of truth for organizations' contracts. It revolutionizes contract handling by providing centralized, secure storage, AI-powered extraction and organization of unstructured data, real-time visibility, user-friendly reporting, and collaboration tools. The platform aims to streamline processes, reduce risks, and improve compliance for efficient contract management. Exante delivers tangible value by automating data extraction, reducing costs, improving accuracy, reinforcing compliance, enhancing accessibility, and providing actionable insights.
20 - Open Source AI Tools
LLMs
LLMs is a Chinese large language model technology stack for practical use. It includes high-availability pre-training, SFT, and DPO preference alignment code framework. The repository covers pre-training data cleaning, high-concurrency framework, SFT dataset cleaning, data quality improvement, and security alignment work for Chinese large language models. It also provides open-source SFT dataset construction, pre-training from scratch, and various tools and frameworks for data cleaning, quality optimization, and task alignment.
LLMBox
LLMBox is a comprehensive library designed for implementing Large Language Models (LLMs) with a focus on a unified training pipeline and comprehensive model evaluation. It serves as a one-stop solution for training and utilizing LLMs, offering flexibility and efficiency in both training and utilization stages. The library supports diverse training strategies, comprehensive datasets, tokenizer vocabulary merging, data construction strategies, parameter efficient fine-tuning, and efficient training methods. For utilization, LLMBox provides comprehensive evaluation on various datasets, in-context learning strategies, chain-of-thought evaluation, evaluation methods, prefix caching for faster inference, support for specific LLM models like vLLM and Flash Attention, and quantization options. The tool is suitable for researchers and developers working with LLMs for natural language processing tasks.
TableLLM
TableLLM is a large language model designed for efficient tabular data manipulation tasks in real office scenarios. It can generate code solutions or direct text answers for tasks like insert, delete, update, query, merge, and chart operations on tables embedded in spreadsheets or documents. The model has been fine-tuned based on CodeLlama-7B and 13B, offering two scales: TableLLM-7B and TableLLM-13B. Evaluation results show its performance on benchmarks like WikiSQL, Spider, and self-created table operation benchmark. Users can use TableLLM for code and text generation tasks on tabular data.
Step-DPO
Step-DPO is a method for enhancing long-chain reasoning ability of LLMs with a data construction pipeline creating a high-quality dataset. It significantly improves performance on math and GSM8K tasks with minimal data and training steps. The tool fine-tunes pre-trained models like Qwen2-7B-Instruct with Step-DPO, achieving superior results compared to other models. It provides scripts for training, evaluation, and deployment, along with examples and acknowledgements.
octopus-v4
The Octopus-v4 project aims to build the world's largest graph of language models, integrating specialized models and training Octopus models to connect nodes efficiently. The project focuses on identifying, training, and connecting specialized models. The repository includes scripts for running the Octopus v4 model, methods for managing the graph, training code for specialized models, and inference code. Environment setup instructions are provided for Linux with NVIDIA GPU. The Octopus v4 model helps users find suitable models for tasks and reformats queries for effective processing. The project leverages Language Large Models for various domains and provides benchmark results. Users are encouraged to train and add specialized models following recommended procedures.
awesome-RLAIF
Reinforcement Learning from AI Feedback (RLAIF) is a concept that describes a type of machine learning approach where **an AI agent learns by receiving feedback or guidance from another AI system**. This concept is closely related to the field of Reinforcement Learning (RL), which is a type of machine learning where an agent learns to make a sequence of decisions in an environment to maximize a cumulative reward. In traditional RL, an agent interacts with an environment and receives feedback in the form of rewards or penalties based on the actions it takes. It learns to improve its decision-making over time to achieve its goals. In the context of Reinforcement Learning from AI Feedback, the AI agent still aims to learn optimal behavior through interactions, but **the feedback comes from another AI system rather than from the environment or human evaluators**. This can be **particularly useful in situations where it may be challenging to define clear reward functions or when it is more efficient to use another AI system to provide guidance**. The feedback from the AI system can take various forms, such as: - **Demonstrations** : The AI system provides demonstrations of desired behavior, and the learning agent tries to imitate these demonstrations. - **Comparison Data** : The AI system ranks or compares different actions taken by the learning agent, helping it to understand which actions are better or worse. - **Reward Shaping** : The AI system provides additional reward signals to guide the learning agent's behavior, supplementing the rewards from the environment. This approach is often used in scenarios where the RL agent needs to learn from **limited human or expert feedback or when the reward signal from the environment is sparse or unclear**. It can also be used to **accelerate the learning process and make RL more sample-efficient**. Reinforcement Learning from AI Feedback is an area of ongoing research and has applications in various domains, including robotics, autonomous vehicles, and game playing, among others.
merlin
Merlin is a groundbreaking model capable of generating natural language responses intricately linked with object trajectories of multiple images. It excels in predicting and reasoning about future events based on initial observations, showcasing unprecedented capability in future prediction and reasoning. Merlin achieves state-of-the-art performance on the Future Reasoning Benchmark and multiple existing multimodal language models benchmarks, demonstrating powerful multi-modal general ability and foresight minds.
MedLLMsPracticalGuide
This repository serves as a practical guide for Medical Large Language Models (Medical LLMs) and provides resources, surveys, and tools for building, fine-tuning, and utilizing LLMs in the medical domain. It covers a wide range of topics including pre-training, fine-tuning, downstream biomedical tasks, clinical applications, challenges, future directions, and more. The repository aims to provide insights into the opportunities and challenges of LLMs in medicine and serve as a practical resource for constructing effective medical LLMs.
LongCite
LongCite is a tool that enables Large Language Models (LLMs) to generate fine-grained citations in long-context Question Answering (QA) scenarios. It provides models trained on GLM-4-9B and Meta-Llama-3.1-8B, supporting up to 128K context. Users can deploy LongCite chatbots, generate accurate responses, and obtain precise sentence-level citations. The tool includes components for model deployment, Coarse to Fine (CoF) pipeline for data construction, model training using LongCite-45k dataset, evaluation with LongBench-Cite benchmark, and citation generation.
llms-tools
The 'llms-tools' repository is a comprehensive collection of AI tools, open-source projects, and research related to Large Language Models (LLMs) and Chatbots. It covers a wide range of topics such as AI in various domains, open-source models, chats & assistants, visual language models, evaluation tools, libraries, devices, income models, text-to-image, computer vision, audio & speech, code & math, games, robotics, typography, bio & med, military, climate, finance, and presentation. The repository provides valuable resources for researchers, developers, and enthusiasts interested in exploring the capabilities of LLMs and related technologies.
MiniCPM
MiniCPM is a series of open-source large models on the client side jointly developed by Face Intelligence and Tsinghua University Natural Language Processing Laboratory. The main language model MiniCPM-2B has only 2.4 billion (2.4B) non-word embedding parameters, with a total of 2.7B parameters. - After SFT, MiniCPM-2B performs similarly to Mistral-7B on public comprehensive evaluation sets (better in Chinese, mathematics, and code capabilities), and outperforms models such as Llama2-13B, MPT-30B, and Falcon-40B overall. - After DPO, MiniCPM-2B also surpasses many representative open-source large models such as Llama2-70B-Chat, Vicuna-33B, Mistral-7B-Instruct-v0.1, and Zephyr-7B-alpha on the current evaluation set MTBench, which is closest to the user experience. - Based on MiniCPM-2B, a multi-modal large model MiniCPM-V 2.0 on the client side is constructed, which achieves the best performance of models below 7B in multiple test benchmarks, and surpasses larger parameter scale models such as Qwen-VL-Chat 9.6B, CogVLM-Chat 17.4B, and Yi-VL 34B on the OpenCompass leaderboard. MiniCPM-V 2.0 also demonstrates leading OCR capabilities, approaching Gemini Pro in scene text recognition capabilities. - After Int4 quantization, MiniCPM can be deployed and inferred on mobile phones, with a streaming output speed slightly higher than human speech speed. MiniCPM-V also directly runs through the deployment of multi-modal large models on mobile phones. - A single 1080/2080 can efficiently fine-tune parameters, and a single 3090/4090 can fully fine-tune parameters. A single machine can continuously train MiniCPM, and the secondary development cost is relatively low.
AnyGPT
AnyGPT is a unified multimodal language model that utilizes discrete representations for processing various modalities like speech, text, images, and music. It aligns the modalities for intermodal conversions and text processing. AnyInstruct dataset is constructed for generative models. The model proposes a generative training scheme using Next Token Prediction task for training on a Large Language Model (LLM). It aims to compress vast multimodal data on the internet into a single model for emerging capabilities. The tool supports tasks like text-to-image, image captioning, ASR, TTS, text-to-music, and music captioning.
LLaMA-Factory
LLaMA Factory is a unified framework for fine-tuning 100+ large language models (LLMs) with various methods, including pre-training, supervised fine-tuning, reward modeling, PPO, DPO and ORPO. It features integrated algorithms like GaLore, BAdam, DoRA, LongLoRA, LLaMA Pro, LoRA+, LoftQ and Agent tuning, as well as practical tricks like FlashAttention-2, Unsloth, RoPE scaling, NEFTune and rsLoRA. LLaMA Factory provides experiment monitors like LlamaBoard, TensorBoard, Wandb, MLflow, etc., and supports faster inference with OpenAI-style API, Gradio UI and CLI with vLLM worker. Compared to ChatGLM's P-Tuning, LLaMA Factory's LoRA tuning offers up to 3.7 times faster training speed with a better Rouge score on the advertising text generation task. By leveraging 4-bit quantization technique, LLaMA Factory's QLoRA further improves the efficiency regarding the GPU memory.
20 - OpenAI Gpts
Eco Construct Pro
Leading advisor in sustainable building materials and eco-efficiency, powered by OpenAI
MTG Deck Wizard
Hello! Welcome to the realm of the planeswalkers. I am here to help you construct an MTG deck to suit your every need! Just let me know what colors or types of decks you'd like to build, and I will do my best to help you on the journey!
HouseGPT
This GPT will take a user's data and use it to construct a fake TV scene. Start by providing it with your character's Patient Profile, Diagnostic Findings, and Lab Data
Argumentum
Stephen Toulmin’s Theory of Argumentation. FIRST TIME? Start with "Good morning!" PRIMEIRA VEZ? Comece com um "Bom dia!"
PsyItemGenerator
Generates items for psychometric instruments to measure psychological constructs.
USA Contract Law Master
Expert in answering Contract Law queries for small businesses in the USA
Contract Negotiation Advisor
Facilitates efficient business operations through effective contract negotiations.
Contract Administration Advisor
Advises on contract administration to optimize procurement processes.
Contract Digitizer
Transforms regular contracts into digitized smart contracts. Response will include a diagram of the contract workflow as well as a link to easily auditable smart-contract source code ready for deployment.