Best AI tools for< Train On Codebase >

20 - AI tool Sites

TolyGPT

TolyGPT is an AI-powered chatbot that is designed to read an entire codebase and generate documentation. It is specifically trained on the Solana validator codebase, allowing users to ask questions about how the validator works. The core of TolyGPT is open source as Autodoc, and it is powered by the GPT-3.5 model. Users can apply to have TolyGPT work on their own codebase and stay updated by following Sam Hogan.

site

: 2.2k

Vize.ai

Vize.ai is a custom image recognition API provided by Ximilar, a leading company in Visual AI and Search. The tool offers powerful artificial intelligence capabilities with high accuracy using deep learning algorithms. It allows users to easily set up and implement cutting-edge vision automation without any development costs. Vize.ai enables users to train custom neural networks to recognize specific images and provides a scalable solution with continuous improvements in machine learning algorithms. The tool features an intuitive interface that requires no machine learning or coding knowledge, making it accessible for a wide range of users across industries.

site

: 4.9k

Botsonic

Botsonic is an AI chatbot application that offers custom AI chatbots for websites. It provides AI-powered automation solutions for various industries, enabling businesses to enhance customer engagement, support, sales, and more. Botsonic uses AI Copilots trained on data to deliver authentic customer experiences in multiple languages across different channels. The platform allows users to easily create, customize, and integrate AI chatbots into their websites, providing instant support and personalized interactions.

site

: 0

Lexy

Lexy is an AI chatbot application designed to enhance customer service on websites. It integrates with Notion pages to provide instant, human-like answers to customer queries. Lexy can be set up in just 5 minutes, prioritizes data security, and can be trained on specific Notion pages. The application is free to create the first bot and send 30 messages per month, with options to upgrade for more power.

site

: 2.8k

ColdIQ

ColdIQ is an AI-powered sales prospecting tool that helps B2B companies with revenue above $100k/month to build outbound systems that sell for them. The tool offers end-to-end cold outreach campaign setup and management, email infrastructure setup and warmup, audience research and targeting, data scraping and enrichment, campaigns optimization, sending automation, sales systems implementation, training on tools best practices, sales tools recommendations, free gap analysis, sales consulting, and copywriting frameworks. ColdIQ leverages AI to tailor messaging to each prospect, automate outreach, and flood calendars with opportunities.

site

: 44.6k

EmbedAI

EmbedAI is a platform that enables users to create custom AI chatbots powered by ChatGPT. Users can train the chatbot on their own data and embed it on their website. The platform offers features like AI chatbot customization, multilingual support, integration with various tools, and easy sharing options. EmbedAI aims to provide efficient management of information and automated responses to user queries.

site

: 0

Quivr

Quivr is an open-source chat-powered second brain application that transforms private and enterprise knowledge into a personal AI assistant. It continuously learns and improves at every interaction, offering AI-powered workplace search synced with user data. Quivr allows users to connect with their favorite tools, databases, and applications, and configure their 'second brain' to train on their company's unique context for improved search relevance and knowledge discovery.

site

: 17.0k

AI Profile Pictures

AI Profile Pictures is a website that allows users to generate AI-generated profile pictures. Users can purchase credits to generate 200+ images, and then upload at least 10 photos of themselves (or their subject) for the AI model to train on. Once the photos are uploaded, users can wait 2-3 hours for the AI to generate their profile pictures. Users will have 7 days to generate additional photos if they are not satisfied with their results.

site

: 6.2k

ChatCube

ChatCube is an AI-powered chatbot maker that allows users to create chatbots for their websites without coding. It uses advanced AI technology to train chatbots on any document or website within 60 seconds. ChatCube offers a range of features, including a user-friendly visual editor, lightning-fast integration, fine-tuning on specific data sources, data encryption and security, and customizable chatbots. By leveraging the power of AI, ChatCube helps businesses improve customer support efficiency and reduce support ticket reductions by up to 28%.

site

: 28

Surge AI

Surge AI is a data labeling platform that provides human-generated data for training and evaluating large language models (LLMs). It offers a global workforce of annotators who can label data in over 40 languages. Surge AI's platform is designed to be easy to use and integrates with popular machine learning tools and frameworks. The company's customers include leading AI companies, research labs, and startups.

site

: 16.2k

Cody

Cody is an intelligent AI assistant designed to boost team productivity by providing instant answers, support, troubleshooting, and idea generation. It can be trained on your business knowledge base to cater to your specific needs, making it a valuable asset for various departments such as marketing, HR, IT support, business consultancy, creative tasks, sales, training, hiring, customer support, and translation. Cody offers features like prompt manager, focus mode, conversation logs, scratchpad, and source checking, ensuring efficient and tailored assistance. With multilingual capabilities and customizable access controls, Cody prioritizes data security and user experience.

site

: 88.3k

Chunky

Chunky is an AI chatbot builder that allows users to create human-like chatbots effortlessly. With Chunky, users can automate customer support, train chatbots on their own data without any coding, and integrate chatbots seamlessly into their websites. The platform offers a user-friendly interface, fast and personal support, and the ability to customize chatbot appearance to match branding. Chunky supports multiple languages and provides a generous free plan for users to get started with AI-powered bots.

site

: 616

ChatFast

ChatFast is a platform that allows businesses to create custom GPT chatbots using their own data. These chatbots can be used to answer customer questions, capture leads, and schedule appointments. ChatFast is easy to use and requires no coding. It is trusted by thousands of businesses and provides a range of powerful features, including the ability to train chatbots on multiple data sources, revise responses, capture leads, and create smart forms.

site

: 3.9k

Digital Adoption

This website provides comprehensive information and resources on digital adoption, including its definition, platforms, solutions, strategies, and research. It also covers digital transformation, artificial intelligence, and various tools and applications related to these fields. The website offers articles, guides, comparisons, and industry insights to help businesses and individuals understand and implement digital adoption and transformation initiatives.

site

: 53.2k

MIT Sloan Teaching & Learning Technologies

MIT Sloan Teaching & Learning Technologies connects MIT Sloan to research-driven best practices, resources, and training in instructional technology and design. They help the community make an impact in the classroom and beyond. They offer various services such as trainings, practice sessions, how-to guides, consultations, and a teaching studio. Their latest news and announcements include supporting learning with AI-generated images, providing students with access to Microsoft Copilot, and making Microsoft Copilot available for faculty and staff.

site

: 59.1k

Chatflot

Chatflot is an AI chatbot application that helps businesses automate up to 95% of customer queries. It allows users to create customized AI chatbots based on the ChatGPT language model, enabling them to provide on-demand information to customers through their website. Chatflot is suitable for various industries and offers features like training the chatbot on specific data, optimizing customer interactions, and integrating seamlessly with different CMS platforms. The application aims to enhance customer service, boost sales, and streamline support processes by providing personalized assistance and relevant information to users.

site

: 0

SQLAI.ai

SQLAI.ai is a professional SQL multi-tool that leverages AI technology to generate, fix, explain, and optimize SQL queries and databases. It enables users to interact with SQL using everyday language, effortlessly train AI to understand database schemas, and benefit from AI-driven recommendations for query optimization. The platform caters to a wide range of users, from beginners to experts, by simplifying SQL tasks and providing valuable insights for database management. With features like generating SQL data, data analytics, and real-time data insights, SQLAI.ai revolutionizes the way users interact with databases, making SQL tasks simpler, more efficient, and accessible to all.

site

: 51.5k

Mirage

Mirage is a custom AI platform that builds custom LLMs to accelerate productivity. It is backed by Sequoia and offers a variety of features, including the ability to create custom AI models, train models on your own data, and deploy models to the cloud or on-premises.

site

: 35.0k

Petal

Petal is a document analysis platform powered by generative AI technology. It allows users to chat with their documents, providing fully sourced and reliable answers by linking to their own knowledge bases. Users can train AI on their documents to support their work, ensuring centralized knowledge management and document synchronization. Petal offers features such as automatic metadata extraction, file deduplication, and collaboration tools to enhance productivity and streamline workflows for researchers, faculty, and industry experts.

site

: 29.7k

SQLAI.ai

SQLAI.ai is a powerful SQL multi-tool that utilizes AI to generate, fix, explain, and optimize SQL queries and databases. It empowers users to effortlessly create complex SQL queries using everyday language, optimize queries for better performance, fix syntax errors with ease, and gain a deeper understanding of SQL queries through AI-powered explanations. Additionally, SQLAI.ai enables users to train AI on their database schema, ensuring unparalleled accuracy in AI-generated queries and optimizations.

site

: 0

20 - Open Source AI Tools

hold

This repository contains the code for HOLD, a method that jointly reconstructs hands and objects from monocular videos without assuming a pre-scanned object template. It can reconstruct 3D geometries of novel objects and hands, enabling template-free bimanual hand-object reconstruction, textureless object interaction with hands, and multiple objects interaction with hands. The repository provides instructions to download in-the-wild videos from HOLD, preprocess and train on custom videos, a volumetric rendering framework, a generalized codebase for single and two hand interaction with objects, a viewer to interact with predictions, and code to evaluate and compare with HOLD in HO3D. The repository also includes documentation for setup, training, evaluation, visualization, preprocessing custom sequences, and using HOLD on ARCTIC.

github

: 255

qlora-pipe

qlora-pipe is a pipeline parallel training script designed for efficiently training large language models that cannot fit on one GPU. It supports QLoRA, LoRA, and full fine-tuning, with efficient model loading and the ability to load any dataset that Axolotl can handle. The script allows for raw text training, resuming training from a checkpoint, logging metrics to Tensorboard, specifying a separate evaluation dataset, training on multiple datasets simultaneously, and supports various models like Llama, Mistral, Mixtral, Qwen-1.5, and Cohere (Command R). It handles pipeline- and data-parallelism using Deepspeed, enabling users to set the number of GPUs, pipeline stages, and gradient accumulation steps for optimal utilization.

github

: 61

LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing

LLM-PowerHouse is a comprehensive and curated guide designed to empower developers, researchers, and enthusiasts to harness the true capabilities of Large Language Models (LLMs) and build intelligent applications that push the boundaries of natural language understanding. This GitHub repository provides in-depth articles, codebase mastery, LLM PlayLab, and resources for cost analysis and network visualization. It covers various aspects of LLMs, including NLP, models, training, evaluation metrics, open LLMs, and more. The repository also includes a collection of code examples and tutorials to help users build and deploy LLM-based applications.

github

: 573

trackmania_rl_public

This repository contains the reinforcement learning training code for Trackmania AI with Reinforcement Learning. It is a research work-in-progress project that aims to apply reinforcement learning principles to play Trackmania. The code is constantly evolving and may not be clean or easily usable. The training hyperparameters are intentionally changed in the public repository to encourage understanding of reinforcement learning principles. The project may not receive active support for setup or usage at the moment.

github

: 110

Autonomous-Agents

github

: 359

llm-foundry

LLM Foundry is a codebase for training, finetuning, evaluating, and deploying LLMs for inference with Composer and the MosaicML platform. It is designed to be easy-to-use, efficient _and_ flexible, enabling rapid experimentation with the latest techniques. You'll find in this repo: * `llmfoundry/` - source code for models, datasets, callbacks, utilities, etc. * `scripts/` - scripts to run LLM workloads * `data_prep/` - convert text data from original sources to StreamingDataset format * `train/` - train or finetune HuggingFace and MPT models from 125M - 70B parameters * `train/benchmarking` - profile training throughput and MFU * `inference/` - convert models to HuggingFace or ONNX format, and generate responses * `inference/benchmarking` - profile inference latency and throughput * `eval/` - evaluate LLMs on academic (or custom) in-context-learning tasks * `mcli/` - launch any of these workloads using MCLI and the MosaicML platform * `TUTORIAL.md` - a deeper dive into the repo, example workflows, and FAQs

github

: 4.0k

awesome-mobile-llm

Awesome Mobile LLMs is a curated list of Large Language Models (LLMs) and related studies focused on mobile and embedded hardware. The repository includes information on various LLM models, deployment frameworks, benchmarking efforts, applications, multimodal LLMs, surveys on efficient LLMs, training LLMs on device, mobile-related use-cases, industry announcements, and related repositories. It aims to be a valuable resource for researchers, engineers, and practitioners interested in mobile LLMs.

github

: 53

UMOE-Scaling-Unified-Multimodal-LLMs

Uni-MoE is a MoE-based unified multimodal model that can handle diverse modalities including audio, speech, image, text, and video. The project focuses on scaling Unified Multimodal LLMs with a Mixture of Experts framework. It offers enhanced functionality for training across multiple nodes and GPUs, as well as parallel processing at both the expert and modality levels. The model architecture involves three training stages: building connectors for multimodal understanding, developing modality-specific experts, and incorporating multiple trained experts into LLMs using the LoRA technique on mixed multimodal data. The tool provides instructions for installation, weights organization, inference, training, and evaluation on various datasets.

github

: 728

Awesome-Code-LLM

Analyze the following text from a github repository (name and readme text at end) . Then, generate a JSON object with the following keys and provide the corresponding information for each key, in lowercase letters: 'description' (detailed description of the repo, must be less than 400 words，Ensure that no line breaks and quotation marks.),'for_jobs' (List 5 jobs suitable for this tool,in lowercase letters), 'ai_keywords' (keywords of the tool,user may use those keyword to find the tool,in lowercase letters), 'for_tasks' (list of 5 specific tasks user can use this tool to do,in lowercase letters), 'answer' (in english languages)

github

: 1.4k

llms-tools

The 'llms-tools' repository is a comprehensive collection of AI tools, open-source projects, and research related to Large Language Models (LLMs) and Chatbots. It covers a wide range of topics such as AI in various domains, open-source models, chats & assistants, visual language models, evaluation tools, libraries, devices, income models, text-to-image, computer vision, audio & speech, code & math, games, robotics, typography, bio & med, military, climate, finance, and presentation. The repository provides valuable resources for researchers, developers, and enthusiasts interested in exploring the capabilities of LLMs and related technologies.

github

: 106

rlhf_trojan_competition

This competition is organized by Javier Rando and Florian Tramèr from the ETH AI Center and SPY Lab at ETH Zurich. The goal of the competition is to create a method that can detect universal backdoors in aligned language models. A universal backdoor is a secret suffix that, when appended to any prompt, enables the model to answer harmful instructions. The competition provides a set of poisoned generation models, a reward model that measures how safe a completion is, and a dataset with prompts to run experiments. Participants are encouraged to use novel methods for red-teaming, automated approaches with low human oversight, and interpretability tools to find the trojans. The best submissions will be offered the chance to present their work at an event during the SaTML 2024 conference and may be invited to co-author a publication summarizing the competition results.

github

: 93

awesome-llm-and-aigc

github

: 500

awesome-ai-agents

github

: 8.8k

llm-course

github

: 246

llm-baselines

LLM-baselines is a modular codebase to experiment with transformers, inspired from NanoGPT. It provides a quick and easy way to train and evaluate transformer models on a variety of datasets. The codebase is well-documented and easy to use, making it a great resource for researchers and practitioners alike.

github

: 58

videogigagan-pytorch

Video GigaGAN - Pytorch is an implementation of Video GigaGAN, a state-of-the-art video upsampling technique developed by Adobe AI labs. The project aims to provide a Pytorch implementation for researchers and developers interested in video super-resolution. The codebase allows users to replicate the results of the original research paper and experiment with video upscaling techniques. The repository includes the necessary code and resources to train and test the GigaGAN model on video datasets. Researchers can leverage this implementation to enhance the visual quality of low-resolution videos and explore advancements in video super-resolution technology.

github

: 62

humanoid-gym

Humanoid-Gym is a reinforcement learning framework designed for training locomotion skills for humanoid robots, focusing on zero-shot transfer from simulation to real-world environments. It integrates a sim-to-sim framework from Isaac Gym to Mujoco for verifying trained policies in different physical simulations. The codebase is verified with RobotEra's XBot-S and XBot-L humanoid robots. It offers comprehensive training guidelines, step-by-step configuration instructions, and execution scripts for easy deployment. The sim2sim support allows transferring trained policies to accurate simulated environments. The upcoming features include Denoising World Model Learning and Dexterous Hand Manipulation. Installation and usage guides are provided along with examples for training PPO policies and sim-to-sim transformations. The code structure includes environment and configuration files, with instructions on adding new environments. Troubleshooting tips are provided for common issues, along with a citation and acknowledgment section.

github

: 388

EasyLM

EasyLM is a one-stop solution for pre-training, fine-tuning, evaluating, and serving large language models in JAX/Flax. It simplifies the process by leveraging JAX's pjit functionality to scale up training to multiple TPU/GPU accelerators. Built on top of Huggingface's transformers and datasets, EasyLM offers an easy-to-use and customizable codebase for training large language models without the complexity found in other frameworks. It supports sharding model weights and training data across multiple accelerators, enabling multi-TPU/GPU training on a single host or across multiple hosts on Google Cloud TPU Pods. EasyLM currently supports models like LLaMA, LLaMA 2, and LLaMA 3.

github

: 2.3k

forust

Forust is a lightweight package for building gradient boosted decision tree ensembles. The algorithm code is written in Rust with a Python wrapper. It implements the same algorithm as XGBoost and provides nearly identical results. The package was developed to better understand XGBoost, as a fun project in Rust, and to experiment with adding new features to the algorithm in a simpler codebase. Forust allows training gradient boosted decision tree ensembles with multiple objective functions, predicting on datasets, inspecting model structures, calculating feature importance, and saving/loading trained boosters.

github

: 53

zeta

Zeta is a tool designed to build state-of-the-art AI models faster by providing modular, high-performance, and scalable building blocks. It addresses the common issues faced while working with neural nets, such as chaotic codebases, lack of modularity, and low performance modules. Zeta emphasizes usability, modularity, and performance, and is currently used in hundreds of models across various GitHub repositories. It enables users to prototype, train, optimize, and deploy the latest SOTA neural nets into production. The tool offers various modules like FlashAttention, SwiGLUStacked, RelativePositionBias, FeedForward, BitLinear, PalmE, Unet, VisionEmbeddings, niva, FusedDenseGELUDense, FusedDropoutLayerNorm, MambaBlock, Film, hyper_optimize, DPO, and ZetaCloud for different tasks in AI model development.

github

: 365

20 - OpenAI Gpts

AR 600-20, Army Command Policy Expert

An expert chatbot on AR 600-20, Army Command Policy

gpt

: 9

PhishCatcher

I'm PhishCatcher, an expert in identifying phishing attempts.

gpt

: 10+

Web Accessibility Navigator

Expert in web design & accessibility, offering assessments and development guidance.

gpt

: 6

The Train Traveler

Friendly train travel guide focusing on the best routes, essential travel information, and personalized travel insights, for both experienced and novice travelers.

gpt

: 40+