Best AI tools for< train on specific codebase >
20 - AI tool Sites

TolyGPT
TolyGPT is an AI-powered chatbot that is designed to read an entire codebase and generate documentation. It is specifically trained on the Solana validator codebase, allowing users to ask questions about how the validator works. The core of TolyGPT is open source as Autodoc, and it is powered by the GPT-3.5 model. Users can apply to have TolyGPT work on their own codebase and stay updated by following Sam Hogan.

ChatCube
ChatCube is an AI-powered chatbot maker that allows users to create chatbots for their websites without coding. It uses advanced AI technology to train chatbots on any document or website within 60 seconds. ChatCube offers a range of features, including a user-friendly visual editor, lightning-fast integration, fine-tuning on specific data sources, data encryption and security, and customizable chatbots. By leveraging the power of AI, ChatCube helps businesses improve customer support efficiency and reduce support ticket reductions by up to 28%.

Vize.ai
Vize.ai is a custom image recognition API provided by Ximilar, a leading company in Visual AI and Search. The tool offers powerful artificial intelligence capabilities with high accuracy using deep learning algorithms. It allows users to easily set up and implement cutting-edge vision automation without any development costs. Vize.ai enables users to train custom neural networks to recognize specific images and provides a scalable solution with continuous improvements in machine learning algorithms. The tool features an intuitive interface that requires no machine learning or coding knowledge, making it accessible for a wide range of users across industries.

Chatflot
Chatflot is an AI chatbot application that helps businesses automate up to 95% of customer queries. It allows users to create customized AI chatbots based on the ChatGPT language model, enabling them to provide on-demand information to customers through their website. Chatflot is suitable for various industries and offers features like training the chatbot on specific data, optimizing customer interactions, and integrating seamlessly with different CMS platforms. The application aims to enhance customer service, boost sales, and streamline support processes by providing personalized assistance and relevant information to users.

ChatGPT
ChatGPT is an AI-powered chatbot tool that allows users to create and train chatbots for various purposes. It uses advanced natural language processing to generate human-like responses and engage in conversations with users. With ChatGPT, you can customize the chatbot's responses, train it on specific topics, and integrate it into your website or application to provide interactive and personalized experiences for your users.

Safurai
Safurai is an AI-powered coding assistant that helps developers write code faster, safer, and better. It offers a range of features, including a textbox for asking questions and getting code suggestions, shortcuts for code optimization and unit testing, the ability to train the assistant on specific projects, and a natural language search for finding code. Safurai is compatible with various IDEs, including Visual Studio Code, IntelliJ, and PyCharm.

Cody
Cody is an intelligent AI assistant designed to boost team productivity by providing instant answers, support, troubleshooting, and idea generation. It can be trained on your business knowledge base to cater to your specific needs, making it a valuable asset for various departments such as marketing, HR, IT support, business consultancy, creative tasks, sales, training, hiring, customer support, and translation. Cody offers features like prompt manager, focus mode, conversation logs, scratchpad, and source checking, ensuring efficient and tailored assistance. With multilingual capabilities and customizable access controls, Cody prioritizes data security and user experience.

Ferdinand
Ferdinand is a platform designed to help users build their data culture and skills effortlessly. The application offers interactive mini-courses in Data & Analytics through a chatbot integrated with Slack. Users can learn from various themes such as marketing, sales, and product management. Ferdinand removes barriers to learning by delivering courses directly in Slack, allowing teams to develop their data knowledge conveniently. The platform also provides a robust payments platform with features like simplified card issuing, streamlined checkout, smart dashboard, optimized platforms, and faster transaction approval.

RunPod
RunPod is a cloud platform specifically designed for AI development and deployment. It offers a range of features to streamline the process of developing, training, and scaling AI models, including a library of pre-built templates, efficient training pipelines, and scalable deployment options. RunPod also provides access to a wide selection of GPUs, allowing users to choose the optimal hardware for their specific AI workloads.

Lamini
Lamini is an enterprise-level LLM platform that offers precise recall with Memory Tuning, enabling teams to achieve over 95% accuracy even with large amounts of specific data. It guarantees JSON output and delivers massive throughput for inference. Lamini is designed to be deployed anywhere, including air-gapped environments, and supports training and inference on Nvidia or AMD GPUs. The platform is known for its factual LLMs and reengineered decoder that ensures 100% schema accuracy in the JSON output.

Browse AI
Browse AI is an AI-powered web scraping and data monitoring tool that allows users to easily extract and monitor data from any website without the need for coding. With Browse AI, users can train robots in just 2 minutes to extract specific data in spreadsheet format, monitor data on a schedule, and browse prebuilt robots for popular use cases. The tool offers over 7,000 integrations, handles pagination and scrolling, solves captchas, and auto-adapts to site layout changes. Trusted by over 370,000 individuals and teams, Browse AI is a powerful solution for data extraction and monitoring tasks.

Helix AI
Helix AI is a private GenAI platform that enables users to build AI applications using open source models. The platform offers tools for RAG (Retrieval-Augmented Generation) and fine-tuning, allowing deployment on-premises or in a Virtual Private Cloud (VPC). Users can access curated models, utilize Helix API tools to connect internal and external APIs, embed Helix Assistants into websites/apps for chatbot functionality, write AI application logic in natural language, and benefit from the innovative RAG system for Q&A generation. Additionally, users can fine-tune models for domain-specific needs and deploy securely on Kubernetes or Docker in any cloud environment. Helix Cloud offers free and premium tiers with GPU priority, catering to individuals, students, educators, and companies of varying sizes.

prompteasy.ai
Prompteasy.ai is an AI tool that allows users to fine-tune AI models in less than 5 minutes. It simplifies the process of training AI models on user data, making it as easy as having a conversation. Users can fully customize GPT by fine-tuning it to meet their specific needs. The tool offers data-driven customization, interactive AI coaching, and seamless model enhancement, providing users with a competitive edge and simplifying AI integration into their workflows.

ioni.ai
ioni.ai is an AI application that offers ChatGPT-4 solution for customer support. It is a smart chatbot based on the latest AI technology, designed to handle general inquiries, complex questions, and user-specific requests. The application streamlines workflow with immediate responses, brings CSAT scores to a new level, and ensures human-in-the-loop verification for quality control. With self-learning capabilities, ioni.ai constantly improves its responses and provides accurate solutions to customer inquiries.

BotB9
BotB9 is an AI chatbot application that is trained with your business data to provide personalized and programmable video guides. It acts as the missing link between businesses and their customers, offering features like lead capture, order checkout, and customizable templates for various use cases. With BotB9, users can create their own AI chatbots, embed them on websites and mobile apps, and train them with specific business information to answer sales and support questions. The application allows for unlimited chats, custom branding, and theme customization, making it a versatile tool for businesses across different industries.

Support AI
Support AI is a custom AI chatbot application powered by ChatGPT that allows website owners to create personalized chatbots to provide instant answers to customers, capture leads, and enhance customer support. With Support AI, users can easily integrate AI chatbots on their websites, train them with specific content, and customize their behavior and responses. The application offers features such as capturing leads, providing accurate answers, handling bookings, collecting feedback, and offering product recommendations. Users can choose from different pricing plans based on their message volume and training content needs.

Frame AI
Frame AI is a premier Streaming AI Platform powered by STAG, designed to provide proactive insights and tools for every team by continuously querying customer data to detect traits, track trends, and trigger workflows. The platform turns unstructured data into actionable insights, helping teams stay ahead of risks and opportunities. Frame AI's architecture autonomously queries customer data based on user objectives, activating inside existing business tools to provide real-time customer data. With features like enrichments, triggers, alerts, and insights, Frame AI enables better decisions faster by combining predictive signals in customer text into task-specific scores. The platform is suitable for marketing, CX, support, and product teams, offering real-time usability feedback, demographic and psychographic trait detection, and secure data handling. Frame AI is SOC 2 Type II certified and HIPAA compliant, with a team of AI experts leading the development of AI solutions for various organizations.

Botsonic
Botsonic is an AI chatbot application that offers custom AI chatbots for websites. It provides AI-powered automation solutions for various industries, enabling businesses to enhance customer engagement, support, sales, and more. Botsonic uses AI Copilots trained on data to deliver authentic customer experiences in multiple languages across different channels. The platform allows users to easily create, customize, and integrate AI chatbots into their websites, providing instant support and personalized interactions.

ColdIQ
ColdIQ is an AI-powered sales prospecting tool that helps B2B companies with revenue above $100k/month to build outbound systems that sell for them. The tool offers end-to-end cold outreach campaign setup and management, email infrastructure setup and warmup, audience research and targeting, data scraping and enrichment, campaigns optimization, sending automation, sales systems implementation, training on tools best practices, sales tools recommendations, free gap analysis, sales consulting, and copywriting frameworks. ColdIQ leverages AI to tailor messaging to each prospect, automate outreach, and flood calendars with opportunities.

EmbedAI
EmbedAI is a platform that allows users to create custom AI chatbots powered by ChatGPT using their own data. The platform enables users to train AI chatbots on various sources such as files, websites, and YouTube videos, and customize the chatbot's appearance. EmbedAI supports over 100 languages and offers easy integration with other applications via API or Zapier. Users can share their AI chatbots with others and embed them on websites. The platform aims to provide efficient management of information and automated responses to user queries.
20 - Open Source AI Tools

LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse is a comprehensive and curated guide designed to empower developers, researchers, and enthusiasts to harness the true capabilities of Large Language Models (LLMs) and build intelligent applications that push the boundaries of natural language understanding. This GitHub repository provides in-depth articles, codebase mastery, LLM PlayLab, and resources for cost analysis and network visualization. It covers various aspects of LLMs, including NLP, models, training, evaluation metrics, open LLMs, and more. The repository also includes a collection of code examples and tutorials to help users build and deploy LLM-based applications.

prometheus-eval
Prometheus-Eval is a repository dedicated to evaluating large language models (LLMs) in generation tasks. It provides state-of-the-art language models like Prometheus 2 (7B & 8x7B) for assessing in pairwise ranking formats and achieving high correlation scores with benchmarks. The repository includes tools for training, evaluating, and using these models, along with scripts for fine-tuning on custom datasets. Prometheus aims to address issues like fairness, controllability, and affordability in evaluations by simulating human judgments and proprietary LM-based assessments.

awesome-generative-ai
A curated list of Generative AI projects, tools, artworks, and models

pr-agent
PR-Agent is a tool that helps to efficiently review and handle pull requests by providing AI feedbacks and suggestions. It supports various commands such as generating PR descriptions, providing code suggestions, answering questions about the PR, and updating the CHANGELOG.md file. PR-Agent can be used via CLI, GitHub Action, GitHub App, Docker, and supports multiple git providers and models. It emphasizes real-life practical usage, with each tool having a single GPT-4 call for quick and affordable responses. The PR Compression strategy enables effective handling of both short and long PRs, while the JSON prompting strategy allows for modular and customizable tools. PR-Agent Pro, the hosted version by CodiumAI, provides additional benefits such as full management, improved privacy, priority support, and extra features.

hi-ml
The Microsoft Health Intelligence Machine Learning Toolbox is a repository that provides low-level and high-level building blocks for Machine Learning / AI researchers and practitioners. It simplifies and streamlines work on deep learning models for healthcare and life sciences by offering tested components such as data loaders, pre-processing tools, deep learning models, and cloud integration utilities. The repository includes two Python packages, 'hi-ml-azure' for helper functions in AzureML, 'hi-ml' for ML components, and 'hi-ml-cpath' for models and workflows related to histopathology images.

llm-foundry
LLM Foundry is a codebase for training, finetuning, evaluating, and deploying LLMs for inference with Composer and the MosaicML platform. It is designed to be easy-to-use, efficient _and_ flexible, enabling rapid experimentation with the latest techniques. You'll find in this repo: * `llmfoundry/` - source code for models, datasets, callbacks, utilities, etc. * `scripts/` - scripts to run LLM workloads * `data_prep/` - convert text data from original sources to StreamingDataset format * `train/` - train or finetune HuggingFace and MPT models from 125M - 70B parameters * `train/benchmarking` - profile training throughput and MFU * `inference/` - convert models to HuggingFace or ONNX format, and generate responses * `inference/benchmarking` - profile inference latency and throughput * `eval/` - evaluate LLMs on academic (or custom) in-context-learning tasks * `mcli/` - launch any of these workloads using MCLI and the MosaicML platform * `TUTORIAL.md` - a deeper dive into the repo, example workflows, and FAQs

awesome-ai-devtools
Awesome AI-Powered Developer Tools is a curated list of AI-powered developer tools that leverage AI to assist developers in tasks such as code completion, refactoring, debugging, documentation, and more. The repository includes a wide range of tools, from IDEs and Git clients to assistants, agents, app generators, UI generators, snippet generators, documentation tools, code generation tools, agent platforms, OpenAI plugins, search tools, and testing tools. These tools are designed to enhance developer productivity and streamline various development tasks by integrating AI capabilities.

UMOE-Scaling-Unified-Multimodal-LLMs
Uni-MoE is a MoE-based unified multimodal model that can handle diverse modalities including audio, speech, image, text, and video. The project focuses on scaling Unified Multimodal LLMs with a Mixture of Experts framework. It offers enhanced functionality for training across multiple nodes and GPUs, as well as parallel processing at both the expert and modality levels. The model architecture involves three training stages: building connectors for multimodal understanding, developing modality-specific experts, and incorporating multiple trained experts into LLMs using the LoRA technique on mixed multimodal data. The tool provides instructions for installation, weights organization, inference, training, and evaluation on various datasets.

chatgpt-universe
ChatGPT is a large language model that can generate human-like text, translate languages, write different kinds of creative content, and answer your questions in a conversational way. It is trained on a massive amount of text data, and it is able to understand and respond to a wide range of natural language prompts. Here are 5 jobs suitable for this tool, in lowercase letters: 1. content writer 2. chatbot assistant 3. language translator 4. creative writer 5. researcher

forust
Forust is a lightweight package for building gradient boosted decision tree ensembles. The algorithm code is written in Rust with a Python wrapper. It implements the same algorithm as XGBoost and provides nearly identical results. The package was developed to better understand XGBoost, as a fun project in Rust, and to experiment with adding new features to the algorithm in a simpler codebase. Forust allows training gradient boosted decision tree ensembles with multiple objective functions, predicting on datasets, inspecting model structures, calculating feature importance, and saving/loading trained boosters.

generative-models
Generative Models by Stability AI is a repository that provides various generative models for research purposes. It includes models like Stable Video 4D (SV4D) for video synthesis, Stable Video 3D (SV3D) for multi-view synthesis, SDXL-Turbo for text-to-image generation, and more. The repository focuses on modularity and implements a config-driven approach for building and combining submodules. It supports training with PyTorch Lightning and offers inference demos for different models. Users can access pre-trained models like SDXL-base-1.0 and SDXL-refiner-1.0 under a CreativeML Open RAIL++-M license. The codebase also includes tools for invisible watermark detection in generated images.

cambrian
Cambrian-1 is a fully open project focused on exploring multimodal Large Language Models (LLMs) with a vision-centric approach. It offers competitive performance across various benchmarks with models at different parameter levels. The project includes training configurations, model weights, instruction tuning data, and evaluation details. Users can interact with Cambrian-1 through a Gradio web interface for inference. The project is inspired by LLaVA and incorporates contributions from Vicuna, LLaMA, and Yi. Cambrian-1 is licensed under Apache 2.0 and utilizes datasets and checkpoints subject to their respective original licenses.

catalyst
Catalyst is a C# Natural Language Processing library designed for speed, inspired by spaCy's design. It provides pre-trained models, support for training word and document embeddings, and flexible entity recognition models. The library is fast, modern, and pure-C#, supporting .NET standard 2.0. It is cross-platform, running on Windows, Linux, macOS, and ARM. Catalyst offers non-destructive tokenization, named entity recognition, part-of-speech tagging, language detection, and efficient binary serialization. It includes pre-built models for language packages and lemmatization. Users can store and load models using streams. Getting started with Catalyst involves installing its NuGet Package and setting the storage to use the online repository. The library supports lazy loading of models from disk or online. Users can take advantage of C# lazy evaluation and native multi-threading support to process documents in parallel. Training a new FastText word2vec embedding model is straightforward, and Catalyst also provides algorithms for fast embedding search and dimensionality reduction.

Awesome-Code-LLM
Analyze the following text from a github repository (name and readme text at end) . Then, generate a JSON object with the following keys and provide the corresponding information for each key, in lowercase letters: 'description' (detailed description of the repo, must be less than 400 words,Ensure that no line breaks and quotation marks.),'for_jobs' (List 5 jobs suitable for this tool,in lowercase letters), 'ai_keywords' (keywords of the tool,user may use those keyword to find the tool,in lowercase letters), 'for_tasks' (list of 5 specific tasks user can use this tool to do,in lowercase letters), 'answer' (in english languages)

matsciml
The Open MatSci ML Toolkit is a flexible framework for machine learning in materials science. It provides a unified interface to a variety of materials science datasets, as well as a set of tools for data preprocessing, model training, and evaluation. The toolkit is designed to be easy to use for both beginners and experienced researchers, and it can be used to train models for a wide range of tasks, including property prediction, materials discovery, and materials design.

awesome-transformer-nlp
This repository contains a hand-curated list of great machine (deep) learning resources for Natural Language Processing (NLP) with a focus on Generative Pre-trained Transformer (GPT), Bidirectional Encoder Representations from Transformers (BERT), attention mechanism, Transformer architectures/networks, Chatbot, and transfer learning in NLP.
20 - OpenAI Gpts

RailwayGPT
Technical expert on locomotives, trains, signalling, and railway technology. Can answer questions and draw designs specific to transportation domain.

Web Accessibility Navigator
Expert in web design & accessibility, offering assessments and development guidance.

The Train Traveler
Friendly train travel guide focusing on the best routes, essential travel information, and personalized travel insights, for both experienced and novice travelers.

ADA Consultant
Expert on ADA compliance, here to guide you through accessibility standards and regulations. ***Not legal advice.***

Labor and Employment Law Advisor
Advises businesses on labor and employment law compliance.

Emergency Training
Provides emergency training assistance with a focus on safety and clear guidelines.

Intelligently Designed ERP
ERP expert with a focus on Program Management, Business Analysis, and Systems Analysis utilizing Agile and PMBOK principles.

The Relevance Report 2024
Learn what current and future leaders in communication think about AI's impact on our industry!

Cyber Shielder
Expert in cyber security (NIST, OWASP, NIS2, MITRE ATT&CK, DORA) and GDPR, offering clear and concise guidance.

Diversity & Inclusion Advisor
Promotes inclusive culture and diversity within the organization.

H&J Medical Supplies HIPAA Compliance Expert
Expert in HIPAA compliance for medical supplies

ProtectED
A safeguarding advisor for schools, aligned with 'Keeping Children Safe In Education' guidelines.

弍号機 まもる ISO Guardian
ISO27001およびISO/IEC 27002のベストプラクティスに精通したアドバイザー Expert in ISO27001 and ISO/IEC 27002 best practices.

USA Employment Law Master
Expert in answering Employment Law queries for small businesses in the USA