Best AI tools for< Build Large Language Models >
20 - AI tool Sites
![Sarvam AI Screenshot](/screenshots/sarvam.ai.jpg)
Sarvam AI
Sarvam AI is an AI application focused on leading transformative research in AI to develop, deploy, and distribute Generative AI applications in India. The platform aims to build efficient large language models for India's diverse linguistic culture and enable new GenAI applications through bespoke enterprise models. Sarvam AI is also developing an enterprise-grade platform for developing and evaluating GenAI apps, while contributing to open-source models and datasets to accelerate AI innovation.
![Floneum Screenshot](/screenshots/floneum.com.jpg)
Floneum
Floneum is a versatile AI-powered tool designed for language tasks. It allows users to build workflows using large language models through a user-friendly drag and drop interface. With the ability to securely extend Floneum using WebAssembly plugins, users can write plugins in various languages like Rust, C, Java, or Go. The tool offers 41 built-in plugins for tasks such as text generation, search engine operations, file manipulation, Python script execution, browser automation, and more. Floneum empowers users to automate language-related tasks efficiently and effectively.
![LangChain Screenshot](/screenshots/python.langchain.com.jpg)
LangChain
LangChain is a framework for developing applications powered by large language models (LLMs). It simplifies every stage of the LLM application lifecycle, including development, productionization, and deployment. LangChain consists of open-source libraries such as langchain-core, langchain-community, and partner packages. It also includes LangGraph for building stateful agents and LangSmith for debugging and monitoring LLM applications.
![Arcee AI Screenshot](/screenshots/arcee.ai.jpg)
Arcee AI
Arcee AI is a platform that offers a cost-effective, secure, end-to-end solution for building and deploying Small Language Models (SLMs). It allows users to merge and train custom language models by leveraging open source models and their own data. The platform is known for its Model Merging technique, which combines the power of pre-trained Large Language Models (LLMs) with user-specific data to create high-performing models across various industries.
![ThirdAI Screenshot](/screenshots/thirdai.com.jpg)
ThirdAI
ThirdAI is a production-ready AI platform designed for enterprises, offering a comprehensive solution for building AI applications without the need for extensive proof-of-concept cycles or manual fine-tuning. The platform provides enterprise-grade productivity tools, document search and retrieval, content creation, FAQ bots, customer live support, hyper-personalization, risk and compliance features, fraud detection, anomaly detection, and PII/sensitive data redaction. ThirdAI allows users to bring their business problems, apply them to data, and compose AI applications effortlessly. The platform supports no-code customization, turnkey deployment, and user engagement data for best-in-class accuracy.
![Imaginary Programming Screenshot](/screenshots/imaginary.dev.jpg)
Imaginary Programming
Imaginary Programming is an AI tool that allows frontend developers to leverage OpenAI's GPT engine to add human-like intelligence to their code effortlessly. By defining function prototypes in TypeScript, developers can access GPT's capabilities without the need for AI model training. The tool enables users to extract structured data, generate text, classify data based on intent or emotion, and parse unstructured language. Imaginary Programming is designed to help developers tackle new challenges and enhance their projects with AI intelligence.
![Langtail Screenshot](/screenshots/langtail.com.jpg)
Langtail
Langtail is a platform that helps developers build, test, and deploy AI-powered applications. It provides a suite of tools to help developers debug prompts, run tests, and monitor the performance of their AI models. Langtail also offers a community forum where developers can share tips and tricks, and get help from other users.
![Cohere Screenshot](/screenshots/cohere.com.jpg)
Cohere
Cohere is the leading AI platform for enterprise, offering products optimized for generative AI, search and discovery, and advanced retrieval. Their models are designed to enhance the global workforce, enabling businesses to thrive in the AI era. Cohere provides Command R+, Cohere Command, Cohere Embed, and Cohere Rerank for building efficient AI-powered applications. The platform also offers deployment options for enterprise-grade AI on any cloud or on-premises, along with developer resources like Playground, LLM University, and Developer Docs.
![Cohere Screenshot](/screenshots/cohere.ai.jpg)
Cohere
Cohere is the leading AI platform for enterprise, offering generative AI, search and discovery, and advanced retrieval solutions. Their models are designed to enhance the global workforce, empowering businesses to thrive in the AI era. With features like Cohere Command, Cohere Embed, and Cohere Rerank, the platform enables the development of scalable and efficient AI-powered applications. Cohere focuses on optimizing enterprise data through language-based models, supporting over 100 languages for enhanced accuracy and efficiency.
![LlamaIndex Screenshot](/screenshots/llamaindex.ai.jpg)
LlamaIndex
LlamaIndex is a leading data framework designed for building LLM (Large Language Model) applications. It allows enterprises to turn their data into production-ready applications by providing functionalities such as loading data from various sources, indexing data, orchestrating workflows, and evaluating application performance. The platform offers extensive documentation, community-contributed resources, and integration options to support developers in creating innovative LLM applications.
![AI Builders Summit Screenshot](/screenshots/summit.ai.jpg)
AI Builders Summit
AI Builders Summit is a 4-week virtual training event designed to equip data scientists, ML and AI engineers, and innovators with the latest advancements in large language models (LLMs), AI agents, and Retrieval-Augmented Generation (RAG). The summit emphasizes hands-on learning and real-world applications, with interactive workshops, platform credits, and direct exposure to industry-leading tools. Attendees can learn progressively over four weeks, building practical skills through expert-led sessions, cutting-edge tools, and industry insights.
![Avanzai Screenshot](/screenshots/avanz.ai.jpg)
Avanzai
Avanzai is a workflow automation tool designed specifically for financial services. It utilizes AI agents to transform financial datasets into actionable insights, simplifying financial data analysis for institutions. Users can build charts with public data, create agents for data customization, and transform data quickly with Avanzai's services. The tool offers features such as macro analysis, instrument screening, risk analytics, and factor modeling. Avanzai provides a simple pricing structure for professional and enterprise users, allowing access to a wide range of financial data and integration options.
![FutureSmart AI Screenshot](/screenshots/blog.futuresmart.ai.jpg)
FutureSmart AI
FutureSmart AI is a platform that provides custom Natural Language Processing (NLP) solutions. The platform focuses on integrating Mem0 with LangChain to enhance AI Assistants with Intelligent Memory. It offers tutorials, guides, and practical tips for building applications with large language models (LLMs) to create sophisticated and interactive systems. FutureSmart AI also features internship journeys and practical guides for mastering RAG with LangChain, catering to developers and enthusiasts in the realm of NLP and AI.
![Thales Labs AI Screenshot](/screenshots/thaleslabs.ai.jpg)
Thales Labs AI
Thales Labs is a premier AI research lab and incubator empowering entrepreneurs and domain experts to revolutionize industries with large language models and web3. They focus on fostering innovation in sectors like Insurance, Finance, Healthcare, Pharma, Law, and Journalism. The user-friendly app allows experts to build AI applications using their natural language skills, with support from skilled engineers for complex challenges. Join Thales Labs to transform industries, unlock new opportunities, and create value with AI-driven innovation.
![Allganize Screenshot](/screenshots/allganize.ai.jpg)
Allganize
Allganize Inc. is a leading provider of enterprise AI solutions. Their platform enables businesses to build and deploy custom AI applications without the need for coding. Allganize's solutions are used by a variety of industries, including financial services, healthcare, and manufacturing.
![Langtrace AI Screenshot](/screenshots/langtrace.ai.jpg)
Langtrace AI
Langtrace AI is an open-source observability tool powered by Scale3 Labs that helps monitor, evaluate, and improve LLM (Large Language Model) applications. It collects and analyzes traces and metrics to provide insights into the ML pipeline, ensuring security through SOC 2 Type II certification. Langtrace supports popular LLMs, frameworks, and vector databases, offering end-to-end observability and the ability to build and deploy AI applications with confidence.
![Surge AI Screenshot](/screenshots/surgehq.ai.jpg)
Surge AI
Surge AI is a data labeling platform that provides human-generated data for training and evaluating large language models (LLMs). It offers a global workforce of annotators who can label data in over 40 languages. Surge AI's platform is designed to be easy to use and integrates with popular machine learning tools and frameworks. The company's customers include leading AI companies, research labs, and startups.
![Intel Gaudi AI Accelerator Developer Screenshot](/screenshots/developer.habana.ai.jpg)
Intel Gaudi AI Accelerator Developer
The Intel Gaudi AI accelerator developer website provides resources, guidance, tools, and support for building, migrating, and optimizing AI models. It offers software, model references, libraries, containers, and tools for training and deploying Generative AI and Large Language Models. The site focuses on the Intel Gaudi accelerators, including tutorials, documentation, and support for developers to enhance AI model performance.
![Ragobble Screenshot](/screenshots/ragobble.com.jpg)
Ragobble
Ragobble is an audio to LLM data tool that allows you to easily convert audio files into text data that can be used to train large language models (LLMs). With Ragobble, you can quickly and easily create high-quality training data for your LLM projects.
![YourGPT Screenshot](/screenshots/yourgpt.ai.jpg)
YourGPT
YourGPT is a suite of next-generation AI products designed to empower businesses with the potential of Large Language Models (LLMs). Its products include a no-code AI Chatbot solution for customer support and LLM Spark, a developer platform for building and deploying production-ready LLM applications. YourGPT prioritizes data security and is GDPR compliant, ensuring the privacy and protection of customer data. With over 2,000 satisfied customers, YourGPT has earned trust through its commitment to quality and customer satisfaction.
20 - Open Source AI Tools
![llmops-duke-aipi Screenshot](/screenshots_githubs/alfredodeza-llmops-duke-aipi.jpg)
llmops-duke-aipi
LLMOps Duke AIPI is a course focused on operationalizing Large Language Models, teaching methodologies for developing applications using software development best practices with large language models. The course covers various topics such as generative AI concepts, setting up development environments, interacting with large language models, using local large language models, applied solutions with LLMs, extensibility using plugins and functions, retrieval augmented generation, introduction to Python web frameworks for APIs, DevOps principles, deploying machine learning APIs, LLM platforms, and final presentations. Students will learn to build, share, and present portfolios using Github, YouTube, and Linkedin, as well as develop non-linear life-long learning skills. Prerequisites include basic Linux and programming skills, with coursework available in Python or Rust. Additional resources and references are provided for further learning and exploration.
![learn-generative-ai Screenshot](/screenshots_githubs/panaverse-learn-generative-ai.jpg)
learn-generative-ai
Learn Cloud Applied Generative AI Engineering (GenEng) is a course focusing on the application of generative AI technologies in various industries. The course covers topics such as the economic impact of generative AI, the role of developers in adopting and integrating generative AI technologies, and the future trends in generative AI. Students will learn about tools like OpenAI API, LangChain, and Pinecone, and how to build and deploy Large Language Models (LLMs) for different applications. The course also explores the convergence of generative AI with Web 3.0 and its potential implications for decentralized intelligence.
![Awesome-LLM Screenshot](/screenshots_githubs/Hannibal046-Awesome-LLM.jpg)
Awesome-LLM
Awesome-LLM is a curated list of resources related to large language models, focusing on papers, projects, frameworks, tools, tutorials, courses, opinions, and other useful resources in the field. It covers trending LLM projects, milestone papers, other papers, open LLM projects, LLM training frameworks, LLM evaluation frameworks, tools for deploying LLM, prompting libraries & tools, tutorials, courses, books, and opinions. The repository provides a comprehensive overview of the latest advancements and resources in the field of large language models.
![nlp-llms-resources Screenshot](/screenshots_githubs/nlpfromscratch-nlp-llms-resources.jpg)
nlp-llms-resources
The 'nlp-llms-resources' repository is a comprehensive resource list for Natural Language Processing (NLP) and Large Language Models (LLMs). It covers a wide range of topics including traditional NLP datasets, data acquisition, libraries for NLP, neural networks, sentiment analysis, optical character recognition, information extraction, semantics, topic modeling, multilingual NLP, domain-specific LLMs, vector databases, ethics, costing, books, courses, surveys, aggregators, newsletters, papers, conferences, and societies. The repository provides valuable information and resources for individuals interested in NLP and LLMs.
![llm-hosting-container Screenshot](/screenshots_githubs/awslabs-llm-hosting-container.jpg)
llm-hosting-container
The LLM Hosting Container repository provides Dockerfile and associated resources for building and hosting containers for large language models, specifically the HuggingFace Text Generation Inference (TGI) container. This tool allows users to easily deploy and manage large language models in a containerized environment, enabling efficient inference and deployment of language-based applications.
![LLMs-from-scratch Screenshot](/screenshots_githubs/rasbt-LLMs-from-scratch.jpg)
LLMs-from-scratch
This repository contains the code for coding, pretraining, and finetuning a GPT-like LLM and is the official code repository for the book Build a Large Language Model (From Scratch). In _Build a Large Language Model (From Scratch)_, you'll discover how LLMs work from the inside out. In this book, I'll guide you step by step through creating your own LLM, explaining each stage with clear text, diagrams, and examples. The method described in this book for training and developing your own small-but-functional model for educational purposes mirrors the approach used in creating large-scale foundational models such as those behind ChatGPT.
![LLMs-from-scratch-CN Screenshot](/screenshots_githubs/MLNLP-World-LLMs-from-scratch-CN.jpg)
LLMs-from-scratch-CN
This repository is a Chinese translation of the GitHub project 'LLMs-from-scratch', including detailed markdown notes and related Jupyter code. The translation process aims to maintain the accuracy of the original content while optimizing the language and expression to better suit Chinese learners' reading habits. The repository features detailed Chinese annotations for all Jupyter code, aiding users in practical implementation. It also provides various supplementary materials to expand knowledge. The project focuses on building Large Language Models (LLMs) from scratch, covering fundamental constructions like Transformer architecture, sequence modeling, and delving into deep learning models such as GPT and BERT. Each part of the project includes detailed code implementations and learning resources to help users construct LLMs from scratch and master their core technologies.
![LLM-workshop-2024 Screenshot](/screenshots_githubs/rasbt-LLM-workshop-2024.jpg)
LLM-workshop-2024
LLM-workshop-2024 is a tutorial designed for coders interested in understanding the building blocks of large language models (LLMs), how LLMs work, and how to code them from scratch in PyTorch. The tutorial covers topics such as introduction to LLMs, understanding LLM input data, coding LLM architecture, pretraining LLMs, loading pretrained weights, and finetuning LLMs using open-source libraries. Participants will learn to implement a small GPT-like LLM, including data input pipeline, core architecture components, and pretraining code.
![awesome-LLM-resourses Screenshot](/screenshots_githubs/WangRongsheng-awesome-LLM-resourses.jpg)
awesome-LLM-resourses
A comprehensive repository of resources for Chinese large language models (LLMs), including data processing tools, fine-tuning frameworks, inference libraries, evaluation platforms, RAG engines, agent frameworks, books, courses, tutorials, and tips. The repository covers a wide range of tools and resources for working with LLMs, from data labeling and processing to model fine-tuning, inference, evaluation, and application development. It also includes resources for learning about LLMs through books, courses, and tutorials, as well as insights and strategies from building with LLMs.
![LLMLanding Screenshot](/screenshots_githubs/DayDreamChaser-LLMLanding.jpg)
LLMLanding
LLMLanding is a repository focused on practical implementation of large models, covering topics from theory to practice. It provides a structured learning path for training large models, including specific tasks like training 1B-scale models, exploring SFT, and working on specialized tasks such as code generation, NLP tasks, and domain-specific fine-tuning. The repository emphasizes a dual learning approach: quickly applying existing tools for immediate output benefits and delving into foundational concepts for long-term understanding. It offers detailed resources and pathways for in-depth learning based on individual preferences and goals, combining theory with practical application to avoid overwhelm and ensure sustained learning progress.
![ml-road-map Screenshot](/screenshots_githubs/loganthorneloe-ml-road-map.jpg)
ml-road-map
The Machine Learning Road Map is a comprehensive guide designed to take individuals from various levels of machine learning knowledge to a basic understanding of machine learning principles using high-quality, free resources. It aims to simplify the complex and rapidly growing field of machine learning by providing a structured roadmap for learning. The guide emphasizes the importance of understanding AI for everyone, the need for patience in learning machine learning due to its complexity, and the value of learning from experts in the field. It covers five different paths to learning about machine learning, catering to consumers, aspiring AI researchers, ML engineers, developers interested in building ML applications, and companies looking to implement AI solutions.
![DecryptPrompt Screenshot](/screenshots_githubs/DSXiangLi-DecryptPrompt.jpg)
DecryptPrompt
This repository does not provide a tool, but rather a collection of resources and strategies for academics in the field of artificial intelligence who are feeling depressed or overwhelmed by the rapid advancements in the field. The resources include articles, blog posts, and other materials that offer advice on how to cope with the challenges of working in a fast-paced and competitive environment.
![scylla Screenshot](/screenshots_githubs/imWildCat-scylla.jpg)
scylla
Scylla is an intelligent proxy pool tool designed for humanities, enabling users to extract content from the internet and build their own Large Language Models in the AI era. It features automatic proxy IP crawling and validation, an easy-to-use JSON API, a simple web-based user interface, HTTP forward proxy server, Scrapy and requests integration, and headless browser crawling. Users can start using Scylla with just one command, making it a versatile tool for various web scraping and content extraction tasks.
![llms-from-scratch-cn Screenshot](/screenshots_githubs/datawhalechina-llms-from-scratch-cn.jpg)
llms-from-scratch-cn
This repository provides a detailed tutorial on how to build your own large language model (LLM) from scratch. It includes all the code necessary to create a GPT-like LLM, covering the encoding, pre-training, and fine-tuning processes. The tutorial is written in a clear and concise style, with plenty of examples and illustrations to help you understand the concepts involved. It is suitable for developers and researchers with some programming experience who are interested in learning more about LLMs and how to build them.
![start-llms Screenshot](/screenshots_githubs/louisfb01-start-llms.jpg)
start-llms
This repository is a comprehensive guide for individuals looking to start and improve their skills in Large Language Models (LLMs) without an advanced background in the field. It provides free resources, online courses, books, articles, and practical tips to become an expert in machine learning. The guide covers topics such as terminology, transformers, prompting, retrieval augmented generation (RAG), and more. It also includes recommendations for podcasts, YouTube videos, and communities to stay updated with the latest news in AI and LLMs.
20 - OpenAI Gpts
![VitalsGPT [V0.0.2.2] Screenshot](/screenshots_gpts/g-cL1rJdm11.jpg)
VitalsGPT [V0.0.2.2]
Simple CustomGPT built on Vitals Inquiry Case in Malta, aimed to help journalists and citizens navigate the inquiry's large dataset in a neutral, informative fashion. Always cross-reference replies to actual data. Do not rely solely on this LLM for verification of facts.
![Build a Brand Screenshot](/screenshots_gpts/g-ac8fkSk7r.jpg)
Build a Brand
Unique custom images based on your input. Just type ideas and the brand image is created.
![Beam Eye Tracker Extension Copilot Screenshot](/screenshots_gpts/g-D48Iu2zY3.jpg)
Beam Eye Tracker Extension Copilot
Build extensions using the Eyeware Beam eye tracking SDK
![Business Model Canvas Strategist Screenshot](/screenshots_gpts/g-lM6dmUVQm.jpg)
Business Model Canvas Strategist
Business Model Canvas Creator - Build and evaluate your business model
![League Champion Builder GPT Screenshot](/screenshots_gpts/g-3CURsOhEw.jpg)
League Champion Builder GPT
Build your own League of Legends Style Champion with Abilities, Back Story and Splash Art
![RenovaTecno Screenshot](/screenshots_gpts/g-g6dFw7dcN.jpg)
RenovaTecno
Your tech buddy helping you refurbish or build a PC from scratch, tailored to your needs, budget, and language.
![Gradle Expert Screenshot](/screenshots_gpts/g-3VN8IyOdc.jpg)
Gradle Expert
Your expert in Gradle build configuration, offering clear, practical advice.
![XRPL GPT Screenshot](/screenshots_gpts/g-Sjy2xMl6f.jpg)
XRPL GPT
Build on the XRP Ledger with assistance from this GPT trained on extensive documentation and code samples.