awesome-spring-ai
A curated list of awesome resources, tools, tutorials, and projects for building generative AI applications using Spring AI
Stars: 360
Awesome Spring AI is a curated list of resources, tools, tutorials, and projects for building generative AI applications using Spring AI. It provides a familiar developer experience for integrating Large Language Models and other AI capabilities into Spring applications, offering consistent abstractions, support for popular LLM providers, prompt engineering, caching mechanisms, vectorized storage integration, and more. The repository includes official resources, learning materials, code examples, community information, and tools for performance benchmarking.
README:
A curated list of awesome resources, tools, tutorials, and projects for building generative AI applications using Spring AI. This repository aims to help developers leverage the power of Large Language Models (LLMs) within the Spring ecosystem.
- What is Spring AI?
- Official Resources
- Learning Resources
- Code & Examples
- Community
- Tools & Performance
- Contributing
Spring AI is a project from the Spring team that provides a familiar and consistent Spring-style developer experience for building AI applications. It simplifies the integration of Large Language Models and other AI capabilities into Spring applications, offering:
- Consistent abstractions across different AI providers
- Support for popular LLM providers
- Robust prompt engineering capabilities
- Built-in caching and retry mechanisms
- Vectorized storage integration
- Streaming responses
- Customizable model parameters
- Native Spring Boot integration
- Dynamic Tool Updates in Spring AI's Model Context Protocol - How to dynamically update tools available to AI assistants using Spring AI's MCP implementation
- Spring AI Prompt Engineering Patterns - Best practices and patterns for effective prompt engineering in Spring AI applications
- Agentic AI is the future! Agentic AI is now! - Exploring agentic patterns in Spring AI for building autonomous AI systems
- Leverage the Power of 45k, free, Hugging Face Models with Spring AI and Ollama
- Supercharging Your AI Applications with Spring AI Advisors
- Spring AI with NVIDIA LLM API
- Spring AI Embraces OpenAI's Structured Outputs: Enhancing JSON Response Reliability
- Spring AI with Groq - a blazingly fast AI inference engine
- Spring AI with Ollama Tool Support
- Spring AI - Structured Output
- Spring AI - Multimodality - Orbis Sensualium Pictus
- Function Calling in Java and Spring AI using the latest Mistral AI API
- AI Meets Spring Petclinic: Implementing an AI Assistant with Spring AI (Part I)
- AI Meets Spring Petclinic: Implementing an AI Assistant with Spring AI (Part II)
- Spring Pet Klinik - Kotlin
- "Spring AI in Action" by Craig Walls (Manning)
- "Spring AI for Your Organization - GCP Vertex AI Edition" by Muthukumaran Navaneethakrishnan (Leanpub)
- "Beginning Spring AI" by Andrew Lombardi and Joseph Ottinger
- Understanding Tool/Function Calling in LLMs (Step-by-Step Examples in REST and Spring AI - Learn how to implement OpenAI-style tool calling — from raw REST to elegant Spring AI annotations (July 2025)
- Semantic search with embeddings in Spring & Kotlin - Comprehensive guide to the use of embeddings in Spring AI (April 2025)
- Spring AI in Java Applications - Vision for enterprise AI integration with Spring (March 2025)
- Configuring MCP-Client SSE using Spring AI - Technical guide for configuring Server-Sent Events with MCP clients (February 2025)
- Spring AI: A Beginner's Guide (Part 1) & Part 2 - Multi-part walkthrough of Spring AI fundamentals. Part 1 covers integrating chat models (OpenAI/Ollama) in Spring Boot; Part 2 dives into the Advisor API (December 2024)
- Why Spring AI: The Seamless Path to Generative AI - Article explaining the benefits of Spring AI and why you may consider it in favour of other AI frameworks (November 2024)
- Building a Generative AI Application with Spring AI - Project-based learning walkthrough for building a complete Spring AI application (November 2024)
- Spring Boot Meets AI - Practical guide to using OpenAI & Anthropic in a diet-planner application (October 2024)
- Getting Started with Spring AI (Java Code Geeks) - Simple introduction to Spring AI for Java developers (September 2024)
- Getting Started with Spring AI - Introduction to Spring AI's core components and model abstractions (August 2024)
- How to Write GenAI Applications with Java (Foojay.io) - Comprehensive guide covering RAG and Spring AI templates (July 2024)
- Spring AI: Beginner to Guru (Udemy) - Comprehensive course covering Spring AI fundamentals, integration with various LLM providers, prompt engineering, and building AI-powered applications
- Mastering Spring AI: Build AI with Java (Udemy) - Advanced course focused on building production-ready AI applications with Spring AI and Java
- Spring AI for Beginners: Build GenAI & LLM Apps (Udemy) - Step-by-step guide for beginners to create generative AI applications with Spring AI
- Spring AI with Neo4j: Knowledge Graph RAG (Udemy) - Specialized course on implementing Knowledge Graph RAG (Retrieval Augmented Generation) using Spring AI and Neo4j graph database
- Build AI Apps with Spring AI, OpenAI, Ollama & SpringBoot (Udemy) - Hands-on course teaching how to integrate AI capabilities into Spring Boot applications using Spring AI framework and OpenAI (August 2025)
- MCP, it's easy as ABC... -
- Bootiful Spring AI - Thanks to Devnexus for permission to represent this video- March 2025
- What's New in Spring AI M4 • Josh Long - February 2025
- Intelligent Applications with Spring AI • Patrick Baumgartner @ JFokus 2025 - February 2025
- Spring Boot and Vaadin with Spring AI MCP • Marcus Hellberg - February 2025
- Prompt Engineering with Spring AI • Josh Long on Christian Tzolov's Review - April 2025
- Spring AI Deep Dive • Mark Pollack & Josh Long @ Devnexus - April 2025
- Spring AI Course • freeCodeCamp - December 2024
- Building Agents with AWS: Complete Tutorial • Josh Long & James Ward - November 2024
- Supercharging your AI Applications with Spring AI Advisors • Spring Team - October 2024
- Spring AI Is All You Need • Christian Tzolov • GOTO Amsterdam 2024 - June 2024
- Practical GenAI with Spring AI • Rod Johnson @ YOW! 2024 - June 2024
- Introducing Spring AI by Christian Tzolov / Mark Pollack @ Spring I/O 2024 - May 2024
- Bringing GenAI to the Modern Enterprise. A production use-case. In Serverless Java !! • Dan Dobrin • Devoxx Belgium 2024 - May 2024
- Bootiful Spring Boot • Josh Long @ SpringOne 2024 - January 2024
- Bootiful Artificial Intelligence • Josh Long, Mark Pollack & Rod Johnson @ SpringOne 2024 - January 2024
- Spring AI: Seamlessly Integrating AI into Your Enterprise Java Applications - December 2023
- Overview of Spring AI @ Devoxx 2023 - November 2023
- Spring Tips: Spring AI - October 2023
- Introducing Spring AI • Add Generative AI to your Spring Applications - October 2023
- Spring AI at Spring.IO Keynotes - October 2023
- Craig Walls' Spring AI Playlist
- Dan Vega's Playlist
- Devoxx Playlist
- Telusko Spring AI Tutorial Playlist - Comprehensive tutorial series covering Spring AI implementation with OpenAI, Anthropic, and Ollama integration
- AI - Artificial Intelligence Playlist - Collection of videos covering Spring AI and general artificial intelligence concepts and implementations
- Spring AI Zero to Hero Workshop - Example applications showing how to use Spring AI to build Generative AI projects.
- (outdated) Workshop material for Azure OpenAI - contains step-by-step examples from 'hello world' to 'retrieval augmented generation'
- Gemini Workshop for Spring AI Java Developers • Dan Dobrin - workshop materials for the Java developer building Gen AI applications with Gemini models using Spring AI
- Exploring interactions with LLMs : Practical insights with Spring AI - A self-paced workshop designed to practice Spring AI basics and discover interactions with LLMs.
-
Spring AI Samples by Thomas Vitale - Extensive collection of samples showing how to build Java applications powered by Generative AI and Large Language Models (LLMs). Includes examples for different AI models, RAG implementations, and various Spring AI features.
-
Spring AI Examples by Craig Walls - Comprehensive repository with dozens of examples covering all major Spring AI capabilities, model integrations, and implementation patterns. Created by the author of "Spring AI in Action".
-
Spring AI Showcase by Piotr Minkowski - Modular demo project showcasing multiple Spring AI features including prompt templates, chat memory, structured output, function calling, RAG with Pinecone vector store, and image models. Supports multiple AI providers (OpenAI, Mistral, Ollama, Azure OpenAI) with profile-based configuration.
-
Spring PetClinic AI - The classic Spring PetClinic application enhanced with a chatbot powered by Spring AI. Demonstrates natural language interaction with application data, allowing users to query and modify pet clinic information through conversation. Supports both OpenAI and Azure OpenAI as LLM providers. Detailed in a two-part blog series on spring.io.
-
Flight Booking Assistant - Spring AI powered expert system demo that simulates a flight booking assistant. Demonstrates how to build domain-specific AI assistants using Spring AI.
-
Spring AI with QianFan - Spring AI support for various AI language models from QianFan. Shows how to interact with QianFan language models and create a multilingual conversational assistant based on QianFan models.
-
Similarity Search using Spring AI - Implementation of a simple similarity search. Demonstrating how to use Kotlin or Java with Spring-AI to generate embeddings and perform simple similarity searches (March 2025)
-
Spring AI HTMX MCP - Example of building a modern, interactive UI for Spring AI applications using HTMX. Demonstrates how to create a responsive chat interface with minimal JavaScript by leveraging HTMX's server-side rendering capabilities combined with Spring AI's Model Context Protocol.
-
Spring AI Vaadin - Integration of Spring AI with Vaadin, a Java web framework for building modern web applications. Provides components and examples for creating rich, interactive AI-powered UIs with pure Java, without requiring JavaScript or HTML knowledge.
-
DocumentGPT - A RAG-based document query system by Sergi Almar that allows users to upload documents and chat with them using Spring AI's vector search capabilities. Features a web-based user interface for document upload and interactive querying.
-
Spring AI Playground - A web UI designed to make it easy for Java developers to experiment with and integrate AI models. Provides an interactive interface for testing different prompts and models.
-
Spring AI Chat Bot CLI - Command-line chatbot with Retrieval-Augmented Generation (RAG) and conversational memory capabilities. Demonstrates how to build interactive CLI applications with Spring AI.
-
Spring AI Powered Local CLI Chat Bot - A fully local, Spring AI-powered CLI chatbot that runs entirely on your machine with no external services required. Perfect for offline development or privacy-sensitive applications.
- Spring AI Alibaba - An extension of Spring AI that provides an agentic AI framework for Java developers. Adds support for Alibaba Cloud QWen models and Dashscope services, along with additional features like conversation memory, RAG support, and function calling. Maintains compatibility with the Spring AI API while offering specialized capabilities for Alibaba Cloud's AI ecosystem.
- Arconia Ollama Dev Service - A Spring Boot development service that automatically manages Ollama instances for local LLM development. Simplifies testing and development with local models by handling container lifecycle and configuration. Integrates seamlessly with Spring AI's Ollama support.
- MCP Client Documentation - Official documentation for implementing the Model Context Protocol client in Spring AI applications.
- MCP Client Examples - Comprehensive examples showcasing the Model Context Protocol implementation in Spring AI, including client-server communication, tool discovery, filesystem operations, weather services, web search integration, and dynamic tool updates.
- MCP Annotations - Annotation-based programming model for implementing MCP servers and clients. Provides a clean, declarative approach to handling MCP operations with reduced boilerplate code. Includes core annotations that depend only on the MCP Java SDK and a Spring AI integration module.
- Spring Batch MCP Server - An MCP service for introspecting Spring Batch applications, providing AI assistants with access to batch job information.
- Spring Cloud Config MCP Server - An experimental MCP server implementation for Spring Cloud Config that exposes configuration management operations as AI tools, allowing AI assistants to retrieve, update, and refresh application configurations, as well as encrypt/decrypt sensitive values.
- JVM Diagnostics MCP - A Model Context Protocol service for obtaining JVM diagnostics, allowing AI assistants to access runtime information about Java applications.
- Kotlin Crypto Price MCP Server - A Kotlin-based Spring AI MCP server that provides real-time cryptocurrency price information from Binance.
- Spring AI MCP Database Integration Example - A practical implementation of MCP with Spring AI featuring two server applications exposing database operations (person and account data) via @Tool annotations and a client application that discovers and uses these tools with OpenAI models.
- GitHub MCP Application - A 100% Java GitHub MCP application built on Spring AI by Stephan Janssen, creator of Devoxx.
- AWS Sample MCP Demos - Collection of examples showing how to use Model Context Protocol with AWS services, including Spring AI implementations.
- Christian Tzolov
- Josh Long
- Dan Vega
- Thomas Vitale
- Dan Dobrin
- Marcus Hellberg
- Lize Roes
- Bouke Nijhuis
- Guillaume Laforge
- Brian Sam-Bodden
- Adib Saikali
- Clémentine Fourrier
- Craig Walls
- Ilja Leyberman
- This Day in AI
- Practical AI from Changelog
- Latent Space
- Your Undivided Attention
- TWIML (This Week in Machine Learning)
- Gradient Decent
- Spring Office Hours
- Bootiful Podcast
Your contributions are always welcome! Please read the contribution guidelines first.
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for awesome-spring-ai
Similar Open Source Tools
awesome-spring-ai
Awesome Spring AI is a curated list of resources, tools, tutorials, and projects for building generative AI applications using Spring AI. It provides a familiar developer experience for integrating Large Language Models and other AI capabilities into Spring applications, offering consistent abstractions, support for popular LLM providers, prompt engineering, caching mechanisms, vectorized storage integration, and more. The repository includes official resources, learning materials, code examples, community information, and tools for performance benchmarking.
spring-ai-alibaba
Spring AI Alibaba is an AI application framework for Java developers that seamlessly integrates with Alibaba Cloud QWen LLM services and cloud-native infrastructures. It provides features like support for various AI models, high-level AI agent abstraction, function calling, and RAG support. The framework aims to simplify the development, evaluation, deployment, and observability of AI native Java applications. It offers open-source framework and ecosystem integrations to support features like prompt template management, event-driven AI applications, and more.
awesome-ai-tools
Awesome AI Tools is a curated list of popular tools and resources for artificial intelligence enthusiasts. It includes a wide range of tools such as machine learning libraries, deep learning frameworks, data visualization tools, and natural language processing resources. Whether you are a beginner or an experienced AI practitioner, this repository aims to provide you with a comprehensive collection of tools to enhance your AI projects and research. Explore the list to discover new tools, stay updated with the latest advancements in AI technology, and find the right resources to support your AI endeavors.
awesome-generative-ai
Awesome Generative AI is a curated list of modern Generative Artificial Intelligence projects and services. Generative AI technology creates original content like images, sounds, and texts using machine learning algorithms trained on large data sets. It can produce unique and realistic outputs such as photorealistic images, digital art, music, and writing. The repo covers a wide range of applications in art, entertainment, marketing, academia, and computer science.
mo-ai-studio
Mo AI Studio is an enterprise-level AI agent running platform that enables the operation of customized intelligent AI agents with system-level capabilities. It supports various IDEs and programming languages, allows modification of multiple files with reasoning, cross-project context modifications, customizable agents, system-level file operations, document writing, question answering, knowledge sharing, and flexible output processors. The platform also offers various setters and a custom component publishing feature. Mo AI Studio is a fusion of artificial intelligence and human creativity, designed to bring unprecedented efficiency and innovation to enterprises.
awesome-openvino
Awesome OpenVINO is a curated list of AI projects based on the OpenVINO toolkit, offering a rich assortment of projects, libraries, and tutorials covering various topics like model optimization, deployment, and real-world applications across industries. It serves as a valuable resource continuously updated to maximize the potential of OpenVINO in projects, featuring projects like Stable Diffusion web UI, Visioncom, FastSD CPU, OpenVINO AI Plugins for GIMP, and more.
awesome-AI-driven-development
Awesome AI-Driven Development is a curated list of tools, frameworks, and resources for AI-driven development. It includes AI code editors, terminal-based coding agents, IDE plugins & extensions, multi-agent systems, code generation & templates, testing & quality assurance tools, Model Context Protocol implementations, pull request & code review tools, project management & documentation tools, language models for code, development workflows tools, code search & analysis tools, specialized tools for Git & version control, cloud & DevOps, language-specific tasks, terminal & shell utilities, prompt & context management tools, Copilot extensions & alternatives, learning & tutorials resources, and configuration & enhancement tools for AI coding assistants.
lsp-ai
LSP-AI is an open source language server designed to enhance software engineers' productivity by integrating AI-powered functionality into various text editors. It serves as a backend for completion with large language models and offers features like unified AI capabilities, simplified plugin development, enhanced collaboration, broad compatibility with editors supporting Language Server Protocol, flexible LLM backend support, and commitment to staying updated with the latest advancements in LLM-driven software development. The tool aims to centralize open-source development work, provide a collaborative platform for developers, and offer a future-ready solution for AI-powered assistants in text editors.
Midori-AI
Midori AI is a cutting-edge initiative dedicated to advancing the field of artificial intelligence through research, development, and community engagement. They focus on creating innovative AI solutions, exploring novel approaches, and empowering users to harness the power of AI. Key areas of focus include cluster-based AI, AI setup assistance, AI development for Discord bots, model serving and hosting, novel AI memory architectures, and Carly - a fully simulated human with advanced AI capabilities. They have also developed the Midori AI Subsystem to streamline AI workloads by providing simplified deployment, standardized configurations, isolation for AI systems, and a growing library of backends and tools.
llmariner
LLMariner is an extensible open source platform built on Kubernetes to simplify the management of generative AI workloads. It enables efficient handling of training and inference data within clusters, with OpenAI-compatible APIs for seamless integration with a wide range of AI-driven applications.
web-llm-chat
WebLLM Chat is a private AI chat interface that combines WebLLM with a user-friendly design, leveraging WebGPU to run large language models natively in your browser. It offers browser-native AI experience with WebGPU acceleration, guaranteed privacy as all data processing happens locally, offline accessibility, user-friendly interface with markdown support, and open-source customization. The project aims to democratize AI technology by making powerful tools accessible directly to end-users, enhancing the chatting experience and broadening the scope for deployment of self-hosted and customizable language models.
awesome-ai
Awesome AI is a curated list of artificial intelligence resources including courses, tools, apps, and open-source projects. It covers a wide range of topics such as machine learning, deep learning, natural language processing, robotics, conversational interfaces, data science, and more. The repository serves as a comprehensive guide for individuals interested in exploring the field of artificial intelligence and its applications across various domains.
awesome-generative-ai
A curated list of Generative AI projects, tools, artworks, and models
learn-modern-ai-python
This repository is part of the Certified Agentic & Robotic AI Engineer program, covering the first quarter of the course work. It focuses on Modern AI Python Programming, emphasizing static typing for robust and scalable AI development. The course includes modules on Python fundamentals, object-oriented programming, advanced Python concepts, AI-assisted Python programming, web application basics with Python, and the future of Python in AI. Upon completion, students will be able to write proficient Modern Python code, apply OOP principles, implement asynchronous programming, utilize AI-powered tools, develop basic web applications, and understand the future directions of Python in AI.
Symposium2023
Symposium2023 is a project aimed at enabling Delphi users to incorporate AI technology into their applications. It provides generalized interfaces to different AI models, making them easily accessible. The project showcases AI's versatility in tasks like language translation, human-like conversations, image generation, data analysis, and more. Users can experiment with different AI models, change providers easily, and avoid vendor lock-in. The project supports various AI features like vision support and function calling, utilizing providers like Google, Microsoft Azure, Amazon, OpenAI, and more. It includes example programs demonstrating tasks such as text-to-speech, language translation, face detection, weather querying, audio transcription, voice recognition, image generation, invoice processing, and API testing. The project also hints at potential future research areas like using embeddings for data search and integrating Python AI libraries with Delphi.
For similar tasks
Flowise
Flowise is a tool that allows users to build customized LLM flows with a drag-and-drop UI. It is open-source and self-hostable, and it supports various deployments, including AWS, Azure, Digital Ocean, GCP, Railway, Render, HuggingFace Spaces, Elestio, Sealos, and RepoCloud. Flowise has three different modules in a single mono repository: server, ui, and components. The server module is a Node backend that serves API logics, the ui module is a React frontend, and the components module contains third-party node integrations. Flowise supports different environment variables to configure your instance, and you can specify these variables in the .env file inside the packages/server folder.
nlux
nlux is an open-source Javascript and React JS library that makes it super simple to integrate powerful large language models (LLMs) like ChatGPT into your web app or website. With just a few lines of code, you can add conversational AI capabilities and interact with your favourite LLM.
generative-ai-go
The Google AI Go SDK enables developers to use Google's state-of-the-art generative AI models (like Gemini) to build AI-powered features and applications. It supports use cases like generating text from text-only input, generating text from text-and-images input (multimodal), building multi-turn conversations (chat), and embedding.
awesome-langchain-zh
The awesome-langchain-zh repository is a collection of resources related to LangChain, a framework for building AI applications using large language models (LLMs). The repository includes sections on the LangChain framework itself, other language ports of LangChain, tools for low-code development, services, agents, templates, platforms, open-source projects related to knowledge management and chatbots, as well as learning resources such as notebooks, videos, and articles. It also covers other LLM frameworks and provides additional resources for exploring and working with LLMs. The repository serves as a comprehensive guide for developers and AI enthusiasts interested in leveraging LangChain and LLMs for various applications.
Large-Language-Model-Notebooks-Course
This practical free hands-on course focuses on Large Language models and their applications, providing a hands-on experience using models from OpenAI and the Hugging Face library. The course is divided into three major sections: Techniques and Libraries, Projects, and Enterprise Solutions. It covers topics such as Chatbots, Code Generation, Vector databases, LangChain, Fine Tuning, PEFT Fine Tuning, Soft Prompt tuning, LoRA, QLoRA, Evaluate Models, Knowledge Distillation, and more. Each section contains chapters with lessons supported by notebooks and articles. The course aims to help users build projects and explore enterprise solutions using Large Language Models.
ai-chatbot
Next.js AI Chatbot is an open-source app template for building AI chatbots using Next.js, Vercel AI SDK, OpenAI, and Vercel KV. It includes features like Next.js App Router, React Server Components, Vercel AI SDK for streaming chat UI, support for various AI models, Tailwind CSS styling, Radix UI for headless components, chat history management, rate limiting, session storage with Vercel KV, and authentication with NextAuth.js. The template allows easy deployment to Vercel and customization of AI model providers.
awesome-local-llms
The 'awesome-local-llms' repository is a curated list of open-source tools for local Large Language Model (LLM) inference, covering both proprietary and open weights LLMs. The repository categorizes these tools into LLM inference backend engines, LLM front end UIs, and all-in-one desktop applications. It collects GitHub repository metrics as proxies for popularity and active maintenance. Contributions are encouraged, and users can suggest additional open-source repositories through the Issues section or by running a provided script to update the README and make a pull request. The repository aims to provide a comprehensive resource for exploring and utilizing local LLM tools.
Awesome-AI-Data-Guided-Projects
A curated list of data science & AI guided projects to start building your portfolio. The repository contains guided projects covering various topics such as large language models, time series analysis, computer vision, natural language processing (NLP), and data science. Each project provides detailed instructions on how to implement specific tasks using different tools and technologies.
For similar jobs
promptflow
**Prompt flow** is a suite of development tools designed to streamline the end-to-end development cycle of LLM-based AI applications, from ideation, prototyping, testing, evaluation to production deployment and monitoring. It makes prompt engineering much easier and enables you to build LLM apps with production quality.
deepeval
DeepEval is a simple-to-use, open-source LLM evaluation framework specialized for unit testing LLM outputs. It incorporates various metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., and runs locally on your machine for evaluation. It provides a wide range of ready-to-use evaluation metrics, allows for creating custom metrics, integrates with any CI/CD environment, and enables benchmarking LLMs on popular benchmarks. DeepEval is designed for evaluating RAG and fine-tuning applications, helping users optimize hyperparameters, prevent prompt drifting, and transition from OpenAI to hosting their own Llama2 with confidence.
MegaDetector
MegaDetector is an AI model that identifies animals, people, and vehicles in camera trap images (which also makes it useful for eliminating blank images). This model is trained on several million images from a variety of ecosystems. MegaDetector is just one of many tools that aims to make conservation biologists more efficient with AI. If you want to learn about other ways to use AI to accelerate camera trap workflows, check out our of the field, affectionately titled "Everything I know about machine learning and camera traps".
leapfrogai
LeapfrogAI is a self-hosted AI platform designed to be deployed in air-gapped resource-constrained environments. It brings sophisticated AI solutions to these environments by hosting all the necessary components of an AI stack, including vector databases, model backends, API, and UI. LeapfrogAI's API closely matches that of OpenAI, allowing tools built for OpenAI/ChatGPT to function seamlessly with a LeapfrogAI backend. It provides several backends for various use cases, including llama-cpp-python, whisper, text-embeddings, and vllm. LeapfrogAI leverages Chainguard's apko to harden base python images, ensuring the latest supported Python versions are used by the other components of the stack. The LeapfrogAI SDK provides a standard set of protobuffs and python utilities for implementing backends and gRPC. LeapfrogAI offers UI options for common use-cases like chat, summarization, and transcription. It can be deployed and run locally via UDS and Kubernetes, built out using Zarf packages. LeapfrogAI is supported by a community of users and contributors, including Defense Unicorns, Beast Code, Chainguard, Exovera, Hypergiant, Pulze, SOSi, United States Navy, United States Air Force, and United States Space Force.
llava-docker
This Docker image for LLaVA (Large Language and Vision Assistant) provides a convenient way to run LLaVA locally or on RunPod. LLaVA is a powerful AI tool that combines natural language processing and computer vision capabilities. With this Docker image, you can easily access LLaVA's functionalities for various tasks, including image captioning, visual question answering, text summarization, and more. The image comes pre-installed with LLaVA v1.2.0, Torch 2.1.2, xformers 0.0.23.post1, and other necessary dependencies. You can customize the model used by setting the MODEL environment variable. The image also includes a Jupyter Lab environment for interactive development and exploration. Overall, this Docker image offers a comprehensive and user-friendly platform for leveraging LLaVA's capabilities.
carrot
The 'carrot' repository on GitHub provides a list of free and user-friendly ChatGPT mirror sites for easy access. The repository includes sponsored sites offering various GPT models and services. Users can find and share sites, report errors, and access stable and recommended sites for ChatGPT usage. The repository also includes a detailed list of ChatGPT sites, their features, and accessibility options, making it a valuable resource for ChatGPT users seeking free and unlimited GPT services.
TrustLLM
TrustLLM is a comprehensive study of trustworthiness in LLMs, including principles for different dimensions of trustworthiness, established benchmark, evaluation, and analysis of trustworthiness for mainstream LLMs, and discussion of open challenges and future directions. Specifically, we first propose a set of principles for trustworthy LLMs that span eight different dimensions. Based on these principles, we further establish a benchmark across six dimensions including truthfulness, safety, fairness, robustness, privacy, and machine ethics. We then present a study evaluating 16 mainstream LLMs in TrustLLM, consisting of over 30 datasets. The document explains how to use the trustllm python package to help you assess the performance of your LLM in trustworthiness more quickly. For more details about TrustLLM, please refer to project website.
AI-YinMei
AI-YinMei is an AI virtual anchor Vtuber development tool (N card version). It supports fastgpt knowledge base chat dialogue, a complete set of solutions for LLM large language models: [fastgpt] + [one-api] + [Xinference], supports docking bilibili live broadcast barrage reply and entering live broadcast welcome speech, supports Microsoft edge-tts speech synthesis, supports Bert-VITS2 speech synthesis, supports GPT-SoVITS speech synthesis, supports expression control Vtuber Studio, supports painting stable-diffusion-webui output OBS live broadcast room, supports painting picture pornography public-NSFW-y-distinguish, supports search and image search service duckduckgo (requires magic Internet access), supports image search service Baidu image search (no magic Internet access), supports AI reply chat box [html plug-in], supports AI singing Auto-Convert-Music, supports playlist [html plug-in], supports dancing function, supports expression video playback, supports head touching action, supports gift smashing action, supports singing automatic start dancing function, chat and singing automatic cycle swing action, supports multi scene switching, background music switching, day and night automatic switching scene, supports open singing and painting, let AI automatically judge the content.