llm4s

Agentic and LLM Programming in Scala

Stars: 135

Visit

LLM4S provides a simple, robust, and scalable framework for building Large Language Models (LLM) applications in Scala. It aims to leverage Scala's type safety, functional programming, JVM ecosystem, concurrency, and performance advantages to create reliable and maintainable AI-powered applications. The framework supports multi-provider integration, execution environments, error handling, Model Context Protocol (MCP) support, agent frameworks, multimodal generation, and Retrieval-Augmented Generation (RAG) workflows. It also offers observability features like detailed trace logging, monitoring, and analytics for debugging and performance insights.

README:

LLM4S - Large Language Models for Scala

⭐ Stars Over Time

## Project Momentum

<br>

Overview

LLM4S provides a simple, robust, and scalable framework for building LLM applications in Scala. While most LLM work is done in Python, we believe that Scala offers a fundamentally better foundation for building reliable, maintainable AI-powered applications. <br>

Note: This is a work in progress project and is likely to change significantly over time.

Why Scala for LLMs?

Type Safety: Catch errors at compile time, not in production.
Functional Programming: Immutable data and pure functions for predictable, maintainable systems.
JVM Ecosystem: Access to mature, production-grade libraries and tooling.
Concurrency: Advanced models for safe, efficient parallelism.
Performance: JVM speed with functional elegance.
Ecosystem Interoperability: Seamless integration with enterprise JVM systems and cloud-native tooling.

Features

Multi-Provider Support: Connect seamlessly to multiple LLM providers (OpenAI, Anthropic, etc.) with planned expansion.
Execution Environments: Run LLM-driven operations in secure, containerized or non-containerized setups.
Error Handling: Robust mechanisms to catch, log, and recover from failures gracefully.
MCP Support: Integration with Model Context Protocol for richer context management.
Agent Framework: Build single or multi-agent workflows with standardized interfaces.
Multimodal Generation: Support for text, image, voice, and other LLM modalities.
RAG (Retrieval-Augmented Generation): Built-in tools for search, embedding, and retrieval workflows.
Observability: Detailed trace logging, monitoring, and analytics for debugging and performance insights.

Architecture

        ┌───────────────────────────┐
        │    LLM4S API Layer        │
        └──────────┬────────────────┘
                   │
          Multi-Provider Connector
       (OpenAI | Anthropic | Future...)
                   │
         ┌─────────┴─────────┐
         │ Execution Manager │
         └─────────┬─────────┘
                   │
        ┌──────────┴──────────┐
        │   Agent Framework   │
        └──────────┬──────────┘
                   │
      ┌────────────┴────────────┐
      │  RAG Engine + Tooling   │
      └────────────┬────────────┘
                   │
     ┌─────────────┴─────────────┐
     │   Observability Layer     │
     └───────────────────────────┘

Project Structure

llm4s: Main project - contains the core LLM4S framework
shared: Shared code between main project and workspace runner
workspaceRunner: Code that performs the requested actions on the workspace within the docker container
samples: Usage examples demonstrating various features

Getting Started

To get started with the LLM4S project, check out this teaser talk presented by Kannupriya Kalra at the Bay Area Scala Conference. This recording is essential for understanding where we’re headed:

🎥 Teaser Talk: https://www.youtube.com/watch?v=SXybj2P3_DE&ab_channel=SalarRahmanian

LLM4S was officially introduced at the Bay Area Scala Conference in San Francisco on February 25, 2025.

Setting Up Pre-commit Hooks

To ensure code quality, we use a Git pre-commit hook that automatically checks code formatting and runs tests before allowing commits:

# Install the pre-commit hook
./hooks/install.sh

# The hook will automatically:
# - Check code formatting with scalafmt
# - Compile code for both Scala 2.13 and 3
# - Run tests for both Scala versions

# To skip the hook temporarily (not recommended):
# git commit --no-verify

Prerequisites

JDK 21+
SBT
Docker (for containerized workspace)

Building the Project

sbt compile

# For all supported Scala versions (2.13 and 3)
sbt +compile

# Build and test all versions
sbt buildAll

Setup your LLM Environment

You will need an API key for either OpenAI (https://platform.openai.com/) or Anthropic (https://console.anthropic.com/) other LLMS may be supported in the future (see the backlog).

Set the environment variables:

LLM_MODEL=openai/gpt-4o
OPENAI_API_KEY=<your_openai_api_key>

or Anthropic:

LLM_MODEL=anthropic/claude-3-7-sonnet-latest
ANTHROPIC_API_KEY=<your_anthropic_api_key>

or OpenRouter:

LLM_MODEL=openai/gpt-4o
OPENAI_API_KEY=<your_openai_api_key>
OPENAI_BASE_URL=https://openrouter.ai/api/v1

This will allow you to run the non-containerized examples.

Running the Examples

# Using Scala 3
sbt "samples/runMain org.llm4s.samples.basic.BasicLLMCallingExample"

Run containerised test

sbt docker:publishLocal
sbt "samples/runMain org.llm4s.samples.workspace.ContainerisedWorkspaceDemo"

# Using Scala 2.13
sbt ++2.13.14 "samples/runMain org.llm4s.samples.basic.BasicLLMCallingExample"

Cross Compilation

LLM4S supports both Scala 2.13 and Scala 3.3. The codebase is set up to handle version-specific code through source directories:

src/main/scala - Common code for both Scala 2.13 and 3.3
src/main/scala-2.13 - Scala 2.13 specific code
src/main/scala-3 - Scala 3 specific code

When you need to use version-specific features, place the code in the appropriate directory.

We've added convenient aliases for cross-compilation:

# Compile for all Scala versions
sbt compileAll

# Test all Scala versions
sbt testAll

# Both compile and test
sbt buildAll

# Publish for all versions
sbt publishAll

Cross-Compilation Testing

We use specialized test projects to verify cross-version compatibility against the published artifacts. These tests ensure that the library works correctly across different Scala versions by testing against actual published JARs rather than local target directories.

# Run tests for both Scala 2 and 3 against published JARs
sbt testCross

# Full clean, publish, and test verification
sbt fullCrossTest

Note: For detailed information about our cross-testing strategy and setup, see crossTest/README.md

Roadmap

Our goal is to implement Scala equivalents of popular Python LLM frameworks, with multi-provider, multimodal, and observability-first design as core principles:

[ ] Single API access to multiple LLM providers (like LiteLLM) — llmconnect
[ ] Comprehensive toolchain for building LLM apps (LangChain/LangGraph equivalent)
- [ ] RAG search & retrieval
- [ ] Tool calling
- [ ] Logging, tracking, and monitoring
[ ] Agentic framework (like PydanticAI, CrewAI)
- [ ] Single-agent and multi-agent orchestration
[ ] Tokenization utilities (Scala port of tiktoken)
[ ] Examples for all supported modalities and workflows
[ ] Stable platform with extensive test coverage
[ ] Scala Coding SWE Agent — perform SWE Bench–type tasks on Scala codebases
- [ ] Code maps, code generation, and library templates

Tool Calling

Tool calling is a critical integration — designed to work seamlessly with multi-provider support and agent frameworks. We use ScalaMeta to auto-generate tool definitions, support dynamic mapping, and run in secure execution environments.

Tools can run:

In containerized sandboxes for isolation and safety.
In multi-modal pipelines where LLMs interact with text, images, and voice.
With observability hooks for trace analysis.

Tool Signature Generation

Using ScalaMeta to automatically generate tool definitions from Scala methods:

/** My tool does some funky things with a & b...
 * @param a The first thing
 * @param b The second thing
 */
def myTool(a: Int, b: String): ToolResponse = {
  // Implementation
}

ScalaMeta extracts method parameters, types, and documentation to generate OpenAI-compatible tool definitions.

Tool Call Mapping

Mapping LLM tool call requests to actual method invocations through:

Code generation
Reflection-based approaches
ScalaMeta-based parameter mapping

Secure Execution

Tools run in a protected Docker container environment to prevent accidental system damage or data leakage.

Comprehensive Tracing & Observability

Tracing isn’t just for debugging — it’s the backbone of understanding model behavior.LLM4S’s observability layer includes:

Detailed token usage reporting
Multi-backend trace output (Langfuse, console, none)
Agent state visualization
Integration with monitoring dashboards

Tracing Modes

Configure tracing behavior using the TRACING_MODE environment variable:

# Send traces to Langfuse (default)
TRACING_MODE=langfuse
LANGFUSE_PUBLIC_KEY=pk-lf-your-key
LANGFUSE_SECRET_KEY=sk-lf-your-secret

# Print detailed traces to console with colors and token usage
TRACING_MODE=print

# Disable tracing completely
TRACING_MODE=none

Basic Usage

import org.llm4s.trace.Tracing

// Create tracer based on TRACING_MODE environment variable
val tracer = Tracing.create()

// Trace events, completions, and token usage
tracer.traceEvent("Starting LLM operation")
tracer.traceCompletion(completion, model)
tracer.traceTokenUsage(tokenUsage, model, "chat-completion")
tracer.traceAgentState(agentState)

Usage using starter kit `llm4s.g8`

A carefully crafted starter kit to unlock the power of llm4s

Note: The LLM4S template has moved to its own repository for better maintainability and independent versioning.

The llm4s.g8 starter kit helps you quickly create AI-powered applications using llm4s. It is a starter kit for building AI-powered applications using llm4s with improved SDK usability and developer ergonomics. You can now spin up a fully working scala project with a single sbt command. The starter kit comes pre-configured with best practices, prompt execution examples, CI, formatting hooks, unit testing, documentation, and cross-platform support.

Template Repository: github.com/llm4s/llm4s.g8

Using sbt, do:

sbt new llm4s/llm4s.g8 \
--name=<your.project.name> \
--package=<your.organization> \
--version=0.1.0-SNAPSHOT \
--llm4s_version=<llm4s.version> \ # 0.1.1 is the latest version at the time of writing
--scala_version=<scala.version> \ # 2.x.x or Scala 3.x.x
--munit_version=<munit.version> \ # 1.1.1 is the latest version at the time of writing
--directory=<your.project.name> \
--force

to create new project.

For more information about the template, including compatibility matrix and documentation, visit the template repository. Use the comprehensive documentation to get started with the project using starter kit.

Continuous Integration (CI)

LLM4S uses GitHub Actions for continuous integration to ensure code quality and compatibility across different platforms and Scala versions.

CI Workflows

Main CI Pipeline (`ci.yml`)

Our unified CI workflow runs on every push and pull request to main/master branches:

Quick Checks: Fast-failing checks for code formatting and compilation
Cross-Platform Testing: Tests run on Ubuntu and Windows with Scala 2.13.16 and 3.7.1
Template Validation: Verifies the g8 template works correctly
Caching: Optimized caching strategy with Coursier for faster builds

Claude Code Review (`claude-code-review.yml`)

Automated AI-powered code review for pull requests:

Automatic Reviews: Trusted PRs get automatic Claude reviews
Security: External PRs require manual trigger by maintainers
Manual Trigger: Maintainers can request reviews with @claude comment

Release Pipeline (`release.yml`)

Automated release process triggered by version tags (format: v0.1.11):

Tag Format: Must use v prefix (e.g., v0.1.11, not 0.1.11)
Pre-release Checks: Runs full CI suite before publishing
GPG Signing: Artifacts are signed for security
Maven Central: Publishes to Sonatype/Maven Central

See RELEASE.md for detailed release instructions.

Running CI Locally

You can run the same checks locally before pushing:

# Check formatting
sbt scalafmtCheckAll

# Compile all Scala versions
sbt +compile

# Run all tests
sbt +test

# Full build (compile + test)
sbt buildAll

Hands-On Sessions & Live Collaboration

Stay hands-on with LLM4S! Join us for interactive mob programming sessions, live debugging, and open-source collaboration. These events are great for developers, contributors, and anyone curious about Scala + GenAI.

🗓️ Weekly live coding and collaboration during LLM4S Dev Hour — join us every Sunday on Discord!

Date	Session Title	Description	Location	Hosts	Details URL	Featured In
20-Jul-2025 onwards (Weekly Sundays)	🗓️ LLM4S Dev Hour — Weekly Live Coding & Collaboration	A weekly mob programming session where we code, debug, and learn together — open to all! `<br>`📌 Link to join is shared by the host in the `#llm4s-dev-hour` Discord channel before each session. Luma invite link https://luma.com/193ak5ln (for scheduling in your calender)	Online, London, UK (9am local time)	Kannupriya Kalra, Rory Graves	LinkedIn `<br>` Reddit `<br>` Bluesky `<br>` Mastodon `<br>` X/Twitter	Scala Times – Issue #537

📢 Talks & Presentations

See the talks being given by maintainers and open source developers globally and witness the engagement by developers around the world.

Stay updated with talks, workshops, and presentations about LLM4S happening globally. These sessions dive into the architecture, features, and future plans of the project.

Snapshots from LLM4S talks held around the world 🌍.

Upcoming & Past Talks

Date	Event/Conference	Talk Title	Location	Speaker Name	Details URL	Recording Link URL	Featured In
25-Feb-2025	Bay Area Scala	Let's Teach LLMs to Write Great Scala! (Original version)	Tubi office, San Francisco, CA, USA 🇺🇸	Kannupriya Kalra	Event Info `<br>` Reddit Discussion `<br>` Mastodon Post `<br>` Bluesky Post `<br>` X/Twitter Post `<br>` Meetup Event	Watch Recording	–
20-Apr-2025	Scala India	Let's Teach LLMs to Write Great Scala! (Updated from Feb 2025)	India 🇮🇳	Kannupriya Kalra	Event Info `<br>` Reddit Discussion `<br>` X/Twitter Post	Watch Recording	–
28-May-2025	Functional World 2025 by Scalac	Let's Teach LLMs to Write Great Scala! (Updated from Apr 2025)	Gdansk, Poland 🇵🇱	Kannupriya Kalra	LinkedIn Post 1 `<br>` LinkedIn Post 2 `<br>` Reddit Discussion `<br>` Meetup Link `<br>` X/Twitter Post	Watch Recording	Scalendar (May 2025) `<br>` Scala Times 1 `<br>` Scala Times 2
13-Jun-2025	Dallas Scala Enthusiasts	Let's Teach LLMs to Write Great Scala! (Updated from May 2025)	Dallas, Texas, USA 🇺🇸	Kannupriya Kalra	Meetup Event `<br>` LinkedIn Post `<br>` X/Twitter Post `<br>` Reddit Discussion `<br>` Bluesky Post `<br>` Mastodon Post	Watch Recording	Scalendar (June 2025)
13-Aug-2025	London Scala Users Group	Scala Meets GenAI: Build the Cool Stuff with LLM4S	The Trade Desk office, London, UK 🇬🇧	Kannupriya Kalra, Rory Graves	Meetup Event `<br>` X/Twitter Post `<br>` Bluesky Post `<br>` LinkedIn Post	Recording will be posted once the event is done	Scalendar (August 2025)
21-Aug-2025	Scala Days 2025	Scala Meets GenAI: Build the Cool Stuff with LLM4S	SwissTech Convention Center,EPFL campus, Lausanne, Switzerland 🇨🇭	Kannupriya Kalra, Rory Graves	Talk Info `<br>` LinkedIn Post `<br>` X/Twitter Post `<br>` Reddit Discussion `<br>` Bluesky Post `<br>` Mastodon Post	Recording will be posted once the event is done	Scala Days 2025: August in Lausanne – Code, Community & Innovation `<br>` Scalendar (August 2025) `<br>` Scala Days 2025 LinkedIn Post `<br>` Scala Days 2025 Highlights `<br>` Scala Days 2025 Wrap `<br>` Scala Days 2025 Recap – A Scala Community Reunion
25-Aug-2025	Zürich Scala Enthusiasts	Fork It Till You Make It: Career Building with Scala OSS	Rivero AG, ABB Historic Building, Elias-Canetti-Strasse 7, Zürich, Switzerland 🇨🇭	Kannupriya Kalra	Meetup Event `<br>` LinkedIn Post `<br>` X/Twitter Post `<br>` Bluesky Post `<br>` Mastodon Post `<br>` Reddit Discussion	Recording will be posted once the event is done	Scalendar (August 2025)
18-Sept-2025	Scala Center Talks	Lightning Talks Powered by GSoC 2025 for Scala	EPFL campus, Lausanne, Switzerland 🇨🇭	Kannupriya Kalra	Event Invite `<br>` LinkedIn Post `<br>` Scala Center's LinkedIn Post `<br>` X/Twitter Post `<br>` Mastodon Post `<br>` Bluesky Post `<br>` Reddit Discussion	Recording will be posted once the event is done	–
23-25-Oct-2025	Google Summer Of Code Mentor Summit 2025	LLM4S x GSoC 2025: Engineering GenAI Agents in Functional Scala	Munich Marriott Hotel, Berliner Str. 93, 80805 München, Germany 🇩🇪	Kannupriya Kalra	Event Website	Recording will be posted once the event is done	–

📝 Want to invite us for a talk or workshop? Reach out via our respective emails or connect on Discord: https://discord.gg/4uvTPn6qww

Why You Should Contribute to LLM4S?

Build AI-powered applications in a statically typed, functional language designed for large systems.
Help shape the Scala ecosystem’s future in the AI/LLM space.
Learn modern LLM techniques like zero-shot prompting, tool calling, and agentic workflows.
Collaborate with experienced Scala engineers and open-source contributors.
Gain real-world experience working with Dockerized environments and multi-LLM providers.
Contribute to a project that offers you the opportunity to become a mentor or contributor funded by Google through its Google Summer of Code (GSoC) program.
Join a global developer community focused on type-safe, maintainable AI systems.

Contributing

Interested in contributing? Start here:

LLM4S GitHub Issues: https://lnkd.in/eXrhwgWY

Join the Community

Want to be part of developing this and interact with other developers? Join our Discord community!

LLM4S Discord: https://lnkd.in/eb4ZFdtG

Google Summer of Code (GSoC)

LLM4S was selected for GSoC 2025 under the Scala Center Organisation.

This project is also participating in Google Summer of Code (GSoC) 2025! If you're interested in contributing to the project as a contributor, check out the details here:

👉 Scala Center GSoC Ideas: https://lnkd.in/enXAepQ3

To know everything about GSoC and how it works, check out this talk:

🎥 GSoC Process Explained: https://lnkd.in/e_dM57bZ

To learn about the experience of GSoC contributors of LLM4S, check out their blogs in the section below.

📚 Explore Past GSoC Projects with Scala Center: https://www.gsocorganizations.dev/organization/scala-center/ This page includes detailed information on all GSoC projects with Scala Center from past years — including project descriptions, code repositories, contributor blogs, and mentor details.

👥 GSoC Contributor Onboarding Resources

Hello GSoCers and future GSoC aspirants! Here are some essential onboarding links to help you collaborate and stay organized within the LLM4S community.

🔗 LLM4S GSoC GitHub Team:You have been invited to join the LLM4S GitHub team for GSoC participants. Accepting this invite will grant you access to internal resources and coordination tools.👉 https://github.com/orgs/llm4s/teams/gsoc/members
📌 Private GSoC Project Tracking Board: Once you're part of the team, you will have access to our private GSoC tracking board. This board helps you track tasks, timelines, and project deliverables throughout the GSoC period. 👉 https://github.com/orgs/llm4s/projects/3

GSoC 2025: Google Open Source Funded Project Ideas from LLM4S

Feel free to reach out to the contributors or mentors listed for any guidance or questions related to GSoC 2026.

Contributors selected across the globe for GSoC 2025 program.

🚧 Behind the Build: Blogs & Series

We’ve got exciting news to share — Scalac, one of the leading Scala development companies, has officially partnered with LLM4S for a dedicated AI-focused blog series!

This collaboration was initiated after our talk at Functional World 2025, and it’s now evolving into a full-fledged multi-part series and an upcoming eBook hosted on Scalac’s platform. The series will combine practical Scala code, GenAI architecture, and reflections from the LLM4S team — making it accessible for Scala developers everywhere who want to build with LLMs.

📝 The first post is already drafted and under review by the Scalac editorial team. We’re working together to ensure this content is both technically insightful and visually engaging.

🎉 Thanks to Matylda Kamińska, Rafał Kruczek, and the Scalac marketing team for this opportunity and collaboration!

Stay tuned — the series will be published soon on scalac.io/blog, and we’ll link it here as it goes live.

LLM4S blogs powered by Scalac.

✍️ Blogs Powered by GSoC

Our Google Summer of Code (GSoC) 2025 contributors have actively documented their journeys, sharing insights and implementation deep-dives from their projects. These blog posts offer valuable perspectives on how LLM4S is evolving from a contributor-first lens.

Contributor	Blog(s)	Project
Elvan Konukseven	elvankonukseven.com/blog	Agentic Toolkit for LLMs
Gopi Trinadh Maddikunta	Main Blog `<br>` Scala at Light Speed – Part 1 `<br>` Scala at Light Speed – Part 2	RAG in a Box
Anshuman Awasthi	Anshuman's GSoC Journey	Multimodal LLM Support
Shubham Vishwakarma	Cracking the Code: My GSoC 2025 Story	Tracing and Observability

💡 These blogs reflect first-hand experience in building real-world AI tools using Scala, and are great resources for future contributors and researchers alike.

📚 Where Innovation Speaks: Articles by GSoC's Brightest

Elvan Konukseven

🌐 Main Blog
📝 Articles:
- The Application
- First Task

Gopi Trinadh Maddikunta

Anshuman Awasthi

Shubham Vishwakarma

🌐 Main Blog : Shubham's GSoC Journey
📝 Articles:
- Cracking the Code: My GSoC 2025 Story
- My Scala Sprint: JVM to Poland Insights

Maintainers

Want to connect with maintainers? The LLM4S project is maintained by:

Rory Graves - https://www.linkedin.com/in/roryjgraves/ | Email: [email protected] | Discord: rorybot1
Kannupriya Kalra - https://www.linkedin.com/in/kannupriyakalra/ | Email: [email protected] | Discord: kannupriyakalra_46520

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

For Tasks:

Click tags to check more tools for each tasks

build ai applications integrate multi-providers handle errors gracefully implement rag workflows monitor and debug performance

For Jobs:

ai engineer scala developer machine learning engineer data scientist software engineer

Alternative AI tools for llm4s

Similar Open Source Tools

llm4s

github

: 135

MassGen

MassGen is a cutting-edge multi-agent system that leverages the power of collaborative AI to solve complex tasks. It assigns a task to multiple AI agents who work in parallel, observe each other's progress, and refine their approaches to converge on the best solution to deliver a comprehensive and high-quality result. The system operates through an architecture designed for seamless multi-agent collaboration, with key features including cross-model/agent synergy, parallel processing, intelligence sharing, consensus building, and live visualization. Users can install the system, configure API settings, and run MassGen for various tasks such as question answering, creative writing, research, development & coding tasks, and web automation & browser tasks. The roadmap includes plans for advanced agent collaboration, expanded model, tool & agent integration, improved performance & scalability, enhanced developer experience, and a web interface.

github

: 454

Automodel

Automodel is a Python library for automating the process of building and evaluating machine learning models. It provides a set of tools and utilities to streamline the model development workflow, from data preprocessing to model selection and evaluation. With Automodel, users can easily experiment with different algorithms, hyperparameters, and feature engineering techniques to find the best model for their dataset. The library is designed to be user-friendly and customizable, allowing users to define their own pipelines and workflows. Automodel is suitable for data scientists, machine learning engineers, and anyone looking to quickly build and test machine learning models without the need for manual intervention.

github

: 66

inspector

A developer tool for testing and debugging Model Context Protocol (MCP) servers. It allows users to test the compliance of their MCP servers with the latest MCP specs, supports various transports like STDIO, SSE, and Streamable HTTP, features an LLM Playground for testing server behavior against different models, provides comprehensive logging and error reporting for MCP server development, and offers a modern developer experience with multiple server connections and saved configurations. The tool is built using Next.js and integrates MCP capabilities, AI SDKs from OpenAI, Anthropic, and Ollama, and various technologies like Node.js, TypeScript, and Next.js.

github

: 1.0k

aigne-doc-smith

AIGNE DocSmith is a powerful AI-driven documentation generation tool that automates the creation of detailed, structured, and multi-language documentation directly from source code. It intelligently analyzes codebase to generate a comprehensive document structure, populates content with high-quality AI-powered generation, supports seamless translation into 12+ languages, integrates with AIGNE Hub for large language models, offers Discuss Kit publishing, automatically updates documentation with source code changes, and allows for individual document optimization.

github

: 307

conduit

Conduit is an open-source, cross-platform mobile application for Open-WebUI, providing a native mobile experience for interacting with your self-hosted AI infrastructure. It supports real-time chat, model selection, conversation management, markdown rendering, theme support, voice input, file uploads, multi-modal support, secure storage, folder management, and tools invocation. Conduit offers multiple authentication flows and follows a clean architecture pattern with Riverpod for state management, Dio for HTTP networking, WebSocket for real-time streaming, and Flutter Secure Storage for credential management.

github

: 429

zotero-mcp

Zotero MCP is an open-source project that integrates AI capabilities with Zotero using the Model Context Protocol. It consists of a Zotero plugin and an MCP server, enabling AI assistants to search, retrieve, and cite references from Zotero library. The project features a unified architecture with an integrated MCP server, eliminating the need for a separate server process. It provides features like intelligent search, detailed reference information, filtering by tags and identifiers, aiding in academic tasks such as literature reviews and citation management.

github

: 99

Archon

Archon is an AI meta-agent designed to autonomously build, refine, and optimize other AI agents. It serves as a practical tool for developers and an educational framework showcasing the evolution of agentic systems. Through iterative development, Archon demonstrates the power of planning, feedback loops, and domain-specific knowledge in creating robust AI agents.

github

: 12.2k

paelladoc

PAELLADOC is an intelligent documentation system that uses AI to analyze code repositories and generate comprehensive technical documentation. It offers a modular architecture with MECE principles, interactive documentation process, key features like Orchestrator and Commands, and a focus on context for successful AI programming. The tool aims to streamline documentation creation, code generation, and product management tasks for software development teams, providing a definitive standard for AI-assisted development documentation.

github

: 221

tunacode

TunaCode CLI is an AI-powered coding assistant that provides a command-line interface for developers to enhance their coding experience. It offers features like model selection, parallel execution for faster file operations, and various commands for code management. The tool aims to improve coding efficiency and provide a seamless coding environment for developers.

github

: 83

astrsk

astrsk is a tool that pushes the boundaries of AI storytelling by offering advanced AI agents, customizable response formatting, and flexible prompt editing for immersive roleplaying experiences. It provides complete AI agent control, a visual flow editor for conversation flows, and ensures 100% local-first data storage. The tool is true cross-platform with support for various AI providers and modern technologies like React, TypeScript, and Tailwind CSS. Coming soon features include cross-device sync, enhanced session customization, and community features.

github

: 106

local-deep-research

Local Deep Research is a powerful AI-powered research assistant that performs deep, iterative analysis using multiple LLMs and web searches. It can be run locally for privacy or configured to use cloud-based LLMs for enhanced capabilities. The tool offers advanced research capabilities, flexible LLM support, rich output options, privacy-focused operation, enhanced search integration, and academic & scientific integration. It also provides a web interface, command line interface, and supports multiple LLM providers and search engines. Users can configure AI models, search engines, and research parameters for customized research experiences.

github

: 3.4k

caddy-defender

The Caddy Defender plugin is a middleware for Caddy that allows you to block or manipulate requests based on the client's IP address. It provides features such as IP range filtering, predefined IP ranges for popular AI services, custom IP ranges configuration, and multiple responder backends for different actions like blocking, custom responses, dropping connections, returning garbage data, redirecting, and tarpitting to stall bots. The plugin can be easily installed using Docker or built with `xcaddy`. Configuration is done through the Caddyfile syntax with various options for responders, IP ranges, custom messages, and URLs.

github

: 432

trpc-agent-go

A powerful Go framework for building intelligent agent systems with large language models (LLMs), hierarchical planners, memory, telemetry, and a rich tool ecosystem. tRPC-Agent-Go enables the creation of autonomous or semi-autonomous agents that reason, call tools, collaborate with sub-agents, and maintain long-term state. The framework provides detailed documentation, examples, and tools for accelerating the development of AI applications.

github

: 122

roo-code-memory-bank

Roo Code Memory Bank is a tool designed for AI-assisted development to maintain project context across sessions. It provides a structured memory system integrated with VS Code, ensuring deep understanding of the project for the AI assistant. The tool includes key components such as Memory Bank for persistent storage, Mode Rules for behavior configuration, VS Code Integration for seamless development experience, and Real-time Updates for continuous context synchronization. Users can configure custom instructions, initialize the Memory Bank, and organize files within the project root directory. The Memory Bank structure includes files for tracking session state, technical decisions, project overview, progress tracking, and optional project brief and system patterns documentation. Features include persistent context, smart workflows for specialized tasks, knowledge management with structured documentation, and cross-referenced project knowledge. Pro tips include handling multiple projects, utilizing Debug mode for troubleshooting, and managing session updates for synchronization. The tool aims to enhance AI-assisted development by providing a comprehensive solution for maintaining project context and facilitating efficient workflows.

github

: 200

open-health

OpenHealth is an AI health assistant that helps users manage their health data by leveraging AI and personal health information. It allows users to consolidate health data, parse it smartly, and engage in contextual conversations with GPT-powered AI. The tool supports various data sources like blood test results, health checkup data, personal physical information, family history, and symptoms. OpenHealth aims to empower users to take control of their health by combining data and intelligence for actionable health management.

github

: 3.2k

For similar tasks

griptape

Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.

github

: 2.2k

AI-in-a-Box

AI-in-a-Box is a curated collection of solution accelerators that can help engineers establish their AI/ML environments and solutions rapidly and with minimal friction, while maintaining the highest standards of quality and efficiency. It provides essential guidance on the responsible use of AI and LLM technologies, specific security guidance for Generative AI (GenAI) applications, and best practices for scaling OpenAI applications within Azure. The available accelerators include: Azure ML Operationalization in-a-box, Edge AI in-a-box, Doc Intelligence in-a-box, Image and Video Analysis in-a-box, Cognitive Services Landing Zone in-a-box, Semantic Kernel Bot in-a-box, NLP to SQL in-a-box, Assistants API in-a-box, and Assistants API Bot in-a-box.

github

: 527

spring-ai

The Spring AI project provides a Spring-friendly API and abstractions for developing AI applications. It offers a portable client API for interacting with generative AI models, enabling developers to easily swap out implementations and access various models like OpenAI, Azure OpenAI, and HuggingFace. Spring AI also supports prompt engineering, providing classes and interfaces for creating and parsing prompts, as well as incorporating proprietary data into generative AI without retraining the model. This is achieved through Retrieval Augmented Generation (RAG), which involves extracting, transforming, and loading data into a vector database for use by AI models. Spring AI's VectorStore abstraction allows for seamless transitions between different vector database implementations.

github

: 6.8k

ragstack-ai

RAGStack is an out-of-the-box solution simplifying Retrieval Augmented Generation (RAG) in GenAI apps. RAGStack includes the best open-source for implementing RAG, giving developers a comprehensive Gen AI Stack leveraging LangChain, CassIO, and more. RAGStack leverages the LangChain ecosystem and is fully compatible with LangSmith for monitoring your AI deployments.

github

: 127

breadboard

Breadboard is a library for prototyping generative AI applications. It is inspired by the hardware maker community and their boundless creativity. Breadboard makes it easy to wire prototypes and share, remix, reuse, and compose them. The library emphasizes ease and flexibility of wiring, as well as modularity and composability.

github

: 332

cloudflare-ai-web

Cloudflare-ai-web is a lightweight and easy-to-use tool that allows you to quickly deploy a multi-modal AI platform using Cloudflare Workers AI. It supports serverless deployment, password protection, and local storage of chat logs. With a size of only ~638 kB gzip, it is a great option for building AI-powered applications without the need for a dedicated server.

github

: 2.1k

app-builder

AppBuilder SDK is a one-stop development tool for AI native applications, providing basic cloud resources, AI capability engine, Qianfan large model, and related capability components to improve the development efficiency of AI native applications.

github

: 553

cookbook

This repository contains community-driven practical examples of building AI applications and solving various tasks with AI using open-source tools and models. Everyone is welcome to contribute, and we value everybody's contribution! There are several ways you can contribute to the Open-Source AI Cookbook: Submit an idea for a desired example/guide via GitHub Issues. Contribute a new notebook with a practical example. Improve existing examples by fixing issues/typos. Before contributing, check currently open issues and pull requests to avoid working on something that someone else is already working on.

github

: 2.3k

For similar jobs

sweep

Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.

github

: 7.1k

teams-ai

The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.

github

: 502

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

github

: 668

chatbot-ui

Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.

github

: 27.7k

BricksLLM

BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students

github

: 953

uAgents

uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.

github

: 1.3k

griptape

github

: 2.2k

llm4s

README:

LLM4S - Large Language Models for Scala

⭐ Stars Over Time

Overview

Why Scala for LLMs?

Features

Architecture

Project Structure

Getting Started

Setting Up Pre-commit Hooks

Prerequisites

Building the Project

Setup your LLM Environment

Running the Examples

Run containerised test

Cross Compilation

Cross-Compilation Testing

Roadmap

Tool Calling

Tool Signature Generation

Tool Call Mapping

Secure Execution

Comprehensive Tracing & Observability

Tracing Modes

Basic Usage

Usage using starter kit llm4s.g8

A carefully crafted starter kit to unlock the power of llm4s

Continuous Integration (CI)

CI Workflows

Main CI Pipeline (ci.yml)

Claude Code Review (claude-code-review.yml)

Release Pipeline (release.yml)

Running CI Locally

Hands-On Sessions & Live Collaboration

📢 Talks & Presentations

Upcoming & Past Talks

Why You Should Contribute to LLM4S?

Contributing

Join the Community

Google Summer of Code (GSoC)

👥 GSoC Contributor Onboarding Resources

GSoC 2025: Google Open Source Funded Project Ideas from LLM4S

LLM4S - Implement an agentic toolkit for Large Language Models

LLM4S - RAG in a box

LLM4S - Support image, voice and other LLM modalites

LLM4S - Tracing support

🚧 Behind the Build: Blogs & Series

✍️ Blogs Powered by GSoC

📚 Where Innovation Speaks: Articles by GSoC's Brightest

Elvan Konukseven

Gopi Trinadh Maddikunta

Anshuman Awasthi

Shubham Vishwakarma

Maintainers

License

For Tasks:

For Jobs:

Alternative AI tools for llm4s

Similar Open Source Tools

llm4s

MassGen

Automodel

inspector

aigne-doc-smith

conduit

zotero-mcp

Archon

paelladoc

tunacode

astrsk

local-deep-research

caddy-defender

trpc-agent-go

roo-code-memory-bank

open-health

For similar tasks

griptape

AI-in-a-Box

spring-ai

Usage using starter kit `llm4s.g8`

Main CI Pipeline (`ci.yml`)

Claude Code Review (`claude-code-review.yml`)

Release Pipeline (`release.yml`)