agentpress

AI Agents API Server Starter; FastAPI, Supabase, Redis

Stars: 67

Visit

README:

AgentPress: Building Blocks for AI Agents

AgentPress is a collection of simple, but powerful utilities that serve as building blocks for creating AI agents. Plug, play, and customize.

See How It Works for an explanation of this flow.

Core Components

Threads: Manage Messages[] as threads.
Tools: Register code as callable tools with definitions in both OpenAPI and XML
Response Processing: Support for native-LLM OpenAPI and XML-based tool calling
State Management: Thread-safe JSON key-value state management
LLM: +100 LLMs using the OpenAI I/O Format powered by LiteLLM

Installation & Setup

Install the package:

pip install agentpress

Initialize AgentPress in your project:

agentpress init

Creates a agentpress directory with all the core utilities. Check out File Overview for explanations of the generated files.

If you selected the example agent during initialization:
- Creates an agent.py file with a web development agent example
- Creates a tools directory with example tools:
  - files_tool.py: File operations (create/update files, read directory and load into state)
  - terminal_tool.py: Terminal command execution
- Creates a workspace directory for the agent to work in

Quick Start

Set up your environment variables in a .env file:

OPENAI_API_KEY=your_key_here
ANTHROPIC_API_KEY=your_key_here
GROQ_API_KEY=your_key_here

Create a calculator tool with OpenAPI schema:

from agentpress.tool import Tool, ToolResult, openapi_schema

class CalculatorTool(Tool):
    @openapi_schema({
        "type": "function",
        "function": {
            "name": "add",
            "description": "Add two numbers",
            "parameters": {
                "type": "object",
                "properties": {
                    "a": {"type": "number"},
                    "b": {"type": "number"}
                },
                "required": ["a", "b"]
            }
        }
    })
    async def add(self, a: float, b: float) -> ToolResult:
        try:
            result = a + b
            return self.success_response(f"The sum is {result}")
        except Exception as e:
            return self.fail_response(f"Failed to add numbers: {str(e)}")

Or create a tool with XML schema:

from agentpress.tool import Tool, ToolResult, xml_schema

class FilesTool(Tool):
    @xml_schema(
        tag_name="create-file",
        mappings=[
            {"param_name": "file_path", "node_type": "attribute", "path": "."},
            {"param_name": "file_contents", "node_type": "content", "path": "."}
        ],
        example='''
        <create-file file_path="path/to/file">
        File contents go here
        </create-file>
        '''
    )
    async def create_file(self, file_path: str, file_contents: str) -> ToolResult:
        # Implementation here
        pass

Use the Thread Manager with tool execution:

import asyncio
from agentpress.thread_manager import ThreadManager
from calculator_tool import CalculatorTool

async def main():
    # Initialize thread manager and add tools
    manager = ThreadManager()
    manager.add_tool(CalculatorTool)

    # Create a new thread
    thread_id = await manager.create_thread()
    
    # Add your message
    await manager.add_message(thread_id, {
        "role": "user", 
        "content": "What's 2 + 2?"
    })
    
    # Run with streaming and tool execution
    response = await manager.run_thread(
        thread_id=thread_id,
        system_message={
            "role": "system", 
            "content": "You are a helpful assistant with calculation abilities."
        },
        model_name="anthropic/claude-3-5-sonnet-latest",
        execute_tools=True,
        native_tool_calling=True, # Contrary to xml_tool_calling = True
        parallel_tool_execution=True # Will execute tools in parallel, contrary to sequential (one after another)
    )

asyncio.run(main())

View conversation threads in a web UI:

streamlit run agentpress/thread_viewer_ui.py

How It Works

Each AI agent iteration follows a clear, modular flow:

Message & LLM Handling
- Messages are managed in threads via ThreadManager
- LLM API calls are made through a unified interface (llm.py)
- Supports streaming responses for real-time interaction
Response Processing
- LLM returns both content and tool calls
- Content is streamed in real-time
- Tool calls are parsed using either:
  - Standard OpenAPI function calling
  - XML-based tool definitions
  - Custom parsers (extend ToolParserBase)
Tool Execution
- Tools are executed either:
  - In real-time during streaming (execute_tools_on_stream)
  - After complete response
  - In parallel or sequential order
- Supports both standard and XML tool formats
- Extensible through ToolExecutorBase
Results Management
- Results from both content and tool executions are handled
- Supports different result formats (standard/XML)
- Customizable through ResultsAdderBase

This modular architecture allows you to:

Use standard OpenAPI function calling
Switch to XML-based tool definitions
Create custom processors by extending base classes
Mix and match different approaches

File Overview

Core Components

agentpress/llm.py

LLM API interface using LiteLLM. Supports 100+ LLMs with OpenAI-compatible format. Includes streaming, retry logic, and error handling.

agentpress/thread_manager.py

Manages conversation threads with support for:

Message history management
Tool registration and execution
Streaming responses
Both OpenAPI and XML tool calling patterns

agentpress/tool.py

Base infrastructure for tools with:

OpenAPI schema decorator for standard function calling
XML schema decorator for XML-based tool calls
Standardized ToolResult responses

agentpress/tool_registry.py

Central registry for tool management:

Registers both OpenAPI and XML tools
Maintains tool schemas and implementations
Provides tool lookup and validation

agentpress/state_manager.py

Thread-safe state persistence:

JSON-based key-value storage
Atomic operations with locking
Automatic file handling

Response Processing

agentpress/llm_response_processor.py

Handles LLM response processing with support for:

Streaming and complete responses
Tool call extraction and execution
Result formatting and message management

Standard Processing

standard_tool_parser.py: Parses OpenAPI function calls
standard_tool_executor.py: Executes standard tool calls
standard_results_adder.py: Manages standard results

XML Processing

xml_tool_parser.py: Parses XML-formatted tool calls
xml_tool_executor.py: Executes XML tool calls
xml_results_adder.py: Manages XML results

Philosophy

Plug & Play: Start with our defaults, then customize to your needs.
Agnostic: Built on LiteLLM, supporting any LLM provider. Minimal opinions, maximum flexibility.
Simplicity: Clean, readable code that's easy to understand and modify.
No Lock-in: Take full ownership of the code. Copy what you need directly into your codebase.

Contributing

We welcome contributions! Feel free to:

Submit issues for bugs or suggestions
Fork the repository and send pull requests
Share how you've used AgentPress in your projects

Development

Clone:

git clone https://github.com/kortix-ai/agentpress
cd agentpress

Install dependencies:

pip install poetry
poetry install

For quick testing:

pip install -e .

License

MIT License

Built with ❤️ by Kortix AI Corp

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for agentpress

Similar Open Source Tools

PocketGroq is a tool that provides advanced functionalities for text generation, web scraping, web search, and AI response evaluation. It includes features like an Autonomous Agent for answering questions, web crawling and scraping capabilities, enhanced web search functionality, and flexible integration with Ollama server. Users can customize the agent's behavior, evaluate responses using AI, and utilize various methods for text generation, conversation management, and Chain of Thought reasoning. The tool offers comprehensive methods for different tasks, such as initializing RAG, error handling, and tool management. PocketGroq is designed to enhance development processes and enable the creation of AI-powered applications with ease.

github

: 178

Gmail-MCP-Server

github

: 126

LLMVoX

LLMVoX is a lightweight 30M-parameter, LLM-agnostic, autoregressive streaming Text-to-Speech (TTS) system designed to convert text outputs from Large Language Models into high-fidelity streaming speech with low latency. It achieves significantly lower Word Error Rate compared to speech-enabled LLMs while operating at comparable latency and speech quality. Key features include being lightweight & fast with only 30M parameters, LLM-agnostic for easy integration with existing models, multi-queue streaming for continuous speech generation, and multilingual support for easy adaptation to new languages.

github

: 167

langchainrb

Langchain.rb is a Ruby library that makes it easy to build LLM-powered applications. It provides a unified interface to a variety of LLMs, vector search databases, and other tools, making it easy to build and deploy RAG (Retrieval Augmented Generation) systems and assistants. Langchain.rb is open source and available under the MIT License.

github

: 1.7k

LightRAG

LightRAG is a repository hosting the code for LightRAG, a system that supports seamless integration of custom knowledge graphs, Oracle Database 23ai, Neo4J for storage, and multiple file types. It includes features like entity deletion, batch insert, incremental insert, and graph visualization. LightRAG provides an API server implementation for RESTful API access to RAG operations, allowing users to interact with it through HTTP requests. The repository also includes evaluation scripts, code for reproducing results, and a comprehensive code structure.

github

: 13.5k

aider-desk

AiderDesk is a desktop application that enhances coding workflow by leveraging AI capabilities. It offers an intuitive GUI, project management, IDE integration, MCP support, settings management, cost tracking, structured messages, visual file management, model switching, code diff viewer, one-click reverts, and easy sharing. Users can install it by downloading the latest release and running the executable. AiderDesk also supports Python version detection and auto update disabling. It includes features like multiple project management, context file management, model switching, chat mode selection, question answering, cost tracking, MCP server integration, and MCP support for external tools and context. Development setup involves cloning the repository, installing dependencies, running in development mode, and building executables for different platforms. Contributions from the community are welcome following specific guidelines.

github

: 107

clarifai-python

The Clarifai Python SDK offers a comprehensive set of tools to integrate Clarifai's AI platform to leverage computer vision capabilities like classification , detection ,segementation and natural language capabilities like classification , summarisation , generation , Q&A ,etc into your applications. With just a few lines of code, you can leverage cutting-edge artificial intelligence to unlock valuable insights from visual and textual content.

github

: 392

ai-gateway

LangDB AI Gateway is an open-source enterprise AI gateway built in Rust. It provides a unified interface to all LLMs using the OpenAI API format, focusing on high performance, enterprise readiness, and data control. The gateway offers features like comprehensive usage analytics, cost tracking, rate limiting, data ownership, and detailed logging. It supports various LLM providers and provides OpenAI-compatible endpoints for chat completions, model listing, embeddings generation, and image generation. Users can configure advanced settings, such as rate limiting, cost control, dynamic model routing, and observability with OpenTelemetry tracing. The gateway can be run with Docker Compose and integrated with MCP tools for server communication.

github

: 389

mcp-server-mysql

github

: 128

quantalogic

QuantaLogic is a ReAct framework for building advanced AI agents that seamlessly integrates large language models with a robust tool system. It aims to bridge the gap between advanced AI models and practical implementation in business processes by enabling agents to understand, reason about, and execute complex tasks through natural language interaction. The framework includes features such as ReAct Framework, Universal LLM Support, Secure Tool System, Real-time Monitoring, Memory Management, and Enterprise Ready components.

github

: 376

gemini-coder

github

: 67

json-repair

JSON Repair is a toolkit designed to address JSON anomalies that can arise from Large Language Models (LLMs). It offers a comprehensive solution for repairing JSON strings, ensuring accuracy and reliability in your data processing. With its user-friendly interface and extensive capabilities, JSON Repair empowers developers to seamlessly integrate JSON repair into their workflows.

github

: 135

mcp

github

: 58

For similar tasks

No tools available

For similar jobs

No tools available

agentpress

README:

AgentPress: Building Blocks for AI Agents

Core Components

Installation & Setup

Quick Start

How It Works

File Overview

Core Components

agentpress/llm.py

agentpress/thread_manager.py

agentpress/tool.py

agentpress/tool_registry.py

agentpress/state_manager.py

Response Processing

agentpress/llm_response_processor.py

Standard Processing

XML Processing

Philosophy

Contributing

Development

License

For Tasks:

For Jobs:

Alternative AI tools for agentpress

Similar Open Source Tools

agentpress

mysql_mcp_server

UnrealGenAISupport

pocketgroq

Gmail-MCP-Server

LLMVoX

langchainrb

LightRAG

aider-desk

clarifai-python

ai-gateway

mcp-server-mysql

quantalogic

gemini-coder

json-repair

mcp

For similar tasks

For similar jobs