hayhooks

Easily deploy Haystack pipelines as REST APIs and MCP Tools.

Stars: 115

Visit

Hayhooks is a tool that simplifies the deployment and serving of Haystack pipelines as REST APIs. It allows users to wrap their pipelines with custom logic and expose them via HTTP endpoints, including OpenAI-compatible chat completion endpoints. With Hayhooks, users can easily convert their Haystack pipelines into API services with minimal boilerplate code.

README:

Hayhooks

Hayhooks makes it easy to deploy and serve Haystack Pipelines and Agents.

With Hayhooks, you can:

📦 Deploy your Haystack pipelines and agents as REST APIs with maximum flexibility and minimal boilerplate code.
🛠️ Expose your Haystack pipelines and agents over the MCP protocol, making them available as tools in AI dev environments like Cursor or Claude Desktop. Under the hood, Hayhooks runs as an MCP Server, exposing each pipeline and agent as an MCP Tool.
💬 Integrate your Haystack pipelines and agents with open-webui as OpenAI-compatible chat completion backends with streaming support.
🕹️ Control Hayhooks core API endpoints through chat - deploy, undeploy, list, or run Haystack pipelines and agents by chatting with Claude Desktop, Cursor, or any other MCP client.

Quick Start with Docker Compose
Quick Start
Install the package
Configuration
- Environment Variables
- CORS Settings
Logging
- Using the logger
- Changing the log level
CLI Commands
Start Hayhooks
Deploy a Pipeline
- PipelineWrapper
- Setup Method
- Run API Method
- Async Run API Method
- PipelineWrapper development with overwrite option
- Additional Dependencies
Deploy a YAML Pipeline
Deploy an Agent
Load pipelines or agents at startup
Support file uploads
Run pipelines from the CLI
- Run a pipeline from the CLI JSON-compatible parameters
- Run a pipeline from the CLI uploading files
MCP support
- MCP Server
- Create a PipelineWrapper for exposing a Haystack pipeline as a MCP Tool
- Expose a YAML pipeline as a MCP Tool
- Using Hayhooks MCP Server with Claude Desktop
- Using Hayhooks Core MCP Tools in IDEs like Cursor
- Development and deployment of Haystack pipelines directly from Cursor
- Skip MCP Tool listing
Hayhooks as an OpenAPI Tool Server in open-webui
- Example: Deploy a Haystack pipeline from open-webui chat interface
OpenAI Compatibility and open-webui integration
- OpenAI-compatible endpoints generation
- Using Hayhooks as open-webui backend
- Run Chat Completion Method
- Async Run Chat Completion Method
- Streaming Responses
  - Async Streaming Generator
  - Integration with Haystack OpenAIChatGenerator
Sending open-webui events enhancing the user experience
Hooks
- Intercepting tool calls when using open-webui and streaming responses
Advanced Usage
- Run Hayhooks Programmatically
- Sharing code between pipeline wrappers
Deployment Guidelines
License

Quick start with Docker Compose

To quickly get started with Hayhooks, we provide a ready-to-use Docker Compose 🐳 setup with pre-configured integration with open-webui.

It's available in the Hayhooks + Open WebUI Docker Compose repository.

Quick start

Install the package

Start by installing the package:

pip install hayhooks

If you want to use the MCP Server, you need to install the hayhooks[mcp] package:

pip install hayhooks[mcp]

NOTE: You'll need to run at least Python 3.10+ to use the MCP Server.

Configuration

Currently, you can configure Hayhooks by:

Set the environment variables in an .env file in the root of your project.
Pass the supported arguments and options to hayhooks run command.
Pass the environment variables to the hayhooks command.

Environment variables

The following environment variables are supported:

HAYHOOKS_HOST: The host on which the server will listen.
HAYHOOKS_PORT: The port on which the server will listen.
HAYHOOKS_MCP_PORT: The port on which the MCP Server will listen.
HAYHOOKS_MCP_HOST: The host on which the MCP Server will listen.
HAYHOOKS_PIPELINES_DIR: The path to the directory containing the pipelines.
HAYHOOKS_ROOT_PATH: The root path of the server.
HAYHOOKS_ADDITIONAL_PYTHON_PATH: Additional Python path to be added to the Python path.
HAYHOOKS_DISABLE_SSL: Boolean flag to disable SSL verification when making requests from the CLI.
HAYHOOKS_USE_HTTPS: Boolean flag to use HTTPS when using CLI commands to interact with the server (e.g. hayhooks status will call https://HAYHOOKS_HOST:HAYHOOKS_PORT/status).
HAYHOOKS_SHOW_TRACEBACKS: Boolean flag to show tracebacks on errors during pipeline execution and deployment.
LOG: The log level to use (default: INFO).

CORS Settings

HAYHOOKS_CORS_ALLOW_ORIGINS: List of allowed origins (default: ["*"])
HAYHOOKS_CORS_ALLOW_METHODS: List of allowed HTTP methods (default: ["*"])
HAYHOOKS_CORS_ALLOW_HEADERS: List of allowed headers (default: ["*"])
HAYHOOKS_CORS_ALLOW_CREDENTIALS: Allow credentials (default: false)
HAYHOOKS_CORS_ALLOW_ORIGIN_REGEX: Regex pattern for allowed origins (default: null)
HAYHOOKS_CORS_EXPOSE_HEADERS: Headers to expose in response (default: [])
HAYHOOKS_CORS_MAX_AGE: Maximum age for CORS preflight responses in seconds (default: 600)

Logging

Using the logger

Hayhooks comes with a default logger based on loguru.

To use it, you can import the log object from the hayhooks package:

from hayhooks import log

Changing the log level

To change the log level, you can set the LOG environment variable to one of the levels supported by loguru.

For example, to use the DEBUG level, you can set:

LOG=DEBUG hayhooks run

# or
LOG=debug hayhooks run

# or in an .env file
LOG=debug

CLI commands

The hayhooks package provides a CLI to manage the server and the pipelines. Any command can be run with hayhooks <command> --help to get more information.

CLI commands are basically wrappers around the HTTP API of the server. The full API reference is available at //HAYHOOKS_HOST:HAYHOOKS_PORT/docs or //HAYHOOKS_HOST:HAYHOOKS_PORT/redoc.

hayhooks run     # Start the server
hayhooks status  # Check the status of the server and show deployed pipelines

hayhooks pipeline deploy-files <path_to_dir>   # Deploy a pipeline using PipelineWrapper files (preferred)
hayhooks pipeline deploy-yaml <path_to_yaml>   # Deploy a pipeline from a YAML file
hayhooks pipeline undeploy <pipeline_name>     # Undeploy a pipeline
hayhooks pipeline run <pipeline_name>          # Run a pipeline

Start Hayhooks

Let's start Hayhooks:

hayhooks run

This will start the Hayhooks server on HAYHOOKS_HOST:HAYHOOKS_PORT.

Deploy a Pipeline

Now, we will deploy a pipeline to chat with a website. We have created an example in the examples/pipeline_wrappers/chat_with_website_streaming folder.

In the example folder, we have two files:

chat_with_website.yml: The pipeline definition in YAML format.
pipeline_wrapper.py (mandatory): A pipeline wrapper that uses the pipeline definition.

Why a pipeline wrapper?

The pipeline wrapper provides a flexible foundation for deploying Haystack pipelines, agents or any other component by allowing users to:

Choose their preferred initialization method (YAML files, Haystack templates, or inline code)
Define custom execution logic with configurable inputs and outputs
Optionally expose OpenAI-compatible chat endpoints with streaming support for integration with interfaces like open-webui

The pipeline_wrapper.py file must contain an implementation of the BasePipelineWrapper class (see BasePipelineWrapper source for more details).

A minimal PipelineWrapper looks like this:

from pathlib import Path
from typing import List
from haystack import Pipeline
from hayhooks import BasePipelineWrapper

class PipelineWrapper(BasePipelineWrapper):
    def setup(self) -> None:
        pipeline_yaml = (Path(__file__).parent / "chat_with_website.yml").read_text()
        self.pipeline = Pipeline.loads(pipeline_yaml)

    def run_api(self, urls: List[str], question: str) -> str:
        result = self.pipeline.run({"fetcher": {"urls": urls}, "prompt": {"query": question}})
        return result["llm"]["replies"][0]

It contains two methods:

setup()

This method will be called when the pipeline is deployed. It should initialize the self.pipeline attribute as a Haystack pipeline.

You can initialize the pipeline in many ways:

Load it from a YAML file.
Define it inline as a Haystack pipeline code.
Load it from a Haystack pipeline template.

run_api(...)

This method will be used to run the pipeline in API mode, when you call the {pipeline_name}/run endpoint.

You can define the input arguments of the method according to your needs.

def run_api(self, urls: List[str], question: str, any_other_user_defined_argument: Any) -> str:
    ...

The input arguments will be used to generate a Pydantic model that will be used to validate the request body. The same will be done for the response type.

NOTE: Since Hayhooks will dynamically create the Pydantic models, you need to make sure that the input arguments are JSON-serializable.

run_api_async(...)

This method is the asynchronous version of run_api. It will be used to run the pipeline in API mode when you call the {pipeline_name}/run endpoint, but handles requests asynchronously for better performance under high load.

You can define the input arguments of the method according to your needs, just like with run_api.

async def run_api_async(self, urls: List[str], question: str, any_other_user_defined_argument: Any) -> str:
    # Use async/await with AsyncPipeline or async operations
    result = await self.pipeline.run_async({"fetcher": {"urls": urls}, "prompt": {"query": question}})
    return result["llm"]["replies"][0]

This is particularly useful when:

Working with AsyncPipeline instances that support async execution
Integrating with async-compatible Haystack components (e.g., OpenAIChatGenerator with async support)
Handling I/O-bound operations more efficiently
Deploying pipelines that need to handle many concurrent requests

NOTE: You can implement either run_api, run_api_async, or both. Hayhooks will automatically detect which methods are implemented and route requests accordingly.

You can find complete working examples of async pipeline wrappers in the test files and async streaming examples.

To deploy the pipeline, run:

hayhooks pipeline deploy-files -n chat_with_website examples/pipeline_wrappers/chat_with_website_streaming

This will deploy the pipeline with the name chat_with_website. Any error encountered during development will be printed to the console and show in the server logs.

Alternatively, you can deploy via HTTP: POST /deploy_files.

PipelineWrapper development with `overwrite` option

During development, you can use the --overwrite flag to redeploy your pipeline without restarting the Hayhooks server:

hayhooks pipeline deploy-files -n {pipeline_name} --overwrite {pipeline_dir}

This is particularly useful when:

Iterating on your pipeline wrapper implementation
Debugging pipeline setup issues
Testing different pipeline configurations

The --overwrite flag will:

Remove the existing pipeline from the registry
Delete the pipeline files from disk
Deploy the new version of your pipeline

For even faster development iterations, you can combine --overwrite with --skip-saving-files to avoid writing files to disk:

hayhooks pipeline deploy-files -n {pipeline_name} --overwrite --skip-saving-files {pipeline_dir}

This is useful when:

You're making frequent changes during development
You want to test a pipeline without persisting it
You're running in an environment with limited disk access

Additional dependencies

After installing the Hayhooks package, it might happen that during pipeline deployment you need to install additional dependencies in order to correctly initialize the pipeline instance when calling the wrapper's setup() method. For instance, the chat_with_website pipeline requires the trafilatura package, which is not installed by default.

⚠️ Sometimes you may need to enable tracebacks in hayhooks to see the full error message. You can do this by setting the HAYHOOKS_SHOW_TRACEBACKS environment variable to true or 1.

Then, assuming you've installed the Hayhooks package in a virtual environment, you will need to install the additional required dependencies yourself by running:

pip install trafilatura

Deploy a YAML Pipeline

You can deploy a Haystack pipeline directly from its YAML definition using the /deploy-yaml endpoint. This mode builds request/response schemas from the YAML-declared inputs and outputs.

Note: You can also deploy YAML pipelines from the CLI with hayhooks pipeline deploy-yaml. Wrapper-based deployments continue to use /deploy_files.

Tip: You can obtain a pipeline's YAML from an existing Pipeline instance using pipeline.dumps(). See the Haystack serialization docs for details.

Requirements:

The YAML must declare both inputs and outputs fields so the API request/response schemas can be generated. If you have generated the YAML from a Pipeline using pipeline.dumps(), you will need to add the inputs and outputs fields manually.
inputs/outputs entries map friendly names to pipeline component fields (e.g. fetcher.urls, prompt.query).

Minimal example:

# ... pipeline definition ...

inputs:
  urls:
    - fetcher.urls
  query:
    - prompt.query
outputs:
  replies: llm.replies

CLI:

hayhooks pipeline deploy-yaml -n inputs_outputs_pipeline --description "My pipeline" pipelines/inputs_outputs_pipeline.yml

Alternatively, you can deploy via HTTP: POST /deploy-yaml.

If successful, the server exposes a run endpoint at /{name}/run with a request/response schema derived from the YAML IO. For example:

curl -X POST \
  http://HAYHOOKS_HOST:HAYHOOKS_PORT/inputs_outputs_pipeline/run \
  -H 'Content-Type: application/json' \
  -d '{"urls": ["https://haystack.deepset.ai"], "query": "What is Haystack?"}'

Note: when deploying a YAML pipeline, Hayhooks will create an AsyncPipeline instance from the YAML source code. This is because we are in an async context, so we should avoid running sync methods using e.g. run_in_threadpool. With AsyncPipeline, we can await run_async directly, so we make use of the current event loop.

Limitations:

YAML-deployed pipelines do not support OpenAI-compatible chat completion endpoints, so they cannot be used with Open WebUI. If you need chat completion/streaming, use a PipelineWrapper and implement run_chat_completion or run_chat_completion_async (see the OpenAI compatibility section below).

Available CLI options for hayhooks pipeline deploy-yaml:

--name, -n: override the pipeline name (default: YAML file stem)
--description: optional human-readable description (used in MCP tool listing)
--overwrite, -o: overwrite if the pipeline already exists
--skip-mcp: skip exposing this pipeline as an MCP Tool
--save-file/--no-save-file: save the YAML under pipelines/{name}.yml on the server (default: --save-file)

Deploy an Agent

Deploying a Haystack Agent is very similar to deploying a pipeline.

You simply need to create a PipelineWrapper which will wrap the Haystack Agent instance. The following example is the bare minimum to deploy an agent and make it usable through open-webui, supporting streaming responses:

from typing import AsyncGenerator
from haystack.components.agents import Agent
from haystack.dataclasses import ChatMessage
from haystack.components.generators.chat import OpenAIChatGenerator
from hayhooks import BasePipelineWrapper, async_streaming_generator


class PipelineWrapper(BasePipelineWrapper):
    def setup(self) -> None:
        self.agent = Agent(
            chat_generator=OpenAIChatGenerator(model="gpt-4o-mini"),
            system_prompt="You're a helpful agent",
        )

    async def run_chat_completion_async(
        self, model: str, messages: list[dict], body: dict
    ) -> AsyncGenerator[str, None]:
        chat_messages = [
            ChatMessage.from_openai_dict_format(message) for message in messages
        ]

        return async_streaming_generator(
            pipeline=self.agent,
            pipeline_run_args={
                "messages": chat_messages,
            },
        )

As you can see, the run_chat_completion_async method is the one that will be used to run the agent. You can of course implement also run_api or run_api_async methods if you need to.

The async_streaming_generator function is a utility function that will handle the streaming of the agent's responses.

Load pipelines or agents at startup

Hayhooks can automatically deploy pipelines or agents on startup by scanning a pipelines directory.

Set HAYHOOKS_PIPELINES_DIR (defaults to ./pipelines).
On startup, Hayhooks will:
- Deploy every YAML file at the directory root (*.yml/*.yaml) using the file name as the pipeline name.
- Deploy every immediate subfolder as a wrapper-based pipeline/agent if it contains a pipeline_wrapper.py.

Example layout:

my-project/
├── .env
└── pipelines/
    ├── inputs_outputs_pipeline.yml        # YAML-only pipeline -> POST /inputs_outputs_pipeline/run
    ├── chat_with_website/                 # Wrapper-based pipeline -> POST /chat_with_website/run (+ chat endpoints if implemented)
    │   ├── pipeline_wrapper.py
    │   └── chat_with_website.yml
    └── agent_streaming/
        └── pipeline_wrapper.py

Configure via environment or .env:

# .env
HAYHOOKS_PIPELINES_DIR=./pipelines

Notes:

YAML-deployed pipelines require inputs and outputs in the YAML and do not expose OpenAI-compatible chat endpoints. For chat/streaming, use a PipelineWrapper and implement run_chat_completion/run_chat_completion_async.
If your wrappers import shared code, set HAYHOOKS_ADDITIONAL_PYTHON_PATH (see “Sharing code between pipeline wrappers”).

Support file uploads

Hayhooks can easily handle uploaded files in your pipeline wrapper run_api method by adding files: Optional[List[UploadFile]] = None as an argument.

Here's a simple example:

def run_api(self, files: Optional[List[UploadFile]] = None) -> str:
    if files and len(files) > 0:
        filenames = [f.filename for f in files if f.filename is not None]
        file_contents = [f.file.read() for f in files]

        return f"Received files: {', '.join(filenames)}"

    return "No files received"

This will make Hayhooks handle automatically the file uploads (if they are present) and pass them to the run_api method. This also means that the HTTP request needs to be a multipart/form-data request.

Note also that you can handle both files and parameters in the same request, simply adding them as arguments to the run_api method.

def run_api(self, files: Optional[List[UploadFile]] = None, additional_param: str = "default") -> str:
    ...

You can find a full example in the examples/rag_indexing_query folder.

Run pipelines from the CLI

Run a pipeline from the CLI JSON-compatible parameters

You can run a pipeline by using the hayhooks pipeline run command. Under the hood, this will call the run_api method of the pipeline wrapper, passing parameters as the JSON body of the request. This is convenient when you want to do a test run of the deployed pipeline from the CLI without having to write any code.

To run a pipeline from the CLI, you can use the following command:

hayhooks pipeline run <pipeline_name> --param 'question="is this recipe vegan?"'

Run a pipeline from the CLI uploading files

This is useful when you want to run a pipeline that requires a file as input. In that case, the request will be a multipart/form-data request. You can pass both files and parameters in the same request.

NOTE: To use this feature, you need to deploy a pipeline which is handling files (see Support file uploads and examples/rag_indexing_query for more details).

# Upload a whole directory
hayhooks pipeline run <pipeline_name> --dir files_to_index

# Upload a single file
hayhooks pipeline run <pipeline_name> --file file.pdf

# Upload multiple files
hayhooks pipeline run <pipeline_name> --dir files_to_index --file file1.pdf --file file2.pdf

# Upload a single file passing also a parameter
hayhooks pipeline run <pipeline_name> --file file.pdf --param 'question="is this recipe vegan?"'

MCP support

NOTE: You'll need to run at least Python 3.10+ to use the MCP Server.

MCP Server

Hayhooks now supports the Model Context Protocol and can act as a MCP Server.

It will:

Expose Core Tools to make it possible to control Hayhooks directly from an IDE like Cursor or any other MCP client.
Expose the deployed Haystack pipelines as usable MCP Tools, using both Server-Sent Events (SSE) and (stateless) Streamable HTTP MCP transports.

(Note that SSE transport is deprecated and it's maintained only for backward compatibility).

To run the Hayhooks MCP Server, you can use the following command:

hayhooks mcp run

# Hint: check --help to see all the available options

This will start the Hayhooks MCP Server on HAYHOOKS_MCP_HOST:HAYHOOKS_MCP_PORT.

Expose a YAML pipeline as a MCP Tool

Hayhooks can expose YAML-deployed pipelines as MCP Tools. When you deploy a pipeline via /deploy-yaml (or the CLI hayhooks pipeline deploy-yaml), Hayhooks:

Builds flat request/response models from YAML-declared inputs and outputs.
Registers the pipeline as an AsyncPipeline and adds it to the registry with metadata required for MCP Tools.
Lists it in MCP list_tools() with:
- name: the pipeline name (YAML file stem or provided --name)
- description: the optional description you pass during deployment (defaults to the pipeline name)
- inputSchema: JSON schema derived from YAML inputs

Calling a YAML pipeline via MCP call_tool executes the pipeline asynchronously and returns the pipeline result as a JSON string in TextContent.

Sample YAML for a simple sum pipeline using only the haystack.testing.sample_components.sum.Sum component:

components:
  sum:
    init_parameters: {}
    type: haystack.testing.sample_components.sum.Sum

connections: []

metadata: {}

inputs:
  values: sum.values

outputs:
  total: sum.total

Example (Streamable HTTP via MCP client):

tools = await client.list_tools()
# Find YAML tool by name, e.g., "sum" (the pipeline name)
result = await client.call_tool("sum", {"values": [1, 2, 3]})
assert result.content[0].text == '{"total": 6}'

Notes and limitations:

YAML pipelines must declare inputs and outputs.
YAML pipelines are run-only via MCP and return JSON text; if you need OpenAI-compatible chat endpoints or streaming, use a PipelineWrapper and implement run_chat_completion/run_chat_completion_async.

Create a PipelineWrapper for exposing a Haystack pipeline as a MCP Tool

A MCP Tool requires the following properties:

name: The name of the tool.
description: The description of the tool.
inputSchema: A JSON Schema object describing the tool's input parameters.

For each deployed pipeline, Hayhooks will:

Use the pipeline wrapper name as MCP Tool name (always present).
Parse run_api method docstring:
- If you use Google-style or reStructuredText-style docstrings, use the first line as MCP Tool description and the rest as parameters (if present).
- Each parameter description will be used as the description of the corresponding Pydantic model field (if present).
Generate a Pydantic model from the inputSchema using the run_api method arguments as fields.

Here's an example of a PipelineWrapper implementation for the chat_with_website pipeline which can be used as a MCP Tool:

from pathlib import Path
from typing import List
from haystack import Pipeline
from hayhooks import BasePipelineWrapper


class PipelineWrapper(BasePipelineWrapper):
    def setup(self) -> None:
        pipeline_yaml = (Path(__file__).parent / "chat_with_website.yml").read_text()
        self.pipeline = Pipeline.loads(pipeline_yaml)

    def run_api(self, urls: List[str], question: str) -> str:
        #
        # NOTE: The following docstring will be used as MCP Tool description
        #
        """
        Ask a question about one or more websites using a Haystack pipeline.
        """
        result = self.pipeline.run({"fetcher": {"urls": urls}, "prompt": {"query": question}})
        return result["llm"]["replies"][0]

Skip MCP Tool listing

You can skip the MCP Tool listing by setting the skip_mcp class attribute to True in your PipelineWrapper class. This way, the pipeline will be deployed on Hayhooks but will not be listed as a MCP Tool when you run the hayhooks mcp run command.

class PipelineWrapper(BasePipelineWrapper):
    # This will skip the MCP Tool listing
    skip_mcp = True

    def setup(self) -> None:
        ...

    def run_api(self, urls: List[str], question: str) -> str:
        ...

Using Hayhooks MCP Server with Claude Desktop

As stated in Anthropic's documentation, Claude Desktop supports SSE and Streamable HTTP as MCP Transports only on "Claude.ai & Claude for Desktop for the Pro, Max, Teams, and Enterprise tiers".

If you are using the free tier, only STDIO transport is supported, so you need to use supergateway to connect to the Hayhooks MCP Server via SSE or Streamable HTTP.

After starting the Hayhooks MCP Server, open Settings → Developer in Claude Desktop and update the config file with the following examples:

Using supergateway to bridge Streamable HTTP transport

{
  "mcpServers": {
    "hayhooks": {
      "command": "npx",
      "args": [
        "-y",
        "supergateway",
        "--streamableHttp",
        "http://HAYHOOKS_MCP_HOST:HAYHOOKS_MCP_PORT/mcp"
      ]
    }
  }
}

Using supergateway to bridge SSE transport

{
  "mcpServers": {
    "hayhooks": {
      "command": "npx",
      "args": [
        "-y",
        "supergateway",
        "--sse",
        "http://HAYHOOKS_MCP_HOST:HAYHOOKS_MCP_PORT/sse"
      ]
    }
  }
}

Make sure Node.js is installed, as the npx command depends on it.

Using Hayhooks Core MCP Tools in IDEs like Cursor

Since Hayhooks MCP Server provides by default a set of Core MCP Tools, the MCP server will enable one to interact with Hayhooks in an agentic manner using IDEs like Cursor.

The exposed tools are:

get_all_pipeline_statuses: Get the status of all pipelines and list available pipeline names.
get_pipeline_status: Get status of a specific pipeline. Requires pipeline_name as an argument.
undeploy_pipeline: Undeploy a pipeline. Removes a pipeline from the registry, its API routes, and deletes its files. Requires pipeline_name as an argument.
deploy_pipeline: Deploy a pipeline from files (pipeline_wrapper.py and other files). Requires name (pipeline name), files (list of file contents), save_files (boolean), and overwrite (boolean) as arguments.

From Cursor Settings -> MCP, you can add a new MCP Server by specifying the following parameters (assuming you have Hayhooks MCP Server running on http://localhost:1417 with Streamable HTTP transport):

{
  "mcpServers": {
    "hayhooks": {
      "url": "http://localhost:1417/mcp"
    }
  }
}

Or if you need to use the SSE transport:

{
  "mcpServers": {
    "hayhooks": {
      "url": "http://localhost:1417/sse"
    }
  }
}

After adding the MCP Server, you should see the Hayhooks Core MCP Tools in the list of available tools:

Now in the Cursor chat interface you can use the Hayhooks Core MCP Tools by mentioning them in your messages.

Development and deployment of Haystack pipelines directly from Cursor

Here's a video example of how to develop and deploy a Haystack pipeline directly from Cursor:

Hayhooks as an OpenAPI Tool Server in `open-webui`

Since Hayhooks expose openapi-schema at /openapi.json, it can be used as an OpenAPI Tool Server.

open-webui has recently added support for OpenAPI Tool Servers, meaning that you can use the API endpoints of Hayhooks as tools in your chat interface.

You simply need to configure the OpenAPI Tool Server in the Settings -> Tools section, adding the URL of the Hayhooks server and the path to the openapi.json file:

Example: Deploy a Haystack pipeline from `open-webui` chat interface

Here's a video example of how to deploy a Haystack pipeline from the open-webui chat interface:

OpenAI compatibility and `open-webui` integration

OpenAI-compatible endpoints generation

Hayhooks now can automatically generate OpenAI-compatible endpoints if you implement the run_chat_completion method in your pipeline wrapper.

This will make Hayhooks compatible with fully-featured chat interfaces like open-webui, so you can use it as a backend for your chat interface.

Using Hayhooks as `open-webui` backend

Requirements:

Ensure you have open-webui up and running (you can do it easily using docker, check their quick start guide).
Ensure you have Hayhooks server running somewhere. We will run it locally on http://localhost:1416.

Configuring `open-webui`

First, you need to turn off tags, title and follow-up generation from Admin settings -> Interface:

This is needed to avoid open-webui to make calls to your deployed pipelines or agents asking for generating tags, title and follow-up messages (they may be not suited for this use case). Of course, if you want to use them, you can leave them enabled.

Then you have two options to connect Hayhooks as a backend.

Add a Direct Connection from Settings -> Connections:

NOTE: Fill a random value as API key as it's not needed

Alternatively, you can add an additional OpenAI API Connections from Admin settings -> Connections:

Even in this case, remember to Fill a random value as API key.

run_chat_completion(...)

To enable the automatic generation of OpenAI-compatible endpoints, you need only to implement the run_chat_completion method in your pipeline wrapper.

def run_chat_completion(self, model: str, messages: List[dict], body: dict) -> Union[str, Generator]:
    ...

Let's update the previous example to add a streaming response:

from pathlib import Path
from typing import Generator, List, Union
from haystack import Pipeline
from hayhooks import get_last_user_message, BasePipelineWrapper, log


URLS = ["https://haystack.deepset.ai", "https://www.redis.io", "https://ssi.inc"]


class PipelineWrapper(BasePipelineWrapper):
    def setup(self) -> None:
        ...  # Same as before

    def run_api(self, urls: List[str], question: str) -> str:
        ...  # Same as before

    def run_chat_completion(self, model: str, messages: List[dict], body: dict) -> Union[str, Generator]:
        log.trace(f"Running pipeline with model: {model}, messages: {messages}, body: {body}")

        question = get_last_user_message(messages)
        log.trace(f"Question: {question}")

        # Plain pipeline run, will return a string
        result = self.pipeline.run({"fetcher": {"urls": URLS}, "prompt": {"query": question}})
        return result["llm"]["replies"][0]

Differently from the run_api method, the run_chat_completion has a fixed signature and will be called with the arguments specified in the OpenAI-compatible endpoint.

model: The name of the Haystack pipeline which is called.
messages: The list of messages from the chat in the OpenAI format.
body: The full body of the request.

Some notes:

Since we have only the user messages as input here, the question is extracted from the last user message and the urls argument is hardcoded.
In this example, the run_chat_completion method is returning a string, so the open-webui will receive a string as response and show the pipeline output in the chat all at once.
The body argument contains the full request body, which may be used to extract more information like the temperature or the max_tokens (see the OpenAI API reference for more information).

Finally, to use non-streaming responses in open-webui you need also to turn off Stream Chat Response chat settings.

Here's a video example:

run_chat_completion_async(...)

This method is the asynchronous version of run_chat_completion. It handles OpenAI-compatible chat completion requests asynchronously, which is particularly useful for streaming responses and high-concurrency scenarios.

from hayhooks import async_streaming_generator, get_last_user_message, log

async def run_chat_completion_async(self, model: str, messages: List[dict], body: dict) -> Union[str, AsyncGenerator]:
    log.trace(f"Running pipeline with model: {model}, messages: {messages}, body: {body}")

    question = get_last_user_message(messages)
    log.trace(f"Question: {question}")

    # For async streaming responses
    return async_streaming_generator(
        pipeline=self.pipeline,
        pipeline_run_args={"fetcher": {"urls": URLS}, "prompt": {"query": question}},
    )

Like run_chat_completion, this method has a fixed signature and will be called with the same arguments. The key differences are:

It's declared as async and can use await for asynchronous operations
It can return an AsyncGenerator for streaming responses using async_streaming_generator
It provides better performance for concurrent chat requests
It's required when using async streaming with components that support async streaming callbacks

NOTE: You can implement either run_chat_completion, run_chat_completion_async, or both. When both are implemented, Hayhooks will prefer the async version for better performance.

You can find complete working examples combining async chat completion with streaming in the async streaming test examples.

Streaming responses in OpenAI-compatible endpoints

Hayhooks provides streaming_generator and async_streaming_generator utility functions that can be used to stream the pipeline output to the client.

Let's update the run_chat_completion method of the previous example:

from pathlib import Path
from typing import Generator, List, Union
from haystack import Pipeline
from hayhooks import get_last_user_message, BasePipelineWrapper, log, streaming_generator


URLS = ["https://haystack.deepset.ai", "https://www.redis.io", "https://ssi.inc"]


class PipelineWrapper(BasePipelineWrapper):
    def setup(self) -> None:
        ...  # Same as before

    def run_api(self, urls: List[str], question: str) -> str:
        ...  # Same as before

    def run_chat_completion(self, model: str, messages: List[dict], body: dict) -> Union[str, Generator]:
        log.trace(f"Running pipeline with model: {model}, messages: {messages}, body: {body}")

        question = get_last_user_message(messages)
        log.trace(f"Question: {question}")

        # Streaming pipeline run, will return a generator
        return streaming_generator(
            pipeline=self.pipeline,
            pipeline_run_args={"fetcher": {"urls": URLS}, "prompt": {"query": question}},
        )

Now, if you run the pipeline and call one of the following endpoints:

{pipeline_name}/chat
/chat/completions
/v1/chat/completions

You will see the pipeline output being streamed in OpenAI-compatible format to the client and you'll be able to see the output in chunks.

Since output will be streamed to open-webui there's no need to change Stream Chat Response chat setting (leave it as Default or On).

You can find a complete working example of streaming_generator usage in the examples/pipeline_wrappers/chat_with_website_streaming directory.

Here's a video example:

async_streaming_generator

For asynchronous pipelines or agents, Hayhooks also provides an async_streaming_generator utility function:

from pathlib import Path
from typing import AsyncGenerator, List, Union
from haystack import AsyncPipeline
from hayhooks import get_last_user_message, BasePipelineWrapper, log, async_streaming_generator


URLS = ["https://haystack.deepset.ai", "https://www.redis.io", "https://ssi.inc"]


class PipelineWrapper(BasePipelineWrapper):
    def setup(self) -> None:
        pipeline_yaml = (Path(__file__).parent / "chat_with_website.yml").read_text()
        self.pipeline = AsyncPipeline.loads(pipeline_yaml)  # Note: AsyncPipeline

    async def run_chat_completion_async(self, model: str, messages: List[dict], body: dict) -> AsyncGenerator:
        log.trace(f"Running pipeline with model: {model}, messages: {messages}, body: {body}")

        question = get_last_user_message(messages)
        log.trace(f"Question: {question}")

        # Async streaming pipeline run, will return an async generator
        return async_streaming_generator(
            pipeline=self.pipeline,
            pipeline_run_args={"fetcher": {"urls": URLS}, "prompt": {"query": question}},
        )

The async_streaming_generator function:

Works with both Pipeline and AsyncPipeline instances
Requires components that support async streaming callbacks (e.g., OpenAIChatGenerator instead of OpenAIGenerator)
Provides better performance for concurrent streaming requests
Returns an AsyncGenerator that yields chunks asynchronously
Automatically handles async pipeline execution and cleanup

NOTE: The streaming component in your pipeline must support async streaming callbacks. If you get an error about async streaming support, either use the sync streaming_generator or switch to async-compatible components.

Integration with haystack OpenAIChatGenerator

Since Hayhooks is OpenAI-compatible, it can be used as a backend for the haystack OpenAIChatGenerator.

Assuming you have a Haystack pipeline named chat_with_website_streaming and you have deployed it using Hayhooks, here's an example script of how to use it with the OpenAIChatGenerator:

from haystack.components.generators.chat.openai import OpenAIChatGenerator
from haystack.utils import Secret
from haystack.dataclasses import ChatMessage
from haystack.components.generators.utils import print_streaming_chunk

client = OpenAIChatGenerator(
    model="chat_with_website_streaming",
    api_key=Secret.from_token("not-relevant"),  # This is not used, you can set it to anything
    api_base_url="http://localhost:1416/v1/",
    streaming_callback=print_streaming_chunk,
)

client.run([ChatMessage.from_user("Where are the offices or SSI?")])
# > The offices of Safe Superintelligence Inc. (SSI) are located in Palo Alto, California, and Tel Aviv, Israel.

# > {'replies': [ChatMessage(_role=<ChatRole.ASSISTANT: 'assistant'>, _content=[TextContent(text='The offices of Safe >Superintelligence Inc. (SSI) are located in Palo Alto, California, and Tel Aviv, Israel.')], _name=None, _meta={'model': >'chat_with_website_streaming', 'index': 0, 'finish_reason': 'stop', 'completion_start_time': '2025-02-11T15:31:44.599726', >'usage': {}})]}

Sending `open-webui` events enhancing the user experience

Hayhooks provides support to some open-webui events to enhance the user experience.

The idea is to enhance the user experience by sending events to the client before, after or when the pipeline is running.

You can use those events to:

🔄 Show a loading spinner
💬 Update the chat messages
🍞 Show a toast notification

You can find a complete example in the examples/pipeline_wrappers/open_webui_agent_events folder.

Here's a preview:

Hooks

Intercepting tool calls when using `open-webui` and streaming responses

When using open-webui and streaming responses, both streaming_generator and async_streaming_generator provide hooks to intercept tool calls.

The hooks (parameters of streaming_generator and async_streaming_generator) are:

on_tool_call_start: Called when a tool call starts. It receives the following arguments:
- tool_name: The name of the tool that is being called.
- arguments: The arguments passed to the tool.
- id: The id of the tool call.
on_tool_call_end: Called when a tool call ends. It receives the following arguments:
- tool_name: The name of the tool that is being called.
- arguments: The arguments passed to the tool.
- result: The result of the tool call.
- error: Whether the tool call ended with an error.

You can find a complete example in the examples/pipeline_wrappers/open_webui_agent_on_tool_calls folder.

Here's a preview:

Advanced usage

Run Hayhooks programmatically

A Hayhooks app instance can be programmatically created by using the create_app function. This is useful if you want to add custom routes or middleware to Hayhooks.

Here's an example script:

import uvicorn
from hayhooks.settings import settings
from fastapi import Request
from hayhooks import create_app

# Create the Hayhooks app
hayhooks = create_app()


# Add a custom route
@hayhooks.get("/custom")
async def custom_route():
    return {"message": "Hi, this is a custom route!"}


# Add a custom middleware
@hayhooks.middleware("http")
async def custom_middleware(request: Request, call_next):
    response = await call_next(request)
    response.headers["X-Custom-Header"] = "custom-header-value"
    return response


if __name__ == "__main__":
    uvicorn.run("app:hayhooks", host=settings.host, port=settings.port)

Sharing code between pipeline wrappers

Hayhooks allows you to use your custom code in your pipeline wrappers adding a specific path to the Hayhooks Python Path.

You can do this in three ways:

Set the HAYHOOKS_ADDITIONAL_PYTHON_PATH environment variable to the path of the folder containing your custom code.
Add HAYHOOKS_ADDITIONAL_PYTHON_PATH to the .env file.
Use the --additional-python-path flag when launching Hayhooks.

For example, if you have a folder called common with a my_custom_lib.py module which contains the my_function function, you can deploy your pipelines by using the following command:

export HAYHOOKS_ADDITIONAL_PYTHON_PATH='./common'
hayhooks run

Then you can use the custom code in your pipeline wrappers by importing it like this:

from my_custom_lib import my_function

Note that you can use both absolute and relative paths (relative to the current working directory).

You can check out a complete example in the examples/shared_code_between_wrappers folder.

Deployment guidelines

We have some dedicated documentation for deployment:

Docker-based deployments: https://docs.haystack.deepset.ai/docs/docker
Kubernetes-based deployments: https://docs.haystack.deepset.ai/docs/kubernetes

We also have some additional deployment guidelines, see deployment_guidelines.md.

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

For Tasks:

Click tags to check more tools for each tasks

deploy pipeline start server configure environment manage pipelines integrate with openai

For Jobs:

data scientist machine learning engineer ai engineer nlp engineer backend developer

Alternative AI tools for hayhooks

Similar Open Source Tools

hayhooks

github

: 115

ai21-python

The AI21 Labs Python SDK is a comprehensive tool for interacting with the AI21 API. It provides functionalities for chat completions, conversational RAG, token counting, error handling, and support for various cloud providers like AWS, Azure, and Vertex. The SDK offers both synchronous and asynchronous usage, along with detailed examples and documentation. Users can quickly get started with the SDK to leverage AI21's powerful models for various natural language processing tasks.

github

: 65

hyper-mcp

hyper-mcp is a fast and secure MCP server that enables adding AI capabilities to applications through WebAssembly plugins. It supports writing plugins in various languages, distributing them via standard OCI registries, and running them in resource-constrained environments. The tool offers sandboxing with WASM for limiting access, cross-platform compatibility, and deployment flexibility. Security features include sandboxed plugins, memory-safe execution, secure plugin distribution, and fine-grained access control. Users can configure the tool for global or project-specific use, start the server with different transport options, and utilize available plugins for tasks like time calculations, QR code generation, hash generation, IP retrieval, and webpage fetching.

github

: 787

aiounifi

Aiounifi is a Python library that provides a simple interface for interacting with the Unifi Controller API. It allows users to easily manage their Unifi network devices, such as access points, switches, and gateways, through automated scripts or applications. With Aiounifi, users can retrieve device information, perform configuration changes, monitor network performance, and more, all through a convenient and efficient API wrapper. This library simplifies the process of integrating Unifi network management into custom solutions, making it ideal for network administrators, developers, and enthusiasts looking to automate and streamline their network operations.

github

: 62

ollama4j

Ollama4j is a Java library that serves as a wrapper or binding for the Ollama server. It allows users to communicate with the Ollama server and manage models for various deployment scenarios. The library provides APIs for interacting with Ollama, generating fake data, testing UI interactions, translating messages, and building web UIs. Users can easily integrate Ollama4j into their Java projects to leverage the functionalities offered by the Ollama server.

github

: 438

koog

Koog is a Kotlin-based framework for building and running AI agents entirely in idiomatic Kotlin. It allows users to create agents that interact with tools, handle complex workflows, and communicate with users. Key features include pure Kotlin implementation, MCP integration, embedding capabilities, custom tool creation, ready-to-use components, intelligent history compression, powerful streaming API, persistent agent memory, comprehensive tracing, flexible graph workflows, modular feature system, scalable architecture, and multiplatform support.

github

: 3.2k

OllamaSharp

OllamaSharp is a .NET binding for the Ollama API, providing an intuitive API client to interact with Ollama. It offers support for all Ollama API endpoints, real-time streaming, progress reporting, and an API console for remote management. Users can easily set up the client, list models, pull models with progress feedback, stream completions, and build interactive chats. The project includes a demo console for exploring and managing the Ollama host.

github

: 1.1k

chatluna

Chatluna is a machine learning model plugin that provides chat services with large language models. It is highly extensible, supports multiple output formats, and offers features like custom conversation presets, rate limiting, and context awareness. Users can deploy Chatluna under Koishi without additional configuration. The plugin supports various models/platforms like OpenAI, Azure OpenAI, Google Gemini, and more. It also provides preset customization using YAML files and allows for easy forking and development within Koishi projects. However, the project lacks web UI, HTTP server, and project documentation, inviting contributions from the community.

github

: 345

aide

Aide is a code-first API documentation and utility library for Rust, along with other related utility crates for web-servers. It provides tools for creating API documentation and handling JSON request validation. The repository contains multiple crates that offer drop-in replacements for existing libraries, ensuring compatibility with Aide. Contributions are welcome, and the code is dual licensed under MIT and Apache-2.0. If Aide does not meet your requirements, you can explore similar libraries like paperclip, utoipa, and okapi.

github

: 433

LocalLLMClient

LocalLLMClient is a Swift package designed to interact with local Large Language Models (LLMs) on Apple platforms. It supports GGUF, MLX models, and the FoundationModels framework, providing streaming API, multimodal capabilities, and tool calling functionalities. Users can easily integrate this tool to work with various models for text generation and processing. The package also includes advanced features for low-level API control and multimodal image processing. LocalLLMClient is experimental and subject to API changes, offering support for iOS, macOS, and Linux platforms.

github

: 82

mcp-server-mysql

The MCP Server for MySQL based on NodeJS is a Model Context Protocol server that provides access to MySQL databases. It enables users to inspect database schemas and execute SQL queries. The server offers tools for executing SQL queries, providing comprehensive database information, security features like SQL injection prevention, performance optimizations, monitoring, and debugging capabilities. Users can configure the server using environment variables and advanced options. The server supports multi-DB mode, schema-specific permissions, and includes troubleshooting guidelines for common issues. Contributions are welcome, and the project roadmap includes enhancing query capabilities, security features, performance optimizations, monitoring, and expanding schema information.

github

: 768

BentoVLLM

BentoVLLM is an example project demonstrating how to serve and deploy open-source Large Language Models using vLLM, a high-throughput and memory-efficient inference engine. It provides a basis for advanced code customization, such as custom models, inference logic, or vLLM options. The project allows for simple LLM hosting with OpenAI compatible endpoints without the need to write any code. Users can interact with the server using Swagger UI or other methods, and the service can be deployed to BentoCloud for better management and scalability. Additionally, the repository includes integration examples for different LLM models and tools.

github

: 150

fastapi_mcp

FastAPI-MCP is a zero-configuration tool that automatically exposes FastAPI endpoints as Model Context Protocol (MCP) tools. It allows for direct integration with FastAPI apps, automatic discovery and conversion of endpoints to MCP tools, preservation of request and response schemas, documentation preservation similar to Swagger, and the ability to extend with custom MCP tools. Users can easily add an MCP server to their FastAPI application and customize the server creation and configuration. The tool supports connecting to the MCP server using SSE or mcp-proxy stdio for different MCP clients. FastAPI-MCP is developed and maintained by Tadata Inc.

github

: 10.2k

tools

Strands Agents Tools is a community-driven project that provides a powerful set of tools for your agents to use. It bridges the gap between large language models and practical applications by offering ready-to-use tools for file operations, system execution, API interactions, mathematical operations, and more. The tools cover a wide range of functionalities including file operations, shell integration, memory storage, web infrastructure, HTTP client, Slack client, Python execution, mathematical tools, AWS integration, image and video processing, audio output, environment management, task scheduling, advanced reasoning, swarm intelligence, dynamic MCP client, parallel tool execution, browser automation, diagram creation, RSS feed management, and computer automation.

github

: 620

nvim-aider

Nvim-aider is a plugin for Neovim that provides additional functionality and key mappings to enhance the user's editing experience. It offers features such as code navigation, quick access to commonly used commands, and improved text manipulation tools. With Nvim-aider, users can streamline their workflow and increase productivity while working with Neovim.

github

: 86

fastapi

智元 Fast API is a one-stop API management system that unifies various LLM APIs in terms of format, standards, and management, achieving the ultimate in functionality, performance, and user experience. It supports various models from companies like OpenAI, Azure, Baidu, Keda Xunfei, Alibaba Cloud, Zhifu AI, Google, DeepSeek, 360 Brain, and Midjourney. The project provides user and admin portals for preview, supports cluster deployment, multi-site deployment, and cross-zone deployment. It also offers Docker deployment, a public API site for registration, and screenshots of the admin and user portals. The API interface is similar to OpenAI's interface, and the project is open source with repositories for API, web, admin, and SDK on GitHub and Gitee.

github

: 265

For similar tasks

trickPrompt-engine

This repository contains a vulnerability mining engine based on GPT technology. The engine is designed to identify logic vulnerabilities in code by utilizing task-driven prompts. It does not require prior knowledge or fine-tuning and focuses on prompt design rather than model design. The tool is effective in real-world projects and should not be used for academic vulnerability testing. It supports scanning projects in various languages, with current support for Solidity. The engine is configured through prompts and environment settings, enabling users to scan for vulnerabilities in their codebase. Future updates aim to optimize code structure, add more language support, and enhance usability through command line mode. The tool has received a significant audit bounty of $50,000+ as of May 2024.

github

: 169

MachineSoM

MachineSoM is a code repository for the paper 'Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View'. It focuses on the emergence of intelligence from collaborative and communicative computational modules, enabling effective completion of complex tasks. The repository includes code for societies of LLM agents with different traits, collaboration processes such as debate and self-reflection, and interaction strategies for determining when and with whom to interact. It provides a coding framework compatible with various inference services like Replicate, OpenAI, Dashscope, and Anyscale, supporting models like Qwen and GPT. Users can run experiments, evaluate results, and draw figures based on the paper's content, with available datasets for MMLU, Math, and Chess Move Validity.

github

: 74

comfyui

ComfyUI is a highly-configurable, cloud-first AI-Dock container that allows users to run ComfyUI without bundled models or third-party configurations. Users can configure the container using provisioning scripts. The Docker image supports NVIDIA CUDA, AMD ROCm, and CPU platforms, with version tags for different configurations. Additional environment variables and Python environments are provided for customization. ComfyUI service runs on port 8188 and can be managed using supervisorctl. The tool also includes an API wrapper service and pre-configured templates for Vast.ai. The author may receive compensation for services linked in the documentation.

github

: 434

pyrfuniverse

pyrfuniverse is a python package used to interact with RFUniverse simulation environment. It is developed with reference to ML-Agents and produce new features. The package allows users to work with RFUniverse for simulation purposes, providing tools and functionalities to interact with the environment and create new features.

github

: 56

intentkit

IntentKit is an autonomous agent framework that enables the creation and management of AI agents with capabilities including blockchain interactions, social media management, and custom skill integration. It supports multiple agents, autonomous agent management, blockchain integration, social media integration, extensible skill system, and plugin system. The project is in alpha stage and not recommended for production use. It provides quick start guides for Docker and local development, integrations with Twitter and Coinbase, configuration options using environment variables or AWS Secrets Manager, project structure with core application code, entry points, configuration management, database models, skills, skill sets, and utility functions. Developers can add new skills by creating, implementing, and registering them in the skill directory.

github

: 6.4k

pear-landing-page

PearAI Landing Page is an open-source AI-powered code editor managed by Nang and Pan. It is built with Next.js, Vercel, Tailwind CSS, and TypeScript. The project requires setting up environment variables for proper configuration. Users can run the project locally by starting the development server and visiting the specified URL in the browser. Recommended extensions include Prettier, ESLint, and JavaScript and TypeScript Nightly. Contributions to the project are welcomed and appreciated.

github

: 116

webapp-starter

webapp-starter is a modern full-stack application template built with Turborepo, featuring a Hono + Bun API backend and Next.js frontend. It provides an easy way to build a SaaS product. The backend utilizes technologies like Bun, Drizzle ORM, and Supabase, while the frontend is built with Next.js, Tailwind CSS, Shadcn/ui, and Clerk. Deployment can be done using Vercel and Render. The project structure includes separate directories for API backend and Next.js frontend, along with shared packages for the main database. Setup involves installing dependencies, configuring environment variables, and setting up services like Bun, Supabase, and Clerk. Development can be done using 'turbo dev' command, and deployment instructions are provided for Vercel and Render. Contributions are welcome through pull requests.

github

: 787

hayhooks

github

: 115

For similar jobs

sweep

Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.

github

: 7.1k

teams-ai

The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.

github

: 502

ai-guide

This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.

github

: 159

classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.

github

: 668

chatbot-ui

Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.

github

: 27.7k

BricksLLM

BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students

github

: 953

uAgents

uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.

github

: 1.3k

griptape

Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.

github

: 2.2k

hayhooks

README:

Hayhooks

Quick start with Docker Compose

Quick start

Install the package

Configuration

Environment variables

CORS Settings

Logging

Using the logger

Changing the log level

CLI commands

Start Hayhooks

Deploy a Pipeline

Why a pipeline wrapper?

setup()

run_api(...)

run_api_async(...)

PipelineWrapper development with overwrite option

Additional dependencies

Deploy a YAML Pipeline

Deploy an Agent

Load pipelines or agents at startup

Support file uploads

Run pipelines from the CLI

Run a pipeline from the CLI JSON-compatible parameters

Run a pipeline from the CLI uploading files

MCP support

MCP Server

Expose a YAML pipeline as a MCP Tool

Create a PipelineWrapper for exposing a Haystack pipeline as a MCP Tool

Skip MCP Tool listing

Using Hayhooks MCP Server with Claude Desktop

Using supergateway to bridge Streamable HTTP transport

Using supergateway to bridge SSE transport

Using Hayhooks Core MCP Tools in IDEs like Cursor

Development and deployment of Haystack pipelines directly from Cursor

Hayhooks as an OpenAPI Tool Server in open-webui

Example: Deploy a Haystack pipeline from open-webui chat interface

OpenAI compatibility and open-webui integration

OpenAI-compatible endpoints generation

Using Hayhooks as open-webui backend

Configuring open-webui

run_chat_completion(...)

run_chat_completion_async(...)

Streaming responses in OpenAI-compatible endpoints

async_streaming_generator

Integration with haystack OpenAIChatGenerator

Sending open-webui events enhancing the user experience

Hooks

Intercepting tool calls when using open-webui and streaming responses

Advanced usage

Run Hayhooks programmatically

Sharing code between pipeline wrappers

Deployment guidelines

License

For Tasks:

For Jobs:

Alternative AI tools for hayhooks

Similar Open Source Tools

hayhooks

ai21-python

hyper-mcp

aiounifi

ollama4j

koog

OllamaSharp

chatluna

aide

LocalLLMClient

mcp-server-mysql

BentoVLLM

fastapi_mcp

tools

nvim-aider

fastapi

For similar tasks

trickPrompt-engine

MachineSoM

PipelineWrapper development with `overwrite` option

Hayhooks as an OpenAPI Tool Server in `open-webui`

Example: Deploy a Haystack pipeline from `open-webui` chat interface

OpenAI compatibility and `open-webui` integration

Using Hayhooks as `open-webui` backend

Configuring `open-webui`

Sending `open-webui` events enhancing the user experience

Intercepting tool calls when using `open-webui` and streaming responses