
Pixelle-MCP
An Open-Source Multimodal AIGC Solution based on ComfyUI + MCP + LLM https://pixelle.ai
Stars: 495

Pixelle-MCP is a multi-channel publishing tool designed to streamline the process of publishing content across various social media platforms. It allows users to create, schedule, and publish posts simultaneously on platforms such as Facebook, Twitter, and Instagram. With a user-friendly interface and advanced scheduling features, Pixelle-MCP helps users save time and effort in managing their social media presence. The tool also provides analytics and insights to track the performance of posts and optimize content strategy. Whether you are a social media manager, content creator, or digital marketer, Pixelle-MCP is a valuable tool to enhance your online presence and engage with your audience effectively.
README:
English | δΈζ
β¨ An AIGC solution based on the MCP protocol, seamlessly converting ComfyUI workflows into MCP tools with zero code, empowering LLM and ComfyUI integration.
https://github.com/user-attachments/assets/65422cef-96f9-44fe-a82b-6a124674c417
- β 2025-09-03: Architecture refactoring from three services to unified application; added CLI tool support; published to PyPI
- β 2025-08-12: Integrated the LiteLLM framework, adding multi-model support for Gemini, DeepSeek, Claude, Qwen, and more
- β π Full-modal Support: Supports TISV (Text, Image, Sound/Speech, Video) full-modal conversion and generation
- β π§© ComfyUI Ecosystem: Built on ComfyUI, inheriting all capabilities from the open ComfyUI ecosystem
- β π§ Zero-code Development: Defines and implements the Workflow-as-MCP Tool solution, enabling zero-code development and dynamic addition of new MCP Tools
- β ποΈ MCP Server: Based on the MCP protocol, supporting integration with any MCP client (including but not limited to Cursor, Claude Desktop, etc.)
- β π Web Interface: Developed based on the Chainlit framework, inheriting Chainlit's UI controls and supporting integration with more MCP Servers
- β π¦ One-click Deployment: Supports PyPI installation, CLI commands, Docker and other deployment methods, ready to use out of the box
- β βοΈ Simplified Configuration: Uses environment variable configuration scheme, simple and intuitive configuration
- β π€ Multi-LLM Support: Supports multiple mainstream LLMs, including OpenAI, Ollama, Gemini, DeepSeek, Claude, Qwen, and more
Pixelle MCP adopts a unified architecture design, integrating MCP server, web interface, and file services into one application, providing:
- π Web Interface: Chainlit-based chat interface supporting multimodal interaction
- π MCP Endpoint: For external MCP clients (such as Cursor, Claude Desktop) to connect
- π File Service: Handles file upload, download, and storage
- π οΈ Workflow Engine: Automatically converts ComfyUI workflows into MCP tools
Choose the deployment method that best suits your needs, from simple to complex:
π‘ Zero configuration startup, perfect for quick experience and testing
# Start with one command, no system installation required
uvx pixelle@latest
π View uvx CLI Reference β
# Install to system
pip install -U pixelle
# Start service
pixelle
π View pip CLI Reference β
After startup, it will automatically enter the configuration wizard to guide you through ComfyUI connection and LLM configuration.
π‘ Supports custom workflows and secondary development
git clone https://github.com/AIDC-AI/Pixelle-MCP.git
cd Pixelle-MCP
# Interactive mode (recommended)
uv run pixelle
π View Complete CLI Reference β
# Copy example workflows to data directory (run this in your desired project directory)
cp -r workflows/* ./data/custom_workflows/
π‘ Suitable for production environments and containerized deployment
git clone https://github.com/AIDC-AI/Pixelle-MCP.git
cd Pixelle-MCP
# Create environment configuration file
cp .env.example .env
# Edit .env file to configure your ComfyUI address and LLM settings
# Start all services in background
docker compose up -d
# View logs
docker compose logs -f
Regardless of which method you use, after startup you can access via:
-
π Web Interface: http://localhost:9004
Default username and password are bothdev
, can be modified after startup -
π MCP Endpoint: http://localhost:9004/pixelle/mcp
For MCP clients like Cursor, Claude Desktop to connect
π‘ Port Configuration: Default port is 9004, can be customized via environment variable PORT=your_port
.
On first startup, the system will automatically detect configuration status:
-
π§ ComfyUI Connection: Ensure ComfyUI service is running at
http://localhost:8188
- π€ LLM Configuration: Configure at least one LLM provider (OpenAI, Ollama, etc.)
- π Workflow Directory: System will automatically create necessary directory structure
π Need Help? Join community groups for support (see Community section below)
β‘ One workflow = One MCP Tool
-
π Build a workflow in ComfyUI for image Gaussian blur (Get it here), then set the
LoadImage
node's title to$image.image!
as shown below: -
π€ Export it as an API format file and rename it to
i_blur.json
. You can export it yourself or use our pre-exported version (Get it here) -
π Copy the exported API workflow file (must be API format), input it on the web page, and let the LLM add this Tool
-
β¨ After sending, the LLM will automatically convert this workflow into an MCP Tool
-
π¨ Now, refresh the page and send any image to perform Gaussian blur processing via LLM
The steps are the same as above, only the workflow part differs (Download workflow: UI format and API format)
The system supports ComfyUI workflows. Just design your workflow in the canvas and export it as API format. Use special syntax in node titles to define parameters and outputs.
In the ComfyUI canvas, double-click the node title to edit, and use the following DSL syntax to define parameters:
$<param_name>.[~]<field_name>[!][:<description>]
-
param_name
: The parameter name for the generated MCP tool function -
~
: Optional, indicates URL parameter upload processing, returns relative path -
field_name
: The corresponding input field in the node -
!
: Indicates this parameter is required -
description
: Description of the parameter
Required parameter example:
- Set LoadImage node title to:
$image.image!:Input image URL
- Meaning: Creates a required parameter named
image
, mapped to the node'simage
field
URL upload processing example:
- Set any node title to:
$image.~image!:Input image URL
- Meaning: Creates a required parameter named
image
, system will automatically download URL and upload to ComfyUI, returns relative path
π Note:
LoadImage
,VHS_LoadAudioUpload
,VHS_LoadVideo
and other nodes have built-in functionality, no need to add~
marker
Optional parameter example:
- Set EmptyLatentImage node title to:
$width.width:Image width, default 512
- Meaning: Creates an optional parameter named
width
, mapped to the node'swidth
field, default value is 512
The system automatically infers parameter types based on the current value of the node field:
- π’
int
: Integer values (e.g. 512, 1024) - π
float
: Floating-point values (e.g. 1.5, 3.14) - β
bool
: Boolean values (e.g. true, false) - π
str
: String values (default type)
The system will automatically detect the following common output nodes:
- πΌοΈ
SaveImage
- Image save node - π¬
SaveVideo
- Video save node - π
SaveAudio
- Audio save node - πΉ
VHS_SaveVideo
- VHS video save node - π΅
VHS_SaveAudio
- VHS audio save node
Usually used for multiple outputs Use
$output.var_name
in any node title to mark output:
- Set node title to:
$output.result
- The system will use this node's output as the tool's return value
You can add a node titled MCP
in the workflow to provide a tool description:
- Add a
String (Multiline)
or similar text node (must have a single string property, and the node field should be one of: value, text, string) - Set the node title to:
MCP
- Enter a detailed tool description in the value field
- π Parameter Validation: Optional parameters (without !) must have default values set in the node
- π Node Connections: Fields already connected to other nodes will not be parsed as parameters
- π·οΈ Tool Naming: Exported file name will be used as the tool name, use meaningful English names
- π Detailed Descriptions: Provide detailed parameter descriptions for better user experience
- π― Export Format: Must export as API format, do not export as UI format
Scan the QR codes below to join our communities for latest updates and technical support:
Discord Community | WeChat Group |
---|---|
![]() |
![]() |
We welcome all forms of contribution! Whether you're a developer, designer, or user, you can participate in the project in the following ways:
- π Submit bug reports on the Issues page
- π Please search for similar issues before submitting
- π Describe the reproduction steps and environment in detail
- π Submit feature requests in Issues
- π Describe the feature you want and its use case
- π― Explain how it improves user experience
- π΄ Fork this repo to your GitHub account
- πΏ Create a feature branch:
git checkout -b feature/your-feature-name
- π» Develop and add corresponding tests
- π Commit changes:
git commit -m "feat: add your feature"
- π€ Push to your repo:
git push origin feature/your-feature-name
- π Create a Pull Request to the main repo
- π Python code follows PEP 8 style guide
- π Add appropriate documentation and comments for new features
- π¦ Share your ComfyUI workflows with the community
- π οΈ Submit tested workflow files
- π Add usage instructions and examples for workflows
β€οΈ Sincere thanks to the following organizations, projects, and teams for supporting the development and implementation of this project.
This project is released under the MIT License (LICENSE, SPDX-License-identifier: MIT).
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for Pixelle-MCP
Similar Open Source Tools

Pixelle-MCP
Pixelle-MCP is a multi-channel publishing tool designed to streamline the process of publishing content across various social media platforms. It allows users to create, schedule, and publish posts simultaneously on platforms such as Facebook, Twitter, and Instagram. With a user-friendly interface and advanced scheduling features, Pixelle-MCP helps users save time and effort in managing their social media presence. The tool also provides analytics and insights to track the performance of posts and optimize content strategy. Whether you are a social media manager, content creator, or digital marketer, Pixelle-MCP is a valuable tool to enhance your online presence and engage with your audience effectively.

comfyui-web-viewer
The ComfyUI Web Viewer by vrch.ai is a real-time AI-generated interactive art framework that integrates realtime streaming into ComfyUI workflows. It supports keyboard control nodes, OSC control nodes, sound input nodes, and more, accessible from any device with a web browser. It enables real-time interaction with AI-generated content, ideal for interactive visual projects and enhancing ComfyUI workflows with efficient content management and display.

recommendarr
Recommendarr is a tool that generates personalized TV show and movie recommendations based on your Sonarr, Radarr, Plex, and Jellyfin libraries using AI. It offers AI-powered recommendations, media server integration, flexible AI support, watch history analysis, customization options, and dark/light mode toggle. Users can connect their media libraries and watch history services, configure AI service settings, and get personalized recommendations based on genre, language, and mood/vibe preferences. The tool works with any OpenAI-compatible API and offers various recommended models for different cost options and performance levels. It provides personalized suggestions, detailed information, filter options, watch history analysis, and one-click adding of recommended content to Sonarr/Radarr.

web-ui
WebUI is a user-friendly tool built on Gradio that enhances website accessibility for AI agents. It supports various Large Language Models (LLMs) and allows custom browser integration for seamless interaction. The tool eliminates the need for re-login and authentication challenges, offering high-definition screen recording capabilities.

action_mcp
Action MCP is a powerful tool for managing and automating your cloud infrastructure. It provides a user-friendly interface to easily create, update, and delete resources on popular cloud platforms. With Action MCP, you can streamline your deployment process, reduce manual errors, and improve overall efficiency. The tool supports various cloud providers and offers a wide range of features to meet your infrastructure management needs. Whether you are a developer, system administrator, or DevOps engineer, Action MCP can help you simplify and optimize your cloud operations.

CodeRAG
CodeRAG is an AI-powered code retrieval and assistance tool that combines Retrieval-Augmented Generation (RAG) with AI to provide intelligent coding assistance. It indexes your entire codebase for contextual suggestions based on your complete project, offering real-time indexing, semantic code search, and contextual AI responses. The tool monitors your code directory, generates embeddings for Python files, stores them in a FAISS vector database, matches user queries against the code database, and sends retrieved code context to GPT models for intelligent responses. CodeRAG also features a Streamlit web interface with a chat-like experience for easy usage.

aigne-doc-smith
AIGNE DocSmith is a powerful AI-driven documentation generation tool that automates the creation of detailed, structured, and multi-language documentation directly from source code. It intelligently analyzes codebase to generate a comprehensive document structure, populates content with high-quality AI-powered generation, supports seamless translation into 12+ languages, integrates with AIGNE Hub for large language models, offers Discuss Kit publishing, automatically updates documentation with source code changes, and allows for individual document optimization.

eliza
Eliza is a versatile AI agent operating system designed to support various models and connectors, enabling users to create chatbots, autonomous agents, handle business processes, create video game NPCs, and engage in trading. It offers multi-agent and room support, document ingestion and interaction, retrievable memory and document store, and extensibility to create custom actions and clients. Eliza is easy to use and provides a comprehensive solution for AI agent development.

aiaio
aiaio (AI-AI-O) is a lightweight, privacy-focused web UI for interacting with AI models. It supports both local and remote LLM deployments through OpenAI-compatible APIs. The tool provides features such as dark/light mode support, local SQLite database for conversation storage, file upload and processing, configurable model parameters through UI, privacy-focused design, responsive design for mobile/desktop, syntax highlighting for code blocks, real-time conversation updates, automatic conversation summarization, customizable system prompts, WebSocket support for real-time updates, Docker support for deployment, multiple API endpoint support, and multiple system prompt support. Users can configure model parameters and API settings through the UI, handle file uploads, manage conversations, and use keyboard shortcuts for efficient interaction. The tool uses SQLite for storage with tables for conversations, messages, attachments, and settings. Contributions to the project are welcome under the Apache License 2.0.

search_with_ai
Build your own conversation-based search with AI, a simple implementation with Node.js & Vue3. Live Demo Features: * Built-in support for LLM: OpenAI, Google, Lepton, Ollama(Free) * Built-in support for search engine: Bing, Sogou, Google, SearXNG(Free) * Customizable pretty UI interface * Support dark mode * Support mobile display * Support local LLM with Ollama * Support i18n * Support Continue Q&A with contexts.

ps-fuzz
The Prompt Fuzzer is an open-source tool that helps you assess the security of your GenAI application's system prompt against various dynamic LLM-based attacks. It provides a security evaluation based on the outcome of these attack simulations, enabling you to strengthen your system prompt as needed. The Prompt Fuzzer dynamically tailors its tests to your application's unique configuration and domain. The Fuzzer also includes a Playground chat interface, giving you the chance to iteratively improve your system prompt, hardening it against a wide spectrum of generative AI attacks.

DesktopCommanderMCP
Desktop Commander MCP is a server that allows the Claude desktop app to execute long-running terminal commands on your computer and manage processes through Model Context Protocol (MCP). It is built on top of MCP Filesystem Server to provide additional search and replace file editing capabilities. The tool enables users to execute terminal commands with output streaming, manage processes, perform full filesystem operations, and edit code with surgical text replacements or full file rewrites. It also supports vscode-ripgrep based recursive code or text search in folders.

better-chatbot
Better Chatbot is an open-source AI chatbot designed for individuals and teams, inspired by various AI models. It integrates major LLMs, offers powerful tools like MCP protocol and data visualization, supports automation with custom agents and visual workflows, enables collaboration by sharing configurations, provides a voice assistant feature, and ensures an intuitive user experience. The platform is built with Vercel AI SDK and Next.js, combining leading AI services into one platform for enhanced chatbot capabilities.

xGitGuard
xGitGuard is an AI-based system developed by Comcast Cybersecurity Research and Development team to detect secrets (e.g., API tokens, usernames, passwords) exposed on GitHub repositories. It uses advanced Natural Language Processing to detect secrets at scale and with appropriate velocity. The tool provides workflows for detecting credentials and keys/tokens in both enterprise and public GitHub accounts. Users can set up search patterns, configure API access, run detections with or without ML filters, and train ML models for improved detection accuracy. xGitGuard also supports custom keyword scans for targeted organizations or repositories. The tool is licensed under Apache 2.0.

Groqqle
Groqqle 2.1 is a revolutionary, free AI web search and API that instantly returns ORIGINAL content derived from source articles, websites, videos, and even foreign language sources, for ANY target market of ANY reading comprehension level! It combines the power of large language models with advanced web and news search capabilities, offering a user-friendly web interface, a robust API, and now a powerful Groqqle_web_tool for seamless integration into your projects. Developers can instantly incorporate Groqqle into their applications, providing a powerful tool for content generation, research, and analysis across various domains and languages.

next-money
Next Money Stripe Starter is a SaaS Starter project that empowers your next project with a stack of Next.js, Prisma, Supabase, Clerk Auth, Resend, React Email, Shadcn/ui, and Stripe. It seamlessly integrates these technologies to accelerate your development and SaaS journey. The project includes frameworks, platforms, UI components, hooks and utilities, code quality tools, and miscellaneous features to enhance the development experience. Created by @koyaguo in 2023 and released under the MIT license.
For similar tasks

intro-llm-rag
This repository serves as a comprehensive guide for technical teams interested in developing conversational AI solutions using Retrieval-Augmented Generation (RAG) techniques. It covers theoretical knowledge and practical code implementations, making it suitable for individuals with a basic technical background. The content includes information on large language models (LLMs), transformers, prompt engineering, embeddings, vector stores, and various other key concepts related to conversational AI. The repository also provides hands-on examples for two different use cases, along with implementation details and performance analysis.

LLM-Viewer
LLM-Viewer is a tool for visualizing Language and Learning Models (LLMs) and analyzing performance on different hardware platforms. It enables network-wise analysis, considering factors such as peak memory consumption and total inference time cost. With LLM-Viewer, users can gain valuable insights into LLM inference and performance optimization. The tool can be used in a web browser or as a command line interface (CLI) for easy configuration and visualization. The ongoing project aims to enhance features like showing tensor shapes, expanding hardware platform compatibility, and supporting more LLMs with manual model graph configuration.

llm-colosseum
llm-colosseum is a tool designed to evaluate Language Model Models (LLMs) in real-time by making them fight each other in Street Fighter III. The tool assesses LLMs based on speed, strategic thinking, adaptability, out-of-the-box thinking, and resilience. It provides a benchmark for LLMs to understand their environment and take context-based actions. Users can analyze the performance of different LLMs through ELO rankings and win rate matrices. The tool allows users to run experiments, test different LLM models, and customize prompts for LLM interactions. It offers installation instructions, test mode options, logging configurations, and the ability to run the tool with local models. Users can also contribute their own LLM models for evaluation and ranking.

eureka-ml-insights
The Eureka ML Insights Framework is a repository containing code designed to help researchers and practitioners run reproducible evaluations of generative models efficiently. Users can define custom pipelines for data processing, inference, and evaluation, as well as utilize pre-defined evaluation pipelines for key benchmarks. The framework provides a structured approach to conducting experiments and analyzing model performance across various tasks and modalities.

Pixelle-MCP
Pixelle-MCP is a multi-channel publishing tool designed to streamline the process of publishing content across various social media platforms. It allows users to create, schedule, and publish posts simultaneously on platforms such as Facebook, Twitter, and Instagram. With a user-friendly interface and advanced scheduling features, Pixelle-MCP helps users save time and effort in managing their social media presence. The tool also provides analytics and insights to track the performance of posts and optimize content strategy. Whether you are a social media manager, content creator, or digital marketer, Pixelle-MCP is a valuable tool to enhance your online presence and engage with your audience effectively.

trae-agent
Trae-agent is a Python library for building and training reinforcement learning agents. It provides a simple and flexible framework for implementing various reinforcement learning algorithms and experimenting with different environments. With Trae-agent, users can easily create custom agents, define reward functions, and train them on a variety of tasks. The library also includes utilities for visualizing agent performance and analyzing training results, making it a valuable tool for both beginners and experienced researchers in the field of reinforcement learning.

dataset-viewer
Dataset Viewer is a modern, high-performance tool built with Tauri, React, and TypeScript, designed to handle massive datasets from multiple sources with efficient streaming for large files (100GB+) and lightning-fast search capabilities. It supports instant large file opening, real-time search, direct archive preview, multi-protocol and multi-format support, and features a modern interface with dark/light themes and responsive design. The tool is perfect for data scientists, log analysis, archive management, remote access, and performance-critical tasks.

basehub
JavaScript / TypeScript SDK for BaseHub, the first AI-native content hub. **Features:** * β¨ Infers types from your BaseHub repository... _meaning IDE autocompletion works great._ * ποΈ No dependency on graphql... _meaning your bundle is more lightweight._ * π Works everywhere `fetch` is supported... _meaning you can use it anywhere._
For similar jobs

LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

daily-poetry-image
Daily Chinese ancient poetry and AI-generated images powered by Bing DALL-E-3. GitHub Action triggers the process automatically. Poetry is provided by Today's Poem API. The website is built with Astro.

exif-photo-blog
EXIF Photo Blog is a full-stack photo blog application built with Next.js, Vercel, and Postgres. It features built-in authentication, photo upload with EXIF extraction, photo organization by tag, infinite scroll, light/dark mode, automatic OG image generation, a CMD-K menu with photo search, experimental support for AI-generated descriptions, and support for Fujifilm simulations. The application is easy to deploy to Vercel with just a few clicks and can be customized with a variety of environment variables.

SillyTavern
SillyTavern is a user interface you can install on your computer (and Android phones) that allows you to interact with text generation AIs and chat/roleplay with characters you or the community create. SillyTavern is a fork of TavernAI 1.2.8 which is under more active development and has added many major features. At this point, they can be thought of as completely independent programs.

Twitter-Insight-LLM
This project enables you to fetch liked tweets from Twitter (using Selenium), save it to JSON and Excel files, and perform initial data analysis and image captions. This is part of the initial steps for a larger personal project involving Large Language Models (LLMs).

AISuperDomain
Aila Desktop Application is a powerful tool that integrates multiple leading AI models into a single desktop application. It allows users to interact with various AI models simultaneously, providing diverse responses and insights to their inquiries. With its user-friendly interface and customizable features, Aila empowers users to engage with AI seamlessly and efficiently. Whether you're a researcher, student, or professional, Aila can enhance your AI interactions and streamline your workflow.

ChatGPT-On-CS
This project is an intelligent dialogue customer service tool based on a large model, which supports access to platforms such as WeChat, Qianniu, Bilibili, Douyin Enterprise, Douyin, Doudian, Weibo chat, Xiaohongshu professional account operation, Xiaohongshu, Zhihu, etc. You can choose GPT3.5/GPT4.0/ Lazy Treasure Box (more platforms will be supported in the future), which can process text, voice and pictures, and access external resources such as operating systems and the Internet through plug-ins, and support enterprise AI applications customized based on their own knowledge base.

obs-localvocal
LocalVocal is a live-streaming AI assistant plugin for OBS that allows you to transcribe audio speech into text and perform various language processing functions on the text using AI / LLMs (Large Language Models). It's privacy-first, with all data staying on your machine, and requires no GPU, cloud costs, network, or downtime.