ComfyUI-OllamaGemini

ComfyUI-OllamaGemini

AI-api text generation

Stars: 160

Visit
 screenshot

ComfyUI GeminiOllama Extension integrates Google's Gemini API, OpenAI (ChatGPT), Anthropic's Claude, Ollama, Qwen, and image processing tools into ComfyUI for leveraging powerful models and features directly within workflows. Features include multiple AI API integrations, advanced prompt engineering, Gemini image generation, background removal, SVG conversion, FLUX resolutions, ComfyUI Styler, smart prompt generator, and more. The extension offers comprehensive API integration, advanced prompt engineering with researched templates, high-quality tools like Smart Prompt Generator and BRIA RMBG, and supports video & audio processing. It provides a single interface to access powerful AI models, transform prompts into detailed instructions, and use various tools for image processing, styling, and content generation.

README:


Tagline



Stars   Forks   Version   License



🤖 Gemini  ·  🧠 OpenAI  ·  🎭 Claude  ·  🦙 Ollama  ·  🌐 Qwen

Get Started    Star    Sponsor



   




Happy Creators

Hours Saved Daily

AI Providers

Prompt Templates





Demo $\huge\textsf{\textcolor{ff6b35}{See It In Action}}$


https://github.com/user-attachments/assets/6ffba8bc-47e9-42c5-be98-5849ffb03547









🎞️ View More Examples

500+ Styles

FLUX Resolutions

SVG Conversion





Diamond $\huge\textsf{\textcolor{c41e3a}{Why Creators Love Us}}$


   

$\textsf{\textcolor{c41e3a}{😫 Before}}$

- 5 different extensions to manage
- 5 different config files
- Inconsistent prompt formats
- Hours wasted switching tools
- Frequent compatibility issues

$\textsf{\textcolor{7ed321}{✨ After}}$

+ ONE unified extension
+ ONE config for all APIs
+ Smart prompt optimization
+ Instant provider switching
+ Always up-to-date





Rocket $\huge\textsf{\textcolor{ffd700}{Powerful Features}}$





Gemini
2.0 Pro • Flash • 1.5






ChatGPT
GPT-4o • 4-Turbo • 3.5






Claude
3.7 • 3.5 Sonnet • Opus






Ollama
Any Local Model






Qwen
Max • Plus • Turbo








Veo 3.1 Video
Text/Image to Video + Extend




Background Removal
BRIA RMBG hair-level detail




Imagen 4
Google's latest image model




Gemini Banana Pro
Advanced image editing




FLUX Resolutions
Perfect sizing for every model




500+ Art Styles
🎨 Curated artistic presets




Smart Prompts
AI-enhanced engineering




Multi Prompt
Batch processing workflows



Latest AI for Object Detection & Segmentation
SAM3 Text Prompts
Detect "sun", "lake", "shadow"
YOLOE-26 + SAM2.1
Auto-download detection
BiRefNet Matting
Hair-level edge quality
Smart Model Paths
Auto-finds in models/





Art $\huge\textsf{\textcolor{ffd700}{500+ Curated Art Styles}}$


Styles




Cinema
80+ styles

Fine Art
120+ styles

Gaming
60+ styles

Photo
90+ styles

Fantasy
100+ styles

🔥 View Popular Style Categories
Category Styles Examples
🎬 Cinematic 80+ Film Noir, Blade Runner, Spielberg, Nolan, Wes Anderson
🖼️ Fine Art 120+ Van Gogh, Monet, Picasso, Rembrandt, Caravaggio
🎮 Digital Art 60+ Cyberpunk, Synthwave, Vaporwave, Pixel Art, 3D Render
📸 Photography 90+ Portrait, Landscape, Street, Fashion, Product
Fantasy 100+ Epic Fantasy, Dark Fantasy, Fairy Tale, Mythological
🎌 Anime 50+ Studio Ghibli, Makoto Shinkai, Trigger, Mappa





Bolt $\huge\textsf{\textcolor{f5a623}{Quick Start}}$


⚡ Install in under 30 seconds


$\textsf{\textcolor{7ed321}{▶ Recommended}}$

ComfyUI Manager (One-Click)

1. Open ComfyUI Manager
2. Search "OllamaGemini"  
3. Click Install ✓
4. Restart ComfyUI

$\textsf{\textcolor{ff6b35}{▷ Manual}}$

Git Clone

cd ComfyUI/custom_nodes
git clone https://github.com/al-swaiti/ComfyUI-OllamaGemini.git
pip install -r requirements.txt

🔑 API Configuration
{
  "GEMINI_API_KEY": "your_key",      // 🆓 aistudio.google.com
  "OPENAI_API_KEY": "your_key",      // 💰 platform.openai.com
  "ANTHROPIC_API_KEY": "your_key",   // ⚠️ console.anthropic.com
  "OLLAMA_URL": "http://localhost:11434",  // 🆓 Local
  "QWEN_API_KEY": "your_key"         // ⚠️ dashscope.console.aliyun.com
}





Scroll $\huge\textsf{\textcolor{c41e3a}{20+ Prompt Templates}}$

Extensively researched • Model-optimized • Professional results


🎬 Video Generation
Template Description
Veo3-TextToVideo Google Veo 3.1 with composition, camera, subject, action & native audio
Veo3-ReferenceImages Reference image video preserving subject appearance
Veo3-Interpolation First-to-last frame interpolation with motion paths
VideoGen Professional cinematography: subject, action, lighting, style
⚡ FLUX Models
Template Description
FLUX.1-dev Hyper-detailed cinematographic with lighting & camera specs
FLUX.2-dev Natural language following official BFL guide
FLUX.2-dev-Edit Multi-reference editing for up to 10 images
FLUX.2-dev-JSON Structured JSON for complex scenes
FLUXKontext Context-aware editing with character consistency
🎨 Image Generation
Template Description
SDXL Premium comma-separated tags with artistic medium
Imagen4 Structured, layered prompts for Google Imagen 4
Z-Image-Turbo 6B diffusion transformer for concept fusion
Qwen-Image-2512 Photorealistic eliminating "AI look"
Upscale Sharpness-maximizing enhancement
🍌 Gemini Nano Banana Pro
Template Description
GeminiNanaBananaEdit Mask-free contextual editing
NanaBananaPro Gemini 3 Pro Image with narrative style
NanaBananaPro-Edit Advanced editing with multi-image composition
NanaBananaPro-Pro Professional 4K asset production





Heart $\huge\textsf{\textcolor{c41e3a}{Support This Project}}$


500+ hours of development

💝 Your support keeps it FREE for everyone

Every star & donation means the world 💖





Abdallah Al-Swaiti
🇯🇴 Amman, Jordan

"I built this because I was frustrated switching between 5 different AI tools.
Now, 150+ creators use it daily. If this helps your workflow, consider supporting!"



       





Link $\large\textsf{\textcolor{f5a623}{Connect}}$


       






CTA





FREEOpen SourceMIT License
Made with ❤️ in Jordan 🇯🇴



For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for ComfyUI-OllamaGemini

Similar Open Source Tools

For similar tasks

For similar jobs