ComfyUI_Yvann-Nodes

Audio Reactivity Nodes for ComfyUI 🔊 Create AI generated audio-driven animations. Compatible with IPAdapter, ControlNets, AnimateDiff...

Stars: 340

Visit

ComfyUI_Yvann-Nodes is a pack of custom nodes that enable audio reactivity within ComfyUI, allowing users to create AI-driven animations that sync with music. Users can generate audio reactive AI videos, control AI generation styles, content, and composition with any audio input. The tool is simple to use by dropping workflows in ComfyUI and specifying audio and visual inputs. It is flexible and works with existing ComfyUI AI tech and nodes like IPAdapter, AnimateDiff, and ControlNet. Users can pick workflows for Images → Video or Video → Video, download the corresponding .json file, drop it into ComfyUI, install missing custom nodes, set inputs, and generate audio-reactive animations.

README:

🔊 ComfyUI_Yvann-Nodes

Made with the help of Lilien

A pack of custom nodes that enable audio reactivity within ComfyUI, allowing you to generate AI-driven animations that sync with music

What Does This Do?

Create Audio Reactive AI videos, enable controls over AI generations styles, content and composition with any audio
Simple: Just Drop one of our Workflows in ComfyUI and specify your audio and visuals input
Flexible: Works with existing ComfyUI AI tech and nodes (eg: IPAdapter, AnimateDiff, ControlNet, etc.)

Quick Setup

Install ComfyUI and ComfyUI-Manager

Pick a Workflow (Images → Video or Video → Video)

Images → Video
- Takes a set of images plus an audio track.
- Watch Tutorial:
- Example Render (Sound On): https://github.com/user-attachments/assets/1e6590fc-e0d7-42d7-a205-433adf6c405c
Video → Video
- Takes a source video plus an audio track.
- Watch Tutorial:
- Example Render (Sound On): https://github.com/user-attachments/assets/6b0aa544-aa20-4257-b6be-28673082c7ef

Load Your Chosen Workflow in ComfyUI

Download the .json file for the workflow you picked:
- AudioReactive_ImagesToVideo_Yvann.json
- AudioReactive_VideoToVideo_Yvann.json
Drop the .json file into the ComfyUI window.
Open the “🧩 Manager” → “Install Missing Custom Nodes”
- Install each pack of nodes that appears.
- Restart ComfyUI if prompted.
Set Your Inputs & Generate
- Provide the inputs needed (everything explained here
- Click Queue button to produce your audio-reactive animation!

That’s it! Have fun playing with the differents settings now !! (if you have any questions or problems, check my Youtube Tutorials

Advanced/Optional Node Details

Click to Expand: Node-by-Node Reference

Audio Analysis 🔍

Analyzes audio to generate reactive weights for each frame.

Node Parameters

audio_sep_model: Model from "Load Audio Separation Model"
audio: Input audio file
batch_size: Frames to associate with audio weights
fps: Frame rate for the analysis

Parameters:

analysis_mode: e.g., Drums Only, Vocals, Full Audio
threshold: Minimum weight pass-through
multiply: Amplification factor

Outputs:

graph_audio (image preview),
processed_audio, original_audio,
audio_weights (list of values).

Load Audio Separation Model 🎧

Loads or downloads an audio separation model (e.g., HybridDemucs, OpenUnmix).

Node Parameters

model: Choose between HybridDemucs / OpenUnmix.
Outputs: audio_sep_model (connect to Audio Analysis or Remixer).

Audio Peaks Detection 📈

Identifies peaks in the audio weights to trigger transitions or events.

Node Parameters

peaks_threshold: Sensitivity.
min_peaks_distance: Minimum gap in frames between peaks.
Outputs: Binary peak list, alternate list, peak indices/count, graph.

Audio IP Adapter Transitions 🔄

Manages transitions between images based on peaks. Great for stable or style transitions.

Node Parameters

images: Batch of images.
peaks_weights: From “Audio Peaks Detection”.
blend_mode, transitions_length, min_IPA_weight, etc.

Audio Prompt Schedule 📝

Links text prompts to peak indices.

Node Parameters

peaks_index: Indices from peaks detection.
prompts: multiline string.
Outputs: mapped schedule string.

Audio Remixer 🎛️

Adjusts volume levels (drums, vocals, bass, others) in a track.

Node Parameters

drums_volume, vocals_volume, bass_volume, others_volume
Outputs: single merged audio track.

Repeat Image To Count 🔁

Repeats a set of images N times.

Node Parameters

mask: Mask input.
Outputs: Repeated images.

Invert Floats 🔄

Flips sign of float values.

Node Parameters

floats: list of floats.
Outputs: inverted list.

Floats Visualizer 📈

Plots float values as a graph.

Node Parameters

floats (and optional second/third).
Outputs: visual graph image.

Mask To Float 🎭

Converts a mask into a single float value.

Node Parameters

mask: input.
Outputs: float.

Floats To Weights Strategy 🏋️

Transforms float lists into an IPAdapter “weight strategy.”

Node Parameters

floats: list of floats.
Outputs: dictionary with strategy info.

6. Thank You!

Please give a ⭐ on GitHub it helps us enhance our Tool and it's Free for you !! (:

For Tasks:

Click tags to check more tools for each tasks

create audio-reactive videos control ai generation styles generate ai-driven animations sync music with animations customize audio-reactive visuals

For Jobs:

motion graphics designer video editor creative technologist media artist visual effects artist

Alternative AI tools for ComfyUI_Yvann-Nodes

Similar Open Source Tools

ComfyUI_Yvann-Nodes

github

: 340

ComfyUI-Ollama-Describer

ComfyUI-Ollama-Describer is an extension for ComfyUI that enables the use of LLM models provided by Ollama, such as Gemma, Llava (multimodal), Llama2, Llama3, or Mistral. It requires the Ollama library for interacting with large-scale language models, supporting GPUs using CUDA and AMD GPUs on Windows, Linux, and Mac. The extension allows users to run Ollama through Docker and utilize NVIDIA GPUs for faster processing. It provides nodes for image description, text description, image captioning, and text transformation, with various customizable parameters for model selection, API communication, response generation, and model memory management.

github

: 70

pocketpal-ai

PocketPal AI is a versatile virtual assistant tool designed to streamline daily tasks and enhance productivity. It leverages artificial intelligence technology to provide personalized assistance in managing schedules, organizing information, setting reminders, and more. With its intuitive interface and smart features, PocketPal AI aims to simplify users' lives by automating routine activities and offering proactive suggestions for optimal time management and task prioritization.

github

: 2.7k

summarize

The 'summarize' tool is designed to transcribe and summarize videos from various sources using AI models. It helps users efficiently summarize lengthy videos, take notes, and extract key insights by providing timestamps, original transcripts, and support for auto-generated captions. Users can utilize different AI models via Groq, OpenAI, or custom local models to generate grammatically correct video transcripts and extract wisdom from video content. The tool simplifies the process of summarizing video content, making it easier to remember and reference important information.

github

: 73

system-prompts-and-models-of-ai-tools

This repository contains a significant portion of the FULL official v0, Manus, and Cursor system prompts and AI models. It includes over 5,000+ lines of insights into their structure and functionality. The available files include FULL v0, v0 model.txt, v0 tools.txt, Cursor (with cursor agent.txt, cursor ask.txt, cursor edit.txt), and Manus Folder with multiple files inside.

github

: 6.5k

llmchat

LLMChat is an all-in-one AI chat interface that supports multiple language models, offers a plugin library for enhanced functionality, enables web search capabilities, allows customization of AI assistants, provides text-to-speech conversion, ensures secure local data storage, and facilitates data import/export. It also includes features like knowledge spaces, prompt library, personalization, and can be installed as a Progressive Web App (PWA). The tech stack includes Next.js, TypeScript, Pglite, LangChain, Zustand, React Query, Supabase, Tailwind CSS, Framer Motion, Shadcn, and Tiptap. The roadmap includes upcoming features like speech-to-text and knowledge spaces.

github

: 541

forge

Forge is a free and open-source digital collectible card game (CCG) engine written in Java. It is designed to be easy to use and extend, and it comes with a variety of features that make it a great choice for developers who want to create their own CCGs. Forge is used by a number of popular CCGs, including Ascension, Dominion, and Thunderstone.

github

: 1.3k

lawglance

LawGlance is an AI-powered legal assistant that aims to bridge the gap between people and legal access. It is a free, open-source initiative designed to provide quick and accurate legal support tailored to individual needs. The project covers various laws, with plans for international expansion in the future. LawGlance utilizes AI-powered Retriever-Augmented Generation (RAG) to deliver legal guidance accessible to both laypersons and professionals. The tool is developed with support from mentors and experts at Data Science Academy and Curvelogics.

github

: 89

kitchenai

KitchenAI is an open-source toolkit designed to simplify AI development by serving as an AI backend and LLMOps solution. It aims to empower developers to focus on delivering results without being bogged down by AI infrastructure complexities. With features like simplifying AI integration, providing an AI backend, and empowering developers, KitchenAI streamlines the process of turning AI experiments into production-ready APIs. It offers built-in LLMOps features, is framework-agnostic and extensible, and enables faster time-to-production. KitchenAI is suitable for application developers, AI developers & data scientists, and platform & infra engineers, allowing them to seamlessly integrate AI into apps, deploy custom AI techniques, and optimize AI services with a modular framework. The toolkit eliminates the need to build APIs and infrastructure from scratch, making it easier to deploy AI code as production-ready APIs in minutes. KitchenAI also provides observability, tracing, and evaluation tools, and offers a Docker-first deployment approach for scalability and confidence.

github

: 88

general_framework

General Framework is a cross-platform library designed to help create apps with a unified codebase using Flutter. It offers features such as cross-platform support, standardized style code, a CLI for easier usage, API integration for bot development, customizable extensions for faster development, and user-friendly information. The library is intended to streamline the app, server, bot, and userbot creation process by providing a comprehensive set of tools and functionalities.

github

: 99

chipper

Chipper provides a web interface, CLI, and architecture for pipelines, document chunking, web scraping, and query workflows. It is built with Haystack, Ollama, Hugging Face, Docker, Tailwind, and ElasticSearch, running locally or as a Dockerized service. Originally created to assist in creative writing, it now offers features like local Ollama and Hugging Face API, ElasticSearch embeddings, document splitting, web scraping, audio transcription, user-friendly CLI, and Docker deployment. The project aims to be educational, beginner-friendly, and a playground for AI exploration and innovation.

github

: 437

tensorzero

TensorZero is an open-source platform that helps LLM applications graduate from API wrappers into defensible AI products. It enables a data & learning flywheel for LLMs by unifying inference, observability, optimization, and experimentation. The platform includes a high-performance model gateway, structured schema-based inference, observability, experimentation, and data warehouse for analytics. TensorZero Recipes optimize prompts and models, and the platform supports experimentation features and GitOps orchestration for deployment.

github

: 3.4k

LLM-Navigation

LLM-Navigation is a repository dedicated to documenting learning records related to large models, including basic knowledge, prompt engineering, building effective agents, model expansion capabilities, security measures against prompt injection, and applications in various fields such as AI agent control, browser automation, financial analysis, 3D modeling, and tool navigation using MCP servers. The repository aims to organize and collect information for personal learning and self-improvement through AI exploration.

github

: 110

ai-rules-builder

github

: 83

AiLearning-Theory-Applying

This repository provides a comprehensive guide to understanding and applying artificial intelligence (AI) theory, including basic knowledge, machine learning, deep learning, and natural language processing (BERT). It features detailed explanations, annotated code, and datasets to help users grasp the concepts and implement them in practice. The repository is continuously updated to ensure the latest information and best practices are covered.

github

: 2.9k

J.A.R.V.I.S.2.0

github

: 123

For similar tasks

ComfyUI_Yvann-Nodes

github

: 340

For similar jobs

latentbox

Latent Box is a curated collection of resources for AI, creativity, and art. It aims to bridge the information gap with high-quality content, promote diversity and interdisciplinary collaboration, and maintain updates through community co-creation. The website features a wide range of resources, including articles, tutorials, tools, and datasets, covering various topics such as machine learning, computer vision, natural language processing, generative art, and creative coding.

github

: 941

LeaferJS

LeaferJS is a colorful HTML5 Canvas 2D graphics rendering engine that can be combined with AI drawing to generate interfaces. It gives you the superpower to instantly create 1 million graphics, free and open source, easy to learn and use, with rich scenes.

github

: 265

ComfyUI_VLM_nodes

ComfyUI_VLM_nodes is a repository containing various nodes for utilizing Vision Language Models (VLMs) and Language Models (LLMs). The repository provides nodes for tasks such as structured output generation, image to music conversion, LLM prompt generation, automatic prompt generation, and more. Users can integrate different models like InternLM-XComposer2-VL, UForm-Gen2, Kosmos-2, moondream1, moondream2, JoyTag, and Chat Musician. The nodes support features like extracting keywords, generating prompts, suggesting prompts, and obtaining structured outputs. The repository includes examples and instructions for using the nodes effectively.

github

: 251

amazon-sagemaker-generativeai

Repository for training and deploying Generative AI models, including text-text, text-to-image generation, prompt engineering playground and chain of thought examples using SageMaker Studio. The tool provides a platform for users to experiment with generative AI techniques, enabling them to create text and image outputs based on input data. It offers a range of functionalities for training and deploying models, as well as exploring different generative AI applications.

github

: 116

awesome-ai-painting

This repository, named 'awesome-ai-painting', is a comprehensive collection of resources related to AI painting. It is curated by a user named 秋风, who is an AI painting enthusiast with a background in the AIGC industry. The repository aims to help more people learn AI painting and also documents the user's goal of creating 100 AI products, with current progress at 4/100. The repository includes information on various AI painting products, tutorials, tools, and models, providing a valuable resource for individuals interested in AI painting and related technologies.

github

: 11.0k

nodetool

NodeTool is a platform designed for AI enthusiasts, developers, and creators, providing a visual interface to access a variety of AI tools and models. It simplifies access to advanced AI technologies, offering resources for content creation, data analysis, automation, and more. With features like a visual editor, seamless integration with leading AI platforms, model manager, and API integration, NodeTool caters to both newcomers and experienced users in the AI field.

github

: 112

ComfyUI_Yvann-Nodes

github

: 340

comfyui-web-viewer

The ComfyUI Web Viewer by vrch.ai is a real-time AI-generated interactive art framework that integrates realtime streaming into ComfyUI workflows. It supports keyboard control nodes, OSC control nodes, sound input nodes, and more, accessible from any device with a web browser. It enables real-time interaction with AI-generated content, ideal for interactive visual projects and enhancing ComfyUI workflows with efficient content management and display.

github

: 169