Pallaidium

Pallaidium

PALLAIDIUM - a generative AI movie studio integrated in the Blender video editor.

Stars: 963

Visit
 screenshot

Pallaidium is a generative AI movie studio integrated into the Blender video editor. It allows users to AI-generate video, image, and audio from text prompts or existing media files. The tool provides various features such as text to video, text to audio, text to speech, text to image, image to image, image to video, video to video, image to text, and more. It requires a Windows system with a CUDA-supported Nvidia card and at least 6 GB VRAM. Pallaidium offers batch processing capabilities, text to audio conversion using Bark, and various performance optimization tips. Users can install the tool by downloading the add-on and following the installation instructions provided. The tool comes with a set of restrictions on usage, prohibiting the generation of harmful, pornographic, violent, or false content.

README:

X (formerly Twitter) Follow GitHub Repo stars

PALLAIDIUM - a generative AI movie studio integrated into the Blender video editor.

AI-generate video, image, and audio from text prompts or video, image, or text strips.

PallAIdium

Features

Text to video Text to audio
Text to speech Text to image
Image to image Image to video
Video to video Image to text
ControlNet OpenPose
ADetailer IP Adapter Face/Style
Canny Illusion
Multiple LoRAs Segmind distilled SDXL
Seed Quality steps
Frames Word power
Style selector Strip power
Batch conversion Batch refinement of images.
Batch upscale & refinement of movies. Model card selector.
Render-to-path selector. Render finished notification.
Model Cards One-click install and uninstall dependencies.
User-defined file path for generated files. Seed and prompt added to strip name.

image

Requirements

  • Windows (Unsupported: Linux and MacOS).
  • A CUDA-supported Nvidia card with at least 6 GB VRAM.
  • CUDA: 12.4
  • 20+ GB HDD. (Each model is 6+ GB).

For Mac and Linux, we'll have to rely on contributor support. So, post your issues here for Mac: https://github.com/tin2tin/Pallaidium/issues/106 and here for Linux: https://github.com/tin2tin/Pallaidium/issues/105, and hope some contributor wants to help you out.

How to install

  • First, download and install git (must be on PATH): https://git-scm.com/downloads

  • Download the add-on: https://github.com/tin2tin/text_to_video/archive/refs/heads/main.zip

  • On Windows, right-click on the Blender icon and "Run Blender as Administrator"(or you'll get write permission errors).

  • Install the add-on as usual: Preferences > Add-ons > Install > select file > enable the add-on.

  • In the Generative AI add-on preferences, hit the "Uninstall Dependencies" button (to clear out any incompatible libs).

  • Restart Blender via "Run as Administrator".

  • In the Generative AI add-on preferences, hit the "Install Dependencies" button.

  • Restart the computer and run Blender via "Run as Administrator".

  • Open the add-on UI in the Sequencer > Sidebar > Generative AI.

  • The first time any model is executed, 5-10 GB will have to be downloaded first.

Tip
If any Python modules are missing, use this add-on to install them manually:
https://github.com/tin2tin/blender_pip

Change Log

  • 2024-9-19: Add: Image to Video for CogVideoX
  • 2024-9-15: Add: LoRA import for Flux
  • 2024-9-14: Add: Flux Inpaint & Img2img.
  • 2024-9-4: Add: Florence 2 (Image Caption), AudioLDM2-Large, CogVideox-2b, flash_attn on Win.
  • 2024-9-2: Add: Vid2vid for CogVideoX-5b and Parler TTS
  • 2024-8-28: Make CogVideox-5b run on 6 GB VRAM & Flux on 2 GB VRAM
  • 2024-8-27: Add: CogVideoX-5b Remove: Low-quality models
  • 2024-8-5: Add: Flux Dev - NB. needs update of dependencies and 24 GB VRAM
  • 2024-8-2: Add: Flux Schnell - NB. needs update of dependencies and 24 GB VRAM
  • 2024-7-12: Add: Kwai/Kolors (txt2img & img2img)
  • 2024-6-13: Add: SD3 - A "Read" token from HuggingFace must be entered, it's free (img2img). Fix: Installation of Dependencies
  • 2024-6-6: Add: Stable Audio Open, Frame:-1 will inherit duration.
  • 2024-6-1: IP Adapter(When using SDXL): Face (Image or folder), Style (image or folder) New image models: Mobius, OpenVision, Juggernaut X Hyper
  • 2024-4-29: Add: PixArt Sigma 2k, PixArt 1024 and RealViz V4
  • 2024-2-23: Add: Proteus Lightning and Dreamshaper XL Lightning
  • 2024-2-21: Add: SDXL-Lightning 2 Step & Proteus v. 0.3
  • 2024-1-02: Add: WhisperSpeech
  • 2024-01-01: Fix installation and Bark bugs.
  • 2024-01-31: Add OpenDalle, Speed option, SDXL, and LoRA support for Canny and OpenPose, including OpenPose rig images. Prune old models including SD.
  • 2023-12-18: Add: Bark audio enhance, Segmind Vega.
  • 2023-12-1: Add SD Turbo & MusicGen Medium, MPS device for MacOS.
  • 2023-11-30: Add: SVD, SVD-XT, SDXL Turbo

Location

Install Dependencies, and set Sound Notification in the add-on preferences:

image

Video Sequence Editor > Sidebar > Generative AI:

image

Styles:

image

See SDXL handling most of the styles here: https://stable-diffusion-art.com/sdxl-styles/

Prompting:

https://replicate.com/blog/get-the-best-from-stable-diffusion-3

https://github.com/invoke-ai/InvokeAI/blob/main/docs/features/PROMPTS.md

https://stablediffusion.fr/prompts

https://blog.segmind.com/generating-photographic-images-with-stable-diffusion/

Tip
If the image of your renders breaks, use the resolution from the Model Card in the Preferences.
Tip
If the image of your playback stutters, then select a strip > Menu > Strip > Movie Strip > Set Render Size.
Tip
If you get the message that CUDA is out of memory, restart Blender to free up memory and make it stable again.

Batch Processing

Select multiple strips and hit Generate. When doing this, the file name, and if found the seed value, are automatically inserted into the prompt and seed value. However, in the add-on preferences, this behavior can be switched off.

https://github.com/tin2tin/Pallaidium/assets/1322593/28098eb6-3a93-4bcb-bd6f-53b71faabd8d

Text to Audio

Bark

Find Bark documentation here: https://github.com/suno-ai/bark

  • [laughter]
  • [laughs]
  • [sighs]
  • [music]
  • [gasps]
  • [clears throat]
  • — or ... for hesitations
  • ♪ for song lyrics
  • capitalization for emphasis on a word
  • MAN/WOMAN: for bias towards the speaker

Speaker Library: https://suno-ai.notion.site/8b8e8749ed514b0cbf3f699013548683?v=bc67cff786b04b50b3ceb756fd05f68c

Tip
If the audio breaks up, try processing longer sentences.

Performance

The performance can be improved by following this guide: https://nvidia.custhelp.com/app/answers/detail/a_id/5490/~/system-memory-fallback-for-stable-diffusion

New to Blender?

Watch this tutorial: https://youtu.be/4_MIaxzjh5Y?feature=shared

Uninstall

Hugging Face Diffusers models are downloaded from the hub and saved to a local cache directory. By default, the cache directory is located at:

On Linux and macOS: ~/.cache/huggingface/hub

On Windows: %userprofile%\.cache\huggingface\hub

Here you can locate and delete the individual models.

Useful add-ons

Add Rendered Strips

Since the Generative AI add-on can only input images or movie strips, you'll need to convert other strip types to movie-strip. For this purpose, this add-on can be used:

https://github.com/tin2tin/Add_Rendered_Strips

image

VSE Masking Tools

For creating a mask on top of a clip in the Sequencer, this add-on can be used to input the clip as background in the Blender Image Editor. The created mask can then be added to the VSE as a strip, and converted to video with the above add-on:

https://github.com/tin2tin/vse_masking_tools

image

Subtitle Editor

Edit and navigate in the generated text strips.

https://github.com/tin2tin/Subtitle_Editor

Screenwriter Assistant

Get chatGPT to generate stories, which can be used as prompts.

https://github.com/tin2tin/Blender_Screenwriter_Assistant_chat_GPT

Text to Strip

Convert text from the Text Editor to strips which can be used as prompts for batch generation.

https://github.com/tin2tin/text_to_strip

Useful Projects

Trainer for LoRAs: https://github.com/Nerogar/OneTrainer https://github.com/johnman3032/simple-lora-dreambooth-trainer

HD Horizon(LoRA for making SD 1.5 work at higher resolutions): https://civitai.com/models/238891/hd-horizon-the-resolution-frontier-multi-resolution-high-resolution-native-inferencing

Triton for manual installation on Windows: https://huggingface.co/madbuda/triton-windows-builds

Video Examples

VID2VID & TXT2VID

Watch the video

Image to Text

https://github.com/tin2tin/Pallaidium/assets/1322593/91eb17e4-72d6-4c69-8e5c-a3d38af5a770

Illusion Diffusion

https://github.com/tin2tin/Pallaidium/assets/1322593/42eadfd8-3ebf-4747-b8e0-7b79fe8626b6

Scribble

https://github.com/tin2tin/Pallaidium/assets/1322593/c74a4e38-8b16-423b-be78-aadfbfe284dc

Styles

https://github.com/tin2tin/Pallaidium/assets/1322593/b80812b4-e3be-40b0-a73b-bc55b7eeadf7

Canny

https://github.com/tin2tin/Pallaidium/assets/1322593/a1e94e09-0147-40ae-b4c2-4ce0671b1289

OpenPose

https://github.com/tin2tin/Pallaidium/assets/1322593/ac9f278e-9fc9-46fc-a4e7-562ff041964f

Screenplay to Film

Watch the video

Img to Txt to Audio

Watch the video

Zeroscope

Watch the video

Würstchen

Watch the video

Bark

Watch the video

Batch from Text Strips

Watch the video

Video to video:

https://github.com/tin2tin/Generative_AI/assets/1322593/c044a0b0-95c2-4b54-af0b-45bc0c670c89

https://github.com/tin2tin/Generative_AI/assets/1322593/0105cd35-b3b2-49cf-91c1-0633dd484177

Img2img:

https://github.com/tin2tin/Generative_AI/assets/1322593/2dd2d2f1-a1f6-4562-8116-ffce872b79c3

Painting

https://github.com/tin2tin/Generative_AI/assets/1322593/7cd69cd0-5842-40f0-b41f-455c77443535

Enhancement Info

LCM

https://huggingface.co/blog/lcm_lora

Restrictions on using Pallaidium:

  • It is prohibited to use Pallaidium to generate content that is demeaning or harmful to people, their environment, culture, religion, etc.
  • It is prohibited to use Pallaidium for pornographic, violent, and bloody content generation.
  • It is prohibited to use Pallaidium for error and false information generation.

Restrictions on using the AI models:

  • Pallaidium does not include any genAI models(weights). If the user decides to use a model, it is downloaded from HuggingFace.
  • In general, the models can only be used for non-commercial purposes and are meant for research purposes.
  • Consult the individual models on HuggingFace to read up on their licenses and ex. if they can be used commercially.

Via the Diffusers lib the following weights are accesible through Pallaidium:

Video:

Image:

Audio:

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for Pallaidium

Similar Open Source Tools

For similar tasks

For similar jobs