Mind-Video

Mind-Video

Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity

Monthly visits:7979
Visit
Mind-Video screenshot

Description:

Mind-Video is a two-module pipeline designed to bridge the gap between image and video brain decoding. The first module is an fMRI encoder that learns brain features through multiple stages, including multimodal contrastive learning with spatiotemporal attention for windowed fMRI. The second module is an augmented stable diffusion model that is specifically tailored for video generation under fMRI guidance. Mind-Video has been shown to outperform previous state-of-the-art approaches in terms of semantic and pixel metrics, and its attention analysis has revealed mapping to the visual cortex and higher cognitive networks, suggesting that it is biologically plausible and interpretable.

For Tasks:

For Jobs:

Features

Advantages

  • Outperforms previous state-of-the-art approaches in terms of semantic and pixel metrics
  • Attention analysis reveals mapping to the visual cortex and higher cognitive networks
  • Can be used to study the neural basis of visual perception
  • Has potential applications in brain-computer interface, neuroimaging, and neuroscience

Disadvantages

  • Lack of pixel-level controllability
  • Uncontrollable factors during the scan
  • Requires a large amount of data to train

Frequently Asked Questions

Alternative AI tools for Mind-Video

Similar sites

For similar tasks

For similar jobs