Related papers: Controllable Mind Visual Diffusion Model

Decoding Realistic Images from Brain Activity with Contrastive Self-supervision and Latent Diffusion

Reconstructing visual stimuli from human brain activities provides a promising opportunity to advance our understanding of the brain's visual system and its connection with computer vision models. Although deep generative models have been…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Jingyuan Sun , Mingxiao Li , Marie-Francine Moens

MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion

Reconstructing visual stimuli from brain recordings has been a meaningful and challenging task. Especially, the achievement of precise and controllable image reconstruction bears great significance in propelling the progress and utilization…

Computer Vision and Pattern Recognition · Computer Science 2023-08-09 Yizhuo Lu , Changde Du , Qiongyi zhou , Dianpeng Wang , Huiguang He

MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion

Reconstructing visual stimuli from measured functional magnetic resonance imaging (fMRI) has been a meaningful and challenging task. Previous studies have successfully achieved reconstructions with structures similar to the original images,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-27 Yizhuo Lu , Changde Du , Dianpeng Wang , Huiguang He

Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance

Understanding how humans process visual information is one of the crucial steps for unraveling the underlying mechanism of brain activity. Recently, this curiosity has motivated the fMRI-to-image reconstruction task; given the fMRI data…

Computer Vision and Pattern Recognition · Computer Science 2024-09-19 Jaehoon Joo , Taejin Jeong , Seongjae Hwang

Diffusion Models for Computational Neuroimaging: A Survey

Computational neuroimaging involves analyzing brain images or signals to provide mechanistic insights and predictive tools for human cognition and behavior. While diffusion models have shown stability and high-quality generation in natural…

Computer Vision and Pattern Recognition · Computer Science 2025-02-11 Haokai Zhao , Haowei Lou , Lina Yao , Wei Peng , Ehsan Adeli , Kilian M Pohl , Yu Zhang

NeuralDiffuser: Neuroscience-inspired Diffusion Guidance for fMRI Visual Reconstruction

Reconstructing visual stimuli from functional Magnetic Resonance Imaging fMRI enables fine-grained retrieval of brain activity. However, the accurate reconstruction of diverse details, including structure, background, texture, color, and…

Neural and Evolutionary Computing · Computer Science 2025-01-09 Haoyu Li , Hao Wu , Badong Chen

VGDM: Vision-Guided Diffusion Model for Brain Tumor Detection and Segmentation

Accurate detection and segmentation of brain tumors from magnetic resonance imaging (MRI) are essential for diagnosis, treatment planning, and clinical monitoring. While convolutional architectures such as U-Net have long been the backbone…

Computer Vision and Pattern Recognition · Computer Science 2025-10-03 Arman Behnam

DCBM: Data-Efficient Visual Concept Bottleneck Models

Concept Bottleneck Models (CBMs) enhance the interpretability of neural networks by basing predictions on human-understandable concepts. However, current CBMs typically rely on concept sets extracted from large language models or extensive…

Computer Vision and Pattern Recognition · Computer Science 2025-07-03 Katharina Prasse , Patrick Knab , Sascha Marton , Christian Bartelt , Margret Keuper

A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models

Text-to-image diffusion models have made significant advancements in generating high-quality, diverse images from text prompts. However, the inherent limitations of textual signals often prevent these models from fully capturing specific…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Ziqiang Li , Jun Li , Lizhi Xiong , Zhangjie Fu , Zechao Li

Seeing Through the Brain: New Insights from Decoding Visual Stimuli with fMRI

Understanding how the brain encodes visual information is a central challenge in neuroscience and machine learning. A promising approach is to reconstruct visual stimuli, essentially images, from functional Magnetic Resonance Imaging (fMRI)…

Computer Vision and Pattern Recognition · Computer Science 2026-05-15 Zheng Huang , Enpei Zhang , Weikang Qiu , Yinghao Cai , Carl Yang , Elynn Chen , Xiang Zhang , Rex Ying , Dawei Zhou , Yujun Yan

FDDM: Unsupervised Medical Image Translation with a Frequency-Decoupled Diffusion Model

Diffusion models have demonstrated significant potential in producing high-quality images in medical image translation to aid disease diagnosis, localization, and treatment. Nevertheless, current diffusion models have limited success in…

Image and Video Processing · Electrical Eng. & Systems 2024-11-26 Yunxiang Li , Hua-Chieh Shao , Xiaoxue Qian , You Zhang

Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation

Visual information has been introduced for enhancing machine translation (MT), and its effectiveness heavily relies on the availability of large amounts of bilingual parallel sentence pairs with manual image annotations. In this paper, we…

Computation and Language · Computer Science 2025-01-07 Andong Chen , Yuchen Song , Kehai Chen , Muyun Yang , Tiejun Zhao , Min Zhang

SC-CDM: Enhancing Quality of Image Semantic Communication with a Compact Diffusion Model

Semantic Communication (SC) is an emerging technology that has attracted much attention in the sixth-generation (6G) mobile communication systems. However, few literature has fully considered the perceptual quality of the reconstructed…

Image and Video Processing · Electrical Eng. & Systems 2024-10-04 Kexin Zhang , Lixin Li , Wensheng Lin , Yuna Yan , Wenchi Cheng , Zhu Han

Semantic Brain Decoding: from fMRI to conceptually similar image reconstruction of visual stimuli

Brain decoding is a field of computational neuroscience that uses measurable brain activity to infer mental states or internal representations of perceptual inputs. Therefore, we propose a novel approach to brain decoding that also relies…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Matteo Ferrante , Tommaso Boccato , Nicola Toschi

Computationally Efficient Diffusion Models in Medical Imaging: A Comprehensive Review

The diffusion model has recently emerged as a potent approach in computer vision, demonstrating remarkable performances in the field of generative artificial intelligence. Capable of producing high-quality synthetic images, diffusion models…

Image and Video Processing · Electrical Eng. & Systems 2025-05-14 Abdullah , Tao Huang , Ickjai Lee , Euijoon Ahn

MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D Generation

We introduce MVControl, a novel neural network architecture that enhances existing pre-trained multi-view 2D diffusion models by incorporating additional input conditions, e.g. edge maps. Our approach enables the generation of controllable…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Zhiqi Li , Yiming Chen , Lingzhe Zhao , Peidong Liu

Diffusion-based Blind Text Image Super-Resolution

Recovering degraded low-resolution text images is challenging, especially for Chinese text images with complex strokes and severe degradation in real-world scenarios. Ensuring both text fidelity and style realness is crucial for…

Computer Vision and Pattern Recognition · Computer Science 2024-03-05 Yuzhe Zhang , Jiawei Zhang , Hao Li , Zhouxia Wang , Luwei Hou , Dongqing Zou , Liheng Bian

MDDM: A Multi-view Discriminative Enhanced Diffusion-based Model for Speech Enhancement

With the development of deep learning, speech enhancement has been greatly optimized in terms of speech quality. Previous methods typically focus on the discriminative supervised learning or generative modeling, which tends to introduce…

Audio and Speech Processing · Electrical Eng. & Systems 2025-10-31 Nan Xu , Zhaolong Huang , Xiaonan Zhi

Invertible Diffusion Models for Compressed Sensing

While deep neural networks (NN) significantly advance image compressed sensing (CS) by improving reconstruction quality, the necessity of training current CS NNs from scratch constrains their effectiveness and hampers rapid deployment.…

Computer Vision and Pattern Recognition · Computer Science 2025-02-04 Bin Chen , Zhenyu Zhang , Weiqi Li , Chen Zhao , Jiwen Yu , Shijie Zhao , Jie Chen , Jian Zhang

Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images

Spatial control methods using additional modules on pretrained diffusion models have gained attention for enabling conditional generation in natural images. These methods guide the generation process with new conditions while leveraging the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Suhyun Ahn , Wonjung Park , Jihoon Cho , Seunghyuck Park , Jinah Park