Related papers: GuideFlow3D: Optimization-Guided Rectified Flow Fo…

GenFlow: Generalizable Recurrent Flow for 6D Pose Refinement of Novel Objects

Despite the progress of learning-based methods for 6D object pose estimation, the trade-off between accuracy and scalability for novel objects still exists. Specifically, previous methods for novel objects do not make good use of the target…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Sungphill Moon , Hyeontae Son , Dongcheol Hur , Sangwook Kim

FG-Portrait: 3D Flow Guided Editable Portrait Animation

Motion transfer from the driving to the source portrait remains a key challenge in the portrait animation. Current diffusion-based approaches condition only on the driving motion, which fails to capture source-to-driving correspondences and…

Computer Vision and Pattern Recognition · Computer Science 2026-03-25 Yating Xu , Yunqi Miao , Evangelos Ververas , Jiankang Deng , Jifei Song

Text-to-Image Rectified Flow as Plug-and-Play Priors

Large-scale diffusion models have achieved remarkable performance in generative tasks. Beyond their initial training applications, these models have proven their ability to function as versatile plug-and-play priors. For instance, 2D…

Computer Vision and Pattern Recognition · Computer Science 2025-02-21 Xiaofeng Yang , Cheng Chen , Xulei Yang , Fayao Liu , Guosheng Lin

DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving

In autonomous driving, vision-centric 3D object detection recognizes and localizes 3D objects from RGB images. However, due to high annotation costs and diverse outdoor scenes, training data often fails to cover all possible test scenarios,…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Hongbin Lin , Yiming Yang , Chaoda Zheng , Yifan Zhang , Shuaicheng Niu , Zilu Guo , Yafeng Li , Gui Gui , Shuguang Cui , Zhen Li

Inversion-Free Style Transfer with Dual Rectified Flows

Style transfer, a pivotal task in image processing, synthesizes visually compelling images by seamlessly blending realistic content with artistic styles, enabling applications in photo editing and creative design. While mainstream…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Yingying Deng , Xiangyu He , Fan Tang , Weiming Dong , Xucheng Yin

DiffStyle3D: Consistent 3D Gaussian Stylization via Attention Optimization

3D style transfer enables the creation of visually expressive 3D content, enriching the visual appearance of 3D scenes and objects. However, existing VGG- and CLIP-based methods struggle to model multi-view consistency within the model…

Computer Vision and Pattern Recognition · Computer Science 2026-01-28 Yitong Yang , Xuexin Liu , Yinglin Wang , Jing Wang , Hao Dou , Changshuo Wang , Shuting He

PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance

Exploiting pre-trained diffusion models for restoration has recently become a favored alternative to the traditional task-specific training approach. Previous works have achieved noteworthy success by limiting the solution space using…

Computer Vision and Pattern Recognition · Computer Science 2023-09-20 Peiqing Yang , Shangchen Zhou , Qingyi Tao , Chen Change Loy

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Diffusion models create data from noise by inverting the forward paths of data towards noise and have emerged as a powerful generative modeling technique for high-dimensional, perceptual data such as images and videos. Rectified flow is a…

Computer Vision and Pattern Recognition · Computer Science 2024-03-06 Patrick Esser , Sumith Kulal , Andreas Blattmann , Rahim Entezari , Jonas Müller , Harry Saini , Yam Levi , Dominik Lorenz , Axel Sauer , Frederic Boesel , Dustin Podell , Tim Dockhorn , Zion English , Kyle Lacey , Alex Goodwin , Yannik Marek , Robin Rombach

Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation

Generating high-quality 3D objects from textual descriptions remains a challenging problem due to computational cost, the scarcity of 3D data, and complex 3D representations. We introduce Geometry Image Diffusion (GIMDiffusion), a novel…

Computer Vision and Pattern Recognition · Computer Science 2024-09-06 Slava Elizarov , Ciara Rowles , Simon Donné

G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation

Recent advances in imitation learning for 3D robotic manipulation have shown promising results with diffusion-based policies. However, achieving human-level dexterity requires seamless integration of geometric precision and semantic…

Robotics · Computer Science 2025-06-24 Tianxing Chen , Yao Mu , Zhixuan Liang , Zanxin Chen , Shijia Peng , Qiangyu Chen , Mingkun Xu , Ruizhen Hu , Hongyuan Zhang , Xuelong Li , Ping Luo

Guide3D: Create 3D Avatars from Text and Image Guidance

Recently, text-to-image generation has exhibited remarkable advancements, with the ability to produce visually impressive results. In contrast, text-to-3D generation has not yet reached a comparable level of quality. Existing methods…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Yukang Cao , Yan-Pei Cao , Kai Han , Ying Shan , Kwan-Yee K. Wong

Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control

Recent advances in text-driven 3D scene editing and stylization, which leverage the powerful capabilities of 2D generative models, have demonstrated promising outcomes. However, challenges remain in ensuring high-quality stylization and…

Graphics · Computer Science 2026-03-03 Haruo Fujiwara , Yusuke Mukuta , Tatsuya Harada

RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance

Customizing diffusion models to generate identity-preserving images from user-provided reference images is an intriguing new problem. The prevalent approaches typically require training on extensive domain-specific images to achieve…

Computer Vision and Pattern Recognition · Computer Science 2024-12-11 Zhicheng Sun , Zhenhao Yang , Yang Jin , Haozhe Chi , Kun Xu , Kun Xu , Liwei Chen , Hao Jiang , Yang Song , Kun Gai , Yadong Mu

From Diffusion to Flow: Efficient Motion Generation in MotionGPT3

Recent text-driven motion generation methods span both discrete token-based approaches and continuous-latent formulations. MotionGPT3 exemplifies the latter paradigm, combining a learned continuous motion latent space with a diffusion-based…

Computer Vision and Pattern Recognition · Computer Science 2026-04-23 Jaymin Ban , JiHong Jeon , SangYeop Jeong

Unsupervised Pose Flow Learning for Pose Guided Synthesis

Pose guided synthesis aims to generate a new image in an arbitrary target pose while preserving the appearance details from the source image. Existing approaches rely on either hard-coded spatial transformations or 3D body modeling. They…

Computer Vision and Pattern Recognition · Computer Science 2019-10-01 Haitian Zheng , Lele Chen , Chenliang Xu , Jiebo Luo

On the Guidance of Flow Matching

Flow matching has shown state-of-the-art performance in various generative tasks, ranging from image generation to decision-making, where generation under energy guidance (abbreviated as guidance in the following) is pivotal. However, the…

Machine Learning · Computer Science 2025-05-27 Ruiqi Feng , Chenglei Yu , Wenhao Deng , Peiyan Hu , Tailin Wu

FROMAT: Multiview Material Appearance Transfer via Few-Shot Self-Attention Adaptation

Multiview diffusion models have rapidly emerged as a powerful tool for content creation with spatial consistency across viewpoints, offering rich visual realism without requiring explicit geometry and appearance representation. However,…

Computer Vision and Pattern Recognition · Computer Science 2025-12-11 Hubert Kompanowski , Varun Jampani , Aaryaman Vasishta , Binh-Son Hua

ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation

Rectified Flow text-to-image models surpass diffusion models in image quality and text alignment, but adapting ReFlow for real-image editing remains challenging. We propose a new real-image editing method for ReFlow by analyzing the…

Computer Vision and Pattern Recognition · Computer Science 2025-07-03 Jimyeong Kim , Jungwon Park , Yeji Song , Nojun Kwak , Wonjong Rhee

Dense Intrinsic Appearance Flow for Human Pose Transfer

We present a novel approach for the task of human pose transfer, which aims at synthesizing a new image of a person from an input image of that person and a target pose. We address the issues of limited correspondences identified between…

Computer Vision and Pattern Recognition · Computer Science 2019-03-28 Yining Li , Chen Huang , Chen Change Loy

Pose Guided Image Generation from Misaligned Sources via Residual Flow Based Correction

Generating new images with desired properties (e.g. new view/poses) from source images has been enthusiastically pursued recently, due to its wide range of potential applications. One way to ensure high-quality generation is to use multiple…

Computer Vision and Pattern Recognition · Computer Science 2022-02-03 Jiawei Lu , He Wang , Tianjia Shao , Yin Yang , Kun Zhou