English
Related papers

Related papers: GuideFlow3D: Optimization-Guided Rectified Flow Fo…

200 papers

Despite the progress of learning-based methods for 6D object pose estimation, the trade-off between accuracy and scalability for novel objects still exists. Specifically, previous methods for novel objects do not make good use of the target…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Sungphill Moon , Hyeontae Son , Dongcheol Hur , Sangwook Kim

Motion transfer from the driving to the source portrait remains a key challenge in the portrait animation. Current diffusion-based approaches condition only on the driving motion, which fails to capture source-to-driving correspondences and…

Computer Vision and Pattern Recognition · Computer Science 2026-03-25 Yating Xu , Yunqi Miao , Evangelos Ververas , Jiankang Deng , Jifei Song

Large-scale diffusion models have achieved remarkable performance in generative tasks. Beyond their initial training applications, these models have proven their ability to function as versatile plug-and-play priors. For instance, 2D…

Computer Vision and Pattern Recognition · Computer Science 2025-02-21 Xiaofeng Yang , Cheng Chen , Xulei Yang , Fayao Liu , Guosheng Lin

In autonomous driving, vision-centric 3D object detection recognizes and localizes 3D objects from RGB images. However, due to high annotation costs and diverse outdoor scenes, training data often fails to cover all possible test scenarios,…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Hongbin Lin , Yiming Yang , Chaoda Zheng , Yifan Zhang , Shuaicheng Niu , Zilu Guo , Yafeng Li , Gui Gui , Shuguang Cui , Zhen Li

Style transfer, a pivotal task in image processing, synthesizes visually compelling images by seamlessly blending realistic content with artistic styles, enabling applications in photo editing and creative design. While mainstream…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Yingying Deng , Xiangyu He , Fan Tang , Weiming Dong , Xucheng Yin

3D style transfer enables the creation of visually expressive 3D content, enriching the visual appearance of 3D scenes and objects. However, existing VGG- and CLIP-based methods struggle to model multi-view consistency within the model…

Computer Vision and Pattern Recognition · Computer Science 2026-01-28 Yitong Yang , Xuexin Liu , Yinglin Wang , Jing Wang , Hao Dou , Changshuo Wang , Shuting He

Exploiting pre-trained diffusion models for restoration has recently become a favored alternative to the traditional task-specific training approach. Previous works have achieved noteworthy success by limiting the solution space using…

Computer Vision and Pattern Recognition · Computer Science 2023-09-20 Peiqing Yang , Shangchen Zhou , Qingyi Tao , Chen Change Loy

Diffusion models create data from noise by inverting the forward paths of data towards noise and have emerged as a powerful generative modeling technique for high-dimensional, perceptual data such as images and videos. Rectified flow is a…

Generating high-quality 3D objects from textual descriptions remains a challenging problem due to computational cost, the scarcity of 3D data, and complex 3D representations. We introduce Geometry Image Diffusion (GIMDiffusion), a novel…

Computer Vision and Pattern Recognition · Computer Science 2024-09-06 Slava Elizarov , Ciara Rowles , Simon Donné

Recent advances in imitation learning for 3D robotic manipulation have shown promising results with diffusion-based policies. However, achieving human-level dexterity requires seamless integration of geometric precision and semantic…

Recently, text-to-image generation has exhibited remarkable advancements, with the ability to produce visually impressive results. In contrast, text-to-3D generation has not yet reached a comparable level of quality. Existing methods…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Yukang Cao , Yan-Pei Cao , Kai Han , Ying Shan , Kwan-Yee K. Wong

Recent advances in text-driven 3D scene editing and stylization, which leverage the powerful capabilities of 2D generative models, have demonstrated promising outcomes. However, challenges remain in ensuring high-quality stylization and…

Graphics · Computer Science 2026-03-03 Haruo Fujiwara , Yusuke Mukuta , Tatsuya Harada

Customizing diffusion models to generate identity-preserving images from user-provided reference images is an intriguing new problem. The prevalent approaches typically require training on extensive domain-specific images to achieve…

Computer Vision and Pattern Recognition · Computer Science 2024-12-11 Zhicheng Sun , Zhenhao Yang , Yang Jin , Haozhe Chi , Kun Xu , Kun Xu , Liwei Chen , Hao Jiang , Yang Song , Kun Gai , Yadong Mu

Recent text-driven motion generation methods span both discrete token-based approaches and continuous-latent formulations. MotionGPT3 exemplifies the latter paradigm, combining a learned continuous motion latent space with a diffusion-based…

Computer Vision and Pattern Recognition · Computer Science 2026-04-23 Jaymin Ban , JiHong Jeon , SangYeop Jeong

Pose guided synthesis aims to generate a new image in an arbitrary target pose while preserving the appearance details from the source image. Existing approaches rely on either hard-coded spatial transformations or 3D body modeling. They…

Computer Vision and Pattern Recognition · Computer Science 2019-10-01 Haitian Zheng , Lele Chen , Chenliang Xu , Jiebo Luo

Flow matching has shown state-of-the-art performance in various generative tasks, ranging from image generation to decision-making, where generation under energy guidance (abbreviated as guidance in the following) is pivotal. However, the…

Machine Learning · Computer Science 2025-05-27 Ruiqi Feng , Chenglei Yu , Wenhao Deng , Peiyan Hu , Tailin Wu

Multiview diffusion models have rapidly emerged as a powerful tool for content creation with spatial consistency across viewpoints, offering rich visual realism without requiring explicit geometry and appearance representation. However,…

Computer Vision and Pattern Recognition · Computer Science 2025-12-11 Hubert Kompanowski , Varun Jampani , Aaryaman Vasishta , Binh-Son Hua

Rectified Flow text-to-image models surpass diffusion models in image quality and text alignment, but adapting ReFlow for real-image editing remains challenging. We propose a new real-image editing method for ReFlow by analyzing the…

Computer Vision and Pattern Recognition · Computer Science 2025-07-03 Jimyeong Kim , Jungwon Park , Yeji Song , Nojun Kwak , Wonjong Rhee

We present a novel approach for the task of human pose transfer, which aims at synthesizing a new image of a person from an input image of that person and a target pose. We address the issues of limited correspondences identified between…

Computer Vision and Pattern Recognition · Computer Science 2019-03-28 Yining Li , Chen Huang , Chen Change Loy

Generating new images with desired properties (e.g. new view/poses) from source images has been enthusiastically pursued recently, due to its wide range of potential applications. One way to ensure high-quality generation is to use multiple…

Computer Vision and Pattern Recognition · Computer Science 2022-02-03 Jiawei Lu , He Wang , Tianjia Shao , Yin Yang , Kun Zhou
‹ Prev 1 2 3 10 Next ›