English
Related papers

Related papers: Relation Rectification in Diffusion Model

200 papers

Using reviews to learn user and item representations is important for recommender system. Current review based methods can be divided into two categories: (1) the Convolution Neural Network (CNN) based models that extract n-gram features…

Information Retrieval · Computer Science 2020-11-30 Hansi Zeng , Qingyao Ai

Diffusion models have demonstrated exceptional capability in generating high-quality images, videos, and audio. Due to their adaptiveness in iterative refinement, they provide a strong potential for achieving better non-autoregressive…

Computation and Language · Computer Science 2024-02-26 Yuxuan Liu , Tianchi Yang , Shaohan Huang , Zihan Zhang , Haizhen Huang , Furu Wei , Weiwei Deng , Feng Sun , Qi Zhang

The conditional text-to-image diffusion models have garnered significant attention in recent years. However, the precision of these models is often compromised mainly for two reasons, ambiguous condition input and inadequate condition…

Computer Vision and Pattern Recognition · Computer Science 2024-03-28 Sicheng Li , Keqiang Sun , Zhixin Lai , Xiaoshi Wu , Feng Qiu , Haoran Xie , Kazunori Miyata , Hongsheng Li

Graph matching pairs corresponding nodes across two or more graphs. The problem is difficult as it is hard to capture the structural similarity across graphs, especially on large graphs. We propose to incorporate high-order information for…

Artificial Intelligence · Computer Science 2020-10-12 Hui Xu , Liyao Xiang , Youmin Le , Xiaoying Gan , Yuting Jia , Luoyi Fu , Xinbing Wang

Text-to-image synthesis aims to automatically generate images according to text descriptions given by users, which is a highly challenging task. The main issues of text-to-image synthesis lie in two gaps: the heterogeneous and homogeneous…

Computer Vision and Pattern Recognition · Computer Science 2018-08-22 Mingkuan Yuan , Yuxin Peng

Arbitrary shape text detection is a challenging task due to the high variety and complexity of scenes texts. In this paper, we propose a novel unified relational reasoning graph network for arbitrary shape text detection. In our method, an…

Computer Vision and Pattern Recognition · Computer Science 2020-09-01 Shi-Xue Zhang , Xiaobin Zhu , Jie-Bo Hou , Chang Liu , Chun Yang , Hongfa Wang , Xu-Cheng Yin

Most real-world networks are noisy and incomplete samples from an unknown target distribution. Refining them by correcting corruptions or inferring unobserved regions typically improves downstream performance. Inspired by the impressive…

Personalized text-to-image models allow users to generate varied styles of images (specified with a sentence) for an object (specified with a set of reference images). While remarkable results have been achieved using diffusion-based…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Fanyue Wei , Wei Zeng , Zhenyang Li , Dawei Yin , Lixin Duan , Wen Li

We present HICNet, a reference-guided exposure correction framework. A lightweight, content-agnostic encoder distills each image into a compact illumination embedding capturing regional brightness, edge contrast, and higher-order luminance…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Hao Ren , Zetong Bi , Zhaoliang Wan , Hui Cheng

Graph neural networks (GNNs) have been broadly studied on dynamic graphs for their representation learning, majority of which focus on graphs with homogeneous structures in the spatial domain. However, many real-world graphs - i.e.,…

Machine Learning · Computer Science 2021-10-27 Yujie Fan , Mingxuan Ju , Chuxu Zhang , Liang Zhao , Yanfang Ye

Despite the latest remarkable advances in generative modeling, efficient generation of high-quality 3D assets from textual prompts remains a difficult task. A key challenge lies in data scarcity: the most extensive 3D datasets encompass…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Antoine Mercier , Ramin Nakhli , Mahesh Reddy , Rajeev Yasarla , Hong Cai , Fatih Porikli , Guillaume Berger

Motion retargeting for specific robot from existing motion datasets is one critical step in transferring motion patterns from human behaviors to and across various robots. However, inconsistencies in topological structure, geometrical…

Robotics · Computer Science 2025-05-28 Zhefeng Cao , Ben Liu , Sen Li , Wei Zhang , Hua Chen

Hypergraph neural networks (HGNN) have recently become attractive and received significant attention due to their excellent performance in various domains. However, most existing HGNNs rely on first-order approximations of hypergraph…

Artificial Intelligence · Computer Science 2024-01-11 Maolin Wang , Yaoming Zhen , Yu Pan , Yao Zhao , Chenyi Zhuang , Zenglin Xu , Ruocheng Guo , Xiangyu Zhao

Reinforcement learning (RL) has improved guided image generation with diffusion models by directly optimizing rewards that capture image quality, aesthetics, and instruction following capabilities. However, the resulting generative policies…

Computer Vision and Pattern Recognition · Computer Science 2024-06-25 Owen Oertell , Jonathan D. Chang , Yiyi Zhang , Kianté Brantley , Wen Sun

Recent text-to-image models have achieved impressive results. However, since they require large-scale datasets of text-image pairs, it is impractical to train them on new domains where data is scarce or not labeled. In this work, we propose…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Shelly Sheynin , Oron Ashual , Adam Polyak , Uriel Singer , Oran Gafni , Eliya Nachmani , Yaniv Taigman

Graph Convolutional Networks (GCNs) have been widely applied in various fields due to their significant power on processing graph-structured data. Typical GCN and its variants work under a homophily assumption (i.e., nodes with same class…

Machine Learning · Computer Science 2021-12-28 Tao Wang , Rui Wang , Di Jin , Dongxiao He , Yuxiao Huang

We introduce a diffusion-based cross-domain image translator in the absence of paired training data. Unlike GAN-based methods, our approach integrates diffusion models to learn the image translation process, allowing for more coverable…

Computer Vision and Pattern Recognition · Computer Science 2026-01-30 Shilong Zou , Yuhang Huang , Renjiao Yi , Chenyang Zhu , Kai Xu

While diffusion models dominate the field of visual generation, they are computationally inefficient, applying a uniform computational effort regardless of different complexity. In contrast, autoregressive (AR) models are inherently…

Computer Vision and Pattern Recognition · Computer Science 2026-04-15 Jian Han , Jinlai Liu , Jiahuan Wang , Bingyue Peng , Zehuan Yuan

Traffic forecasting is significant for urban traffic management, intelligent route planning, and real-time flow monitoring. Recent advances in spatial-temporal models have markedly improved the modeling of intricate spatial-temporal…

Machine Learning · Computer Science 2025-09-03 Xinyu Ji , Chengcheng Yan , Jibiao Yuan , Fiefie Zhao

Image stitching from different captures often results in non-rectangular boundaries, which is often considered unappealing. To solve non-rectangular boundaries, current solutions involve cropping, which discards image content, inpainting,…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Tianhao Zhou , Haipeng Li , Ziyi Wang , Ao Luo , Chen-Lin Zhang , Jiajun Li , Bing Zeng , Shuaicheng Liu