English
Related papers

Related papers: Efficient Semantic Image Synthesis via Class-Adapt…

200 papers

Spatially-adaptive normalization is remarkably successful recently in conditional semantic image synthesis, which modulates the normalized activation with spatially-varying transformations learned from semantic layouts, to preserve the…

Computer Vision and Pattern Recognition · Computer Science 2020-04-07 Zhentao Tan , Dongdong Chen , Qi Chu , Menglei Chai , Jing Liao , Mingming He , Lu Yuan , Nenghai Yu

We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the deep network, which is then…

Computer Vision and Pattern Recognition · Computer Science 2019-11-06 Taesung Park , Ming-Yu Liu , Ting-Chun Wang , Jun-Yan Zhu

In semantic image synthesis the state of the art is dominated by methods that use customized variants of the SPatially-Adaptive DE-normalization (SPADE) layers, which allow for good visual generation quality and editing versatility. By…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Tomaso Fontanini , Claudio Ferrari , Giuseppe Lisanti , Massimo Bertozzi , Andrea Prati

Recent advancements in large-scale pre-trained text-to-image models have led to remarkable progress in semantic image synthesis. Nevertheless, synthesizing high-quality images with consistent semantics and layout remains a challenge. In…

Computer Vision and Pattern Recognition · Computer Science 2024-03-05 Zhengyao Lv , Yuxiang Wei , Wangmeng Zuo , Kwan-Yee K. Wong

Semantic image synthesis, translating semantic layouts to photo-realistic images, is a one-to-many mapping problem. Though impressive progress has been recently made, diverse semantic synthesis that can efficiently produce semantic-level…

Computer Vision and Pattern Recognition · Computer Science 2021-03-12 Zhentao Tan , Menglei Chai , Dongdong Chen , Jing Liao , Qi Chu , Bin Liu , Gang Hua , Nenghai Yu

Semantic image editing utilizes local semantic label maps to generate the desired content in the edited region. A recent work borrows SPADE block to achieve semantic image editing. However, it cannot produce pleasing results due to style…

Computer Vision and Pattern Recognition · Computer Science 2022-07-14 Wuyang Luo , Su Yang , Hong Wang , Bo Long , Weishan Zhang

Recent years have witnessed substantial progress in semantic image synthesis, it is still challenging in synthesizing photo-realistic images with rich details. Most previous methods focus on exploiting the given semantic map, which just…

Computer Vision and Pattern Recognition · Computer Science 2022-04-01 Zhengyao Lv , Xiaoming Li , Zhenxing Niu , Bing Cao , Wangmeng Zuo

Driving scene parsing is critical for autonomous vehicles to operate reliably in complex real-world traffic environments. To reduce the reliance on costly pixel-level annotations, synthetic datasets with automatically generated labels have…

Computer Vision and Pattern Recognition · Computer Science 2026-03-18 Jiahe Fan , Xiao Ma , Sergey Vityazev , George Giakos , Shaolong Shu , Rui Fan

Domain generalization for semantic segmentation aims to mitigate the degradation in model performance caused by domain shifts. However, in many real-world scenarios, we are unable to access the model parameters and architectural details due…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Qingmei Li , Yang Zhang , Peifeng Zhang , Haohuan Fu , Juepeng Zheng

Domain generalization aims to enhance model robustness against unseen domains with embedding distribution shifts. While large-scale vision-language models like CLIP exhibit strong generalization, their direct image-text embedding alignment…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Kai Gan , Tong Wei

Panoptic Scene Graph Generation (PSG) integrates instance segmentation with relation understanding to capture pixel-level structural relationships in complex scenes. Although recent approaches leveraging pre-trained vision-language models…

Computer Vision and Pattern Recognition · Computer Science 2025-07-09 Xin Hu , Ke Qin , Guiduo Duan , Ming Li , Yuan-Fang Li , Tao He

The SPLADE (SParse Lexical AnD Expansion) model is a highly effective approach to learned sparse retrieval, where documents are represented by term impact scores derived from large language models. During training, SPLADE applies…

Information Retrieval · Computer Science 2023-06-30 Joel Mackenzie , Shengyao Zhuang , Guido Zuccon

Multilayer-perceptrons (MLP) are known to struggle with learning functions of high-frequencies, and in particular cases with wide frequency bands. We present a spatially adaptive progressive encoding (SAPE) scheme for input signals of MLP…

Machine Learning · Computer Science 2021-05-31 Amir Hertz , Or Perel , Raja Giryes , Olga Sorkine-Hornung , Daniel Cohen-Or

The rapid growth of digital pathology and advances in self-supervised deep learning have enabled the development of foundational models for various pathology tasks across diverse diseases. While multimodal approaches integrating diverse…

Computer Vision and Pattern Recognition · Computer Science 2025-10-15 Ekaterina Redekop , Mara Pleasure , Zichen Wang , Kimberly Flores , Anthony Sisk , William Speier , Corey W. Arnold

The need for large amounts of training and validation data is a huge concern in scaling AI algorithms for autonomous driving. Semantic Image Synthesis (SIS), or label-to-image translation, promises to address this issue by translating…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 George Eskandar , Diandian Guo , Karim Guirguis , Bin Yang

Learning light-weight yet expressive deep networks in both image synthesis and image recognition remains a challenging problem. Inspired by a more recent observation that it is the data-specificity that makes the multi-head self-attention…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Jianghao Shen , Tianfu Wu

Vision transformers have demonstrated significant advantages in computer vision tasks due to their ability to capture long-range dependencies and contextual relationships through self-attention. However, existing position encoding…

Computer Vision and Pattern Recognition · Computer Science 2025-05-15 Xi Chen , Shiyang Zhou , Muqi Huang , Jiaxu Feng , Yun Xiong , Kun Zhou , Biao Yang , Yuhui Zhang , Huishuai Bao , Sijia Peng , Chuan Li , Feng Shi

Semantic segmentation has recently achieved notable advances by exploiting "class-level" contextual information during learning. However, these approaches simply concatenate class-level information to pixel features to boost the pixel…

Computer Vision and Pattern Recognition · Computer Science 2023-01-12 Ye Huang , Di Kang , Liang Chen , Wenjing Jia , Xiangjian He , Lixin Duan , Xuefei Zhe , Linchao Bao

In this work, we propose CLUDA, a simple, yet novel method for performing unsupervised domain adaptation (UDA) for semantic segmentation by incorporating contrastive losses into a student-teacher learning paradigm, that makes use of…

Computer Vision and Pattern Recognition · Computer Science 2022-11-09 Midhun Vayyat , Jaswin Kasi , Anuraag Bhattacharya , Shuaib Ahmed , Rahul Tallamraju

Domain shift is a very challenging problem for semantic segmentation. Any model can be easily trained on synthetic data, where images and labels are artificially generated, but it will perform poorly when deployed on real environments. In…

Computer Vision and Pattern Recognition · Computer Science 2020-09-03 Luigi Musto , Andrea Zinelli
‹ Prev 1 2 3 10 Next ›