Related papers: Efficient Semantic Image Synthesis via Class-Adapt…

Rethinking Spatially-Adaptive Normalization

Spatially-adaptive normalization is remarkably successful recently in conditional semantic image synthesis, which modulates the normalized activation with spatially-varying transformations learned from semantic layouts, to preserve the…

Computer Vision and Pattern Recognition · Computer Science 2020-04-07 Zhentao Tan , Dongdong Chen , Qi Chu , Menglei Chai , Jing Liao , Mingming He , Lu Yuan , Nenghai Yu

Semantic Image Synthesis with Spatially-Adaptive Normalization

We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the deep network, which is then…

Computer Vision and Pattern Recognition · Computer Science 2019-11-06 Taesung Park , Ming-Yu Liu , Ting-Chun Wang , Jun-Yan Zhu

Semantic Image Synthesis via Class-Adaptive Cross-Attention

In semantic image synthesis the state of the art is dominated by methods that use customized variants of the SPatially-Adaptive DE-normalization (SPADE) layers, which allow for good visual generation quality and editing versatility. By…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Tomaso Fontanini , Claudio Ferrari , Giuseppe Lisanti , Massimo Bertozzi , Andrea Prati

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis

Recent advancements in large-scale pre-trained text-to-image models have led to remarkable progress in semantic image synthesis. Nevertheless, synthesizing high-quality images with consistent semantics and layout remains a challenge. In…

Computer Vision and Pattern Recognition · Computer Science 2024-03-05 Zhengyao Lv , Yuxiang Wei , Wangmeng Zuo , Kwan-Yee K. Wong

Diverse Semantic Image Synthesis via Probability Distribution Modeling

Semantic image synthesis, translating semantic layouts to photo-realistic images, is a one-to-many mapping problem. Though impressive progress has been recently made, diverse semantic synthesis that can efficiently produce semantic-level…

Computer Vision and Pattern Recognition · Computer Science 2021-03-12 Zhentao Tan , Menglei Chai , Dongdong Chen , Jing Liao , Qi Chu , Bin Liu , Gang Hua , Nenghai Yu

Context-Consistent Semantic Image Editing with Style-Preserved Modulation

Semantic image editing utilizes local semantic label maps to generate the desired content in the edited region. A recent work borrows SPADE block to achieve semantic image editing. However, it cannot produce pleasing results due to style…

Computer Vision and Pattern Recognition · Computer Science 2022-07-14 Wuyang Luo , Su Yang , Hong Wang , Bo Long , Weishan Zhang

Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis

Recent years have witnessed substantial progress in semantic image synthesis, it is still challenging in synthesizing photo-realistic images with rich details. Most previous methods focus on exploiting the given semantic map, which just…

Computer Vision and Pattern Recognition · Computer Science 2022-04-01 Zhengyao Lv , Xiaoming Li , Zhenxing Niu , Bing Cao , Wangmeng Zuo

Structured prototype regularization for synthetic-to-real driving scene parsing

Driving scene parsing is critical for autonomous vehicles to operate reliably in complex real-world traffic environments. To reduce the reliance on costly pixel-level annotations, synthetic datasets with automatically generated labels have…

Computer Vision and Pattern Recognition · Computer Science 2026-03-18 Jiahe Fan , Xiao Ma , Sergey Vityazev , George Giakos , Shaolong Shu , Rui Fan

SAGE: Style-Adaptive Generalization for Privacy-Constrained Semantic Segmentation Across Domains

Domain generalization for semantic segmentation aims to mitigate the degradation in model performance caused by domain shifts. However, in many real-world scenarios, we are unable to access the model parameters and architectural details due…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Qingmei Li , Yang Zhang , Peifeng Zhang , Haohuan Fu , Juepeng Zheng

SHED: Style-Homogenized Embedding Alignment for Domain Generalization

Domain generalization aims to enhance model robustness against unseen domains with embedding distribution shifts. While large-scale vision-language models like CLIP exhibit strong generalization, their direct image-text embedding alignment…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Kai Gan , Tong Wei

SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning

Panoptic Scene Graph Generation (PSG) integrates instance segmentation with relation understanding to capture pixel-level structural relationships in complex scenes. Although recent approaches leveraging pre-trained vision-language models…

Computer Vision and Pattern Recognition · Computer Science 2025-07-09 Xin Hu , Ke Qin , Guiduo Duan , Ming Li , Yuan-Fang Li , Tao He

Exploring the Representation Power of SPLADE Models

The SPLADE (SParse Lexical AnD Expansion) model is a highly effective approach to learned sparse retrieval, where documents are represented by term impact scores derived from large language models. During training, SPLADE applies…

Information Retrieval · Computer Science 2023-06-30 Joel Mackenzie , Shengyao Zhuang , Guido Zuccon

SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization

Multilayer-perceptrons (MLP) are known to struggle with learning functions of high-frequencies, and in particular cases with wide frequency bands. We present a spatially adaptive progressive encoding (SAPE) scheme for input signals of MLP…

Machine Learning · Computer Science 2021-05-31 Amir Hertz , Or Perel , Raja Giryes , Olga Sorkine-Hornung , Daniel Cohen-Or

SPADE: Spatial Transcriptomics and Pathology Alignment Using a Mixture of Data Experts for an Expressive Latent Space

The rapid growth of digital pathology and advances in self-supervised deep learning have enabled the development of foundational models for various pathology tasks across diverse diseases. While multimodal approaches integrating diverse…

Computer Vision and Pattern Recognition · Computer Science 2025-10-15 Ekaterina Redekop , Mara Pleasure , Zichen Wang , Kimberly Flores , Anthony Sisk , William Speier , Corey W. Arnold

Towards Pragmatic Semantic Image Synthesis for Urban Scenes

The need for large amounts of training and validation data is a huge concern in scaling AI algorithms for autonomous driving. Semantic Image Synthesis (SIS), or label-to-image translation, promises to address this issue by translating…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 George Eskandar , Diandian Guo , Karim Guirguis , Bin Yang

Learning Spatially-Adaptive Squeeze-Excitation Networks for Image Synthesis and Image Recognition

Learning light-weight yet expressive deep networks in both image synthesis and image recognition remains a challenging problem. Inspired by a more recent observation that it is the data-specificity that makes the multi-head self-attention…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Jianghao Shen , Tianfu Wu

A 2D Semantic-Aware Position Encoding for Vision Transformers

Vision transformers have demonstrated significant advantages in computer vision tasks due to their ability to capture long-range dependencies and contextual relationships through self-attention. However, existing position encoding…

Computer Vision and Pattern Recognition · Computer Science 2025-05-15 Xi Chen , Shiyang Zhou , Muqi Huang , Jiaxu Feng , Yun Xiong , Kun Zhou , Biao Yang , Yuhui Zhang , Huishuai Bao , Sijia Peng , Chuan Li , Feng Shi

CARD: Semantic Segmentation with Efficient Class-Aware Regularized Decoder

Semantic segmentation has recently achieved notable advances by exploiting "class-level" contextual information during learning. However, these approaches simply concatenate class-level information to pixel features to boost the pixel…

Computer Vision and Pattern Recognition · Computer Science 2023-01-12 Ye Huang , Di Kang , Liang Chen , Wenjing Jia , Xiangjian He , Lixin Duan , Xuefei Zhe , Linchao Bao

CLUDA : Contrastive Learning in Unsupervised Domain Adaptation for Semantic Segmentation

In this work, we propose CLUDA, a simple, yet novel method for performing unsupervised domain adaptation (UDA) for semantic segmentation by incorporating contrastive losses into a student-teacher learning paradigm, that makes use of…

Computer Vision and Pattern Recognition · Computer Science 2022-11-09 Midhun Vayyat , Jaswin Kasi , Anuraag Bhattacharya , Shuaib Ahmed , Rahul Tallamraju

Semantically Adaptive Image-to-image Translation for Domain Adaptation of Semantic Segmentation

Domain shift is a very challenging problem for semantic segmentation. Any model can be easily trained on synthetic data, where images and labels are artificially generated, but it will perform poorly when deployed on real environments. In…

Computer Vision and Pattern Recognition · Computer Science 2020-09-03 Luigi Musto , Andrea Zinelli