Related papers: SimCon Loss with Multiple Views for Text Supervise…

ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency

Recently, great success has been made in learning visual representations from text supervision, facilitating the emergence of text-supervised semantic segmentation. However, existing works focus on pixel grouping and cross-modal semantic…

Computer Vision and Pattern Recognition · Computer Science 2023-02-22 Pengzhen Ren , Changlin Li , Hang Xu , Yi Zhu , Guangrun Wang , Jianzhuang Liu , Xiaojun Chang , Xiaodan Liang

A Weakly-Supervised Semantic Segmentation Approach based on the Centroid Loss: Application to Quality Control and Inspection

It is generally accepted that one of the critical parts of current vision algorithms based on deep learning and convolutional neural networks is the annotation of a sufficient number of images to achieve competitive performance. This is…

Computer Vision and Pattern Recognition · Computer Science 2021-03-05 Kai Yao , Alberto Ortiz , Francisco Bonnin-Pascual

Loss Functions in the Era of Semantic Segmentation: A Survey and Outlook

Semantic image segmentation, the process of classifying each pixel in an image into a particular class, plays an important role in many visual understanding systems. As the predominant criterion for evaluating the performance of statistical…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Reza Azad , Moein Heidary , Kadir Yilmaz , Michael Hüttemann , Sanaz Karimijafarbigloo , Yuli Wu , Anke Schmeink , Dorit Merhof

Supervised Metric Learning to Rank for Retrieval via Contextual Similarity Optimization

There is extensive interest in metric learning methods for image retrieval. Many metric learning loss functions focus on learning a correct ranking of training samples, but strongly overfit semantically inconsistent labels and require a…

Machine Learning · Computer Science 2023-06-05 Christopher Liao , Theodoros Tsiligkaridis , Brian Kulis

Learning to Segment using Summary Statistics and Weak Supervision

Medical experts often manually segment images to obtain diagnostic statistics and discard the resulting annotations. We aim to train segmentation models to alleviate this burden, but constrained to the retained summary statistics (e.g., the…

Computer Vision and Pattern Recognition · Computer Science 2026-05-06 Omkar Kulkarni , Edward Raff , Tim Oates

Seed, Expand and Constrain: Three Principles for Weakly-Supervised Image Segmentation

We introduce a new loss function for the weakly-supervised training of semantic image segmentation models based on three guiding principles: to seed with weak localization cues, to expand objects based on the information about which classes…

Computer Vision and Pattern Recognition · Computer Science 2016-08-09 Alexander Kolesnikov , Christoph H. Lampert

Correlation Maximized Structural Similarity Loss for Semantic Segmentation

Most semantic segmentation models treat semantic segmentation as a pixel-wise classification task and use a pixel-wise classification error as their optimization criterions. However, the pixel-wise error ignores the strong dependencies…

Computer Vision and Pattern Recognition · Computer Science 2019-10-22 Shuai Zhao , Boxi Wu , Wenqing Chu , Yao Hu , Deng Cai

Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings

We propose an approach to semantic segmentation that achieves state-of-the-art supervised performance when applied in a zero-shot setting. It thus achieves results equivalent to those of the supervised methods, on each of the major semantic…

Computer Vision and Pattern Recognition · Computer Science 2024-05-01 Wei Yin , Yifan Liu , Chunhua Shen , Baichuan Sun , Anton van den Hengel

Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation

This paper addresses text-supervised semantic segmentation, aiming to learn a model capable of segmenting arbitrary visual concepts within images by using only image-text pairs without dense annotations. Existing methods have demonstrated…

Computer Vision and Pattern Recognition · Computer Science 2024-04-08 Ji-Jia Wu , Andy Chia-Hao Chang , Chieh-Yu Chuang , Chun-Pei Chen , Yu-Lun Liu , Min-Hung Chen , Hou-Ning Hu , Yung-Yu Chuang , Yen-Yu Lin

The Contextual Loss for Image Transformation with Non-Aligned Data

Feed-forward CNNs trained for image transformation problems rely on loss functions that measure the similarity between the generated image and a target image. Most of the common loss functions assume that these images are spatially aligned…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Roey Mechrez , Itamar Talmi , Lihi Zelnik-Manor

Self-Supervised Difference Detection for Weakly-Supervised Semantic Segmentation

To minimize the annotation costs associated with the training of semantic segmentation models, researchers have extensively investigated weakly-supervised segmentation approaches. In the current weakly-supervised segmentation methods, the…

Computer Vision and Pattern Recognition · Computer Science 2019-11-13 Wataru Shimoda , Keiji Yanai

Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning

Recent work shows that documents from encyclopedias serve as helpful auxiliary information for zero-shot learning. Existing methods align the entire semantics of a document with corresponding images to transfer knowledge. However, they…

Computer Vision and Pattern Recognition · Computer Science 2024-07-24 Xiangyan Qu , Jing Yu , Keke Gai , Jiamin Zhuang , Yuanmin Tang , Gang Xiong , Gaopeng Gou , Qi Wu

A survey of loss functions for semantic segmentation

Image Segmentation has been an active field of research as it has a wide range of applications, ranging from automated disease detection to self-driving cars. In the past five years, various papers came up with different objective loss…

Image and Video Processing · Electrical Eng. & Systems 2020-12-15 Shruti Jadon

Unsupervised Total Variation Loss for Semi-supervised Deep Learning of Semantic Segmentation

We introduce a novel unsupervised loss function for learning semantic segmentation with deep convolutional neural nets (ConvNet) when densely labeled training images are not available. More specifically, the proposed loss function penalizes…

Computer Vision and Pattern Recognition · Computer Science 2018-08-09 Mehran Javanmardi , Mehdi Sajjadi , Ting Liu , Tolga Tasdizen

Improving Visual-Semantic Embeddings by Learning Semantically-Enhanced Hard Negatives for Cross-modal Information Retrieval

Visual Semantic Embedding (VSE) aims to extract the semantics of images and their descriptions, and embed them into the same latent space for cross-modal information retrieval. Most existing VSE networks are trained by adopting a hard…

Computer Vision and Pattern Recognition · Computer Science 2023-02-15 Yan Gong , Georgina Cosma

Combo Loss: Handling Input and Output Imbalance in Multi-Organ Segmentation

Simultaneous segmentation of multiple organs from different medical imaging modalities is a crucial task as it can be utilized for computer-aided diagnosis, computer-assisted surgery, and therapy planning. Thanks to the recent advances in…

Computer Vision and Pattern Recognition · Computer Science 2021-09-16 Saeid Asgari Taghanaki , Yefeng Zheng , S. Kevin Zhou , Bogdan Georgescu , Puneet Sharma , Daguang Xu , Dorin Comaniciu , Ghassan Hamarneh

TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework

Semantic segmentation and stereo matching, respectively analogous to the ventral and dorsal streams in our human brain, are two key components of autonomous driving perception systems. Addressing these two tasks with separate networks is no…

Computer Vision and Pattern Recognition · Computer Science 2025-07-04 Guanfeng Tang , Zhiyuan Wu , Jiahang Li , Ping Zhong , We Ye , Xieyuanli Chen , Huiming Lu , Rui Fan

ViCO: A Training Strategy towards Semantic Aware Dynamic High-Resolution

Existing Multimodal Large Language Models (MLLMs) suffer from increased inference costs due to the additional vision tokens introduced by image inputs. In this work, we propose Visual Consistency Learning (ViCO), a novel training algorithm…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Long Cui , Weiyun Wang , Jie Shao , Zichen Wen , Gen Luo , Linfeng Zhang , Yanting Zhang , Yu Qiao , Wenhai Wang

Learning to Generate Images with Perceptual Similarity Metrics

Deep networks are increasingly being applied to problems involving image synthesis, e.g., generating images from textual descriptions and reconstructing an input image from a compact representation. Supervised training of image-synthesis…

Machine Learning · Computer Science 2017-01-25 Jake Snell , Karl Ridgeway , Renjie Liao , Brett D. Roads , Michael C. Mozer , Richard S. Zemel

Self-supervised Learning with Local Contrastive Loss for Detection and Semantic Segmentation

We present a self-supervised learning (SSL) method suitable for semi-global tasks such as object detection and semantic segmentation. We enforce local consistency between self-learned features, representing corresponding image locations of…

Computer Vision and Pattern Recognition · Computer Science 2022-12-09 Ashraful Islam , Ben Lundell , Harpreet Sawhney , Sudipta Sinha , Peter Morales , Richard J. Radke