Related papers: Interpretable Convolutional SyncNet

Symmetrization Weighted Binary Cross-Entropy: Modeling Perceptual Asymmetry for Human-Consistent Neural Edge Detection

Edge detection (ED) is a fundamental perceptual process in computer vision, forming the structural basis for high-level reasoning tasks such as segmentation, recognition, and scene understanding. Despite substantial progress achieved by…

Computer Vision and Pattern Recognition · Computer Science 2026-02-12 Hao Shu

InfoSyncNet: Information Synchronization Temporal Convolutional Network for Visual Speech Recognition

Estimating spoken content from silent videos is crucial for applications in Assistive Technology (AT) and Augmented Reality (AR). However, accurately mapping lip movement sequences in videos to words poses significant challenges due to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Junxiao Xue , Xiaozhen Liu , Xuecheng Wu , Fei Yu , Jun Wang

Towards noise contrastive estimation with soft targets for conditional models

Soft targets combined with the cross-entropy loss have shown to improve generalization performance of deep neural networks on supervised classification tasks. The standard cross-entropy loss however assumes data to be categorically…

Machine Learning · Computer Science 2024-07-16 Johannes Hugger , Virginie Uhlmann

SINCERE: Supervised Information Noise-Contrastive Estimation REvisited

The information noise-contrastive estimation (InfoNCE) loss function provides the basis of many self-supervised deep learning methods due to its strong empirical results and theoretic motivation. Previous work suggests a supervised…

Computer Vision and Pattern Recognition · Computer Science 2024-11-11 Patrick Feeney , Michael C. Hughes

Learning Compatible Embeddings

Achieving backward compatibility when rolling out new models can highly reduce costs or even bypass feature re-encoding of existing gallery images for in-production visual retrieval systems. Previous related works usually leverage losses…

Computer Vision and Pattern Recognition · Computer Science 2021-08-05 Qiang Meng , Chixiang Zhang , Xiaoqiang Xu , Feng Zhou

CS-MCNet:A Video Compressive Sensing Reconstruction Network with Interpretable Motion Compensation

In this paper, a deep neural network with interpretable motion compensation called CS-MCNet is proposed to realize high-quality and real-time decoding of video compressive sensing. Firstly, explicit multi-hypothesis motion compensation is…

Image and Video Processing · Electrical Eng. & Systems 2020-10-09 Bowen Huang , Jinjia Zhou , Xiao Yan , Ming'e Jing , Rentao Wan , Yibo Fan

Learning Broken Symmetries with Approximate Invariance

Recognizing symmetries in data allows for significant boosts in neural network training, which is especially important where training data are limited. In many cases, however, the exact underlying symmetry is present only in an idealized…

High Energy Physics - Phenomenology · Physics 2025-04-07 Seth Nabat , Aishik Ghosh , Edmund Witkowski , Gregor Kasieczka , Daniel Whiteson

Bit Error and Block Error Rate Training for ML-Assisted Communication

Even though machine learning (ML) techniques are being widely used in communications, the question of how to train communication systems has received surprisingly little attention. In this paper, we show that the commonly used binary…

Information Theory · Computer Science 2023-03-08 Reinhard Wiesmayr , Gian Marti , Chris Dick , Haochuan Song , Christoph Studer

Contrastive representation learning has proven to be an effective self-supervised learning method for images and videos. Most successful approaches are based on Noise Contrastive Estimation (NCE) and use different views of an instance as…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Julien Denize , Jaonary Rabarisoa , Astrid Orcesi , Romain Hérault

A Comparative Analysis Of Latent Regressor Losses For Singing Voice Conversion

Previous research has shown that established techniques for spoken voice conversion (VC) do not perform as well when applied to singing voice conversion (SVC). We propose an alternative loss component in a loss function that is otherwise…

Sound · Computer Science 2023-02-28 Brendan O'Connor , Simon Dixon

Contrastive representation learning has proven to be an effective self-supervised learning method. Most successful approaches are based on Noise Contrastive Estimation (NCE) and use different views of an instance as positives that should be…

Computer Vision and Pattern Recognition · Computer Science 2023-09-04 Julien Denize , Jaonary Rabarisoa , Astrid Orcesi , Romain Hérault , Stéphane Canu

Understanding InfoNCE: Transition Probability Matrix Induced Feature Clustering

Contrastive learning has emerged as a cornerstone of unsupervised representation learning across vision, language, and graph domains, with InfoNCE as its dominant objective. Despite its empirical success, the theoretical underpinnings of…

Machine Learning · Computer Science 2025-11-18 Ge Cheng , Shuo Wang , Yun Zhang

Weighted Point Set Embedding for Multimodal Contrastive Learning Toward Optimal Similarity Metric

In typical multimodal contrastive learning, such as CLIP, encoders produce one point in the latent representation space for each input. However, one-point representation has difficulty in capturing the relationship and the similarity…

Machine Learning · Computer Science 2025-03-04 Toshimitsu Uesaka , Taiji Suzuki , Yuhta Takida , Chieh-Hsin Lai , Naoki Murata , Yuki Mitsufuji

InfoNCE Loss Provably Learns Cluster-Preserving Representations

The goal of contrasting learning is to learn a representation that preserves underlying clusters by keeping samples with similar content, e.g. the ``dogness'' of a dog, close to each other in the space generated by the representation. A…

Machine Learning · Computer Science 2023-02-17 Advait Parulekar , Liam Collins , Karthikeyan Shanmugam , Aryan Mokhtari , Sanjay Shakkottai

Smoothed Contrastive Learning for Unsupervised Sentence Embedding

Contrastive learning has been gradually applied to learn high-quality unsupervised sentence embedding. Among the previous un-supervised methods, the latest state-of-the-art method, as far as we know, is unsupervised SimCSE (unsup-SimCSE).…

Computation and Language · Computer Science 2022-09-13 Xing Wu , Chaochen Gao , Yipeng Su , Jizhong Han , Zhongyuan Wang , Songlin Hu

On Network-Error Correcting Convolutional Codes under the BSC Edge Error Model

Convolutional network-error correcting codes (CNECCs) are known to provide error correcting capability in acyclic instantaneous networks within the network coding paradigm under small field size conditions. In this work, we investigate the…

Information Theory · Computer Science 2010-01-08 K. Prasad , B. Sundar Rajan

InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings

Contrastive learning has been extensively studied in sentence embedding learning, which assumes that the embeddings of different views of the same sentence are closer. The constraint brought by this assumption is weak, and a good sentence…

Computation and Language · Computer Science 2022-10-17 Xing Wu , Chaochen Gao , Zijia Lin , Jizhong Han , Zhongyuan Wang , Songlin Hu

CSHNet: A Novel Information Asymmetric Image Translation Method

Despite advancements in cross-domain image translation, challenges persist in asymmetric tasks such as SAR-to-Optical and Sketch-to-Instance conversions, which involve transforming data from a less detailed domain into one with richer…

Computer Vision and Pattern Recognition · Computer Science 2025-01-20 Xi Yang , Haoyuan Shi , Zihan Wang , Nannan Wang , Xinbo Gao

Learning Convolutional Networks for Content-weighted Image Compression

Lossy image compression is generally formulated as a joint rate-distortion optimization to learn encoder, quantizer, and decoder. However, the quantizer is non-differentiable, and discrete entropy estimation usually is required for rate…

Computer Vision and Pattern Recognition · Computer Science 2017-09-20 Mu Li , Wangmeng Zuo , Shuhang Gu , Debin Zhao , David Zhang

SINET: Sparsity-driven Interpretable Neural Network for Underwater Image Enhancement

Improving the quality of underwater images is essential for advancing marine research and technology. This work introduces a sparsity-driven interpretable neural network (SINET) for the underwater image enhancement (UIE) task. Unlike pure…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Gargi Panda , Soumitra Kundu , Saumik Bhattacharya , Aurobinda Routray