Related papers: Localization with Sampling-Argmax

The Sampling-Gaussian for stereo matching

The soft-argmax operation is widely adopted in neural network-based stereo matching methods to enable differentiable regression of disparity. However, network trained with soft-argmax is prone to being multimodal due to absence of explicit…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Baiyu Pan , jichao jiao , Bowen Yao , Jianxin Pang , Jun Cheng

Heatmap Regression without Soft-Argmax for Facial Landmark Detection

Facial landmark detection is an important task in computer vision with numerous applications, such as head pose estimation, expression analysis, face swapping, etc. Heatmap regression-based methods have been widely used to achieve…

Computer Vision and Pattern Recognition · Computer Science 2025-08-22 Chiao-An Yang , Raymond A. Yeh

r-softmax: Generalized Softmax with Controllable Sparsity Rate

Nowadays artificial neural network models achieve remarkable results in many disciplines. Functions mapping the representation provided by the model to the probability distribution are the inseparable aspect of deep learning solutions.…

Machine Learning · Computer Science 2023-04-24 Klaudia Bałazy , Łukasz Struski , Marek Śmieja , Jacek Tabor

MultiMax: Sparse and Multi-Modal Attention Learning

SoftMax is a ubiquitous ingredient of modern machine learning algorithms. It maps an input vector onto a probability simplex and reweights the input by concentrating the probability mass at large entries. Yet, as a smooth approximation to…

Machine Learning · Computer Science 2025-01-09 Yuxuan Zhou , Mario Fritz , Margret Keuper

Online Ecological Gearshift Strategy via Neural Network with Soft-Argmax Operator

This paper presents a neural network optimizer with soft-argmax operator to achieve an ecological gearshift strategy in real-time. The strategy is reformulated as the mixed-integer model predictive control (MIMPC) problem to minimize energy…

Systems and Control · Electrical Eng. & Systems 2024-02-29 Xi Luo , Shiying Dong , Jinlong Hong , Bingzhao Gao , Hong Chen

Sparse-softmax: A Simpler and Faster Alternative Softmax Transformation

The softmax function is widely used in artificial neural networks for the multiclass classification problems, where the softmax transformation enforces the output to be positive and sum to one, and the corresponding loss function allows to…

Machine Learning · Computer Science 2021-12-24 Shaoshi Sun , Zhenyuan Zhang , BoCheng Huang , Pengbin Lei , Jianlin Su , Shengfeng Pan , Jiarun Cao

Adaptive Sampled Softmax with Kernel Based Sampling

Softmax is the most commonly used output function for multiclass problems and is widely used in areas such as vision, natural language processing, and recommendation. A softmax model has linear costs in the number of classes which makes it…

Machine Learning · Computer Science 2018-08-03 Guy Blanc , Steffen Rendle

Diverse Human Motion Prediction via Gumbel-Softmax Sampling from an Auxiliary Space

Diverse human motion prediction aims at predicting multiple possible future pose sequences from a sequence of observed poses. Previous approaches usually employ deep generative networks to model the conditional distribution of data, and…

Computer Vision and Pattern Recognition · Computer Science 2022-07-18 Lingwei Dang , Yongwei Nie , Chengjiang Long , Qing Zhang , Guiqing Li

Soft Random Sampling: A Theoretical and Empirical Analysis

Soft random sampling (SRS) is a simple yet effective approach for efficient training of large-scale deep neural networks when dealing with massive data. SRS selects a subset uniformly at random with replacement from the full data set in…

Machine Learning · Computer Science 2023-11-27 Xiaodong Cui , Ashish Mittal , Songtao Lu , Wei Zhang , George Saon , Brian Kingsbury

Efficient Marginalization of Discrete and Structured Latent Variables via Sparsity

Training neural network models with discrete (categorical or structured) latent variables can be computationally challenging, due to the need for marginalization over large or combinatorial sets. To circumvent this issue, one typically…

Machine Learning · Computer Science 2020-12-29 Gonçalo M. Correia , Vlad Niculae , Wilker Aziz , André F. T. Martins

Learning a Weight Map for Weakly-Supervised Localization

In the weakly supervised localization setting, supervision is given as an image-level label. We propose to employ an image classifier $f$ and to train a generative network $g$ that outputs, given the input image, a per-pixel weight map that…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Tal Shaharabany , Lior Wolf

DropMax: Adaptive Variational Softmax

We propose DropMax, a stochastic version of softmax classifier which at each iteration drops non-target classes according to dropout probabilities adaptively decided for each instance. Specifically, we overlay binary masking variables over…

Machine Learning · Computer Science 2018-11-05 Hae Beom Lee , Juho Lee , Saehoon Kim , Eunho Yang , Sung Ju Hwang

Rethinking Approximate Gaussian Inference in Classification

In classification tasks, softmax functions are ubiquitously used as output activations to produce predictive probabilities. Such outputs only capture aleatoric uncertainty. To capture epistemic uncertainty, approximate Gaussian inference…

Machine Learning · Computer Science 2026-02-12 Bálint Mucsányi , Nathaël Da Costa , Philipp Hennig

Optimal sensor placement for the reconstruction of ocean states using differentiable Gumbel-Softmax sampling operator

Accurately reconstructing and forecasting ocean fields from sparse observations is critical for both operational and scientific purposes. Optimizing sensor placement to maximize reconstruction skill remains challenging due to evolving ocean…

Atmospheric and Oceanic Physics · Physics 2026-04-27 Oscar Chapron , Ronan Fablet , Yann Stéphan

Reparameterizable Subset Sampling via Continuous Relaxations

Many machine learning tasks require sampling a subset of items from a collection based on a parameterized distribution. The Gumbel-softmax trick can be used to sample a single item, and allows for low-variance reparameterized gradients with…

Machine Learning · Computer Science 2021-03-02 Sang Michael Xie , Stefano Ermon

One-vs-Each Approximation to Softmax for Scalable Estimation of Probabilities

The softmax representation of probabilities for categorical variables plays a prominent role in modern machine learning with numerous applications in areas such as large scale classification, neural language modeling and recommendation…

Machine Learning · Statistics 2016-11-01 Michalis K. Titsias

Conformalized Non-uniform Sampling Strategies for Accelerated Sampling-based Motion Planning

Sampling-based motion planners (SBMPs) are widely used to compute dynamically feasible robot paths. However, their reliance on uniform sampling often leads to poor efficiency and slow planning in complex environments. We introduce a novel…

Robotics · Computer Science 2025-11-10 Shubham Natraj , Bruno Sinopoli , Yiannis Kantaros

Active Target Localization using Low-Rank Matrix Completion and Unimodal Regression

The detection and localization of a target from samples of its generated field is a problem of interest in a broad range of applications. Often, the target field admits structural properties that enable the design of lower sample detection…

Information Theory · Computer Science 2016-01-28 Sunav Choudhary , Naveen Kumar , Srikanth Narayanan , Urbashi Mitra

Approximation Guarantees of Local Search Algorithms via Localizability of Set Functions

This paper proposes a new framework for providing approximation guarantees of local search algorithms. Local search is a basic algorithm design technique and is widely used for various combinatorial optimization problems. To analyze local…

Data Structures and Algorithms · Computer Science 2020-06-03 Kaito Fujii

Efficient Supernet Training with Orthogonal Softmax for Scalable ASR Model Compression

ASR systems are deployed across diverse environments, each with specific hardware constraints. We use supernet training to jointly train multiple encoders of varying sizes, enabling dynamic model size adjustment to fit hardware constraints…

Computation and Language · Computer Science 2025-02-05 Jingjing Xu , Eugen Beck , Zijian Yang , Ralf Schlüter