Related papers: Regularized Multi-Task Learning for Multi-Dimensio…

Clustering via Mode Seeking by Direct Estimation of the Gradient of a Log-Density

Mean shift clustering finds the modes of the data probability density by identifying the zero points of the density gradient. Since it does not require to fix the number of clusters in advance, the mean shift has been a popular clustering…

Machine Learning · Statistics 2014-04-22 Hiroaki Sasaki , Aapo Hyvärinen , Masashi Sugiyama

Mode-Seeking Clustering and Density Ridge Estimation via Direct Estimation of Density-Derivative-Ratios

Modes and ridges of the probability density function behind observed data are useful geometric features. Mode-seeking clustering assigns cluster labels by associating data samples with the nearest modes, and estimation of density ridges…

Machine Learning · Statistics 2018-04-03 Hiroaki Sasaki , Takafumi Kanamori , Aapo Hyvärinen , Gang Niu , Masashi Sugiyama

Direct Density-Derivative Estimation and Its Application in KL-Divergence Approximation

Estimation of density derivatives is a versatile tool in statistical data analysis. A naive approach is to first estimate the density and then compute its derivative. However, such a two-step approach does not work well because a good…

Machine Learning · Statistics 2014-07-01 Hiroaki Sasaki , Yung-Kyun Noh , Masashi Sugiyama

Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning

As machine learning becomes more prominent there is a growing demand to perform several inference tasks in parallel. Running a dedicated model for each task is computationally expensive and therefore there is a great interest in multi-task…

Machine Learning · Computer Science 2024-05-14 Idan Achituve , Idit Diamant , Arnon Netzer , Gal Chechik , Ethan Fetaya

Multi-task learning via robust regularized clustering with non-convex group penalties

Multi-task learning (MTL) aims to improve estimation and prediction performance by sharing common information among related tasks. One natural assumption in MTL is that tasks are classified into clusters based on their characteristics.…

Methodology · Statistics 2024-05-28 Akira Okazaki , Shuichi Kawano

Multitask Learning with Single Gradient Step Update for Task Balancing

Multitask learning is a methodology to boost generalization performance and also reduce computational intensity and memory usage. However, learning multiple tasks simultaneously can be more difficult than learning a single task because it…

Machine Learning · Computer Science 2020-06-03 Sungjae Lee , Youngdoo Son

Semisoft Task Clustering for Multi-Task Learning

Multi-task learning (MTL) aims to improve the performance of multiple related prediction tasks by leveraging useful information from them. Due to their flexibility and ability to reduce unknown coefficients substantially, the…

Machine Learning · Computer Science 2022-12-01 Yuzhao Zhang , Yifan Sun

Fast Multivariate Log-Concave Density Estimation

A novel computational approach to log-concave density estimation is proposed. Previous approaches utilize the piecewise-affine parametrization of the density induced by the given sample set. The number of parameters as well as non-smooth…

Computation · Statistics 2019-02-21 Fabian Rathke , Christoph Schnörr

Dual-Balancing for Multi-Task Learning

Multi-task learning aims to learn multiple related tasks simultaneously and has achieved great success in various fields. However, the disparity in loss and gradient scales among tasks often leads to performance compromises, and the…

Machine Learning · Computer Science 2025-11-27 Baijiong Lin , Weisen Jiang , Feiyang Ye , Yu Zhang , Pengguang Chen , Ying-Cong Chen , Shu Liu , Ivor W. Tsang , James T. Kwok

Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning

One of the key factors of enabling machine learning models to comprehend and solve real-world tasks is to leverage multimodal data. Unfortunately, annotation of multimodal data is challenging and expensive. Recently, self-supervised…

Computer Vision and Pattern Recognition · Computer Science 2020-12-11 Elad Amrani , Rami Ben-Ari , Daniel Rotman , Alex Bronstein

Towards Provable Log Density Policy Gradient

Policy gradient methods are a vital ingredient behind the success of modern reinforcement learning. Modern policy gradient methods, although successful, introduce a residual error in gradient estimation. In this work, we argue that this…

Machine Learning · Computer Science 2024-03-05 Pulkit Katdare , Anant Joshi , Katherine Driggs-Campbell

A supervised deep learning method for nonparametric density estimation

Nonparametric density estimation is an unsupervised learning problem. In this work we propose a two-step procedure that casts the density estimation problem in the first step into a supervised regression problem. The advantage is that we…

Statistics Theory · Mathematics 2024-06-04 Thijs Bos , Johannes Schmidt-Hieber

Learning Representation for Multitask learning through Self Supervised Auxiliary learning

Multi-task learning is a popular machine learning approach that enables simultaneous learning of multiple related tasks, improving algorithmic efficiency and effectiveness. In the hard parameter sharing approach, an encoder shared through…

Machine Learning · Statistics 2024-09-26 Seokwon Shin , Hyungrok Do , Youngdoo Son

Randomized subspace gradient method for constrained optimization

We propose randomized subspace gradient methods for high-dimensional constrained optimization. While there have been similarly purposed studies on unconstrained optimization problems, there have been few on constrained optimization problems…

Optimization and Control · Mathematics 2023-07-10 Ryota Nozawa , Pierre-Louis Poirion , Akiko Takeda

Implicit Regularization in Nonconvex Statistical Estimation: Gradient Descent Converges Linearly for Phase Retrieval, Matrix Completion, and Blind Deconvolution

Recent years have seen a flurry of activities in designing provably efficient nonconvex procedures for solving statistical estimation problems. Due to the highly nonconvex nature of the empirical loss, state-of-the-art procedures often…

Machine Learning · Computer Science 2020-06-09 Cong Ma , Kaizheng Wang , Yuejie Chi , Yuxin Chen

Dataset Distillation Efficiently Encodes Low-Dimensional Representations from Gradient-Based Learning of Non-Linear Tasks

Dataset distillation, a training-aware data compression technique, has recently attracted increasing attention as an effective tool for mitigating costs of optimization and data storage. However, progress remains largely empirical.…

Machine Learning · Computer Science 2026-03-31 Yuri Kinoshita , Naoki Nishikawa , Taro Toyoizumi

Overcoming Saturation in Density Ratio Estimation by Iterated Regularization

Estimating the ratio of two probability densities from finitely many samples, is a central task in machine learning and statistics. In this work, we show that a large class of kernel methods for density ratio estimation suffers from error…

Machine Learning · Computer Science 2024-06-04 Lukas Gruber , Markus Holzleitner , Johannes Lehner , Sepp Hochreiter , Werner Zellinger

Robust modal regression with direct log-density derivative estimation

Modal regression is aimed at estimating the global mode (i.e., global maximum) of the conditional density function of the output variable given input variables, and has led to regression methods robust against heavy-tailed or skewed noises.…

Machine Learning · Statistics 2019-10-21 Hiroaki Sasaki , Tomoya Sakai , Takafumi Kanamori

Fast Line Search for Multi-Task Learning

Multi-task learning is a powerful method for solving several tasks jointly by learning robust representation. Optimization of the multi-task learning model is a more complex task than a single-task due to task conflict. Based on theoretical…

Machine Learning · Computer Science 2021-10-05 Andrey Filatov , Daniil Merkulov

Scalable Multitask Learning Using Gradient-based Estimation of Task Affinity

Multitask learning is a widely used paradigm for training models on diverse tasks, with applications ranging from graph neural networks to language model fine-tuning. Since tasks may interfere with each other, a key notion for modeling…

Machine Learning · Computer Science 2024-11-22 Dongyue Li , Aneesh Sharma , Hongyang R. Zhang