Related papers: Learning conditional distributions on continuous s…

Learning Neural Models for End-to-End Clustering

We propose a novel end-to-end neural network architecture that, once trained, directly outputs a probabilistic clustering of a batch of input examples in one pass. It estimates a distribution over the number of clusters $k$, and for each $1…

Machine Learning · Computer Science 2018-07-12 Benjamin Bruno Meier , Ismail Elezi , Mohammadreza Amirian , Oliver Durr , Thilo Stadelmann

Learning Counterfactual Distributions via Kernel Nearest Neighbors

Consider a setting with multiple units (e.g., individuals, cohorts, geographic locations) and outcomes (e.g., treatments, times, items), where the goal is to learn a multivariate distribution for each unit-outcome entry, such as the…

Machine Learning · Statistics 2025-10-21 Kyuseong Choi , Jacob Feitelberg , Caleb Chin , Anish Agarwal , Raaz Dwivedi

Learning with Clustering Structure

We study supervised learning problems using clustering constraints to impose structure on either features or samples, seeking to help both prediction and interpretation. The problem of clustering features arises naturally in text…

Machine Learning · Computer Science 2016-09-20 Vincent Roulet , Fajwel Fogel , Alexandre d'Aspremont , Francis Bach

Distributionally Robust Weighted $k$-Nearest Neighbors

Learning a robust classifier from a few samples remains a key challenge in machine learning. A major thrust of research has been focused on developing $k$-nearest neighbor ($k$-NN) based algorithms combined with metric learning that…

Machine Learning · Statistics 2022-02-17 Shixiang Zhu , Liyan Xie , Minghe Zhang , Rui Gao , Yao Xie

Learning from Conditional Distributions via Dual Embeddings

Many machine learning tasks, such as learning with invariance and policy evaluation in reinforcement learning, can be characterized as problems of learning from conditional distributions. In such problems, each sample $x$ itself is…

Machine Learning · Computer Science 2017-01-03 Bo Dai , Niao He , Yunpeng Pan , Byron Boots , Le Song

Unsupervised feature selection via self-paced learning and low-redundant regularization

Much more attention has been paid to unsupervised feature selection nowadays due to the emergence of massive unlabeled data. The distribution of samples and the latent effect of training a learning method using samples in more effective…

Machine Learning · Computer Science 2021-12-15 Weiyi Li , Hongmei Chen , Tianrui Li , Jihong Wan , Binbin Sang

Computationally Efficient Learning of Statistical Manifolds

Analyzing high-dimensional data with manifold learning algorithms often requires searching for the nearest neighbors of all observations. This presents a computational bottleneck in statistical manifold learning when observations of…

Machine Learning · Computer Science 2022-03-11 Fan Cheng , Anastasios Panagiotelis , Rob J Hyndman

Improving classifier decision boundaries using nearest neighbors

Neural networks are not learning optimal decision boundaries. We show that decision boundaries are situated in areas of low training data density. They are impacted by few training samples which can easily lead to overfitting. We provide a…

Machine Learning · Computer Science 2023-10-09 Johannes Schneider

Contextual Preference Distribution Learning

Decision-making problems often feature uncertainty stemming from heterogeneous and context-dependent human preferences. To address this, we propose a sequential learning-and-optimization pipeline to learn preference distributions and…

Machine Learning · Computer Science 2026-03-19 Benjamin Hudson , Laurent Charlin , Emma Frejinger

Probabilistic spatial clustering based on the Self Discipline Learning (SDL) model of autonomous learning

Unsupervised clustering algorithm can effectively reduce the dimension of high-dimensional unlabeled data, thus reducing the time and space complexity of data processing. However, the traditional clustering algorithm needs to set the upper…

Machine Learning · Computer Science 2022-01-17 Zecang Gu , Xiaoqi Sun , Yuan Sun , Fuquan Zhang

Unsupervised Feature Selection based on Adaptive Similarity Learning and Subspace Clustering

Feature selection methods have an important role on the readability of data and the reduction of complexity of learning algorithms. In recent years, a variety of efforts are investigated on feature selection problems based on unsupervised…

Machine Learning · Computer Science 2019-12-12 Mohsen Ghassemi Parsa , Hadi Zare , Mehdi Ghatee

Wald-Kernel: Learning to Aggregate Information for Sequential Inference

Sequential hypothesis testing is a desirable decision making strategy in any time sensitive scenario. Compared with fixed sample-size testing, sequential testing is capable of achieving identical probability of error requirements using less…

Machine Learning · Statistics 2017-11-17 Diyan Teng , Emre Ertin

Simultaneously Learning Neighborship and Projection Matrix for Supervised Dimensionality Reduction

Explicitly or implicitly, most of dimensionality reduction methods need to determine which samples are neighbors and the similarity between the neighbors in the original highdimensional space. The projection matrix is then learned on the…

Computer Vision and Pattern Recognition · Computer Science 2017-09-12 Yanwei Pang , Bo Zhou , Feiping Nie

Adaptive transfer learning

In transfer learning, we wish to make inference about a target population when we have access to data both from the distribution itself, and from a different but related source distribution. We introduce a flexible framework for transfer…

Machine Learning · Statistics 2021-09-03 Henry W. J. Reeve , Timothy I. Cannings , Richard J. Samworth

Learning Uniformly Distributed Embedding Clusters of Stylistic Skills for Physically Simulated Characters

Learning natural and diverse behaviors from human motion datasets remains challenging in physics-based character control. Existing conditional adversarial models often suffer from tight and biased embedding distributions where embeddings…

Graphics · Computer Science 2024-11-12 Nian Liu , Libin Liu , Zilong Zhang , Zi Wang , Hongzhao Xie , Tengyu Liu , Xinyi Tong , Yaodong Yang , Zhaofeng He

Proximal Multitask Learning over Networks with Sparsity-inducing Coregularization

In this work, we consider multitask learning problems where clusters of nodes are interested in estimating their own parameter vector. Cooperation among clusters is beneficial when the optimal models of adjacent clusters have a good number…

Systems and Control · Computer Science 2016-11-03 Roula Nassif , Cédric Richard , André Ferrari , Ali H. Sayed

Sample Complexity of Robust Learning against Evasion Attacks

It is becoming increasingly important to understand the vulnerability of machine learning models to adversarial attacks. One of the fundamental problems in adversarial machine learning is to quantify how much training data is needed in the…

Machine Learning · Computer Science 2023-08-24 Pascale Gourdeau

A Kernel-Based Conditional Two-Sample Test Using Nearest Neighbors (with Applications to Calibration, Regression Curves, and Simulation-Based Inference)

In this paper we introduce a kernel-based measure for detecting differences between two conditional distributions. Using the `kernel trick' and nearest-neighbor graphs, we propose a consistent estimate of this measure which can be computed…

Methodology · Statistics 2024-08-30 Anirban Chatterjee , Ziang Niu , Bhaswar B. Bhattacharya

Explaining the Success of Nearest Neighbor Methods in Prediction

Many modern methods for prediction leverage nearest neighbor search to find past training examples most similar to a test example, an idea that dates back in text to at least the 11th century and has stood the test of time. This monograph…

Machine Learning · Computer Science 2025-02-25 George H. Chen , Devavrat Shah

Generative Conditional Distributions by Neural (Entropic) Optimal Transport

Learning conditional distributions is challenging because the desired outcome is not a single distribution but multiple distributions that correspond to multiple instances of the covariates. We introduce a novel neural entropic optimal…

Machine Learning · Computer Science 2024-06-05 Bao Nguyen , Binh Nguyen , Hieu Trung Nguyen , Viet Anh Nguyen