Related papers: Gibbs Max-margin Topic Models with Data Augmentati…

Learning latent variable structured prediction models with Gaussian perturbations

The standard margin-based structured prediction commonly uses a maximum loss over all possible structured outputs. The large-margin formulation including latent variables not only results in a non-convex formulation but also increases the…

Machine Learning · Computer Science 2019-06-25 Kevin Bello , Jean Honorio

MedLDA: A General Framework of Maximum Margin Supervised Topic Models

Supervised topic models utilize document's side information for discovering predictive low dimensional representations of documents. Existing models apply the likelihood-based estimation. In this paper, we present a general framework of…

Machine Learning · Statistics 2013-04-09 Jun Zhu , Amr Ahmed , Eric P. Xing

Margin Maximization as Lossless Maximal Compression

The ultimate goal of a supervised learning algorithm is to produce models constructed on the training data that can generalize well to new examples. In classification, functional margin maximization -- correctly classifying as many training…

Machine Learning · Computer Science 2020-01-29 Nikolaos Nikolaou , Henry Reeve , Gavin Brown

Max-Margin based Discriminative Feature Learning

In this paper, we propose a new max-margin based discriminative feature learning method. Specifically, we aim at learning a low-dimensional feature representation, so as to maximize the global margin of the data and make the samples from…

Machine Learning · Computer Science 2017-04-04 Changsheng Li , Qingshan Liu , Weishan Dong , Xin Zhang , Lin Yang

Max-Margin Nonparametric Latent Feature Models for Link Prediction

Link prediction is a fundamental task in statistical network analysis. Recent advances have been made on learning flexible nonparametric Bayesian latent feature models for link prediction. In this paper, we present a max-margin learning…

Machine Learning · Computer Science 2016-02-25 Jun Zhu , Jiaming Song , Bei Chen

Improved Bayesian Logistic Supervised Topic Models with Data Augmentation

Supervised topic models with a logistic likelihood have two issues that potentially limit their practical use: 1) response variables are usually over-weighted by document word counts; and 2) existing variational inference methods make…

Machine Learning · Computer Science 2013-10-10 Jun Zhu , Xun Zheng , Bo Zhang

Max-Margin Deep Generative Models for (Semi-)Supervised Learning

Deep generative models (DGMs) are effective on learning multilayered representations of complex data and performing inference of input data by exploring the generative ability. However, it is relatively insufficient to empower the…

Computer Vision and Pattern Recognition · Computer Science 2016-11-23 Chongxuan Li , Jun Zhu , Bo Zhang

Max-Margin Nonparametric Latent Feature Models for Link Prediction

We present a max-margin nonparametric latent feature model, which unites the ideas of max-margin learning and Bayesian nonparametrics to discover discriminative latent features for link prediction and automatically infer the unknown latent…

Machine Learning · Computer Science 2012-06-22 Jun Zhu

Learning Topic Models: Identifiability and Finite-Sample Analysis

Topic models provide a useful text-mining tool for learning, extracting, and discovering latent structures in large text corpora. Although a plethora of methods have been proposed for topic modeling, lacking in the literature is a formal…

Machine Learning · Statistics 2022-08-12 Yinyin Chen , Shishuang He , Yun Yang , Feng Liang

Discriminative Relational Topic Models

Many scientific and engineering fields involve analyzing network data. For document networks, relational topic models (RTMs) provide a probabilistic generative process to describe both the link structure and document contents, and they have…

Machine Learning · Computer Science 2013-10-10 Ning Chen , Jun Zhu , Fei Xia , Bo Zhang

Prediction-Constrained Training for Semi-Supervised Mixture and Topic Models

Supervisory signals have the potential to make low-dimensional data representations, like those learned by mixture and topic models, more interpretable and useful. We propose a framework for training latent variable models that explicitly…

Machine Learning · Statistics 2017-11-15 Michael C. Hughes , Leah Weiner , Gabriel Hope , Thomas H. McCoy , Roy H. Perlis , Erik B. Sudderth , Finale Doshi-Velez

Bayesian Semisupervised Learning with Deep Generative Models

Neural network based generative models with discriminative components are a powerful approach for semi-supervised learning. However, these techniques a) cannot account for model uncertainty in the estimation of the model's discriminative…

Machine Learning · Statistics 2017-06-30 Jonathan Gordon , José Miguel Hernández-Lobato

The Implicit Bias of Gradient Descent on Separable Data

We examine gradient descent on unregularized logistic regression problems, with homogeneous linear predictors on linearly separable datasets. We show the predictor converges to the direction of the max-margin (hard margin SVM) solution. The…

Machine Learning · Statistics 2024-10-29 Daniel Soudry , Elad Hoffer , Mor Shpigel Nacson , Suriya Gunasekar , Nathan Srebro

The generalization error of max-margin linear classifiers: Benign overfitting and high dimensional asymptotics in the overparametrized regime

Modern machine learning classifiers often exhibit vanishing classification error on the training set. They achieve this by learning nonlinear representations of the inputs that maps the data into linearly separable classes. Motivated by…

Statistics Theory · Mathematics 2023-03-23 Andrea Montanari , Feng Ruan , Youngtak Sohn , Jun Yan

The Performance Analysis of Generalized Margin Maximizer (GMM) on Separable Data

Logistic models are commonly used for binary classification tasks. The success of such models has often been attributed to their connection to maximum-likelihood estimators. It has been shown that gradient descent algorithm, when applied on…

Machine Learning · Statistics 2020-10-30 Fariborz Salehi , Ehsan Abbasi , Babak Hassibi

Learning beyond Predefined Label Space via Bayesian Nonparametric Topic Modelling

In real world machine learning applications, testing data may contain some meaningful new categories that have not been seen in labeled training data. To simultaneously recognize new data categories and assign most appropriate category…

Machine Learning · Computer Science 2019-10-11 Changying Du , Fuzhen Zhuang , Jia He , Qing He , Guoping Long

Linear Maximum Margin Classifier for Learning from Uncertain Data

In this paper, we propose a maximum margin classifier that deals with uncertainty in data input. More specifically, we reformulate the SVM framework such that each training example can be modeled by a multi-dimensional Gaussian distribution…

Machine Learning · Computer Science 2017-11-21 Christos Tzelepis , Vasileios Mezaris , Ioannis Patras

On Smoothing and Inference for Topic Models

Latent Dirichlet analysis, or topic modeling, is a flexible latent variable framework for modeling high-dimensional sparse count data. Various learning algorithms have been developed in recent years, including collapsed Gibbs sampling,…

Machine Learning · Computer Science 2012-05-14 Arthur Asuncion , Max Welling , Padhraic Smyth , Yee Whye Teh

Learning Topic Models and Latent Bayesian Networks Under Expansion Constraints

Unsupervised estimation of latent variable models is a fundamental problem central to numerous applications of machine learning and statistics. This work presents a principled approach for estimating broad classes of such models, including…

Machine Learning · Statistics 2013-05-27 Animashree Anandkumar , Daniel Hsu , Adel Javanmard , Sham M. Kakade

Efficient Marginalization of Discrete and Structured Latent Variables via Sparsity

Training neural network models with discrete (categorical or structured) latent variables can be computationally challenging, due to the need for marginalization over large or combinatorial sets. To circumvent this issue, one typically…

Machine Learning · Computer Science 2020-12-29 Gonçalo M. Correia , Vlad Niculae , Wilker Aziz , André F. T. Martins