English
Related papers

Related papers: Bayesian Hierarchical Words Representation Learnin…

200 papers

We consider a dictionary learning problem whose objective is to design a dictionary such that the signals admits a sparse or an approximate sparse representation over the learned dictionary. Such a problem finds a variety of applications…

Machine Learning · Computer Science 2015-03-10 Linxiao Yang , Jun Fang , Hong Cheng , Hongbin Li

We propose a new method for learning word representations using hierarchical regularization in sparse coding inspired by the linguistic study of word meanings. We show an efficient learning algorithm based on stochastic proximal methods…

Computation and Language · Computer Science 2014-11-07 Dani Yogatama , Manaal Faruqui , Chris Dyer , Noah A. Smith

To improve word representation learning, we propose a probabilistic prior which can be seamlessly integrated with word embedding models. Different from previous methods, word embedding is taken as a probabilistic generative model, and it…

Computation and Language · Computer Science 2023-09-22 Shaogang Ren , Dingcheng Li , Ping Li

The recently developed variational autoencoders (VAEs) have proved to be an effective confluence of the rich representational power of neural networks with Bayesian methods. However, most work on VAEs use a rather simple prior over the…

Machine Learning · Computer Science 2017-08-29 Prasoon Goyal , Zhiting Hu , Xiaodan Liang , Chenyu Wang , Eric Xing

Datasets in engineering applications are often limited and contaminated, mainly due to unavoidable measurement noise and signal distortion. Thus, using conventional data-driven approaches to build a reliable discriminative model, and…

Machine Learning · Statistics 2020-04-14 Xihaier Luo , Ahsan Kareem

A common problem in Machine Learning and statistics consists in detecting whether the current sample in a stream of data belongs to the same distribution as previous ones, is an isolated outlier or inaugurates a new distribution of data. We…

Machine Learning · Statistics 2018-05-16 Vincent Moens

We propose a new model for learning bilingual word representations from non-parallel document-aligned data. Following the recent advances in word representation learning, our model learns dense real-valued word vectors, that is, bilingual…

Computation and Language · Computer Science 2016-03-01 Ivan Vulić , Marie-Francine Moens

The meaning of a word often varies depending on its usage in different domains. The standard word embedding models struggle to represent this variation, as they learn a single global representation for a word. We propose a method to learn…

Computation and Language · Computer Science 2019-10-22 Lahari Poddar , Gyorgy Szarvas , Lea Frermann

State-of-the-art neural language models (LMs) represented by Transformers are highly complex. Their use of fixed, deterministic parameter estimates fail to account for model uncertainty and lead to over-fitting and poor generalization when…

Computation and Language · Computer Science 2021-02-10 Boyang Xue , Jianwei Yu , Junhao Xu , Shansong Liu , Shoukang Hu , Zi Ye , Mengzhe Geng , Xunying Liu , Helen Meng

Interpretability is crucial for machine learning in many scenarios such as quantitative finance, banking, healthcare, etc. Symbolic regression (SR) is a classic interpretable machine learning method by bridging X and Y using mathematical…

Methodology · Statistics 2020-01-17 Ying Jin , Weilin Fu , Jian Kang , Jiadong Guo , Jian Guo

Many Natural Language Processing applications nowadays rely on pre-trained word representations estimated from large text corpora such as news collections, Wikipedia and Web Crawl. In this paper, we show how to train high-quality word…

Computation and Language · Computer Science 2017-12-29 Tomas Mikolov , Edouard Grave , Piotr Bojanowski , Christian Puhrsch , Armand Joulin

State-of-the-art neural network language models (NNLMs) represented by long short term memory recurrent neural networks (LSTM-RNNs) and Transformers are becoming highly complex. They are prone to overfitting and poor generalization when…

Computation and Language · Computer Science 2022-08-30 Boyang Xue , Shoukang Hu , Junhao Xu , Mengzhe Geng , Xunying Liu , Helen Meng

We propose a Bayesian generative model for incorporating prior domain knowledge into hierarchical topic modeling. Although embedded topic models (ETMs) and its variants have gained promising performance in text analysis, they mainly focus…

Computation and Language · Computer Science 2022-09-29 Dongsheng Wang , Yishi Xu , Miaoge Li , Zhibin Duan , Chaojie Wang , Bo Chen , Mingyuan Zhou

Offline handwriting recognition (HWR) has improved significantly with the advent of deep learning architectures in recent years. Nevertheless, it remains a challenging problem and practical applications often rely on post-processing…

Computer Vision and Pattern Recognition · Computer Science 2023-09-20 Andrey Totev , Tomas Ward

We propose a Bayesian approach to learn discriminative dictionaries for sparse representation of data. The proposed approach infers probability distributions over the atoms of a discriminative dictionary using a Beta Process. It also…

Computer Vision and Pattern Recognition · Computer Science 2015-03-30 Naveed Akhtar , Faisal Shafait , Ajmal Mian

We use Bayesian optimization to learn curricula for word representation learning, optimizing performance on downstream tasks that depend on the learned representations as features. The curricula are modeled by a linear ranking function…

Computation and Language · Computer Science 2016-06-22 Yulia Tsvetkov , Manaal Faruqui , Wang Ling , Brian MacWhinney , Chris Dyer

Bayesian inference is known to provide a general framework for incorporating prior knowledge or specific properties into machine learning models via carefully choosing a prior distribution. In this work, we propose a new type of prior…

Machine Learning · Statistics 2019-02-20 Andrei Atanov , Arsenii Ashukha , Kirill Struminsky , Dmitry Vetrov , Max Welling

Collected data, which is used for analysis or prediction tasks, often have a hierarchical structure, for example, data from various people performing the same task. Modeling the data's structure can improve the reliability of the derived…

Applications · Statistics 2018-11-12 Dennis Becker

We propose Bayesian Hierarchical Invariant Prediction (BHIP) reframing Invariant Causal Prediction (ICP) through the lens of Hierarchical Bayes. We leverage the hierarchical structure to explicitly test invariance of causal mechanisms under…

Items in modern recommender systems are often organized in hierarchical structures. These hierarchical structures and the data within them provide valuable information for building personalized recommendation systems. In this paper, we…

Machine Learning · Computer Science 2019-08-21 Zitao Liu , Zhexuan Xu , Yan Yan
‹ Prev 1 2 3 10 Next ›