English
Related papers

Related papers: Learning by stochastic serializations

200 papers

A number of machine learning models have been proposed with the goal of achieving systematic generalization: the ability to reason about new situations by combining aspects of previous experiences. These models leverage compositional…

Machine Learning · Computer Science 2024-09-24 Devon Jarvis , Richard Klein , Benjamin Rosman , Andrew M. Saxe

We propose and analyze a regularization approach for structured prediction problems. We characterize a large class of loss functions that allows to naturally embed structured outputs in a linear space. We exploit this fact to design…

Machine Learning · Computer Science 2017-07-31 Carlo Ciliberto , Alessandro Rudi , Lorenzo Rosasco

In forecasting multiple time series, accounting for the individual features of each sequence can be challenging. To address this, modern deep learning methods for time series analysis combine a shared (global) model with local layers,…

Machine Learning · Computer Science 2025-02-14 Luca Butera , Giovanni De Felice , Andrea Cini , Cesare Alippi

This paper investigates a new learning formulation called structured sparsity, which is a natural extension of the standard sparsity concept in statistical learning and compressive sensing. By allowing arbitrary structures on the feature…

Methodology · Statistics 2009-05-05 Junzhou Huang , Tong Zhang , Dimitris Metaxas

In this paper we propose a Bayesian method for estimating architectural parameters of neural networks, namely layer size and network depth. We do this by learning concrete distributions over these parameters. Our results show that regular…

Machine Learning · Statistics 2019-01-29 Georgi Dikov , Patrick van der Smagt , Justin Bayer

Despite the remarkable success of large large-scale neural networks, we still lack unified notation for thinking about and describing their representational spaces. We lack methods to reliably describe how their representations are…

Machine Learning · Computer Science 2025-06-02 Henry Conklin

Continual learning of deep neural networks is a key requirement for scaling them up to more complex applicative scenarios and for achieving real lifelong learning of these architectures. Previous approaches to the problem have considered…

Machine Learning · Computer Science 2020-06-25 Jary Pomponi , Simone Scardapane , Vincenzo Lomonaco , Aurelio Uncini

Continual Learning is a learning paradigm where learning systems are trained with sequential or streaming tasks. Two notable directions among the recent advances in continual learning with neural networks are ($i$) variational Bayes based…

Machine Learning · Computer Science 2020-02-24 Abhishek Kumar , Sunabha Chatterjee , Piyush Rai

There is growing body of learning problems for which it is natural to organize the parameters into matrix, so as to appropriately regularize the parameters under some matrix norm (in order to impose some more sophisticated prior knowledge).…

Machine Learning · Computer Science 2010-10-19 Sham M. Kakade , Shai Shalev-Shwartz , Ambuj Tewari

Recent systems on structured prediction focus on increasing the level of structural dependencies within the model. However, our study suggests that complex structures entail high overfitting risks. To control the structure-based…

Machine Learning · Computer Science 2017-11-29 Xu Sun , Weiwei Sun , Shuming Ma , Xuancheng Ren , Yi Zhang , Wenjie Li , Houfeng Wang

There is mounting evidence that existing neural network models, in particular the very popular sequence-to-sequence architecture, struggle to systematically generalize to unseen compositions of seen components. We demonstrate that one of…

Computation and Language · Computer Science 2022-03-23 Hao Zheng , Mirella Lapata

Many prediction problems, such as those that arise in the context of robotics, have a simplifying underlying structure that, if known, could accelerate learning. In this paper, we present a strategy for learning a set of neural network…

Machine Learning · Computer Science 2019-05-06 Ferran Alet , Tomás Lozano-Pérez , Leslie P. Kaelbling

The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce an architecture based entirely on convolutional neural networks. Compared to…

Computation and Language · Computer Science 2017-07-26 Jonas Gehring , Michael Auli , David Grangier , Denis Yarats , Yann N. Dauphin

We propose a novel probabilistic dimensionality reduction framework that can naturally integrate the generative model and the locality information of data. Based on this framework, we present a new model, which is able to learn a smooth…

Machine Learning · Statistics 2016-10-18 Li Wang

Estimating graphical model structure from high-dimensional and undersampled data is a fundamental problem in many scientific fields. Existing approaches, such as GLASSO, latent variable GLASSO, and latent tree models, suffer from high…

Machine Learning · Statistics 2019-09-18 Greg Ver Steeg , Hrayr Harutyunyan , Daniel Moyer , Aram Galstyan

Key to structured prediction is exploiting the problem structure to simplify the learning process. A major challenge arises when data exhibit a local structure (e.g., are made by "parts") that can be leveraged to better approximate the…

Machine Learning · Statistics 2019-06-03 Carlo Ciliberto , Francis Bach , Alessandro Rudi

Set prediction is about learning to predict a collection of unordered variables with unknown interrelations. Training such models with set losses imposes the structure of a metric space over sets. We focus on stochastic and underdefined…

Machine Learning · Computer Science 2021-02-23 David W. Zhang , Gertjan J. Burghouts , Cees G. M. Snoek

Recursive Neural Networks are non-linear adaptive models that are able to learn deep structured information. However, these models have not yet been broadly accepted. This fact is mainly due to its inherent complexity. In particular, not…

Neural and Evolutionary Computing · Computer Science 2009-11-18 Alejandro Chinea

Collaborative learning has successfully applied knowledge transfer to guide a pool of small student networks towards robust local minima. However, previous approaches typically struggle with drastically aggravated student homogenization…

Machine Learning · Computer Science 2021-02-23 Shaoxiong Feng , Hongshen Chen , Xuancheng Ren , Zhuoye Ding , Kan Li , Xu Sun

Many scientific datasets are of high dimension, and the analysis usually requires visual manipulation by retaining the most important structures of data. Principal curve is a widely used approach for this purpose. However, many existing…

Artificial Intelligence · Computer Science 2016-01-19 Qi Mao , Li Wang , Ivor W. Tsang , Yijun Sun
‹ Prev 1 2 3 10 Next ›