English
Related papers

Related papers: Multilinear Map Layer: Prediction Regularization b…

200 papers

Various natural language processing tasks are structured prediction problems where outputs are constructed with multiple interdependent decisions. Past work has shown that domain knowledge, framed as constraints over the output space, can…

Computation and Language · Computer Science 2020-06-03 Xingyuan Pan , Maitrey Mehta , Vivek Srikumar

In this paper, we propose and study a technique to reduce the number of parameters and computation time in convolutional neural networks. We use Kronecker product to exploit the local structures within convolution and fully-connected…

Computer Vision and Pattern Recognition · Computer Science 2016-02-05 Shuchang Zhou , Jia-Nan Wu , Yuxin Wu , Xinyu Zhou

In this paper we propose and study a technique to reduce the number of parameters and computation time in fully-connected layers of neural networks using Kronecker product, at a mild cost of the prediction quality. The technique proceeds by…

Neural and Evolutionary Computing · Computer Science 2015-07-23 Shuchang Zhou , Jia-Nan Wu

Neural networks can emulate nonlinear physical systems with high accuracy, yet they may produce physically-inconsistent results when violating fundamental constraints. Here, we introduce a systematic way of enforcing nonlinear analytic…

Computational Physics · Physics 2021-03-10 Tom Beucler , Michael Pritchard , Stephan Rasp , Jordan Ott , Pierre Baldi , Pierre Gentine

In this paper, we are interested in constructing general graph-based regularizers for multiple kernel learning (MKL) given a structure which is used to describe the way of combining basis kernels. Such structures are represented by…

Machine Learning · Statistics 2014-02-14 Ziming Zhang

A new computationally simple method of imposing hard convex constraints on the neural network output values is proposed. The key idea behind the method is to map a vector of hidden parameters of the network to a point that is guaranteed to…

Machine Learning · Computer Science 2023-07-21 Andrei V. Konstantinov , Lev V. Utkin

Psychiatric neuroscience is increasingly aware of the need to define psychopathology in terms of abnormal neural computation. The central tool in this endeavour is the fitting of computational models to behavioural data. The most prominent…

Quantitative Methods · Quantitative Biology 2018-03-28 Abraham Nunes , Alexander Rudiuk

Design Structure Matrix (DSM) modularization, the task of partitioning system elements into cohesive modules, is a fundamental combinatorial challenge in engineering design. Traditional methods treat modularization as a pure graph…

Computational Engineering, Finance, and Science · Computer Science 2026-05-01 Shuo Jiang , Jianxi Luo

Through the lense of multilevel model (MLM) specification and regularization, this is a connect-the-dots introductory summary of Small Area Estimation, e.g. small group prediction informed by a complex sampling design. While a comprehensive…

Methodology · Statistics 2018-05-23 Michael Tzen

This paper presents a deep learning-based de-homogenization method for structural compliance minimization. By using a convolutional neural network to parameterize the mapping from a set of lamination parameters on a coarse mesh to a…

Machine Learning · Computer Science 2021-11-03 Martin O. Elingaard , Niels Aage , J. Andreas Bærentzen , Ole Sigmund

Neural language modeling (LM) has led to significant improvements in several applications, including Automatic Speech Recognition. However, they typically require large amounts of training data, which is not available for many domains and…

Computation and Language · Computer Science 2019-06-05 Navid Rekabsaz , Nikolaos Pappas , James Henderson , Banriskhem K. Khonglah , Srikanth Madikeri

Deep learning using neural networks is an effective technique for generating models of complex data. However, training such models can be expensive when networks have large model capacity resulting from a large number of layers and nodes.…

Machine Learning · Computer Science 2023-01-19 Jarom D. Hogue , Robert M. Kirby , Akil Narayan

Due to the huge amount of parameters, fine-tuning of pretrained language models (PLMs) is prone to overfitting in the low resource scenarios. In this work, we present a novel method that operates on the hidden representations of a PLM to…

Computation and Language · Computer Science 2023-05-29 Linlin Liu , Xingxuan Li , Megh Thakkar , Xin Li , Shafiq Joty , Luo Si , Lidong Bing

Deep learning models have proven to be exceptionally useful in performing many machine learning tasks. However, for each new dataset, choosing an effective size and structure of the model can be a time-consuming process of trial and error.…

Machine Learning · Computer Science 2019-08-08 Roozbeh Yousefzadeh , Dianne P O'Leary

LayerNorm and RMSNorm impose fundamentally different geometric constraints on their outputs - and this difference has a precise, quantifiable consequence for model complexity. We prove that LayerNorm's mean-centering step, by confining data…

Machine Learning · Computer Science 2026-03-31 Sungbae Chun

We seek to impose linear, equality constraints in feedforward neural networks. As top layer predictors are usually nonlinear, this is a difficult task if we seek to deploy standard convex optimization methods and strong duality. To overcome…

Machine Learning · Computer Science 2023-01-10 Anand Rangarajan , Pan He , Jaemoon Lee , Tania Banerjee , Sanjay Ranka

Regularization is an effective way to promote the generalization performance of machine learning models. In this paper, we focus on label smoothing, a form of output distribution regularization that prevents overfitting of a neural network…

Machine Learning · Computer Science 2020-07-07 Weizhi Li , Gautam Dasarathy , Visar Berisha

Deep learning models are increasingly deployed in safety-critical tasks where predictions must satisfy hard constraints, such as physical laws, fairness requirements, or safety limits. However, standard architectures lack built-in…

Machine Learning · Computer Science 2025-11-26 Gonzalo E. Constante-Flores , Hao Chen , Can Li

We propose the Limited Multi-Label (LML) projection layer as a new primitive operation for end-to-end learning systems. The LML layer provides a probabilistic way of modeling multi-label predictions limited to having exactly k labels. We…

Machine Learning · Computer Science 2019-10-15 Brandon Amos , Vladlen Koltun , J. Zico Kolter

Unsupervised spectral unmixing consists of representing each observed pixel as a combination of several pure materials called endmembers with their corresponding abundance fractions. Beyond the linear assumption, various nonlinear unmixing…

Computer Vision and Pattern Recognition · Computer Science 2023-03-16 Tingting Fang , Fei Zhu , Jie Chen
‹ Prev 1 2 3 10 Next ›