Related papers: Multilinear Map Layer: Prediction Regularization b…

Learning Constraints for Structured Prediction Using Rectifier Networks

Various natural language processing tasks are structured prediction problems where outputs are constructed with multiple interdependent decisions. Past work has shown that domain knowledge, framed as constraints over the output space, can…

Computation and Language · Computer Science 2020-06-03 Xingyuan Pan , Maitrey Mehta , Vivek Srikumar

Exploiting Local Structures with the Kronecker Layer in Convolutional Networks

In this paper, we propose and study a technique to reduce the number of parameters and computation time in convolutional neural networks. We use Kronecker product to exploit the local structures within convolution and fully-connected…

Computer Vision and Pattern Recognition · Computer Science 2016-02-05 Shuchang Zhou , Jia-Nan Wu , Yuxin Wu , Xinyu Zhou

Compression of Fully-Connected Layer in Neural Network by Kronecker Product

In this paper we propose and study a technique to reduce the number of parameters and computation time in fully-connected layers of neural networks using Kronecker product, at a mild cost of the prediction quality. The technique proceeds by…

Neural and Evolutionary Computing · Computer Science 2015-07-23 Shuchang Zhou , Jia-Nan Wu

Enforcing Analytic Constraints in Neural-Networks Emulating Physical Systems

Neural networks can emulate nonlinear physical systems with high accuracy, yet they may produce physically-inconsistent results when violating fundamental constraints. Here, we introduce a systematic way of enforcing nonlinear analytic…

Computational Physics · Physics 2021-03-10 Tom Beucler , Michael Pritchard , Stephan Rasp , Jordan Ott , Pierre Baldi , Pierre Gentine

Regularization for Multiple Kernel Learning via Sum-Product Networks

In this paper, we are interested in constructing general graph-based regularizers for multiple kernel learning (MKL) given a structure which is used to describe the way of combining basis kernels. Such structures are represented by…

Machine Learning · Statistics 2014-02-14 Ziming Zhang

A New Computationally Simple Approach for Implementing Neural Networks with Output Hard Constraints

A new computationally simple method of imposing hard convex constraints on the neural network output values is proposed. The key idea behind the method is to map a vector of hidden parameters of the network to a point that is guaranteed to…

Machine Learning · Computer Science 2023-07-21 Andrei V. Konstantinov , Lev V. Utkin

The Importance of Constraint Smoothness for Parameter Estimation in Computational Cognitive Modeling

Psychiatric neuroscience is increasingly aware of the need to define psychopathology in terms of abnormal neural computation. The central tool in this endeavour is the fitting of computational models to behavioural data. The most prominent…

Quantitative Methods · Quantitative Biology 2018-03-28 Abraham Nunes , Alexander Rudiuk

Design Structure Matrix Modularization with Large Language Models

Design Structure Matrix (DSM) modularization, the task of partitioning system elements into cohesive modules, is a fundamental combinatorial challenge in engineering design. Traditional methods treat modularization as a pure graph…

Computational Engineering, Finance, and Science · Computer Science 2026-05-01 Shuo Jiang , Jianxi Luo

Multilevel Models Allow Modular Specification of What and Where to Regularize, Especially in Small Area Estimation

Through the lense of multilevel model (MLM) specification and regularization, this is a connect-the-dots introductory summary of Small Area Estimation, e.g. small group prediction informed by a complex sampling design. While a comprehensive…

Methodology · Statistics 2018-05-23 Michael Tzen

De-homogenization using Convolutional Neural Networks

This paper presents a deep learning-based de-homogenization method for structural compliance minimization. By using a convolutional neural network to parameterize the mapping from a set of lamination parameters on a coarse mesh to a…

Machine Learning · Computer Science 2021-11-03 Martin O. Elingaard , Niels Aage , J. Andreas Bærentzen , Ole Sigmund

Regularization Advantages of Multilingual Neural Language Models for Low Resource Domains

Neural language modeling (LM) has led to significant improvements in several applications, including Automatic Speech Recognition. However, they typically require large amounts of training data, which is not available for many domains and…

Computation and Language · Computer Science 2019-06-05 Navid Rekabsaz , Nikolaos Pappas , James Henderson , Banriskhem K. Khonglah , Srikanth Madikeri

Weight Matrix Dimensionality Reduction in Deep Learning via Kronecker Multi-layer Architectures

Deep learning using neural networks is an effective technique for generating models of complex data. However, training such models can be expensive when networks have large model capacity resulting from a large number of layers and nodes.…

Machine Learning · Computer Science 2023-01-19 Jarom D. Hogue , Robert M. Kirby , Akil Narayan

Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations

Due to the huge amount of parameters, fine-tuning of pretrained language models (PLMs) is prone to overfitting in the low resource scenarios. In this work, we present a novel method that operates on the hidden representations of a PLM to…

Computation and Language · Computer Science 2023-05-29 Linlin Liu , Xingxuan Li , Megh Thakkar , Xin Li , Shafiq Joty , Luo Si , Lidong Bing

Refining the Structure of Neural Networks Using Matrix Conditioning

Deep learning models have proven to be exceptionally useful in performing many machine learning tasks. However, for each new dataset, choosing an effective size and structure of the model can be a time-consuming process of trial and error.…

Machine Learning · Computer Science 2019-08-08 Roozbeh Yousefzadeh , Dianne P O'Leary

The Geometric Cost of Normalization: Affine Bounds on the Bayesian Complexity of Neural Networks

LayerNorm and RMSNorm impose fundamentally different geometric constraints on their outputs - and this difference has a precise, quantifiable consequence for model complexity. We prove that LayerNorm's mean-centering step, by confining data…

Machine Learning · Computer Science 2026-03-31 Sungbae Chun

Expressing linear equality constraints in feedforward neural networks

We seek to impose linear, equality constraints in feedforward neural networks. As top layer predictors are usually nonlinear, this is a difficult task if we seek to deploy standard convex optimization methods and strong duality. To overcome…

Machine Learning · Computer Science 2023-01-10 Anand Rangarajan , Pan He , Jaemoon Lee , Tania Banerjee , Sanjay Ranka

Regularization via Structural Label Smoothing

Regularization is an effective way to promote the generalization performance of machine learning models. In this paper, we focus on label smoothing, a form of output distribution regularization that prevents overfitting of a neural network…

Machine Learning · Computer Science 2020-07-07 Weizhi Li , Gautam Dasarathy , Visar Berisha

Enforcing Hard Linear Constraints in Deep Learning Models with Decision Rules

Deep learning models are increasingly deployed in safety-critical tasks where predictions must satisfy hard constraints, such as physical laws, fairness requirements, or safety limits. However, standard architectures lack built-in…

Machine Learning · Computer Science 2025-11-26 Gonzalo E. Constante-Flores , Hao Chen , Can Li

The Limited Multi-Label Projection Layer

We propose the Limited Multi-Label (LML) projection layer as a new primitive operation for end-to-end learning systems. The LML layer provides a probabilistic way of modeling multi-label predictions limited to having exactly k labels. We…

Machine Learning · Computer Science 2019-10-15 Brandon Amos , Vladlen Koltun , J. Zico Kolter

Nonlinear Hyperspectral Unmixing based on Multilinear Mixing Model using Convolutional Autoencoders

Unsupervised spectral unmixing consists of representing each observed pixel as a combination of several pure materials called endmembers with their corresponding abundance fractions. Beyond the linear assumption, various nonlinear unmixing…

Computer Vision and Pattern Recognition · Computer Science 2023-03-16 Tingting Fang , Fei Zhu , Jie Chen