Related papers: Learning advanced mathematical computations from e…

Deep transformation models: Tackling complex regression problems with neural network based transformation models

We present a deep transformation model for probabilistic regression. Deep learning is known for outstandingly accurate predictions on complex data but in regression tasks, it is predominantly used to just predict a single number. This…

Machine Learning · Statistics 2020-04-02 Beate Sick , Torsten Hothorn , Oliver Dürr

Neural Networks for Parameter Estimation in Intractable Models

We propose to use deep learning to estimate parameters in statistical models when standard likelihood estimation methods are computationally infeasible. We show how to estimate parameters from max-stable processes, where inference is…

Methodology · Statistics 2021-08-02 Amanda Lenzi , Julie Bessac , Johann Rudi , Michael L. Stein

Nonlinear Systems Identification Using Deep Dynamic Neural Networks

Neural networks are known to be effective function approximators. Recently, deep neural networks have proven to be very effective in pattern recognition, classification tasks and human-level control to model highly nonlinear realworld…

Neural and Evolutionary Computing · Computer Science 2016-10-06 Olalekan Ogunmolu , Xuejun Gu , Steve Jiang , Nicholas Gans

A Selective Overview of Deep Learning

Deep learning has arguably achieved tremendous success in recent years. In simple words, deep learning uses the composition of many nonlinear functions to model the complex dependency between input features and labels. While neural networks…

Machine Learning · Statistics 2019-04-16 Jianqing Fan , Cong Ma , Yiqiao Zhong

Emergent properties with repeated examples

We study the performance of transformers as a function of the number of repetitions of training examples with algorithmically generated datasets. On three problems of mathematics: the greatest common divisor, modular multiplication, and…

Machine Learning · Computer Science 2024-10-10 François Charton , Julia Kempe

Representation learning for a generalized, quantitative comparison of complex model outputs

Computational models are quantitative representations of systems. By analyzing and comparing the outputs of such models, it is possible to gain a better understanding of the system itself. Though as the complexity of model outputs…

Machine Learning · Computer Science 2022-12-13 Colin G. Cess , Stacey D. Finley

Learning stochastic dynamics and predicting emergent behavior using transformers

We show that a neural network originally designed for language processing can learn the dynamical rules of a stochastic system by observation of a single dynamical trajectory of the system, and can accurately predict its emergent behavior…

Statistical Mechanics · Physics 2022-02-18 Corneel Casert , Isaac Tamblyn , Stephen Whitelam

Modeling rapid language learning by distilling Bayesian priors into artificial neural networks

Humans can learn languages from remarkably little experience. Developing computational models that explain this ability has been a major challenge in cognitive science. Bayesian models that build in strong inductive biases - factors that…

Computation and Language · Computer Science 2023-05-25 R. Thomas McCoy , Thomas L. Griffiths

Intriguing properties of neural networks

Deep neural networks are highly expressive models that have recently achieved state of the art performance on speech and visual recognition tasks. While their expressiveness is the reason they succeed, it also causes them to learn…

Computer Vision and Pattern Recognition · Computer Science 2014-02-20 Christian Szegedy , Wojciech Zaremba , Ilya Sutskever , Joan Bruna , Dumitru Erhan , Ian Goodfellow , Rob Fergus

Stretched and measured neural predictions of complex network dynamics

Differential equations are a ubiquitous tool to study dynamics, ranging from physical systems to complex systems, where a large number of agents interact through a graph with non-trivial topological features. Data-driven approximations of…

Statistical Mechanics · Physics 2024-04-26 Vaiva Vasiliauskaite , Nino Antulov-Fantulin

Constructing Neural Network-Based Models for Simulating Dynamical Systems

Dynamical systems see widespread use in natural sciences like physics, biology, chemistry, as well as engineering disciplines such as circuit analysis, computational fluid dynamics, and control. For simple systems, the differential…

Machine Learning · Computer Science 2022-07-26 Christian Møldrup Legaard , Thomas Schranz , Gerald Schweiger , Ján Drgoňa , Basak Falay , Cláudio Gomes , Alexandros Iosifidis , Mahdi Abkar , Peter Gorm Larsen

The Unreasonable Effectiveness of Solving Inverse Problems with Neural Networks

Finding model parameters from data is an essential task in science and engineering, from weather and climate forecasts to plasma control. Previous works have employed neural networks to greatly accelerate finding solutions to inverse…

Machine Learning · Computer Science 2024-08-16 Philipp Holl , Nils Thuerey

How Deep Neural Networks Learn Compositional Data: The Random Hierarchy Model

Deep learning algorithms demonstrate a surprising ability to learn high-dimensional tasks from limited examples. This is commonly attributed to the depth of neural networks, enabling them to build a hierarchy of abstract, low-dimensional…

Machine Learning · Computer Science 2024-07-04 Francesco Cagnetta , Leonardo Petrini , Umberto M. Tomasini , Alessandro Favero , Matthieu Wyart

A model is worth tens of thousands of examples

Traditional signal processing methods relying on mathematical data generation models have been cast aside in favour of deep neural networks, which require vast amounts of data. Since the theoretical sample complexity is nearly impossible to…

Machine Learning · Computer Science 2023-03-21 Thomas Dagès , Laurent D. Cohen , Alfred M. Bruckstein

Modular Networks: Learning to Decompose Neural Computation

Scaling model capacity has been vital in the success of deep learning. For a typical network, necessary compute resources and training time grow dramatically with model size. Conditional computation is a promising way to increase the number…

Machine Learning · Computer Science 2018-11-14 Louis Kirsch , Julius Kunze , David Barber

Learning Constitutive Relations from Indirect Observations Using Deep Neural Networks

We present a new approach for predictive modeling and its uncertainty quantification for mechanical systems, where coarse-grained models such as constitutive relations are derived directly from observation data. We explore the use of a…

Numerical Analysis · Mathematics 2020-06-24 Daniel Z. Huang , Kailai Xu , Charbel Farhat , Eric Darve

Mathematics of Deep Learning

Recently there has been a dramatic increase in the performance of recognition systems due to the introduction of deep architectures for representation learning and classification. However, the mathematical reasons for this success remain…

Machine Learning · Computer Science 2017-12-14 Rene Vidal , Joan Bruna , Raja Giryes , Stefano Soatto

Machine learning phases of matter

Neural networks can be used to identify phases and phase transitions in condensed matter systems via supervised machine learning. Readily programmable through modern software libraries, we show that a standard feed-forward neural network…

Strongly Correlated Electrons · Physics 2017-05-24 Juan Carrasquilla , Roger G. Melko

Deep Neural Networks for Pattern Recognition

In the field of pattern recognition research, the method of using deep neural networks based on improved computing hardware recently attracted attention because of their superior accuracy compared to conventional methods. Deep neural…

Computer Vision and Pattern Recognition · Computer Science 2018-09-27 Kyongsik Yun , Alexander Huyen , Thomas Lu

Learning Randomized Algorithms with Transformers

Randomization is a powerful tool that endows algorithms with remarkable properties. For instance, randomized algorithms excel in adversarial settings, often surpassing the worst-case performance of deterministic algorithms with large…

Machine Learning · Computer Science 2024-08-21 Johannes von Oswald , Seijin Kobayashi , Yassir Akram , Angelika Steger