Related papers: Neural Arithmetic Units

iNALU: Improved Neural Arithmetic Logic Unit

Neural networks have to capture mathematical relationships in order to learn various tasks. They approximate these relations implicitly and therefore often do not generalize well. The recently proposed Neural Arithmetic Logic Unit (NALU) is…

Neural and Evolutionary Computing · Computer Science 2020-03-18 Daniel Schlör , Markus Ring , Andreas Hotho

Neural Power Units

Conventional Neural Networks can approximate simple arithmetic operations, but fail to generalize beyond the range of numbers that were seen during training. Neural Arithmetic Units aim to overcome this difficulty, but current arithmetic…

Machine Learning · Computer Science 2020-12-18 Niklas Heim , Tomáš Pevný , Václav Šmídl

Neural Arithmetic Logic Units

Neural networks can learn to represent and manipulate numerical information, but they seldom generalize well outside of the range of numerical values encountered during training. To encourage more systematic numerical extrapolation, we…

Neural and Evolutionary Computing · Computer Science 2018-08-03 Andrew Trask , Felix Hill , Scott Reed , Jack Rae , Chris Dyer , Phil Blunsom

The Domain Mixed Unit: A New Neural Arithmetic Layer

The Domain Mixed Unit (DMU) is a new neural arithmetic unit that learns a single parameter gate that mixes between log-space and linear-space representations while performing either addition (DMU add) or subtraction (DMU sub). Two…

Machine Learning · Computer Science 2025-09-16 Paul Curry

A Primer for Neural Arithmetic Logic Modules

Neural Arithmetic Logic Modules have become a growing area of interest, though remain a niche field. These modules are neural networks which aim to achieve systematic generalisation in learning arithmetic and/or logic operations such as…

Neural and Evolutionary Computing · Computer Science 2022-08-09 Bhumika Mistry , Katayoun Farrahi , Jonathon Hare

Measuring Arithmetic Extrapolation Performance

The Neural Arithmetic Logic Unit (NALU) is a neural network layer that can learn exact arithmetic operations between the elements of a hidden state. The goal of NALU is to learn perfect extrapolation, which requires learning the exact…

Machine Learning · Computer Science 2019-11-11 Andreas Madsen , Alexander Rosenberg Johansen

Learning Division with Neural Arithmetic Logic Modules

To achieve systematic generalisation, it first makes sense to master simple tasks such as arithmetic. Of the four fundamental arithmetic operations (+,-,$\times$,$\div$), division is considered the most difficult for both humans and…

Neural and Evolutionary Computing · Computer Science 2021-10-13 Bhumika Mistry , Katayoun Farrahi , Jonathon Hare

Improving the Robustness of Neural Multiplication Units with Reversible Stochasticity

Multilayer Perceptrons struggle to learn certain simple arithmetic tasks. Specialist neural modules for arithmetic can outperform classical architectures with gains in extrapolation, interpretability and convergence speeds, but are highly…

Machine Learning · Computer Science 2022-11-11 Bhumika Mistry , Katayoun Farrahi , Jonathon Hare

Teaching Neural Module Networks to Do Arithmetic

Answering complex questions that require multi-step multi-type reasoning over raw text is challenging, especially when conducting numerical reasoning. Neural Module Networks(NMNs), follow the programmer-interpreter framework and design…

Computation and Language · Computer Science 2022-10-07 Jiayi Chen , Xiao-Yu Guo , Yuan-Fang Li , Gholamreza Haffari

Neural Network Approximation

Neural Networks (NNs) are the method of choice for building learning algorithms. Their popularity stems from their empirical success on several challenging learning problems. However, most scholars agree that a convincing theoretical…

Numerical Analysis · Mathematics 2021-01-01 Ronald DeVore , Boris Hanin , Guergana Petrova

Systematically designing better instance counting models on cell images with Neural Arithmetic Logic Units

The big problem for neural network models which are trained to count instances is that whenever test range goes high training range generalization error increases i.e. they are not good generalizers outside training range. Consider the case…

Machine Learning · Computer Science 2020-06-16 Ashish Rana , Taranveer Singh , Harpreet Singh , Neeraj Kumar , Prashant Singh Rana

Learning Numeracy: Binary Arithmetic with Neural Turing Machines

One of the main problems encountered so far with recurrent neural networks is that they struggle to retain long-time information dependencies in their recurrent connections. Neural Turing Machines (NTMs) attempt to mitigate this issue by…

Neural and Evolutionary Computing · Computer Science 2024-12-20 Jacopo Castellini

A Corrective View of Neural Networks: Representation, Memorization and Learning

We develop a corrective mechanism for neural network approximation: the total available non-linear units are divided into multiple groups and the first group approximates the function under consideration, the second group approximates the…

Machine Learning · Computer Science 2020-06-23 Guy Bresler , Dheeraj Nagaraj

Universal Adder Neural Networks

Compared with cheap addition operation, multiplication operation is of much higher computation complexity. The widely-used convolutions in deep neural networks are exactly cross-correlation to measure the similarity between input feature…

Computer Vision and Pattern Recognition · Computer Science 2021-06-30 Hanting Chen , Yunhe Wang , Chang Xu , Chao Xu , Chunjing Xu , Tong Zhang

Neural Programmer: Inducing Latent Programs with Gradient Descent

Deep neural networks have achieved impressive supervised classification performance in many tasks including image recognition, speech recognition, and sequence to sequence learning. However, this success has not been translated to…

Machine Learning · Computer Science 2016-08-05 Arvind Neelakantan , Quoc V. Le , Ilya Sutskever

Neural Network Processing Neural Networks: An efficient way to learn higher order functions

Functions are rich in meaning and can be interpreted in a variety of ways. Neural networks were proven to be capable of approximating a large class of functions[1]. In this paper, we propose a new class of neural networks called "Neural…

Machine Learning · Computer Science 2020-01-16 Firat Tuna

Deep vs. shallow networks : An approximation theory perspective

The paper briefy reviews several recent results on hierarchical architectures for learning from examples, that may formally explain the conditions under which Deep Convolutional Neural Networks perform much better in function approximation…

Machine Learning · Computer Science 2016-08-12 Hrushikesh Mhaskar , Tomaso Poggio

Arithmetic addition of two integers by deep image classification networks: experiments to quantify their autonomous reasoning ability

The unprecedented performance achieved by deep convolutional neural networks for image classification is linked primarily to their ability of capturing rich structural features at various layers within networks. Here we design a series of…

Computer Vision and Pattern Recognition · Computer Science 2019-12-11 Shuaicheng Liu , Zehao Zhang , Kai Song , Bing Zeng

Diverse Neural Network Learns True Target Functions

Neural networks are a powerful class of functions that can be trained with simple gradient descent to achieve state-of-the-art performance on a variety of applications. Despite their practical success, there is a paucity of results that…

Machine Learning · Computer Science 2017-03-06 Bo Xie , Yingyu Liang , Le Song

Can Neural Networks Learn Small Algebraic Worlds? An Investigation Into the Group-theoretic Structures Learned By Narrow Models Trained To Predict Group Operations

While a real-world research program in mathematics may be guided by a motivating question, the process of mathematical discovery is typically open-ended. Ideally, exploration needed to answer the original question will reveal new…

Machine Learning · Computer Science 2026-01-30 Henry Kvinge , Andrew Aguilar , Nayda Farnsworth , Grace O'Brien , Robert Jasper , Sarah Scullen , Helen Jenne