English
Related papers

Related papers: iNALU: Improved Neural Arithmetic Logic Unit

200 papers

Neural networks can learn to represent and manipulate numerical information, but they seldom generalize well outside of the range of numerical values encountered during training. To encourage more systematic numerical extrapolation, we…

Neural and Evolutionary Computing · Computer Science 2018-08-03 Andrew Trask , Felix Hill , Scott Reed , Jack Rae , Chris Dyer , Phil Blunsom

Neural Arithmetic Logic Modules have become a growing area of interest, though remain a niche field. These modules are neural networks which aim to achieve systematic generalisation in learning arithmetic and/or logic operations such as…

Neural and Evolutionary Computing · Computer Science 2022-08-09 Bhumika Mistry , Katayoun Farrahi , Jonathon Hare

The Neural Arithmetic Logic Unit (NALU) is a neural network layer that can learn exact arithmetic operations between the elements of a hidden state. The goal of NALU is to learn perfect extrapolation, which requires learning the exact…

Machine Learning · Computer Science 2019-11-11 Andreas Madsen , Alexander Rosenberg Johansen

Neural networks can approximate complex functions, but they struggle to perform exact arithmetic operations over real numbers. The lack of inductive bias for arithmetic operations leaves neural networks without the underlying logic…

Neural and Evolutionary Computing · Computer Science 2020-01-16 Andreas Madsen , Alexander Rosenberg Johansen

The big problem for neural network models which are trained to count instances is that whenever test range goes high training range generalization error increases i.e. they are not good generalizers outside training range. Consider the case…

Machine Learning · Computer Science 2020-06-16 Ashish Rana , Taranveer Singh , Harpreet Singh , Neeraj Kumar , Prashant Singh Rana

Neural networks, as currently designed, fall short of achieving true logical intelligence. Modern AI models rely on standard neural computation-inner-product-based transformations and nonlinear activations-to approximate patterns from data.…

Artificial Intelligence · Computer Science 2025-02-05 Youngsung Kim

Conventional Neural Networks can approximate simple arithmetic operations, but fail to generalize beyond the range of numbers that were seen during training. Neural Arithmetic Units aim to overcome this difficulty, but current arithmetic…

Machine Learning · Computer Science 2020-12-18 Niklas Heim , Tomáš Pevný , Václav Šmídl

Recent years have witnessed the great success of deep neural networks in many research areas. The fundamental idea behind the design of most neural networks is to learn similarity patterns from data for prediction and inference, which lacks…

Artificial Intelligence · Computer Science 2019-10-22 Shaoyun Shi , Hanxiong Chen , Min Zhang , Yongfeng Zhang

Deep Neural Networks (DNNs) have provably enhanced the state-of-the-art Neural Machine Translation (NMT) with their capability in modeling complex functions and capturing complex linguistic structures. However NMT systems with deep…

Computation and Language · Computer Science 2017-05-03 Mingxuan Wang , Zhengdong Lu , Jie Zhou , Qun Liu

We propose the Neural Logic Machine (NLM), a neural-symbolic architecture for both inductive learning and logic reasoning. NLMs exploit the power of both neural networks---as function approximators, and logic programming---as a symbolic…

Artificial Intelligence · Computer Science 2019-04-29 Honghua Dong , Jiayuan Mao , Tian Lin , Chong Wang , Lihong Li , Denny Zhou

The Domain Mixed Unit (DMU) is a new neural arithmetic unit that learns a single parameter gate that mixes between log-space and linear-space representations while performing either addition (DMU add) or subtraction (DMU sub). Two…

Machine Learning · Computer Science 2025-09-16 Paul Curry

Neural Networks (NNs) are the method of choice for building learning algorithms. Their popularity stems from their empirical success on several challenging learning problems. However, most scholars agree that a convincing theoretical…

Numerical Analysis · Mathematics 2021-01-01 Ronald DeVore , Boris Hanin , Guergana Petrova

Natural Language Understanding (NLU) is a branch of Natural Language Processing (NLP) that uses intelligent computer software to understand texts that encode human knowledge. Recent years have witnessed notable progress across various NLU…

Computation and Language · Computer Science 2022-03-01 Xinliang Frederick Zhang

Chain-of-Thought (CoT) prompting has shown promise in enhancing the reasoning capabilities of large language models (LLMs) by generating natural language (NL) rationales that lead to the final answer. However, it struggles with numerical…

Artificial Intelligence · Computer Science 2025-02-13 Cheryl Li , Tianyuan Xu , Yiwen Guo

We propose the Moderate Adaptive Linear Unit (MoLU), a novel activation function for deep neural networks, defined analytically as: f(x)=x \times (1+tanh(x))/2. MoLU combines mathematical elegance with empirical effectiveness, exhibiting…

Machine Learning · Computer Science 2025-07-16 Hankyul Koh , Joon-hyuk Ko , Wonho Jhe

With computers to handle more and more complicated things in variable environments, it becomes an urgent requirement that the artificial intelligence has the ability of automatic judging and deciding according to numerous specific…

Neural and Evolutionary Computing · Computer Science 2017-08-03 Gang Wang

Inferring the relations between two images is an important class of tasks in computer vision. Examples of such tasks include computing optical flow and stereo disparity. We treat the relation inference tasks as a machine learning problem…

Computer Vision and Pattern Recognition · Computer Science 2019-03-13 Yao Lu , Zhirong Yang , Juho Kannala , Samuel Kaski

Recent years have witnessed the success of deep neural networks in many research areas. The fundamental idea behind the design of most neural networks is to learn similarity patterns from data for prediction and inference, which lacks the…

Machine Learning · Computer Science 2020-08-24 Shaoyun Shi , Hanxiong Chen , Weizhi Ma , Jiaxin Mao , Min Zhang , Yongfeng Zhang

Neural networks (NNs) have been successfully deployed in various fields. In NNs, a large number of multiplyaccumulate (MAC) operations need to be performed. Most existing digital hardware platforms rely on parallel MAC units to accelerate…

Systems and Control · Electrical Eng. & Systems 2023-09-20 Kangwei Xu , Grace Li Zhang , Ulf Schlichtmann , Bing Li

Logic-based problems such as planning, theorem proving, or puzzles, typically involve combinatoric search and structured knowledge representation. Artificial neural networks are very successful statistical learners, however, for many years,…

Machine Learning · Computer Science 2017-12-11 Gadi Pinkas , Shimon Cohen
‹ Prev 1 2 3 10 Next ›