Related papers: Sequence to General Tree: Knowledge-Guided Geometr…

Graph-to-Tree Neural Networks for Learning Structured Input-Output Translation with Applications to Semantic Parsing and Math Word Problem

The celebrated Seq2Seq technique and its numerous variants achieve excellent performance on many tasks such as neural machine translation, semantic parsing, and math word problem solving. However, these models either only consider input…

Computation and Language · Computer Science 2020-10-07 Shucheng Li , Lingfei Wu , Shiwei Feng , Fangli Xu , Fengyuan Xu , Sheng Zhong

Translating a Math Word Problem to an Expression Tree

Sequence-to-sequence (SEQ2SEQ) models have been successfully applied to automatic math word problem solving. Despite its simplicity, a drawback still remains: a math word problem can be correctly solved by more than one equations. This…

Computation and Language · Computer Science 2018-11-16 Lei Wang , Yan Wang , Deng Cai , Dongxiang Zhang , Xiaojiang Liu

A Self-explaining Neural Architecture for Generalizable Concept Learning

With the wide proliferation of Deep Neural Networks in high-stake applications, there is a growing demand for explainability behind their decision-making process. Concept learning models attempt to learn high-level 'concepts' - abstract…

Machine Learning · Computer Science 2024-05-07 Sanchit Sinha , Guangzhi Xiong , Aidong Zhang

Deep Trees for (Un)structured Data: Tractability, Performance, and Interpretability

Decision Trees have remained a popular machine learning method for tabular datasets, mainly due to their interpretability. However, they lack the expressiveness needed to handle highly nonlinear or unstructured datasets. Motivated by recent…

Machine Learning · Computer Science 2024-10-30 Dimitris Bertsimas , Lisa Everest , Jiayi Gu , Matthew Peroni , Vasiliki Stoumpou

Empowering Decision Trees via Shape Function Branching

Decision trees are prized for their interpretability and strong performance on tabular data. Yet, their reliance on simple axis-aligned linear splits often forces deep, complex structures to capture non-linear feature effects, undermining…

Machine Learning · Computer Science 2025-10-23 Nakul Upadhya , Eldan Cohen

Learning Representations for Reasoning: Generalizing Across Diverse Structures

Reasoning, the ability to logically draw conclusions from existing knowledge, is a hallmark of human. Together with perception, they constitute the two major themes of artificial intelligence. While deep learning has pushed the limit of…

Artificial Intelligence · Computer Science 2024-10-18 Zhaocheng Zhu

Compositional Generalization without Trees using Multiset Tagging and Latent Permutations

Seq2seq models have been shown to struggle with compositional generalization in semantic parsing, i.e. generalizing to unseen compositions of phenomena that the model handles correctly in isolation. We phrase semantic parsing as a two-step…

Computation and Language · Computer Science 2023-05-29 Matthias Lindemann , Alexander Koller , Ivan Titov

Towards Emergent Language Symbolic Semantic Segmentation and Model Interpretability

Recent advances in methods focused on the grounding problem have resulted in techniques that can be used to construct a symbolic language associated with a specific domain. Inspired by how humans communicate complex ideas through language,…

Artificial Intelligence · Computer Science 2020-08-06 Alberto Santamaria-Pang , James Kubricht , Aritra Chowdhury , Chitresh Bhushan , Peter Tu

On Tree-Based Neural Sentence Modeling

Neural networks with tree-based sentence encoders have shown better results on many downstream tasks. Most of existing tree-based encoders adopt syntactic parsing trees as the explicit structure prior. To study the effectiveness of…

Computation and Language · Computer Science 2018-08-30 Haoyue Shi , Hao Zhou , Jiaze Chen , Lei Li

Solving General Arithmetic Word Problems

This paper presents a novel approach to automatically solving arithmetic word problems. This is the first algorithmic approach that can handle arithmetic problems with multiple steps and operations, without depending on additional…

Computation and Language · Computer Science 2016-08-23 Subhro Roy , Dan Roth

Weak-to-Strong Generalization with Failure Trajectories: A Tree-based Approach to Elicit Optimal Policy in Strong Models

Weak-to-Strong generalization (W2SG) is a new trend to elicit the full capabilities of a strong model with supervision from a weak model. While existing W2SG studies focus on simple tasks like binary classification, we extend this paradigm…

Machine Learning · Computer Science 2026-03-10 Ruimeng Ye , Zihan Wang , Yang Xiao , Zinan Ling , Manling Li , Bo Hui

Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing

Sequence-to-Sequence (S2S) models have achieved remarkable success on various text generation tasks. However, learning complex structures with S2S models remains challenging as external neural modules and additional lexicons are often…

Computation and Language · Computer Science 2023-02-07 Han He , Jinho D. Choi

Generative Deep Neural Networks for Dialogue: A Short Review

Researchers have recently started investigating deep neural networks for dialogue applications. In particular, generative sequence-to-sequence (Seq2Seq) models have shown promising results for unstructured tasks, such as word-level dialogue…

Computation and Language · Computer Science 2016-11-21 Iulian Vlad Serban , Ryan Lowe , Laurent Charlin , Joelle Pineau

Binary Tree Arithmetic with Generalized Constructors

We describe arithmetic computations in terms of operations on some well known free algebras (S1S, S2S and ordered rooted binary trees) while emphasizing the common structure present in all them when seen as isomorphic with the set of…

Mathematical Software · Computer Science 2013-01-03 Paul Tarau

Graph-tree Fusion Model with Bidirectional Information Propagation for Long Document Classification

Long document classification presents challenges in capturing both local and global dependencies due to their extensive content and complex structure. Existing methods often struggle with token limits and fail to adequately model…

Computation and Language · Computer Science 2024-10-07 Sudipta Singha Roy , Xindi Wang , Robert E. Mercer , Frank Rudzicz

Sequence-to-Sequence Learning with Latent Neural Grammars

Sequence-to-sequence learning with neural networks has become the de facto standard for sequence prediction tasks. This approach typically models the local distribution over the next word with a powerful neural network that can condition on…

Computation and Language · Computer Science 2021-11-17 Yoon Kim

Sequence-to-Sequence Models for Data-to-Text Natural Language Generation: Word- vs. Character-based Processing and Output Diversity

We present a comparison of word-based and character-based sequence-to-sequence models for data-to-text natural language generation, which generate natural language descriptions for structured inputs. On the datasets of two recent generation…

Computation and Language · Computer Science 2018-10-12 Glorianna Jagfeld , Sabrina Jenne , Ngoc Thang Vu

Sequence-to-Sequence Generation for Spoken Dialogue via Deep Syntax Trees and Strings

We present a natural language generator based on the sequence-to-sequence approach that can be trained to produce natural language strings as well as deep syntax dependency trees from input dialogue acts, and we use it to directly compare…

Computation and Language · Computer Science 2017-09-18 Ondřej Dušek , Filip Jurčíček

Tackling Math Word Problems with Fine-to-Coarse Abstracting and Reasoning

Math Word Problems (MWP) is an important task that requires the ability of understanding and reasoning over mathematical text. Existing approaches mostly formalize it as a generation task by adopting Seq2Seq or Seq2Tree models to encode an…

Computation and Language · Computer Science 2022-05-18 Ailisi Li , Xueyao Jiang , Bang Liu , Jiaqing Liang , Yanghua Xiao

Disentangled Sequence to Sequence Learning for Compositional Generalization

There is mounting evidence that existing neural network models, in particular the very popular sequence-to-sequence architecture, struggle to systematically generalize to unseen compositions of seen components. We demonstrate that one of…

Computation and Language · Computer Science 2022-03-23 Hao Zheng , Mirella Lapata