Related papers: Differentiable Tree Operations Promote Composition…

Characterizing Intrinsic Compositionality in Transformers with Tree Projections

When trained on language data, do transformers learn some arbitrary computation that utilizes the full capacity of the architecture or do they learn a simpler, tree-like computation, hypothesized to underlie compositional meaning systems…

Computation and Language · Computer Science 2022-11-07 Shikhar Murty , Pratyusha Sharma , Jacob Andreas , Christopher D. Manning

Terminating Differentiable Tree Experts

We advance the recently proposed neuro-symbolic Differentiable Tree Machine, which learns tree operations using a combination of transformers and Tensor Product Representations. We investigate the architecture and propose two key…

Machine Learning · Computer Science 2024-07-03 Jonathan Thomm , Michael Hersche , Giacomo Camposampiero , Aleksandar Terzić , Bernhard Schölkopf , Abbas Rahimi

Compositional Generalization Across Distributional Shifts with Sparse Tree Operations

Neural networks continue to struggle with compositional generalization, and this issue is exacerbated by a lack of massive pre-training. One successful approach for developing neural systems which exhibit human-like compositional…

Artificial Intelligence · Computer Science 2024-12-19 Paul Soulos , Henry Conklin , Mattia Opper , Paul Smolensky , Jianfeng Gao , Roland Fernandez

R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling

Human language understanding operates at multiple levels of granularity (e.g., words, phrases, and sentences) with increasing levels of abstraction that can be hierarchically combined. However, existing deep models with stacked layers do…

Computation and Language · Computer Science 2022-03-04 Xiang Hu , Haitao Mi , Zujie Wen , Yafang Wang , Yi Su , Jing Zheng , Gerard de Melo

Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic Supervision

Syntactic Language Models (SLMs) can be trained efficiently to reach relatively high performance; however, they have trouble with inference efficiency due to the explicit generation of syntactic structures. In this paper, we propose a new…

Computation and Language · Computer Science 2025-08-20 Ryo Yoshida , Taiga Someya , Yohei Oseki

Dynamic Compositional Neural Networks over Tree Structure

Tree-structured neural networks have proven to be effective in learning semantic representations by exploiting syntactic information. In spite of their success, most existing models suffer from the underfitting problem: they recursively use…

Computation and Language · Computer Science 2017-05-12 Pengfei Liu , Xipeng Qiu , Xuanjing Huang

Decision Tree Embedding by Leaf-Means

Decision trees and random forest remain highly competitive for classification on medium-sized, standard datasets due to their robustness, minimal preprocessing requirements, and interpretability. However, a single tree suffers from high…

Machine Learning · Statistics 2025-12-02 Cencheng Shen , Yuexiao Dong , Carey E. Priebe

Statistical Machine Translation by Generalized Parsing

Designers of statistical machine translation (SMT) systems have begun to employ tree-structured translation models. Systems involving tree-structured translation models tend to be complex. This article aims to reduce the conceptual…

Computation and Language · Computer Science 2007-05-23 I. Dan Melamed , Wei Wang

A Tree-based Decoder for Neural Machine Translation

Recent advances in Neural Machine Translation (NMT) show that adding syntactic information to NMT systems can improve the quality of their translations. Most existing work utilizes some specific types of linguistically-inspired tree…

Computation and Language · Computer Science 2018-08-29 Xinyi Wang , Hieu Pham , Pengcheng Yin , Graham Neubig

Forming Trees with Treeformers

Human language is known to exhibit a nested, hierarchical structure, allowing us to form complex sentences out of smaller pieces. However, many state-of-the-art neural networks models such as Transformers have no explicit hierarchical…

Computation and Language · Computer Science 2023-07-12 Nilay Patel , Jeffrey Flanigan

Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs

We introduce a neural network that represents sentences by composing their words according to induced binary parse trees. We use Tree-LSTM as our composition function, applied along a tree structure found by a fully differentiable natural…

Computation and Language · Computer Science 2020-01-16 Jean Maillard , Stephen Clark , Dani Yogatama

Inducing Constituency Trees through Neural Machine Translation

Latent tree learning(LTL) methods learn to parse sentences using only indirect supervision from a downstream task. Recent advances in latent tree learning have made it possible to recover moderately high quality tree structures by training…

Computation and Language · Computer Science 2019-09-24 Phu Mon Htut , Kyunghyun Cho , Samuel R. Bowman

Tree-structured composition in neural networks without tree-structured architectures

Tree-structured neural networks encode a particular tree geometry for a sentence in the network design. However, these models have at best only slightly outperformed simpler sequence-based models. We hypothesize that neural sequence models…

Computation and Language · Computer Science 2015-11-10 Samuel R. Bowman , Christopher D. Manning , Christopher Potts

Learnable Tree Filter for Structure-preserving Feature Transform

Learning discriminative global features plays a vital role in semantic segmentation. And most of the existing methods adopt stacks of local convolutions or non-local blocks to capture long-range context. However, due to the absence of…

Computer Vision and Pattern Recognition · Computer Science 2019-09-30 Lin Song , Yanwei Li , Zeming Li , Gang Yu , Hongbin Sun , Jian Sun , Nanning Zheng

Towards Neural Machine Translation with Latent Tree Attention

Building models that take advantage of the hierarchical structure of language without a priori annotation is a longstanding goal in natural language processing. We introduce such a model for the task of machine translation, pairing a…

Computation and Language · Computer Science 2017-09-07 James Bradbury , Richard Socher

Inducing Transformer's Compositional Generalization Ability via Auxiliary Sequence Prediction Tasks

Systematic compositionality is an essential mechanism in human language, allowing the recombination of known parts to create novel expressions. However, existing neural models have been shown to lack this basic ability in learning symbolic…

Computation and Language · Computer Science 2021-10-01 Yichen Jiang , Mohit Bansal

A Novel Architecture for Symbolic Reasoning with Decision Trees and LLM Agents

We propose a hybrid architecture that integrates decision tree-based symbolic reasoning with the generative capabilities of large language models (LLMs) within a coordinated multi-agent framework. Unlike prior approaches that loosely couple…

Artificial Intelligence · Computer Science 2025-08-08 Andrew Kiruluta

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have obtained strong results on a variety of…

Computation and Language · Computer Science 2015-06-02 Kai Sheng Tai , Richard Socher , Christopher D. Manning

Inherently Interpretable Tree Ensemble Learning

Tree ensemble models like random forests and gradient boosting machines are widely used in machine learning due to their excellent predictive performance. However, a high-performance ensemble consisting of a large number of decision trees…

Machine Learning · Statistics 2024-10-28 Zebin Yang , Agus Sudjianto , Xiaoming Li , Aijun Zhang

Dynamic Compositionality in Recursive Neural Networks with Structure-aware Tag Representations

Most existing recursive neural network (RvNN) architectures utilize only the structure of parse trees, ignoring syntactic tags which are provided as by-products of parsing. We present a novel RvNN architecture that can provide dynamic…

Computation and Language · Computer Science 2018-11-27 Taeuk Kim , Jihun Choi , Daniel Edmiston , Sanghwan Bae , Sang-goo Lee