Related papers: Randomized Deep Structured Prediction for Discours…

Learning Robust, Transferable Sentence Representations for Text Classification

Despite deep recurrent neural networks (RNNs) demonstrate strong performance in text classification, training RNN models are often expensive and requires an extensive collection of annotated data which may not be available. To overcome the…

Computation and Language · Computer Science 2018-10-02 Wasi Uddin Ahmad , Xueying Bai , Nanyun Peng , Kai-Wei Chang

Probabilistic Transformer: A Probabilistic Dependency Model for Contextual Word Representation

Syntactic structures used to play a vital role in natural language processing (NLP), but since the deep learning revolution, NLP has been gradually dominated by neural models that do not consider syntactic structures in their design. One…

Computation and Language · Computer Science 2023-11-28 Haoyi Wu , Kewei Tu

Dependency Sensitive Convolutional Neural Networks for Modeling Sentences and Documents

The goal of sentence and document modeling is to accurately represent the meaning of sentences and documents for various Natural Language Processing tasks. In this work, we present Dependency Sensitive Convolutional Neural Networks (DSCNN)…

Computation and Language · Computer Science 2016-11-09 Rui Zhang , Honglak Lee , Dragomir Radev

Deep Structured Prediction with Nonlinear Output Transformations

Deep structured models are widely used for tasks like semantic segmentation, where explicit correlations between variables provide important prior information which generally helps to reduce the data needs of deep nets. However, current…

Machine Learning · Computer Science 2018-11-02 Colin Graber , Ofer Meshi , Alexander Schwing

Sentence Ordering and Coherence Modeling using Recurrent Neural Networks

Modeling the structure of coherent texts is a key NLP problem. The task of coherently organizing a given set of sentences has been commonly used to build and evaluate models that understand such structure. We propose an end-to-end…

Computation and Language · Computer Science 2017-12-25 Lajanugen Logeswaran , Honglak Lee , Dragomir Radev

Learning Constraints for Structured Prediction Using Rectifier Networks

Various natural language processing tasks are structured prediction problems where outputs are constructed with multiple interdependent decisions. Past work has shown that domain knowledge, framed as constraints over the output space, can…

Computation and Language · Computer Science 2020-06-03 Xingyuan Pan , Maitrey Mehta , Vivek Srikumar

Designing Interpretable Approximations to Deep Reinforcement Learning

In an ever expanding set of research and application areas, deep neural networks (DNNs) set the bar for algorithm performance. However, depending upon additional constraints such as processing power and execution time limits, or…

Machine Learning · Computer Science 2021-06-22 Nathan Dahlin , Krishna Chaitanya Kalagarla , Nikhil Naik , Rahul Jain , Pierluigi Nuzzo

Structured Prediction in NLP -- A survey

Over the last several years, the field of Structured prediction in NLP has had seen huge advancements with sophisticated probabilistic graphical models, energy-based networks, and its combination with deep learning-based approaches. This…

Computation and Language · Computer Science 2021-10-06 Chauhan Dev , Naman Biyani , Nirmal P. Suthar , Prashant Kumar , Priyanshu Agarwal

Modeling Bilingual Sentence Processing: Evaluating RNN and Transformer Architectures for Cross-Language Structural Priming

This study evaluates the performance of Recurrent Neural Network (RNN) and Transformer models in replicating cross-language structural priming, a key indicator of abstract grammatical representations in human language processing. Focusing…

Computation and Language · Computer Science 2024-10-17 Demi Zhang , Bushi Xiao , Chao Gao , Sangpil Youm , Bonnie J Dorr

Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks

There is a lot of research interest in encoding variable length sentences into fixed length vectors, in a way that preserves the sentence meanings. Two common methods include representations based on averaging word vectors, and…

Computation and Language · Computer Science 2017-02-10 Yossi Adi , Einat Kermany , Yonatan Belinkov , Ofer Lavi , Yoav Goldberg

Deep Sequence Models for Text Classification Tasks

The exponential growth of data generated on the Internet in the current information age is a driving force for the digital economy. Extraction of information is the major value in an accumulated big data. Big data dependency on statistical…

Computation and Language · Computer Science 2022-07-20 Saheed Salahudeen Abdullahi , Sun Yiming , Shamsuddeen Hassan Muhammad , Abdulrasheed Mustapha , Ahmad Muhammad Aminu , Abdulkadir Abdullahi , Musa Bello , Saminu Mohammad Aliyu

Unsupervised Learning of Explainable Parse Trees for Improved Generalisation

Recursive neural networks (RvNN) have been shown useful for learning sentence representations and helped achieve competitive performance on several natural language inference tasks. However, recent RvNN-based models fail to learn simple…

Computation and Language · Computer Science 2021-04-13 Atul Sahay , Ayush Maheshwari , Ritesh Kumar , Ganesh Ramakrishnan , Manjesh Kumar Hanawal , Kavi Arya

Evaluating Distributed Representations for Multi-Level Lexical Semantics: A Research Proposal

Modern neural networks (NNs), trained on extensive raw sentence data, construct distributed representations by compressing individual words into dense, continuous, high-dimensional vectors. These representations are expected to capture…

Computation and Language · Computer Science 2024-12-04 Zhu Liu

EduBERT: Pretrained Deep Language Models for Learning Analytics

The use of large pretrained neural networks to create contextualized word embeddings has drastically improved performance on several natural language processing (NLP) tasks. These computationally expensive models have begun to be applied to…

Computers and Society · Computer Science 2019-12-03 Benjamin Clavié , Kobi Gal

Modelling Sentence Pairs with Tree-structured Attentive Encoder

We describe an attentive encoder that combines tree-structured recursive neural networks and sequential recurrent neural networks for modelling sentence pairs. Since existing attentive models exert attention on the sequential structure, we…

Computation and Language · Computer Science 2016-10-11 Yao Zhou , Cong Liu , Yan Pan

Learning to Compose Words into Sentences with Reinforcement Learning

We use reinforcement learning to learn tree-structured neural networks for computing representations of natural language sentences. In contrast with prior work on tree-structured models in which the trees are either provided as input or…

Computation and Language · Computer Science 2016-11-29 Dani Yogatama , Phil Blunsom , Chris Dyer , Edward Grefenstette , Wang Ling

General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference

The state of the art on many NLP tasks is currently achieved by large pre-trained language models, which require a considerable amount of computation. We explore a setting where many different predictions are made on a single piece of text.…

Computation and Language · Computer Science 2020-04-30 Jingfei Du , Myle Ott , Haoran Li , Xing Zhou , Veselin Stoyanov

How Does a Deep Neural Network Look at Lexical Stress in English Words?

Despite their success in speech processing, neural networks often operate as black boxes, prompting the question: what informs their decisions, and how can we interpret them? This work examines this issue in the context of lexical stress. A…

Computation and Language · Computer Science 2026-02-13 Itai Allouche , Itay Asael , Rotem Rousso , Vered Dassa , Ann Bradlow , Seung-Eun Kim , Matthew Goldrick , Joseph Keshet

Assessing Phrasal Representation and Composition in Transformers

Deep transformer models have pushed performance on NLP tasks to new limits, suggesting sophisticated treatment of complex linguistic inputs, such as phrases. However, we have limited understanding of how these models handle representation…

Computation and Language · Computer Science 2020-10-15 Lang Yu , Allyson Ettinger

Discovering Useful Sentence Representations from Large Pretrained Language Models

Despite the extensive success of pretrained language models as encoders for building NLP systems, they haven't seen prominence as decoders for sequence generation tasks. We explore the question of whether these models can be adapted to be…

Computation and Language · Computer Science 2020-08-21 Nishant Subramani , Nivedita Suresh