Related papers: FormNet: Structural Encoding beyond Sequential Mod…

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Document structure extraction has been a widely researched area for decades with recent works performing it as a semantic segmentation task over document images using fully-convolution networks. Such methods are limited by image resolution…

Machine Learning · Computer Science 2021-07-12 Milan Aggarwal , Hiresh Gupta , Mausoom Sarkar , Balaji Krishnamurthy

Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition

Encoder-decoder models have become an effective approach for sequence learning tasks like machine translation, image captioning and speech recognition, but have yet to show competitive results for handwritten text recognition. To this end,…

Computer Vision and Pattern Recognition · Computer Science 2019-07-16 Johannes Michael , Roger Labahn , Tobias Grüning , Jochen Zöllner

Finding the Needle in a Haystack: Unsupervised Rationale Extraction from Long Text Classifiers

Long-sequence transformers are designed to improve the representation of longer texts by language models and their performance on downstream document-level tasks. However, not much is understood about the quality of token-level predictions…

Computation and Language · Computer Science 2023-03-15 Kamil Bujel , Andrew Caines , Helen Yannakoudakis , Marek Rei

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Scaling sequence length has become a critical demand in the era of large language models. However, existing methods struggle with either computational complexity or model expressivity, rendering the maximum sequence length restricted. To…

Computation and Language · Computer Science 2023-07-20 Jiayu Ding , Shuming Ma , Li Dong , Xingxing Zhang , Shaohan Huang , Wenhui Wang , Nanning Zheng , Furu Wei

SemToken: Semantic-Aware Tokenization for Efficient Long-Context Language Modeling

Tokenization plays a critical role in language modeling, yet existing approaches such as Byte-Pair Encoding (BPE) or WordPiece operate purely on frequency statistics, ignoring the underlying semantic structure of text. This leads to…

Computation and Language · Computer Science 2025-08-22 Dong Liu , Yanxuan Yu

SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning

Synthesizing SQL queries from natural language is a long-standing open problem and has been attracting considerable interest recently. Toward solving the problem, the de facto approach is to employ a sequence-to-sequence-style model. Such…

Computation and Language · Computer Science 2017-11-15 Xiaojun Xu , Chang Liu , Dawn Song

A Sequence-to-Set Network for Nested Named Entity Recognition

Named entity recognition (NER) is a widely studied task in natural language processing. Recently, a growing number of studies have focused on the nested NER. The span-based methods, considering the entity recognition as a span…

Computation and Language · Computer Science 2021-06-22 Zeqi Tan , Yongliang Shen , Shuai Zhang , Weiming Lu , Yueting Zhuang

WebFormer: The Web-page Transformer for Structure Information Extraction

Structure information extraction refers to the task of extracting structured text fields from web pages, such as extracting a product offer from a shopping page including product title, description, brand and price. It is an important…

Computation and Language · Computer Science 2022-02-02 Qifan Wang , Yi Fang , Anirudh Ravula , Fuli Feng , Xiaojun Quan , Dongfang Liu

A Convolutional Attention Network for Extreme Summarization of Source Code

Attention mechanisms in neural networks have proved useful for problems in which the input and output do not have fixed dimension. Often there exist features that are locally translation invariant and would be valuable for directing the…

Machine Learning · Computer Science 2016-05-26 Miltiadis Allamanis , Hao Peng , Charles Sutton

Sentence Ordering and Coherence Modeling using Recurrent Neural Networks

Modeling the structure of coherent texts is a key NLP problem. The task of coherently organizing a given set of sentences has been commonly used to build and evaluate models that understand such structure. We propose an end-to-end…

Computation and Language · Computer Science 2017-12-25 Lajanugen Logeswaran , Honglak Lee , Dragomir Radev

FoR-Net: Learning to Focus on Hard Regions for Efficient Semantic Segmentation

We present FoR-Net, an efficient semantic segmentation framework that focuses on identifying and enhancing hard regions. Instead of relying on heavy global modeling, FoR-Net adopts an efficient strategy that selectively emphasizes…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Sheng-Wei Chan , Hsin-Jui Pan , Chun-Po Shen , Yung-Che Wang , Meng-Qian Li , Chia-Min Lin , Jen-Shiun Chiang

Structured Attention Networks

Attention networks have proven to be an effective approach for embedding categorical inference within a deep neural network. However, for many tasks we may want to model richer structural dependencies without abandoning end-to-end training.…

Computation and Language · Computer Science 2017-02-17 Yoon Kim , Carl Denton , Luong Hoang , Alexander M. Rush

End-to-End Hierarchical Relation Extraction for Generic Form Understanding

Form understanding is a challenging problem which aims to recognize semantic entities from the input document and their hierarchical relations. Previous approaches face significant difficulty dealing with the complexity of the task, thus…

Artificial Intelligence · Computer Science 2021-06-03 Tuan-Anh Nguyen Dang , Duc-Thanh Hoang , Quang-Bach Tran , Chih-Wei Pan , Thanh-Dat Nguyen

Semi-Structured Object Sequence Encoders

In this paper we explore the task of modeling semi-structured object sequences; in particular, we focus our attention on the problem of developing a structure-aware input representation for such sequences. Examples of such data include user…

Computer Vision and Pattern Recognition · Computer Science 2023-05-24 Rudra Murthy , Riyaz Bhat , Chulaka Gunasekara , Siva Sankalp Patel , Hui Wan , Tejas Indulal Dhamecha , Danish Contractor , Marina Danilevsky

TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling

In this paper, we describe the use of recurrent neural networks to capture sequential information from the self-attention representations to improve the Transformers. Although self-attention mechanism provides a means to exploit long…

Computation and Language · Computer Science 2021-04-06 Tze Yuang Chong , Xuyang Wang , Lin Yang , Junjie Wang

Online Segment to Segment Neural Transduction

We introduce an online neural sequence to sequence model that learns to alternate between encoding and decoding segments of the input as it is read. By independently tracking the encoding and decoding representations our algorithm permits…

Computation and Language · Computer Science 2016-09-28 Lei Yu , Jan Buys , Phil Blunsom

Topic-Aware Encoding for Extractive Summarization

Document summarization provides an instrument for faster understanding the collection of text documents and has several real-life applications. With the growth of online text data, numerous summarization models have been proposed recently.…

Computation and Language · Computer Science 2022-04-01 Mingyang Song , Liping Jing

Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model

A sequence-to-sequence model is a neural network module for mapping two sequences of different lengths. The sequence-to-sequence model has three core modules: encoder, decoder, and attention. Attention is the bridge that connects the…

Computation and Language · Computer Science 2018-07-24 Andros Tjandra , Sakriani Sakti , Satoshi Nakamura

MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models

Machine Comprehension (MC) is one of the core problems in natural language processing, requiring both understanding of the natural language and knowledge about the world. Rapid progress has been made since the release of several benchmark…

Computation and Language · Computer Science 2019-08-07 Boyuan Pan , Yazheng Yang , Hao Li , Zhou Zhao , Yueting Zhuang , Deng Cai , Xiaofei He

Zero-shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens

Can attention- or gradient-based visualization techniques be used to infer token-level labels for binary sequence tagging problems, using networks trained only on sentence-level labels? We construct a neural network architecture based on…

Computation and Language · Computer Science 2018-05-08 Marek Rei , Anders Søgaard