Related papers: Seq2Seq-Vis: A Visual Debugging Tool for Sequence-…

Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking

In this paper, we introduce a new sequence-to-sequence learning framework for RGB-based and multi-modal object tracking. First, we present SeqTrack for RGB-based tracking. It casts visual tracking as a sequence generation task, forecasting…

Computer Vision and Pattern Recognition · Computer Science 2024-03-28 Xin Chen , Ben Kang , Jiawen Zhu , Dong Wang , Houwen Peng , Huchuan Lu

Sequence-to-Sequence Learning as Beam-Search Optimization

Sequence-to-Sequence (seq2seq) modeling has rapidly become an important general-purpose NLP tool that has proven effective for many text-generation and sequence-labeling tasks. Seq2seq builds on deep neural language modeling and inherits…

Computation and Language · Computer Science 2016-11-11 Sam Wiseman , Alexander M. Rush

Neural Machine Translation and Sequence-to-sequence Models: A Tutorial

This tutorial introduces a new and powerful set of techniques variously called "neural machine translation" or "neural sequence-to-sequence models". These techniques have been used in a number of tasks regarding the handling of human…

Computation and Language · Computer Science 2017-03-07 Graham Neubig

Data2Vis: Automatic Generation of Data Visualizations Using Sequence to Sequence Recurrent Neural Networks

Rapidly creating effective visualizations using expressive grammars is challenging for users who have limited time and limited skills in statistics and data visualization. Even high-level, dedicated visualization tools often require users…

Human-Computer Interaction · Computer Science 2018-11-06 Victor Dibia , Çağatay Demiralp

Sequence-to-sequence neural network models for transliteration

Transliteration is a key component of machine translation systems and software internationalization. This paper demonstrates that neural sequence-to-sequence models obtain state of the art or close to state of the art results on existing…

Computation and Language · Computer Science 2016-11-01 Mihaela Rosca , Thomas Breuel

Sequence to Sequence Mixture Model for Diverse Machine Translation

Sequence to sequence (SEQ2SEQ) models often lack diversity in their generated translations. This can be attributed to the limitation of SEQ2SEQ models in capturing lexical and syntactic variations in a parallel corpus resulting from…

Computation and Language · Computer Science 2018-10-18 Xuanli He , Gholamreza Haffari , Mohammad Norouzi

Cseq2seq: Cyclic Sequence-to-Sequence Learning

The vanilla sequence-to-sequence learning (seq2seq) reads and encodes a source sequence into a fixed-length vector only once, suffering from its insufficiency in modeling structural correspondence between the source and target sequence.…

Computation and Language · Computer Science 2018-11-27 Biao Zhang , Deyi Xiong , Jinsong Su

Sequence-to-Sequence Contrastive Learning for Text Recognition

We propose a framework for sequence-to-sequence contrastive learning (SeqCLR) of visual representations, which we apply to text recognition. To account for the sequence-to-sequence structure, each feature map is divided into different…

Computer Vision and Pattern Recognition · Computer Science 2020-12-22 Aviad Aberdam , Ron Litman , Shahar Tsiper , Oron Anschel , Ron Slossberg , Shai Mazor , R. Manmatha , Pietro Perona

Grammatical Sequence Prediction for Real-Time Neural Semantic Parsing

While sequence-to-sequence (seq2seq) models achieve state-of-the-art performance in many natural language processing tasks, they can be too slow for real-time applications. One performance bottleneck is predicting the most likely next token…

Computation and Language · Computer Science 2019-07-26 Chunyang Xiao , Christoph Teichmann , Konstantine Arkoudas

Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining

We introduce a novel sequence-to-sequence (seq2seq) voice conversion (VC) model based on the Transformer architecture with text-to-speech (TTS) pretraining. Seq2seq VC models are attractive owing to their ability to convert prosody. While…

Audio and Speech Processing · Electrical Eng. & Systems 2019-12-17 Wen-Chin Huang , Tomoki Hayashi , Yi-Chiao Wu , Hirokazu Kameoka , Tomoki Toda

Sequential Modeling Enables Scalable Learning for Large Vision Models

We introduce a novel sequential modeling approach which enables learning a Large Vision Model (LVM) without making use of any linguistic data. To do this, we define a common format, "visual sentences", in which we can represent raw images…

Computer Vision and Pattern Recognition · Computer Science 2023-12-04 Yutong Bai , Xinyang Geng , Karttikeya Mangalam , Amir Bar , Alan Yuille , Trevor Darrell , Jitendra Malik , Alexei A Efros

Winner Team Mia at TextVQA Challenge 2021: Vision-and-Language Representation Learning with Pre-trained Sequence-to-Sequence Model

TextVQA requires models to read and reason about text in images to answer questions about them. Specifically, models need to incorporate a new modality of text present in the images and reason over it to answer TextVQA questions. In this…

Computer Vision and Pattern Recognition · Computer Science 2026-02-10 Yixuan Qiao , Hao Chen , Jun Wang , Shanshan Zhao , Yihao Chen , Xianbin Ye , Ziliang Li , Xianbiao Qi , Peng Gao , Guotong Xie

A Gray Box Interpretable Visual Debugging Approach for Deep Sequence Learning Model

Deep Learning algorithms are often used as black box type learning and they are too complex to understand. The widespread usability of Deep Learning algorithms to solve various machine learning problems demands deep and transparent…

Machine Learning · Computer Science 2018-11-21 Md Mofijul Islam , Amar Debnath , Tahsin Al Sayeed , Jyotirmay Nag Setu , Md Mahmudur Rahman , Md Sadman Sakib , Md Abdur Razzaque , Md. Mosaddek Khan , Swakkhar Shatabda

Deconvolution-Based Global Decoding for Neural Machine Translation

A great proportion of sequence-to-sequence (Seq2Seq) models for Neural Machine Translation (NMT) adopt Recurrent Neural Network (RNN) to generate translation word by word following a sequential order. As the studies of linguistics have…

Computation and Language · Computer Science 2018-06-14 Junyang Lin , Xu Sun , Xuancheng Ren , Shuming Ma , Jinsong Su , Qi Su

Predicting the time-evolution of multi-physics systems with sequence-to-sequence models

In this work, sequence-to-sequence (seq2seq) models, originally developed for language translation, are used to predict the temporal evolution of complex, multi-physics computer simulations. The predictive performance of seq2seq models is…

Machine Learning · Computer Science 2018-11-15 K. D. Humbird , J. L. Peterson , R. G. McClarren

Non-autoregressive Sequence-to-Sequence Vision-Language Models

Sequence-to-sequence vision-language models are showing promise, but their applicability is limited by their inference latency due to their autoregressive way of generating predictions. We propose a parallel decoding sequence-to-sequence…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Kunyu Shi , Qi Dong , Luis Goncalves , Zhuowen Tu , Stefano Soatto

Copy that! Editing Sequences by Copying Spans

Neural sequence-to-sequence models are finding increasing use in editing of documents, for example in correcting a text document or repairing source code. In this paper, we argue that common seq2seq models (with a facility to copy single…

Machine Learning · Computer Science 2020-12-15 Sheena Panthaplackel , Miltiadis Allamanis , Marc Brockschmidt

nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning

Text-to-Visualization (Text2VIS) enables users to create visualizations from natural language queries, making data insights more accessible. However, Text2VIS faces challenges in interpreting ambiguous queries, as users often express their…

Computation and Language · Computer Science 2026-01-06 Tianqi Luo , Chuhan Huang , Leixian Shen , Boyan Li , Shuyu Shen , Wei Zeng , Nan Tang , Yuyu Luo

CopyNext: Explicit Span Copying and Alignment in Sequence to Sequence Models

Copy mechanisms are employed in sequence to sequence models (seq2seq) to generate reproductions of words from the input to the output. These frameworks, operating at the lexical type level, fail to provide an explicit alignment that records…

Computation and Language · Computer Science 2020-10-30 Abhinav Singh , Patrick Xia , Guanghui Qin , Mahsa Yarmohammadi , Benjamin Van Durme

Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition

Encoder-decoder models have become an effective approach for sequence learning tasks like machine translation, image captioning and speech recognition, but have yet to show competitive results for handwritten text recognition. To this end,…

Computer Vision and Pattern Recognition · Computer Science 2019-07-16 Johannes Michael , Roger Labahn , Tobias Grüning , Jochen Zöllner