Related papers: Tiny Neural Models for Seq2Seq

Distilling Large Language Models into Tiny and Effective Students using pQRNN

Large pre-trained multilingual models like mBERT, XLM-R achieve state of the art results on language understanding tasks. However, they are not well suited for latency critical applications on both servers and edge devices. It's important…

Computation and Language · Computer Science 2021-01-25 Prabhu Kaliamoorthi , Aditya Siddhant , Edward Li , Melvin Johnson

Non-Autoregressive Semantic Parsing for Compositional Task-Oriented Dialog

Semantic parsing using sequence-to-sequence models allows parsing of deeper representations compared to traditional word tagging based models. In spite of these advantages, widespread adoption of these models for real-time conversational…

Computation and Language · Computer Science 2021-04-13 Arun Babu , Akshat Shrivastava , Armen Aghajanyan , Ahmed Aly , Angela Fan , Marjan Ghazvininejad

Seq2Seq-SC: End-to-End Semantic Communication Systems with Pre-trained Language Model

In this work, we propose a realistic semantic network called seq2seq-SC, designed to be compatible with 5G NR and capable of working with generalized text datasets using a pre-trained language model. The goal is to achieve unprecedented…

Signal Processing · Electrical Eng. & Systems 2023-10-19 Ju-Hyung Lee , Dong-Ho Lee , Eunsoo Sheen , Thomas Choi , Jay Pujara

Deconvolution-Based Global Decoding for Neural Machine Translation

A great proportion of sequence-to-sequence (Seq2Seq) models for Neural Machine Translation (NMT) adopt Recurrent Neural Network (RNN) to generate translation word by word following a sequential order. As the studies of linguistics have…

Computation and Language · Computer Science 2018-06-14 Junyang Lin , Xu Sun , Xuancheng Ren , Shuming Ma , Jinsong Su , Qi Su

Exploiting the Potential of Seq2Seq Models as Robust Few-Shot Learners

In-context learning, which offers substantial advantages over fine-tuning, is predominantly observed in decoder-only models, while encoder-decoder (i.e., seq2seq) models excel in methods that rely on weight updates. Recently, a few studies…

Computation and Language · Computer Science 2024-08-28 Jihyeon Lee , Dain Kim , Doohae Jung , Boseop Kim , Kyoung-Woon On

Improving AMR Parsing with Sequence-to-Sequence Pre-training

In the literature, the research on abstract meaning representation (AMR) parsing is much restricted by the size of human-curated dataset which is critical to build an AMR parser with good performance. To alleviate such data size…

Computation and Language · Computer Science 2020-10-06 Dongqin Xu , Junhui Li , Muhua Zhu , Min Zhang , Guodong Zhou

Deep Text-to-Speech System with Seq2Seq Model

Recent trends in neural network based text-to-speech/speech synthesis pipelines have employed recurrent Seq2seq architectures that can synthesize realistic sounding speech directly from text characters. These systems however have complex…

Computation and Language · Computer Science 2019-03-19 Gary Wang

Grammatical Sequence Prediction for Real-Time Neural Semantic Parsing

While sequence-to-sequence (seq2seq) models achieve state-of-the-art performance in many natural language processing tasks, they can be too slow for real-time applications. One performance bottleneck is predicting the most likely next token…

Computation and Language · Computer Science 2019-07-26 Chunyang Xiao , Christoph Teichmann , Konstantine Arkoudas

pNLP-Mixer: an Efficient all-MLP Architecture for Language

Large pre-trained language models based on transformer architecture have drastically changed the natural language processing (NLP) landscape. However, deploying those models for on-device applications in constrained devices such as smart…

Computation and Language · Computer Science 2023-05-26 Francesco Fusco , Damian Pascual , Peter Staar , Diego Antognini

Sequence-to-Sequence Spanish Pre-trained Language Models

In recent years, significant advancements in pre-trained language models have driven the creation of numerous non-English language variants, with a particular emphasis on encoder-only and decoder-only architectures. While Spanish language…

Computation and Language · Computer Science 2024-03-22 Vladimir Araujo , Maria Mihaela Trusca , Rodrigo Tufiño , Marie-Francine Moens

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation

The past year has witnessed rapid advances in sequence-to-sequence (seq2seq) modeling for Machine Translation (MT). The classic RNN-based approaches to MT were first out-performed by the convolutional seq2seq model, which was then…

Computation and Language · Computer Science 2018-04-30 Mia Xu Chen , Orhan Firat , Ankur Bapna , Melvin Johnson , Wolfgang Macherey , George Foster , Llion Jones , Niki Parmar , Mike Schuster , Zhifeng Chen , Yonghui Wu , Macduff Hughes

QNet: A Quantum-native Sequence Encoder Architecture

This work proposes QNet, a novel sequence encoder model that entirely inferences on the quantum computer using a minimum number of qubits. Let $n$ and $d$ represent the length of the sequence and the embedding size, respectively. The…

Machine Learning · Computer Science 2023-08-29 Wei Day , Hao-Sheng Chen , Min-Te Sun

Text Classification for Task-based Source Code Related Questions

There is a key demand to automatically generate code for small tasks for developers. Websites such as StackOverflow provide a simplistic way by offering solutions in small snippets which provide a complete answer to whatever task question…

Software Engineering · Computer Science 2021-11-02 Sairamvinay Vijayaraghavan , Jinxiao Song , David Tomassi , Siddhartha Punj , Jailan Sabet

Efficient Machine Translation with a BiLSTM-Attention Approach

With the rapid development of Natural Language Processing (NLP) technology, the accuracy and efficiency of machine translation have become hot topics of research. This paper proposes a novel Seq2Seq model aimed at improving translation…

Computation and Language · Computer Science 2024-11-01 Yuxu Wu , Yiren Xing

Tied & Reduced RNN-T Decoder

Previous works on the Recurrent Neural Network-Transducer (RNN-T) models have shown that, under some conditions, it is possible to simplify its prediction network with little or no loss in recognition accuracy (arXiv:2003.07705 [eess.AS],…

Computation and Language · Computer Science 2021-09-17 Rami Botros , Tara N. Sainath , Robert David , Emmanuel Guzman , Wei Li , Yanzhang He

Span Pointer Networks for Non-Autoregressive Task-Oriented Semantic Parsing

An effective recipe for building seq2seq, non-autoregressive, task-oriented parsers to map utterances to semantic frames proceeds in three steps: encoding an utterance $x$, predicting a frame's length |y|, and decoding a |y|-sized frame…

Computation and Language · Computer Science 2021-09-16 Akshat Shrivastava , Pierce Chuang , Arun Babu , Shrey Desai , Abhinav Arora , Alexander Zotov , Ahmed Aly

Top-down Tree Structured Decoding with Syntactic Connections for Neural Machine Translation and Parsing

The addition of syntax-aware decoding in Neural Machine Translation (NMT) systems requires an effective tree-structured neural network, a syntax-aware attention model and a language generation model that is sensitive to sentence structure.…

Computation and Language · Computer Science 2018-09-07 Jetic Gū , Hassan S. Shavarani , Anoop Sarkar

Logical Parsing from Natural Language Based on a Neural Translation Model

Semantic parsing has emerged as a significant and powerful paradigm for natural language interface and question answering systems. Traditional methods of building a semantic parser rely on high-quality lexicons, hand-crafted grammars and…

Computation and Language · Computer Science 2017-05-10 Liang Li , Pengyu Li , Yifan Liu , Tao Wan , Zengchang Qin

Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case Study

Neural Machine Translation (NMT) remains a formidable challenge, especially when dealing with low-resource languages. Pre-trained sequence-to-sequence (seq2seq) multi-lingual models, such as mBART-50, have demonstrated impressive…

Computation and Language · Computer Science 2024-07-10 Aniruddha Roy , Pretam Ray , Ayush Maheshwari , Sudeshna Sarkar , Pawan Goyal

Graph-to-Tree Neural Networks for Learning Structured Input-Output Translation with Applications to Semantic Parsing and Math Word Problem

The celebrated Seq2Seq technique and its numerous variants achieve excellent performance on many tasks such as neural machine translation, semantic parsing, and math word problem solving. However, these models either only consider input…

Computation and Language · Computer Science 2020-10-07 Shucheng Li , Lingfei Wu , Shiwei Feng , Fangli Xu , Fengyuan Xu , Sheng Zhong