English
Related papers

Related papers: Tiny Neural Models for Seq2Seq

200 papers

Large pre-trained multilingual models like mBERT, XLM-R achieve state of the art results on language understanding tasks. However, they are not well suited for latency critical applications on both servers and edge devices. It's important…

Computation and Language · Computer Science 2021-01-25 Prabhu Kaliamoorthi , Aditya Siddhant , Edward Li , Melvin Johnson

Semantic parsing using sequence-to-sequence models allows parsing of deeper representations compared to traditional word tagging based models. In spite of these advantages, widespread adoption of these models for real-time conversational…

Computation and Language · Computer Science 2021-04-13 Arun Babu , Akshat Shrivastava , Armen Aghajanyan , Ahmed Aly , Angela Fan , Marjan Ghazvininejad

In this work, we propose a realistic semantic network called seq2seq-SC, designed to be compatible with 5G NR and capable of working with generalized text datasets using a pre-trained language model. The goal is to achieve unprecedented…

Signal Processing · Electrical Eng. & Systems 2023-10-19 Ju-Hyung Lee , Dong-Ho Lee , Eunsoo Sheen , Thomas Choi , Jay Pujara

A great proportion of sequence-to-sequence (Seq2Seq) models for Neural Machine Translation (NMT) adopt Recurrent Neural Network (RNN) to generate translation word by word following a sequential order. As the studies of linguistics have…

Computation and Language · Computer Science 2018-06-14 Junyang Lin , Xu Sun , Xuancheng Ren , Shuming Ma , Jinsong Su , Qi Su

In-context learning, which offers substantial advantages over fine-tuning, is predominantly observed in decoder-only models, while encoder-decoder (i.e., seq2seq) models excel in methods that rely on weight updates. Recently, a few studies…

Computation and Language · Computer Science 2024-08-28 Jihyeon Lee , Dain Kim , Doohae Jung , Boseop Kim , Kyoung-Woon On

In the literature, the research on abstract meaning representation (AMR) parsing is much restricted by the size of human-curated dataset which is critical to build an AMR parser with good performance. To alleviate such data size…

Computation and Language · Computer Science 2020-10-06 Dongqin Xu , Junhui Li , Muhua Zhu , Min Zhang , Guodong Zhou

Recent trends in neural network based text-to-speech/speech synthesis pipelines have employed recurrent Seq2seq architectures that can synthesize realistic sounding speech directly from text characters. These systems however have complex…

Computation and Language · Computer Science 2019-03-19 Gary Wang

While sequence-to-sequence (seq2seq) models achieve state-of-the-art performance in many natural language processing tasks, they can be too slow for real-time applications. One performance bottleneck is predicting the most likely next token…

Computation and Language · Computer Science 2019-07-26 Chunyang Xiao , Christoph Teichmann , Konstantine Arkoudas

Large pre-trained language models based on transformer architecture have drastically changed the natural language processing (NLP) landscape. However, deploying those models for on-device applications in constrained devices such as smart…

Computation and Language · Computer Science 2023-05-26 Francesco Fusco , Damian Pascual , Peter Staar , Diego Antognini

In recent years, significant advancements in pre-trained language models have driven the creation of numerous non-English language variants, with a particular emphasis on encoder-only and decoder-only architectures. While Spanish language…

Computation and Language · Computer Science 2024-03-22 Vladimir Araujo , Maria Mihaela Trusca , Rodrigo Tufiño , Marie-Francine Moens

The past year has witnessed rapid advances in sequence-to-sequence (seq2seq) modeling for Machine Translation (MT). The classic RNN-based approaches to MT were first out-performed by the convolutional seq2seq model, which was then…

This work proposes QNet, a novel sequence encoder model that entirely inferences on the quantum computer using a minimum number of qubits. Let $n$ and $d$ represent the length of the sequence and the embedding size, respectively. The…

Machine Learning · Computer Science 2023-08-29 Wei Day , Hao-Sheng Chen , Min-Te Sun

There is a key demand to automatically generate code for small tasks for developers. Websites such as StackOverflow provide a simplistic way by offering solutions in small snippets which provide a complete answer to whatever task question…

Software Engineering · Computer Science 2021-11-02 Sairamvinay Vijayaraghavan , Jinxiao Song , David Tomassi , Siddhartha Punj , Jailan Sabet

With the rapid development of Natural Language Processing (NLP) technology, the accuracy and efficiency of machine translation have become hot topics of research. This paper proposes a novel Seq2Seq model aimed at improving translation…

Computation and Language · Computer Science 2024-11-01 Yuxu Wu , Yiren Xing

Previous works on the Recurrent Neural Network-Transducer (RNN-T) models have shown that, under some conditions, it is possible to simplify its prediction network with little or no loss in recognition accuracy (arXiv:2003.07705 [eess.AS],…

Computation and Language · Computer Science 2021-09-17 Rami Botros , Tara N. Sainath , Robert David , Emmanuel Guzman , Wei Li , Yanzhang He

An effective recipe for building seq2seq, non-autoregressive, task-oriented parsers to map utterances to semantic frames proceeds in three steps: encoding an utterance $x$, predicting a frame's length |y|, and decoding a |y|-sized frame…

Computation and Language · Computer Science 2021-09-16 Akshat Shrivastava , Pierce Chuang , Arun Babu , Shrey Desai , Abhinav Arora , Alexander Zotov , Ahmed Aly

The addition of syntax-aware decoding in Neural Machine Translation (NMT) systems requires an effective tree-structured neural network, a syntax-aware attention model and a language generation model that is sensitive to sentence structure.…

Computation and Language · Computer Science 2018-09-07 Jetic Gū , Hassan S. Shavarani , Anoop Sarkar

Semantic parsing has emerged as a significant and powerful paradigm for natural language interface and question answering systems. Traditional methods of building a semantic parser rely on high-quality lexicons, hand-crafted grammars and…

Computation and Language · Computer Science 2017-05-10 Liang Li , Pengyu Li , Yifan Liu , Tao Wan , Zengchang Qin

Neural Machine Translation (NMT) remains a formidable challenge, especially when dealing with low-resource languages. Pre-trained sequence-to-sequence (seq2seq) multi-lingual models, such as mBART-50, have demonstrated impressive…

Computation and Language · Computer Science 2024-07-10 Aniruddha Roy , Pretam Ray , Ayush Maheshwari , Sudeshna Sarkar , Pawan Goyal

The celebrated Seq2Seq technique and its numerous variants achieve excellent performance on many tasks such as neural machine translation, semantic parsing, and math word problem solving. However, these models either only consider input…

Computation and Language · Computer Science 2020-10-07 Shucheng Li , Lingfei Wu , Shiwei Feng , Fangli Xu , Fengyuan Xu , Sheng Zhong
‹ Prev 1 2 3 10 Next ›