Related papers: Improving Minimal Gated Unit for Sequential Data

Minimal Gated Unit for Recurrent Neural Networks

Recently recurrent neural networks (RNN) has been very successful in handling sequence data. However, understanding RNN and finding the best practices for RNN is a difficult task, partly because there are many competing and complex hidden…

Neural and Evolutionary Computing · Computer Science 2016-04-01 Guo-Bing Zhou , Jianxin Wu , Chen-Lin Zhang , Zhi-Hua Zhou

Joint Optimization of Tokenization and Downstream Model

Since traditional tokenizers are isolated from a downstream task and model, they cannot output an appropriate tokenization depending on the task and model, although recent studies imply that the appropriate tokenization improves the…

Computation and Language · Computer Science 2021-05-27 Tatsuya Hiraoka , Sho Takase , Kei Uchiumi , Atsushi Keyaki , Naoaki Okazaki

Sequence Modeling using Gated Recurrent Neural Networks

In this paper, we have used Recurrent Neural Networks to capture and model human motion data and generate motions by prediction of the next immediate data point at each time-step. Our RNN is armed with recently proposed Gated Recurrent…

Neural and Evolutionary Computing · Computer Science 2015-01-05 Mohammad Pezeshki

Incremental Learning of Structured Memory via Closed-Loop Transcription

This work proposes a minimal computational model for learning structured memories of multiple object classes in an incremental setting. Our approach is based on establishing a closed-loop transcription between the classes and a…

Computer Vision and Pattern Recognition · Computer Science 2023-06-08 Shengbang Tong , Xili Dai , Ziyang Wu , Mingyang Li , Brent Yi , Yi Ma

Towards Abstraction from Extraction: Multiple Timescale Gated Recurrent Unit for Summarization

In this work, we introduce temporal hierarchies to the sequence to sequence (seq2seq) model to tackle the problem of abstractive summarization of scientific articles. The proposed Multiple Timescale model of the Gated Recurrent Unit (MTGRU)…

Computation and Language · Computer Science 2016-07-05 Minsoo Kim , Moirangthem Dennis Singh , Minho Lee

Simplified Minimal Gated Unit Variations for Recurrent Neural Networks

Recurrent neural networks with various types of hidden units have been used to solve a diverse range of problems involving sequence data. Two of the most recent proposals, gated recurrent units (GRU) and minimal gated units (MGU), have…

Neural and Evolutionary Computing · Computer Science 2017-01-13 Joel Heck , Fathi M. Salem

Selective Encoding for Abstractive Sentence Summarization

We propose a selective encoding model to extend the sequence-to-sequence framework for abstractive sentence summarization. It consists of a sentence encoder, a selective gate network, and an attention equipped decoder. The sentence encoder…

Computation and Language · Computer Science 2017-07-31 Qingyu Zhou , Nan Yang , Furu Wei , Ming Zhou

Downstream Task-Oriented Neural Tokenizer Optimization with Vocabulary Restriction as Post Processing

This paper proposes a method to optimize tokenization for the performance improvement of already trained downstream models. Our method generates tokenization results attaining lower loss values of a given downstream model on the training…

Computation and Language · Computer Science 2023-04-24 Tatsuya Hiraoka , Tomoya Iwakura

Sequential minimum optimization algorithm with small sample size estimators

Sequential minimum optimization is a machine-learning global search training algorithm. It is applicable when the functional dependence of the cost function on a tunable parameter given the other parameters can be cheaply determined. This…

Quantum Physics · Physics 2023-03-03 Wojciech Roga , Takafumi Ono , Masahiro Takeoka

G-IFT: A Gated Linear Unit adapter with Iterative Fine-Tuning for Low-Resource Children's Speaker Verification

Speaker Verification (SV) systems trained on adults speech often underperform on children's SV due to the acoustic mismatch, and limited children speech data makes fine-tuning not very effective. In this paper, we propose an innovative…

Audio and Speech Processing · Electrical Eng. & Systems 2025-08-12 Vishwas M. Shetty , Jiusi Zheng , Abeer Alwan

A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition

An important problem in machine auditory perception is to recognize and detect sound events. In this paper, we propose a sequential self-teaching approach to learning sounds. Our main proposition is that it is harder to learn sounds in…

Sound · Computer Science 2020-07-02 Anurag Kumar , Vamsi Krishna Ithapu

Gated Subspace Inference for Transformer Acceleration

A method is presented for accelerating inference in transformer language models by exploiting the low effective rank of the token activation manifold at each layer. The method decomposes each activation vector into a subspace component and…

Machine Learning · Computer Science 2026-05-06 Stephen J. Thomas

ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis

Representing speech as discretized units has numerous benefits in supporting downstream spoken language processing tasks. However, the approach has been less explored in speech synthesis of tonal languages like Mandarin Chinese. Our…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-04 Dehua Tao , Daxin Tan , Yu Ting Yeung , Xiao Chen , Tan Lee

Gated Recurrent Unit Based Acoustic Modeling with Future Context

The use of future contextual information is typically shown to be helpful for acoustic modeling. However, for the recurrent neural network (RNN), it's not so easy to model the future temporal context effectively, meanwhile keep lower model…

Computation and Language · Computer Science 2018-05-21 Jie Li , Xiaorui Wang , Yuanyuan Zhao , Yan Li

Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond

We propose task-adaptive tokenization as a way to adapt the generation pipeline to the specifics of a downstream task and enhance long-form generation in mental health. Inspired by insights from cognitive science, our task-adaptive…

Computation and Language · Computer Science 2023-11-14 Siyang Liu , Naihao Deng , Sahand Sabour , Yilin Jia , Minlie Huang , Rada Mihalcea

Iterative Batch Back-Translation for Neural Machine Translation: A Conceptual Model

An effective method to generate a large number of parallel sentences for training improved neural machine translation (NMT) systems is the use of back-translations of the target-side monolingual data. Recently, iterative back-translation…

Computation and Language · Computer Science 2020-12-11 Idris Abdulmumin , Bashir Shehu Galadanci , Abubakar Isa

GLoT: A Novel Gated-Logarithmic Transformer for Efficient Sign Language Translation

Machine Translation has played a critical role in reducing language barriers, but its adaptation for Sign Language Machine Translation (SLMT) has been less explored. Existing works on SLMT mostly use the Transformer neural network which…

Computation and Language · Computer Science 2025-02-19 Nada Shahin , Leila Ismail

Learning to Start for Sequence to Sequence Architecture

The sequence to sequence architecture is widely used in the response generation and neural machine translation to model the potential relationship between two sentences. It typically consists of two parts: an encoder that reads from the…

Computation and Language · Computer Science 2016-08-22 Qingfu Zhu , Weinan Zhang , Lianqiang Zhou , Ting Liu

Fusing Recency into Neural Machine Translation with an Inter-Sentence Gate Model

Neural machine translation (NMT) systems are usually trained on a large amount of bilingual sentence pairs and translate one sentence at a time, ignoring inter-sentence information. This may make the translation of a sentence ambiguous or…

Computation and Language · Computer Science 2018-06-13 Shaohui Kuang , Deyi Xiong

Temporally Correlated Task Scheduling for Sequence Learning

Sequence learning has attracted much research attention from the machine learning community in recent years. In many applications, a sequence learning task is usually associated with multiple temporally correlated auxiliary tasks, which are…

Computation and Language · Computer Science 2021-07-05 Xueqing Wu , Lewen Wang , Yingce Xia , Weiqing Liu , Lijun Wu , Shufang Xie , Tao Qin , Tie-Yan Liu