English
Related papers

Related papers: Improving Minimal Gated Unit for Sequential Data

200 papers

Recently recurrent neural networks (RNN) has been very successful in handling sequence data. However, understanding RNN and finding the best practices for RNN is a difficult task, partly because there are many competing and complex hidden…

Neural and Evolutionary Computing · Computer Science 2016-04-01 Guo-Bing Zhou , Jianxin Wu , Chen-Lin Zhang , Zhi-Hua Zhou

Since traditional tokenizers are isolated from a downstream task and model, they cannot output an appropriate tokenization depending on the task and model, although recent studies imply that the appropriate tokenization improves the…

Computation and Language · Computer Science 2021-05-27 Tatsuya Hiraoka , Sho Takase , Kei Uchiumi , Atsushi Keyaki , Naoaki Okazaki

In this paper, we have used Recurrent Neural Networks to capture and model human motion data and generate motions by prediction of the next immediate data point at each time-step. Our RNN is armed with recently proposed Gated Recurrent…

Neural and Evolutionary Computing · Computer Science 2015-01-05 Mohammad Pezeshki

This work proposes a minimal computational model for learning structured memories of multiple object classes in an incremental setting. Our approach is based on establishing a closed-loop transcription between the classes and a…

Computer Vision and Pattern Recognition · Computer Science 2023-06-08 Shengbang Tong , Xili Dai , Ziyang Wu , Mingyang Li , Brent Yi , Yi Ma

In this work, we introduce temporal hierarchies to the sequence to sequence (seq2seq) model to tackle the problem of abstractive summarization of scientific articles. The proposed Multiple Timescale model of the Gated Recurrent Unit (MTGRU)…

Computation and Language · Computer Science 2016-07-05 Minsoo Kim , Moirangthem Dennis Singh , Minho Lee

Recurrent neural networks with various types of hidden units have been used to solve a diverse range of problems involving sequence data. Two of the most recent proposals, gated recurrent units (GRU) and minimal gated units (MGU), have…

Neural and Evolutionary Computing · Computer Science 2017-01-13 Joel Heck , Fathi M. Salem

We propose a selective encoding model to extend the sequence-to-sequence framework for abstractive sentence summarization. It consists of a sentence encoder, a selective gate network, and an attention equipped decoder. The sentence encoder…

Computation and Language · Computer Science 2017-07-31 Qingyu Zhou , Nan Yang , Furu Wei , Ming Zhou

This paper proposes a method to optimize tokenization for the performance improvement of already trained downstream models. Our method generates tokenization results attaining lower loss values of a given downstream model on the training…

Computation and Language · Computer Science 2023-04-24 Tatsuya Hiraoka , Tomoya Iwakura

Sequential minimum optimization is a machine-learning global search training algorithm. It is applicable when the functional dependence of the cost function on a tunable parameter given the other parameters can be cheaply determined. This…

Quantum Physics · Physics 2023-03-03 Wojciech Roga , Takafumi Ono , Masahiro Takeoka

Speaker Verification (SV) systems trained on adults speech often underperform on children's SV due to the acoustic mismatch, and limited children speech data makes fine-tuning not very effective. In this paper, we propose an innovative…

Audio and Speech Processing · Electrical Eng. & Systems 2025-08-12 Vishwas M. Shetty , Jiusi Zheng , Abeer Alwan

An important problem in machine auditory perception is to recognize and detect sound events. In this paper, we propose a sequential self-teaching approach to learning sounds. Our main proposition is that it is harder to learn sounds in…

Sound · Computer Science 2020-07-02 Anurag Kumar , Vamsi Krishna Ithapu

A method is presented for accelerating inference in transformer language models by exploiting the low effective rank of the token activation manifold at each layer. The method decomposes each activation vector into a subspace component and…

Machine Learning · Computer Science 2026-05-06 Stephen J. Thomas

Representing speech as discretized units has numerous benefits in supporting downstream spoken language processing tasks. However, the approach has been less explored in speech synthesis of tonal languages like Mandarin Chinese. Our…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-04 Dehua Tao , Daxin Tan , Yu Ting Yeung , Xiao Chen , Tan Lee

The use of future contextual information is typically shown to be helpful for acoustic modeling. However, for the recurrent neural network (RNN), it's not so easy to model the future temporal context effectively, meanwhile keep lower model…

Computation and Language · Computer Science 2018-05-21 Jie Li , Xiaorui Wang , Yuanyuan Zhao , Yan Li

We propose task-adaptive tokenization as a way to adapt the generation pipeline to the specifics of a downstream task and enhance long-form generation in mental health. Inspired by insights from cognitive science, our task-adaptive…

Computation and Language · Computer Science 2023-11-14 Siyang Liu , Naihao Deng , Sahand Sabour , Yilin Jia , Minlie Huang , Rada Mihalcea

An effective method to generate a large number of parallel sentences for training improved neural machine translation (NMT) systems is the use of back-translations of the target-side monolingual data. Recently, iterative back-translation…

Computation and Language · Computer Science 2020-12-11 Idris Abdulmumin , Bashir Shehu Galadanci , Abubakar Isa

Machine Translation has played a critical role in reducing language barriers, but its adaptation for Sign Language Machine Translation (SLMT) has been less explored. Existing works on SLMT mostly use the Transformer neural network which…

Computation and Language · Computer Science 2025-02-19 Nada Shahin , Leila Ismail

The sequence to sequence architecture is widely used in the response generation and neural machine translation to model the potential relationship between two sentences. It typically consists of two parts: an encoder that reads from the…

Computation and Language · Computer Science 2016-08-22 Qingfu Zhu , Weinan Zhang , Lianqiang Zhou , Ting Liu

Neural machine translation (NMT) systems are usually trained on a large amount of bilingual sentence pairs and translate one sentence at a time, ignoring inter-sentence information. This may make the translation of a sentence ambiguous or…

Computation and Language · Computer Science 2018-06-13 Shaohui Kuang , Deyi Xiong

Sequence learning has attracted much research attention from the machine learning community in recent years. In many applications, a sequence learning task is usually associated with multiple temporally correlated auxiliary tasks, which are…

Computation and Language · Computer Science 2021-07-05 Xueqing Wu , Lewen Wang , Yingce Xia , Weiqing Liu , Lijun Wu , Shufang Xie , Tao Qin , Tie-Yan Liu
‹ Prev 1 2 3 10 Next ›