Related papers: Multi-view Subword Regularization

Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates

Subword units are an effective way to alleviate the open vocabulary problems in neural machine translation (NMT). While sentences are usually converted into unique subword sequences, subword segmentation is potentially ambiguous and…

Computation and Language · Computer Science 2018-05-01 Taku Kudo

Single Model Ensemble for Subword Regularized Models in Low-Resource Machine Translation

Subword regularizations use multiple subword segmentations during training to improve the robustness of neural machine translation models. In previous subword regularizations, we use multiple segmentations in the training process but use…

Computation and Language · Computer Science 2022-03-28 Sho Takase , Tatsuya Hiraoka , Naoaki Okazaki

A Systematic Analysis of Subwords and Cross-Lingual Transfer in Multilingual Translation

Multilingual modelling can improve machine translation for low-resource languages, partly through shared subword representations. This paper studies the role of subword segmentation in cross-lingual transfer. We systematically compare the…

Computation and Language · Computer Science 2024-04-01 Francois Meyer , Jan Buys

Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition

Subwords are the most widely used output units in end-to-end speech recognition. They combine the best of two worlds by modeling the majority of frequent words directly and at the same time allow open vocabulary speech recognition by…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-11 Egor Lakomkin , Jahn Heymann , Ilya Sklyar , Simon Wiesler

Towards Robust Text-to-Image Person Retrieval: Multi-View Reformulation for Semantic Compensation

In text-to-image person retrieval tasks, the diversity of natural language expressions and the implicitness of visual semantics often lead to the problem of Expression Drift, where semantically equivalent texts exhibit significant feature…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Chao Yuan , Yujian Zhao , Haoxuan Xu , Guanglin Niu

Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models

While Multimodal Large Language Models (MLLMs) excel at vision-language tasks, the cost of their language-driven training on internal visual foundational competence remains unclear. In this paper, we conduct a detailed diagnostic analysis…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Enguang Wang , Qiang Wang , Yuanchen Wu , Ke Yan , Xinbin Yuan , Shouhong Ding , Xialei Liu , Ming-Ming Cheng

Learning to Scale Multilingual Representations for Vision-Language Tasks

Current multilingual vision-language models either require a large number of additional parameters for each supported language, or suffer performance degradation as languages are added. In this paper, we propose a Scalable Multilingual…

Computer Vision and Pattern Recognition · Computer Science 2020-08-31 Andrea Burns , Donghyun Kim , Derry Wijaya , Kate Saenko , Bryan A. Plummer

Regularization Advantages of Multilingual Neural Language Models for Low Resource Domains

Neural language modeling (LM) has led to significant improvements in several applications, including Automatic Speech Recognition. However, they typically require large amounts of training data, which is not available for many domains and…

Computation and Language · Computer Science 2019-06-05 Navid Rekabsaz , Nikolaos Pappas , James Henderson , Banriskhem K. Khonglah , Srikanth Madikeri

Refinement of Unsupervised Cross-Lingual Word Embeddings

Cross-lingual word embeddings aim to bridge the gap between high-resource and low-resource languages by allowing to learn multilingual word representations even without using any direct bilingual signal. The lion's share of the methods are…

Computation and Language · Computer Science 2020-09-03 Magdalena Biesialska , Marta R. Costa-jussà

Consistency Regularization for Cross-Lingual Fine-Tuning

Fine-tuning pre-trained cross-lingual language models can transfer task-specific supervision from one language to the others. In this work, we propose to improve cross-lingual fine-tuning with consistency regularization. Specifically, we…

Computation and Language · Computer Science 2021-06-16 Bo Zheng , Li Dong , Shaohan Huang , Wenhui Wang , Zewen Chi , Saksham Singhal , Wanxiang Che , Ting Liu , Xia Song , Furu Wei

Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation

This paper explores sentence-level multilingual Visual Speech Recognition (VSR) that can recognize different languages with a single trained model. As the massive multilingual modeling of visual data requires huge computational costs, we…

Audio and Speech Processing · Electrical Eng. & Systems 2024-07-19 Minsu Kim , Jeong Hun Yeo , Se Jin Park , Hyeongseop Rha , Yong Man Ro

Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization

Multilingual sentence representations are the foundation for similarity-based bitext mining, which is crucial for scaling multilingual neural machine translation (NMT) system to more languages. In this paper, we introduce MuSR: a…

Computation and Language · Computer Science 2023-06-13 Pengzhi Gao , Liwen Zhang , Zhongjun He , Hua Wu , Haifeng Wang

Maximum-Variance-Reduction Stratification for Improved Subsampling

Subsampling is a widely used and effective approach for addressing the computational challenges posed by massive datasets. Substantial progress has been made in developing non-uniform, probability-based subsampling schemes that prioritize…

Methodology · Statistics 2026-05-07 Dingyi Wang , Haiying Wang , Qingpei Hu

Accurate Word Representations with Universal Visual Guidance

Word representation is a fundamental component in neural language understanding models. Recently, pre-trained language models (PrLMs) offer a new performant method of contextualized word representations by leveraging the sequence-level…

Computation and Language · Computer Science 2021-01-01 Zhuosheng Zhang , Haojie Yu , Hai Zhao , Rui Wang , Masao Utiyama

Cross-lingual Spoken Language Understanding with Regularized Representation Alignment

Despite the promising results of current cross-lingual models for spoken language understanding systems, they still suffer from imperfect cross-lingual representation alignments between the source and target languages, which makes the…

Computation and Language · Computer Science 2020-10-01 Zihan Liu , Genta Indra Winata , Peng Xu , Zhaojiang Lin , Pascale Fung

Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks

Unsupervised cross-lingual speech representation learning (XLSR) has recently shown promising results in speech recognition by leveraging vast amounts of unlabeled data across multiple languages. However, standard XLSR model suffers from…

Audio and Speech Processing · Electrical Eng. & Systems 2022-03-10 Yizhou Lu , Mingkun Huang , Xinghua Qu , Pengfei Wei , Zejun Ma

ReWE: Regressing Word Embeddings for Regularization of Neural Machine Translation Systems

Regularization of neural machine translation is still a significant problem, especially in low-resource settings. To mollify this problem, we propose regressing word embeddings (ReWE) as a new regularization technique in a system that is…

Computation and Language · Computer Science 2019-04-05 Inigo Jauregi Unanue , Ehsan Zare Borzeshi , Nazanin Esmaili , Massimo Piccardi

Margin-Based Regularization and Selective Sampling in Deep Neural Networks

We derive a new margin-based regularization formulation, termed multi-margin regularization (MMR), for deep neural networks (DNNs). The MMR is inspired by principles that were applied in margin analysis of shallow linear classifiers, e.g.,…

Machine Learning · Computer Science 2020-09-15 Berry Weinstein , Shai Fine , Yacov Hel-Or

Visual Agreement Regularized Training for Multi-Modal Machine Translation

Multi-modal machine translation aims at translating the source sentence into a different language in the presence of the paired image. Previous work suggests that additional visual information only provides dispensable help to translation,…

Computation and Language · Computer Science 2019-12-30 Pengcheng Yang , Boxing Chen , Pei Zhang , Xu Sun

MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation

Conditional masked language models (CMLM) have shown impressive progress in non-autoregressive machine translation (NAT). They learn the conditional translation model by predicting the random masked subset in the target sentence. Based on…

Computation and Language · Computer Science 2021-08-20 Pan Xie , Zexian Li , Xiaohui Hu