Related papers: Enhancing Context Through Contrast

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?

Word alignment has proven to benefit many-to-many neural machine translation (NMT). However, high-quality ground-truth bilingual dictionaries were used for pre-editing in previous methods, which are unavailable for most language pairs.…

Computation and Language · Computer Science 2022-04-27 Zhuoyuan Mao , Chenhui Chu , Raj Dabre , Haiyue Song , Zhen Wan , Sadao Kurohashi

Enhancing Supervised Learning with Contrastive Markings in Neural Machine Translation Training

Supervised learning in Neural Machine Translation (NMT) typically follows a teacher forcing paradigm where reference tokens constitute the conditioning context in the model's prediction, instead of its own previous predictions. In order to…

Computation and Language · Computer Science 2023-07-18 Nathaniel Berger , Miriam Exel , Matthias Huck , Stefan Riezler

Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings

Semantic representation learning for sentences is an important and well-studied problem in NLP. The current trend for this task involves training a Transformer-based sentence encoder through a contrastive objective with text, i.e.,…

Computation and Language · Computer Science 2022-09-21 Yiren Jian , Chongyang Gao , Soroush Vosoughi

A Large-Scale Test Set for the Evaluation of Context-Aware Pronoun Translation in Neural Machine Translation

The translation of pronouns presents a special challenge to machine translation to this day, since it often requires context outside the current sentence. Recent work on models that have access to information across sentence boundaries has…

Computation and Language · Computer Science 2019-03-07 Mathias Müller , Annette Rios , Elena Voita , Rico Sennrich

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation

Existing multilingual machine translation approaches mainly focus on English-centric directions, while the non-English directions still lag behind. In this work, we aim to build a many-to-many translation system with an emphasis on the…

Computation and Language · Computer Science 2021-07-23 Xiao Pan , Mingxuan Wang , Liwei Wu , Lei Li

Enhancing Multilingual Embeddings via Multi-Way Parallel Text Alignment

Multilingual pretraining typically lacks explicit alignment signals, leading to suboptimal cross-lingual alignment in the representation space. In this work, we show that training standard pretrained models for cross-lingual alignment with…

Computation and Language · Computer Science 2026-02-26 Barah Fazili , Koustava Goswami

Neural Machine Translation with Contrastive Translation Memories

Retrieval-augmented Neural Machine Translation models have been successful in many translation scenarios. Different from previous works that make use of mutually similar but redundant translation memories~(TMs), we propose a new…

Computation and Language · Computer Science 2022-12-07 Xin Cheng , Shen Gao , Lemao Liu , Dongyan Zhao , Rui Yan

Encouraging Neural Machine Translation to Satisfy Terminology Constraints

We present a new approach to encourage neural machine translation to satisfy lexical constraints. Our method acts at the training step and thereby avoiding the introduction of any extra computational overhead at inference step. The proposed…

Computation and Language · Computer Science 2021-06-08 Melissa Ailem , Jinghsu Liu , Raheel Qader

Context-Aware Learning for Neural Machine Translation

Interest in larger-context neural machine translation, including document-level and multi-modal translation, has been growing. Multiple works have proposed new network architectures or evaluation schemes, but potentially helpful context is…

Computation and Language · Computer Science 2019-03-13 Sébastien Jean , Kyunghyun Cho

Multilingual Multi-modal Embeddings for Natural Language Processing

We propose a novel discriminative model that learns embeddings from multilingual and multi-modal data, meaning that our model can take advantage of images and descriptions in multiple languages to improve embedding quality. To that end, we…

Computation and Language · Computer Science 2017-02-06 Iacer Calixto , Qun Liu , Nick Campbell

Improving Neural Machine Translation with Pre-trained Representation

Monolingual data has been demonstrated to be helpful in improving the translation quality of neural machine translation (NMT). The current methods stay at the usage of word-level knowledge, such as generating synthetic parallel data or…

Computation and Language · Computer Science 2019-08-22 Rongxiang Weng , Heng Yu , Shujian Huang , Weihua Luo , Jiajun Chen

Mutual Information and Diverse Decoding Improve Neural Machine Translation

Sequence-to-sequence neural translation models learn semantic and syntactic relations between sentence pairs by optimizing the likelihood of the target given the source, i.e., $p(y|x)$, an objective that ignores other potentially useful…

Computation and Language · Computer Science 2016-03-24 Jiwei Li , Dan Jurafsky

Improving Language Model Integration for Neural Machine Translation

The integration of language models for neural machine translation has been extensively studied in the past. It has been shown that an external language model, trained on additional target-side monolingual data, can help improve translation…

Computation and Language · Computer Science 2023-06-09 Christian Herold , Yingbo Gao , Mohammad Zeineldeen , Hermann Ney

Context-Enriched Contrastive Loss: Enhancing Presentation of Inherent Sample Connections in Contrastive Learning Framework

Contrastive learning has gained popularity and pushes state-of-the-art performance across numerous large-scale benchmarks. In contrastive learning, the contrastive loss function plays a pivotal role in discerning similarities between…

Computer Vision and Pattern Recognition · Computer Science 2025-12-03 Haojin Deng , Yimin Yang

Contrastive Learning for Context-aware Neural Machine TranslationUsing Coreference Information

Context-aware neural machine translation (NMT) incorporates contextual information of surrounding texts, that can improve the translation quality of document-level machine translation. Many existing works on context-aware NMT have focused…

Computation and Language · Computer Science 2021-09-14 Yongkeun Hwang , Hyungu Yun , Kyomin Jung

Exploiting Cross-Sentence Context for Neural Machine Translation

In translation, considering the document as a whole can help to resolve ambiguities and inconsistencies. In this paper, we propose a cross-sentence context-aware approach and investigate the influence of historical contextual information on…

Computation and Language · Computer Science 2017-07-25 Longyue Wang , Zhaopeng Tu , Andy Way , Qun Liu

Improving In-context Learning of Multilingual Generative Language Models with Cross-lingual Alignment

Multilingual generative models obtain remarkable cross-lingual in-context learning capabilities through pre-training on large-scale corpora. However, they still exhibit a performance bias toward high-resource languages and learn isolated…

Computation and Language · Computer Science 2024-06-13 Chong Li , Shaonan Wang , Jiajun Zhang , Chengqing Zong

Does Neural Machine Translation Benefit from Larger Context?

We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation quality and pronoun prediction, when…

Machine Learning · Statistics 2017-04-19 Sebastien Jean , Stanislas Lauly , Orhan Firat , Kyunghyun Cho

MCSE: Multimodal Contrastive Learning of Sentence Embeddings

Learning semantically meaningful sentence embeddings is an open problem in natural language processing. In this work, we propose a sentence embedding learning approach that exploits both visual and textual information via a multimodal…

Computation and Language · Computer Science 2022-04-26 Miaoran Zhang , Marius Mosbach , David Ifeoluwa Adelani , Michael A. Hedderich , Dietrich Klakow

Improving Zero-Shot Multilingual Translation with Universal Representations and Cross-Mappings

The many-to-many multilingual neural machine translation can translate between language pairs unseen during training, i.e., zero-shot translation. Improving zero-shot translation requires the model to learn universal representations and…

Computation and Language · Computer Science 2022-10-31 Shuhao Gu , Yang Feng