Related papers: Mask-Align: Self-Supervised Neural Word Alignment

Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment

Word alignment which aims to extract lexicon translation equivalents between source and target sentences, serves as a fundamental tool for natural language processing. Recent studies in this area have yielded substantial improvements by…

Computation and Language · Computer Science 2022-10-11 Siyu Lai , Zhen Yang , Fandong Meng , Yufeng Chen , Jinan Xu , Jie Zhou

MirrorAlign: A Super Lightweight Unsupervised Word Alignment Model via Cross-Lingual Contrastive Learning

Word alignment is essential for the downstream cross-lingual language understanding and generation tasks. Recently, the performance of the neural word alignment models has exceeded that of statistical models. However, they heavily rely on…

Computation and Language · Computer Science 2022-05-11 Di Wu , Liang Ding , Shuo Yang , Mingyang Li

Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment

The cross-lingual language models are typically pretrained with masked language modeling on multilingual text or parallel sentences. In this paper, we introduce denoising word alignment as a new cross-lingual pre-training task.…

Computation and Language · Computer Science 2021-09-14 Zewen Chi , Li Dong , Bo Zheng , Shaohan Huang , Xian-Ling Mao , Heyan Huang , Furu Wei

Third-Party Aligner for Neural Word Alignments

Word alignment is to find translationally equivalent words between source and target sentences. Previous work has demonstrated that self-training can achieve competitive word alignment results. In this paper, we propose to use word…

Computation and Language · Computer Science 2022-11-09 Jinpeng Zhang , Chuanqi Dong , Xiangyu Duan , Yuqi Zhang , Min Zhang

Neural Machine Translation with Supervised Attention

The attention mechanisim is appealing for neural machine translation, since it is able to dynam- ically encode a source sentence by generating a alignment between a target word and source words. Unfortunately, it has been proved to be worse…

Computation and Language · Computer Science 2016-09-15 Lemao Liu , Masao Utiyama , Andrew Finch , Eiichiro Sumita

Alignment Attention by Matching Key and Query Distributions

The neural attention mechanism has been incorporated into deep neural networks to achieve state-of-the-art performance in various domains. Most such models use multi-head self-attention which is appealing for the ability to attend to…

Machine Learning · Computer Science 2021-10-26 Shujian Zhang , Xinjie Fan , Huangjie Zheng , Korawat Tanwisuth , Mingyuan Zhou

Supervised Attention in Sequence-to-Sequence Models for Speech Recognition

Attention mechanism in sequence-to-sequence models is designed to model the alignments between acoustic features and output tokens in speech recognition. However, attention weights produced by models trained end to end do not always…

Audio and Speech Processing · Electrical Eng. & Systems 2022-04-27 Gene-Ping Yang , Hao Tang

Neural Baselines for Word Alignment

Word alignments identify translational correspondences between words in a parallel sentence pair and is used, for instance, to learn bilingual dictionaries, to train statistical machine translation systems , or to perform quality…

Computation and Language · Computer Science 2020-09-29 Anh Khoa Ngo Ho , François Yvon

Unsupervised Attention-based Sentence-Level Meta-Embeddings from Contextualised Language Models

A variety of contextualised language models have been proposed in the NLP community, which are trained on diverse corpora to produce numerous Neural Language Models (NLMs). However, different NLMs have reported different levels of…

Computation and Language · Computer Science 2022-04-19 Keigo Takahashi , Danushka Bollegala

Contrastive Unsupervised Word Alignment with Non-Local Features

Word alignment is an important natural language processing task that indicates the correspondence between natural languages. Recently, unsupervised learning of log-linear models for word alignment has received considerable attention as it…

Computation and Language · Computer Science 2014-10-13 Yang Liu , Maosong Sun

Relaxing Positional Alignment in Masked Diffusion Language Models

Masked diffusion language models (MDLMs) have emerged as a promising alternative to dominant autoregressive approaches. Although they achieve competitive performance on several tasks, a substantial gap remains in open-ended text generation.…

Computation and Language · Computer Science 2026-02-02 Mengyu Ye , Ryosuke Takahashi , Keito Kudo , Jun Suzuki

Leveraging Neural Machine Translation for Word Alignment

The most common tools for word-alignment rely on a large amount of parallel sentences, which are then usually processed according to one of the IBM model algorithms. The training data is, however, the same as for machine translation (MT)…

Computation and Language · Computer Science 2021-04-01 Vilém Zouhar , Daria Pylypenko

Supervised Attentions for Neural Machine Translation

In this paper, we improve the attention or alignment accuracy of neural machine translation by utilizing the alignments of training sentence pairs. We simply compute the distance between the machine attentions and the "true" alignments, and…

Computation and Language · Computer Science 2016-08-02 Haitao Mi , Zhiguo Wang , Abe Ittycheriah

Multilevel Text Alignment with Cross-Document Attention

Text alignment finds application in tasks such as citation recommendation and plagiarism detection. Existing alignment methods operate at a single, predefined level and cannot learn to align texts at, for example, sentence and document…

Computation and Language · Computer Science 2020-10-06 Xuhui Zhou , Nikolaos Pappas , Noah A. Smith

Adding Interpretable Attention to Neural Translation Models Improves Word Alignment

Multi-layer models with multiple attention heads per layer provide superior translation quality compared to simpler and shallower models, but determining what source context is most relevant to each target word is more challenging as a…

Computation and Language · Computer Science 2019-02-01 Thomas Zenkel , Joern Wuebker , John DeNero

Progressive Self-Supervised Attention Learning for Aspect-Level Sentiment Analysis

In aspect-level sentiment classification (ASC), it is prevalent to equip dominant neural models with attention mechanisms, for the sake of acquiring the importance of each context word on the given aspect. However, such a mechanism tends to…

Computation and Language · Computer Science 2019-06-07 Jialong Tang , Ziyao Lu , Jinsong Su , Yubin Ge , Linfeng Song , Le Sun , Jiebo Luo

Unsupervised Cross-lingual Word Embedding by Multilingual Neural Language Models

We propose an unsupervised method to obtain cross-lingual embeddings without any parallel data or pre-trained word embeddings. The proposed model, which we call multilingual neural language models, takes sentences of multiple languages as…

Computation and Language · Computer Science 2018-09-10 Takashi Wada , Tomoharu Iwata

SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining

Vision-language models (VLMs) have made significant strides in cross-modal understanding through large-scale paired datasets. However, in fashion domain, datasets often exhibit a disparity between the information conveyed in image and text.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Chull Hwan Song , Taebaek Hwang , Jooyoung Yoon , Shunghyun Choi , Yeong Hyeon Gu

Learning Concepts, Not Tokens: Self-Supervised Semantic Alignment for Language Models

The next-token prediction (NTP) objective trains language models to predict a single token at each step, even though many continuations can express the same meaning. For example, in the sentence ``this sticker can be placed here'',…

Computation and Language · Computer Science 2026-05-26 Christine Zhang , Dan Jurafsky , Chen Shani

Refinement of Unsupervised Cross-Lingual Word Embeddings

Cross-lingual word embeddings aim to bridge the gap between high-resource and low-resource languages by allowing to learn multilingual word representations even without using any direct bilingual signal. The lion's share of the methods are…

Computation and Language · Computer Science 2020-09-03 Magdalena Biesialska , Marta R. Costa-jussà