Related papers: Graph Algorithms for Multiparallel Word Alignment

Graph Neural Networks for Multiparallel Word Alignment

After a period of decrease, interest in word alignments is increasing again for their usefulness in domains such as typological research, cross-lingual annotation projection, and machine translation. Generally, alignment algorithms only use…

Computation and Language · Computer Science 2022-08-11 Ayyoob Imani , Lütfi Kerem Şenel , Masoud Jalili Sabet , François Yvon , Hinrich Schütze

Word Alignment by Fine-tuning Embeddings on Parallel Corpora

Word alignment over parallel corpora has a wide variety of applications, including learning translation lexicons, cross-lingual transfer of language processing tools, and automatic evaluation or analysis of translation outputs. The great…

Computation and Language · Computer Science 2021-08-13 Zi-Yi Dou , Graham Neubig

Multilingual Word Embeddings using Multigraphs

We present a family of neural-network--inspired models for computing continuous word representations, specifically designed to exploit both monolingual and multilingual text. This framework allows us to perform unsupervised training of…

Computation and Language · Computer Science 2016-12-15 Radu Soricut , Nan Ding

Random Graph Matching with Improved Noise Robustness

Graph matching, also known as network alignment, refers to finding a bijection between the vertex sets of two given graphs so as to maximally align their edges. This fundamental computational problem arises frequently in multiple fields…

Data Structures and Algorithms · Computer Science 2021-08-10 Cheng Mao , Mark Rudelson , Konstantin Tikhomirov

A Survey of Embedding Space Alignment Methods for Language and Knowledge Graphs

Neural embedding approaches have become a staple in the fields of computer vision, natural language processing, and more recently, graph analytics. Given the pervasive nature of these algorithms, the natural question becomes how to exploit…

Computation and Language · Computer Science 2020-10-27 Alexander Kalinowski , Yuan An

A Model for Fine-Grained Alignment of Multilingual Texts

While alignment of texts on the sentential level is often seen as being too coarse, and word alignment as being too fine-grained, bi- or multilingual texts which are aligned on a level in-between are a useful resource for many purposes.…

Computation and Language · Computer Science 2007-05-23 Lea Cyrus , Hendrik Feddes

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?

Word alignment has proven to benefit many-to-many neural machine translation (NMT). However, high-quality ground-truth bilingual dictionaries were used for pre-editing in previous methods, which are unavailable for most language pairs.…

Computation and Language · Computer Science 2022-04-27 Zhuoyuan Mao , Chenhui Chu , Raj Dabre , Haiyue Song , Zhen Wan , Sadao Kurohashi

Beyond Bilingual: Multi-sense Word Embeddings using Multilingual Context

Word embeddings, which represent a word as a point in a vector space, have become ubiquitous to several NLP tasks. A recent line of work uses bilingual (two languages) corpora to learn a different vector for each sense of a word, by…

Computation and Language · Computer Science 2017-06-27 Shyam Upadhyay , Kai-Wei Chang , Matt Taddy , Adam Kalai , James Zou

Inducing Alignment Structure with Gated Graph Attention Networks for Sentence Matching

Sentence matching is a fundamental task of natural language processing with various applications. Most recent approaches adopt attention-based neural models to build word- or phrase-level alignment between two sentences. However, these…

Computation and Language · Computer Science 2021-10-22 Peng Cui , Le Hu , Yuanchao Liu

Neural Baselines for Word Alignment

Word alignments identify translational correspondences between words in a parallel sentence pair and is used, for instance, to learn bilingual dictionaries, to train statistical machine translation systems , or to perform quality…

Computation and Language · Computer Science 2020-09-29 Anh Khoa Ngo Ho , François Yvon

Towards Neural Machine Translation with Partially Aligned Corpora

While neural machine translation (NMT) has become the new paradigm, the parameter optimization requires large-scale parallel data which is scarce in many domains and language pairs. In this paper, we address a new translation scenario in…

Computation and Language · Computer Science 2017-11-06 Yining Wang , Yang Zhao , Jiajun Zhang , Chengqing Zong , Zhengshan Xue

An Algorithm for Aligning Sentences in Bilingual Corpora Using Lexical Information

In this paper we describe an algorithm for aligning sentences with their translations in a bilingual corpus using lexical information of the languages. Existing efficient algorithms ignore word identities and consider only the sentence…

Computation and Language · Computer Science 2007-05-23 Akshar Bharati , V. Sriram , A. Vamshi Krishna , Rajeev Sangal , S. M. Bendre

Graph matching between bipartite and unipartite networks: to collapse, or not to collapse, that is the question

Graph matching consists of aligning the vertices of two unlabeled graphs in order to maximize the shared structure across networks; when the graphs are unipartite, this is commonly formulated as minimizing their edge disagreements. In this…

Machine Learning · Statistics 2021-04-13 Jesús Arroyo , Carey E. Priebe , Vince Lyzinski

Adaptative Bilingual Aligning Using Multilingual Sentence Embedding

In this paper, we present an adaptive bitextual alignment system called AIlign. This aligner relies on sentence embeddings to extract reliable anchor points that can guide the alignment path, even for texts whose parallelism is fragmentary…

Computation and Language · Computer Science 2024-03-19 Olivier Kraif

Emerging Language Spaces Learned From Massively Multilingual Corpora

Translations capture important information about languages that can be used as implicit supervision in learning linguistic properties and semantic representations. In an information-centric view, translated texts may be considered as…

Computation and Language · Computer Science 2018-02-02 Jörg Tiedemann

Enhance Graph Alignment for Large Language Models

Graph-structured data is prevalent in the real world. Recently, due to the powerful emergent capabilities, Large Language Models (LLMs) have shown promising performance in modeling graphs. The key to effectively applying LLMs on graphs is…

Computation and Language · Computer Science 2024-10-16 Haitong Luo , Xuying Meng , Suhang Wang , Tianxiang Zhao , Fali Wang , Hanyun Cao , Yujun Zhang

Conformal Prediction: A Theoretical Note and Benchmarking Transductive Node Classification in Graphs

Conformal prediction has become increasingly popular for quantifying the uncertainty associated with machine learning models. Recent work in graph uncertainty quantification has built upon this approach for conformal graph prediction. The…

Machine Learning · Computer Science 2025-05-21 Pranav Maneriker , Aditya T. Vadlamani , Anutam Srinivasan , Yuntian He , Ali Payani , Srinivasan Parthasarathy

A Sentence Meaning Based Alignment Method for Parallel Text Corpora Preparation

Text alignment is crucial to the accuracy of Machine Translation (MT) systems, some NLP tools or any other text processing tasks requiring bilingual data. This research proposes a language independent sentence alignment approach based on…

Computation and Language · Computer Science 2015-10-01 Krzysztof Wołk , Krzysztof Marasek

Exploring the Potential of Large Language Models for Heterophilic Graphs

Large language models (LLMs) have presented significant opportunities to enhance various machine learning applications, including graph neural networks (GNNs). By leveraging the vast open-world knowledge within LLMs, we can more effectively…

Machine Learning · Computer Science 2025-02-18 Yuxia Wu , Shujie Li , Yuan Fang , Chuan Shi

Multilingual Distributed Representations without Word Alignment

Distributed representations of meaning are a natural way to encode covariance relationships between words and phrases in NLP. By overcoming data sparsity problems, as well as providing information about semantic relatedness which is not…

Computation and Language · Computer Science 2014-03-21 Karl Moritz Hermann , Phil Blunsom