Related papers: New Confidence Measures for Statistical Machine Tr…

Using Source-Side Confidence Estimation for Reliable Translation into Unfamiliar Languages

We present an interactive machine translation (MT) system designed for users who are not proficient in the target language. It aims to improve trustworthiness and explainability by identifying potentially mistranslated words and allowing…

Computation and Language · Computer Science 2025-04-01 Kenneth J. Sible , David Chiang

Learning Confidence for Transformer-based Neural Machine Translation

Confidence estimation aims to quantify the confidence of the model prediction, providing an expectation of success. A well-calibrated confidence estimate enables accurate failure prediction and proper risk measurement when given noisy…

Computation and Language · Computer Science 2022-03-23 Yu Lu , Jiali Zeng , Jiajun Zhang , Shuangzhi Wu , Mu Li

Evaluating Machine Translation Quality with Conformal Predictive Distributions

This paper presents a new approach for assessing uncertainty in machine translation by simultaneously evaluating translation quality and providing a reliable confidence score. Our approach utilizes conformal predictive distributions to…

Computation and Language · Computer Science 2023-06-05 Patrizio Giovannotti

Modeling Voting for System Combination in Machine Translation

System combination is an important technique for combining the hypotheses of different machine translation systems to improve translation performance. Although early statistical approaches to system combination have been proven effective in…

Computation and Language · Computer Science 2020-07-15 Xuancheng Huang , Jiacheng Zhang , Zhixing Tan , Derek F. Wong , Huanbo Luan , Jingfang Xu , Maosong Sun , Yang Liu

Exploring Prediction Uncertainty in Machine Translation Quality Estimation

Machine Translation Quality Estimation is a notoriously difficult task, which lessens its usefulness in real-world translation environments. Such scenarios can be improved if quality predictions are accompanied by a measure of uncertainty.…

Computation and Language · Computer Science 2016-07-01 Daniel Beck , Lucia Specia , Trevor Cohn

Can Automatic Metrics Assess High-Quality Translations?

Automatic metrics for evaluating translation quality are typically validated by measuring how well they correlate with human assessments. However, correlation methods tend to capture only the ability of metrics to differentiate between good…

Computation and Language · Computer Science 2024-10-11 Sweta Agrawal , António Farinhas , Ricardo Rei , André F. T. Martins

Conformalizing Machine Translation Evaluation

Several uncertainty estimation methods have been recently proposed for machine translation evaluation. While these methods can provide a useful indication of when not to trust model predictions, we show in this paper that the majority of…

Computation and Language · Computer Science 2023-06-13 Chrysoula Zerva , André F. T. Martins

Detecting Machine-Translated Paragraphs by Matching Similar Words

Machine-translated text plays an important role in modern life by smoothing communication from various communities using different languages. However, unnatural translation may lead to misunderstanding, a detector is thus needed to avoid…

Computation and Language · Computer Science 2019-04-25 Hoang-Quoc Nguyen-Son , Tran Phuong Thao , Seira Hidano , Shinsaku Kiyomoto

Metric for Automatic Machine Translation Evaluation based on Universal Sentence Representations

Sentence representations can capture a wide range of information that cannot be captured by local features based on character or word N-grams. This paper examines the usefulness of universal sentence representations for evaluating the…

Computation and Language · Computer Science 2018-05-22 Hiroki Shimanaka , Tomoyuki Kajiwara , Mamoru Komachi

Quality and Quantity of Machine Translation References for Automatic Metrics

Automatic machine translation metrics typically rely on human translations to determine the quality of system translations. Common wisdom in the field dictates that the human references should be of very high quality. However, there are no…

Computation and Language · Computer Science 2024-04-11 Vilém Zouhar , Ondřej Bojar

Analyzing Uncertainty in Neural Machine Translation

Machine translation is a popular test bed for research in neural sequence-to-sequence models but despite much recent research, there is still a lack of understanding of these models. Practitioners report performance degradation with large…

Computation and Language · Computer Science 2018-08-14 Myle Ott , Michael Auli , David Grangier , Marc'Aurelio Ranzato

A Measure of the System Dependence of Automated Metrics

Automated metrics for Machine Translation have made significant progress, with the goal of replacing expensive and time-consuming human evaluations. These metrics are typically assessed by their correlation with human judgments, which…

Computation and Language · Computer Science 2024-12-31 Pius von Däniken , Jan Deriu , Mark Cieliebak

Measuring Sentiment Bias in Machine Translation

Biases induced to text by generative models have become an increasingly large topic in recent years. In this paper we explore how machine translation might introduce a bias in sentiments as classified by sentiment analysis models. For this,…

Computation and Language · Computer Science 2023-06-13 Kai Hartung , Aaricia Herygers , Shubham Kurlekar , Khabbab Zakaria , Taylan Volkan , Sören Gröttrup , Munir Georges

Modeling Confidence in Sequence-to-Sequence Models

Recently, significant improvements have been achieved in various natural language processing tasks using neural sequence-to-sequence models. While aiming for the best generation quality is important, ultimately it is also necessary to…

Computation and Language · Computer Science 2019-10-07 Jan Niehues , Ngoc-Quan Pham

Intelligent Hybrid Man-Machine Translation Quality Estimation

Inferring evaluation scores based on human judgments is invaluable compared to using current evaluation metrics which are not suitable for real-time applications e.g. post-editing. However, these judgments are much more expensive to collect…

Computation and Language · Computer Science 2013-07-09 Ibrahim Sabek , Noha A. Yousri , Nagwa Elmakky , Mona Habib

Evaluating Machine Common Sense via Cloze Testing

Language models (LMs) show state of the art performance for common sense (CS) question answering, but whether this ability implies a human-level mastery of CS remains an open question. Understanding the limitations and strengths of LMs can…

Computation and Language · Computer Science 2022-01-21 Ehsan Qasemi , Lee Kezar , Jay Pujara , Pedro Szekely

Sentiment-Aware Measure (SAM) for Evaluating Sentiment Transfer by Machine Translation Systems

In translating text where sentiment is the main message, human translators give particular attention to sentiment-carrying words. The reason is that an incorrect translation of such words would miss the fundamental aspect of the source…

Computation and Language · Computer Science 2021-10-06 Hadeel Saadany , Constantin Orasan , Emad Mohamed , Ashraf Tantawy

Improving Back-Translation with Uncertainty-based Confidence Estimation

While back-translation is simple and effective in exploiting abundant monolingual corpora to improve low-resource neural machine translation (NMT), the synthetic bilingual corpora generated by NMT models trained on limited authentic…

Computation and Language · Computer Science 2019-09-04 Shuo Wang , Yang Liu , Chao Wang , Huanbo Luan , Maosong Sun

Calibrating Large Language Models with Sample Consistency

Accurately gauging the confidence level of Large Language Models' (LLMs) predictions is pivotal for their reliable application. However, LLMs are often uncalibrated inherently and elude conventional calibration techniques due to their…

Computation and Language · Computer Science 2026-02-24 Qing Lyu , Kumar Shridhar , Chaitanya Malaviya , Li Zhang , Yanai Elazar , Niket Tandon , Marianna Apidianaki , Mrinmaya Sachan , Chris Callison-Burch

Uncertainty Quantification for Evaluating Machine Translation Bias

The predictive uncertainty of machine translation (MT) models is typically used as a quality estimation proxy. In this work, we posit that apart from confidently translating when a single correct translation exists, models should also…

Computation and Language · Computer Science 2025-10-22 Ieva Raminta Staliūnaitė , Julius Cheng , Andreas Vlachos