English
Related papers

Related papers: chi2TeX Semi-automatic translation from chiwriter …

200 papers

In mathematics, LaTeX is the de facto standard to prepare documents, e.g., scientific publications. While some formulae are still developed using pen and paper, more complicated mathematical expressions used more and more often with…

Information Retrieval · Computer Science 2020-12-01 André Greiner-Petter

We have implemented a machine translation system, the PolyMath Translator, for LaTeX documents containing mathematical text. The current implementation translates English LaTeX to French LaTeX, attaining a BLEU score of 53.5 on a held-out…

Computation and Language · Computer Science 2020-10-13 Aditya Ohri , Tanya Schmah

Can we improve machine translation (MT) with LLMs by rewriting their inputs automatically? Users commonly rely on the intuition that well-written text is easier to translate when using off-the-shelf MT systems. LLMs can rewrite text in many…

Computation and Language · Computer Science 2025-09-03 Dayeon Ki , Marine Carpuat

Mathematical documents written in LaTeX often contain ambiguities. We can resolve some of them via semantic markup using, e.g., sTeX, which also has other potential benefits, such as interoperability with computer algebra systems, proof…

Computation and Language · Computer Science 2024-08-12 Luka Vrečar , Joe Wells , Fairouz Kamareddine

We propose Text2Math, a model for semantically parsing text into math expressions. The model can be used to solve different math related problems including arithmetic word problems and equation parsing problems. Unlike previous approaches,…

Computation and Language · Computer Science 2019-10-16 Yanyan Zou , Wei Lu

Discourse phenomena in existing document-level translation datasets are sparse, which has been a fundamental obstacle in the development of context-aware machine translation models. Moreover, most existing document-level corpora and…

Computation and Language · Computer Science 2024-07-15 Linghao Jin , Li An , Xuezhe Ma

This paper presents a framework for semi-automatic transcription of large-scale historical handwritten documents and proposes a simple user-friendly text extractor tool, TexT for transcription. The proposed approach provides a quick and…

Digital Libraries · Computer Science 2018-02-22 Anders Hast , Per Cullhed , Ekta Vats

Despite the remarkable progress of modern machine translation (MT) systems on general-domain texts, translating structured LaTeX-formatted documents remains a significant challenge. These documents typically interleave natural language with…

Computation and Language · Computer Science 2026-03-12 Ziming Zhu , Chenglong Wang , Haosong Xv , Shunjie Xing , Yifu Huo , Fengning Tian , Quan Du , Di Yang , Chunliang Zhang , Tong Xiao , Jingbo Zhu

Professional translators often dictate their translations orally and have them typed afterwards. The TransTalk project aims at automating the second part of this process. Its originality as a dictation system lies in the fact that both the…

The goal of this project is to (i) accumulate annotated informal/formal mathematical corpora suitable for training semi-automated translation between informal and formal mathematics by statistical machine-translation methods, (ii) to…

Artificial Intelligence · Computer Science 2014-05-15 Cezary Kaliszyk , Josef Urban , Jiri Vyskocil , Herman Geuvers

In this paper we share several experiments trying to automatically translate informal mathematics into formal mathematics. In our context informal mathematics refers to human-written mathematical sentences in the LaTeX format; and formal…

Logic in Computer Science · Computer Science 2019-12-16 Qingxiang Wang , Chad Brown , Cezary Kaliszyk , Josef Urban

This paper discusses digital online mathematics examinations -- a discussion ranging from high school to university level examinations. In particular, we consider the nature of mathematical writing, what is distinctive about mathematical…

History and Overview · Mathematics 2026-05-26 Laura Kobel-Keller , Chris Sangwin

We tackle the problem of neural machine translation of mathematical formulae between ambiguous presentation languages and unambiguous content languages. Compared to neural machine translation on natural language, mathematical formulae have…

Computation and Language · Computer Science 2023-05-29 Felix Petersen , Moritz Schubotz , Andre Greiner-Petter , Bela Gipp

In this paper we present a step-by-step approach to long-form text translation, drawing on established processes in translation studies. Instead of viewing machine translation as a single, monolithic task, we propose a framework that…

Computation and Language · Computer Science 2024-09-12 Eleftheria Briakou , Jiaming Luo , Colin Cherry , Markus Freitag

A method is presented for automatically augmenting the bilingual lexicon of an existing Machine Translation system, by extracting bilingual entries from aligned bilingual text. The proposed method only relies on the resources already…

cmp-lg · Computer Science 2007-05-23 Davide Turcato

Speech-to-speech translation combines machine translation with speech synthesis, introducing evaluation challenges not present in either task alone. How to automatically evaluate speech-to-speech translation is an open question which has…

Computation and Language · Computer Science 2021-10-27 Elizabeth Salesky , Julian Mäder , Severin Klinger

Mined bitexts can contain imperfect translations that yield unreliable training signals for Neural Machine Translation (NMT). While filtering such pairs out is known to improve final model quality, we argue that it is suboptimal in…

Computation and Language · Computer Science 2022-06-01 Eleftheria Briakou , Sida I. Wang , Luke Zettlemoyer , Marjan Ghazvininejad

This paper proposes a procedure to execute external source codes from a LaTeX document and include the calculation outputs in the resulting Portable Document Format (pdf) file automatically. It integrates programming tools into the LaTeX…

Software Engineering · Computer Science 2021-06-29 Haim Bar , HaiYing Wang

Synthetic translations have been used for a wide range of NLP tasks primarily as a means of data augmentation. This work explores, instead, how synthetic translations can be used to revise potentially imperfect reference translations in…

Computation and Language · Computer Science 2022-03-16 Eleftheria Briakou , Marine Carpuat

For the past 60 years, Research in machine translation is going on. For the development in this field, a lot of new techniques are being developed each day. As a result, we have witnessed development of many automatic machine translators. A…

Computation and Language · Computer Science 2013-07-25 Nisheeth Joshi , Hemant Darbari , Iti Mathur
‹ Prev 1 2 3 10 Next ›