Related papers: Generative Imagination Elevates Machine Translatio…

Neural Machine Translation with Phrase-Level Universal Visual Representations

Multimodal machine translation (MMT) aims to improve neural machine translation (NMT) with additional visual information, but most existing MMT methods require paired input of source sentence and image, which makes them suffer from shortage…

Computation and Language · Computer Science 2022-03-22 Qingkai Fang , Yang Feng

Learning language through pictures

We propose Imaginet, a model of learning visually grounded representations of language from coupled textual and visual input. The model consists of two Gated Recurrent Unit networks with shared word embeddings, and uses a multi-task…

Computation and Language · Computer Science 2015-06-22 Grzegorz Chrupała , Ákos Kádár , Afra Alishahi

ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation

Automatic evaluations for natural language generation (NLG) conventionally rely on token-level or embedding-level comparisons with text references. This differs from human language processing, for which visual imagination often improves…

Computation and Language · Computer Science 2023-02-16 Wanrong Zhu , Xin Eric Wang , An Yan , Miguel Eckstein , William Yang Wang

Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation

Visual information has been introduced for enhancing machine translation (MT), and its effectiveness heavily relies on the availability of large amounts of bilingual parallel sentence pairs with manual image annotations. In this paper, we…

Computation and Language · Computer Science 2025-01-07 Andong Chen , Yuchen Song , Kehai Chen , Muyun Yang , Tiejun Zhao , Min Zhang

Visualize Before You Write: Imagination-Guided Open-Ended Text Generation

Recent advances in text-to-image synthesis make it possible to visualize machine imaginations for a given context. On the other hand, when generating text, human writers are gifted at creative visualization, which enhances their writings by…

Computation and Language · Computer Science 2023-02-16 Wanrong Zhu , An Yan , Yujie Lu , Wenda Xu , Xin Eric Wang , Miguel Eckstein , William Yang Wang

Extending Multilingual Machine Translation through Imitation Learning

Despite the growing variety of languages supported by existing multilingual neural machine translation (MNMT) models, most of the world's languages are still being left behind. We aim to extend large-scale MNMT models to incorporate a new…

Computation and Language · Computer Science 2025-12-02 Wen Lai , Viktor Hangya , Yingli Shen , Alexander Fraser

Increasing Visual Awareness in Multimodal Neural Machine Translation from an Information Theoretic Perspective

Multimodal machine translation (MMT) aims to improve translation quality by equipping the source sentence with its corresponding image. Despite the promising performance, MMT models still suffer the problem of input degradation: models…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Baijun Ji , Tong Zhang , Yicheng Zou , Bojie Hu , Si Shen

Visually Grounded Word Embeddings and Richer Visual Features for Improving Multimodal Neural Machine Translation

In Multimodal Neural Machine Translation (MNMT), a neural model generates a translated sentence that describes an image, given the image itself and one source descriptions in English. This is considered as the multimodal image caption…

Computation and Language · Computer Science 2018-06-01 Jean-Benoit Delbrouck , Stéphane Dupont , Omar Seddati

Neural Machine Translation with Latent Semantic of Image and Text

Although attention-based Neural Machine Translation have achieved great success, attention-mechanism cannot capture the entire meaning of the source sentence because the attention mechanism generates a target word depending heavily on the…

Computation and Language · Computer Science 2016-11-28 Joji Toyama , Masanori Misono , Masahiro Suzuki , Kotaro Nakayama , Yutaka Matsuo

Generative Neural Machine Translation

We introduce Generative Neural Machine Translation (GNMT), a latent variable architecture which is designed to model the semantics of the source and target sentences. We modify an encoder-decoder translation model by adding a latent…

Computation and Language · Computer Science 2018-06-14 Harshil Shah , David Barber

Multimodal Neural Machine Translation with Search Engine Based Image Retrieval

Recently, numbers of works shows that the performance of neural machine translation (NMT) can be improved to a certain extent with using visual information. However, most of these conclusions are drawn from the analysis of experimental…

Computer Vision and Pattern Recognition · Computer Science 2022-09-07 ZhenHao Tang , XiaoBing Zhang , Zi Long , XiangHua Fu

Simultaneous Machine Translation with Visual Context

Simultaneous machine translation (SiMT) aims to translate a continuous input text stream into another language with the lowest latency and highest quality possible. The translation thus has to start with an incomplete source text, which is…

Computation and Language · Computer Science 2020-10-14 Ozan Caglayan , Julia Ive , Veneta Haralampieva , Pranava Madhyastha , Loïc Barrault , Lucia Specia

Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination

In this work, we investigate a more realistic unsupervised multimodal machine translation (UMMT) setup, inference-time image-free UMMT, where the model is trained with source-text image pairs, and tested with only source-text inputs. First,…

Computation and Language · Computer Science 2023-05-26 Hao Fei , Qian Liu , Meishan Zhang , Min Zhang , Tat-Seng Chua

Unsupervised Multi-modal Neural Machine Translation

Unsupervised neural machine translation (UNMT) has recently achieved remarkable results with only large monolingual corpora in each language. However, the uncertainty of associating target with source sentences makes UNMT theoretically an…

Computer Vision and Pattern Recognition · Computer Science 2019-05-28 Yuanhang Su , Kai Fan , Nguyen Bach , C. -C. Jay Kuo , Fei Huang

Neural Machine Translation for Code Generation

Neural machine translation (NMT) methods developed for natural language processing have been shown to be highly successful in automating translation from one natural language to another. Recently, these NMT methods have been adapted to the…

Computation and Language · Computer Science 2023-05-24 Dharma KC , Clayton T. Morrison

An empirical study on the effectiveness of images in Multimodal Neural Machine Translation

In state-of-the-art Neural Machine Translation (NMT), an attention mechanism is used during decoding to enhance the translation. At every step, the decoder uses this mechanism to focus on different parts of the source sentence to gather the…

Computation and Language · Computer Science 2018-05-31 Jean-Benoit Delbrouck , Stéphane Dupont

IMAGINE: Image Synthesis by Image-Guided Model Inversion

We introduce an inversion based method, denoted as IMAge-Guided model INvErsion (IMAGINE), to generate high-quality and diverse images from only a single training sample. We leverage the knowledge of image semantics from a pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2021-04-14 Pei Wang , Yijun Li , Krishna Kumar Singh , Jingwan Lu , Nuno Vasconcelos

From Words to Sentences: A Progressive Learning Approach for Zero-resource Machine Translation with Visual Pivots

The neural machine translation model has suffered from the lack of large-scale parallel corpora. In contrast, we humans can learn multi-lingual translations even without parallel texts by referring our languages to the external world. To…

Computation and Language · Computer Science 2019-06-04 Shizhe Chen , Qin Jin , Jianlong Fu

Human-like machine thinking: Language guided imagination

Human thinking requires the brain to understand the meaning of language expression and to properly organize the thoughts flow using the language. However, current natural language processing models are primarily limited in the word…

Computation and Language · Computer Science 2019-06-04 Feng Qi , Wenchuan Wu

Improving Neural Machine Translation with Pre-trained Representation

Monolingual data has been demonstrated to be helpful in improving the translation quality of neural machine translation (NMT). The current methods stay at the usage of word-level knowledge, such as generating synthetic parallel data or…

Computation and Language · Computer Science 2019-08-22 Rongxiang Weng , Heng Yu , Shujian Huang , Weihua Luo , Jiajun Chen