Related papers: Cross-concordances: terminology mapping and its ef…

Building a terminology network for search: the KoMoHe project

The paper reports about results on the GESIS-IZ project "Competence Center Modeling and Treatment of Semantic Heterogeneity" (KoMoHe). KoMoHe supervised a terminology mapping effort, in which 'cross-concordances' between major controlled…

Digital Libraries · Computer Science 2019-01-15 Philipp Mayr , Vivien Petras

Unification of multi-lingual scientific terminological resources using the ISO 16642 standard. The TermSciences initiative

This paper presents the TermSciences portal, which deals with the implementation of a conceptual model that uses the recent ISO 16642 standard (Terminological Markup Framework). This standard turns out to be suitable for concept modeling…

Computation and Language · Computer Science 2009-01-20 Majid Khayari , Stéphane Schneider , Isabelle Kramer , Laurent Romary , the termsciences Collaboration

Establishing a Multi-Thesauri-Scenario based on SKOS and Cross-Concordances

This case study proposes a scenario with three topic-related thesauri, which have been connected with bilateral cross-concordances as part of a major terminology mapping initiative in the project KoMoHe (Mayr & Petras, 2008). The thesauri…

Digital Libraries · Computer Science 2010-09-28 Philipp Mayr , Benjamin Zapilko , York Sure

InfoCTM: A Mutual Information Maximization Perspective of Cross-Lingual Topic Modeling

Cross-lingual topic models have been prevalent for cross-lingual text analysis by revealing aligned latent topics. However, most existing methods suffer from producing repetitive topics that hinder further analysis and performance decline…

Computation and Language · Computer Science 2024-03-28 Xiaobao Wu , Xinshuai Dong , Thong Nguyen , Chaoqun Liu , Liangming Pan , Anh Tuan Luu

Multi-Target Cross-Lingual Summarization: a novel task and a language-neutral approach

Cross-lingual summarization aims to bridge language barriers by summarizing documents in different languages. However, ensuring semantic coherence across languages is an overlooked challenge and can be critical in several contexts. To fill…

Computation and Language · Computer Science 2024-10-02 Diogo Pernes , Gonçalo M. Correia , Afonso Mendes

Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task

In this paper, we propose a new approach to learn multimodal multilingual embeddings for matching images and their relevant captions in two languages. We combine two existing objective functions to make images and captions close in a joint…

Computation and Language · Computer Science 2020-11-02 Alireza Mohammadshahi , Remi Lebret , Karl Aberer

Cross-lingual Models of Word Embeddings: An Empirical Comparison

Despite interest in using cross-lingual knowledge to learn word embeddings for various tasks, a systematic comparison of the possible approaches is lacking in the literature. We perform an extensive evaluation of four popular approaches of…

Computation and Language · Computer Science 2016-06-09 Shyam Upadhyay , Manaal Faruqui , Chris Dyer , Dan Roth

An Inter-lingual Reference Approach For Multi-Lingual Ontology Matching

Ontologies are considered as the backbone of the Semantic Web. With the rising success of the Semantic Web, the number of participating communities from different countries is constantly increasing. The growing number of ontologies…

Computation and Language · Computer Science 2013-09-27 Haytham Al-Feel , Ralph Schafermeier , Adrian Paschke

Exploiting multilingual nomenclatures and language-independent text features as an interlingua for cross-lingual text analysis applications

We are proposing a simple, but efficient basic approach for a number of multilingual and cross-lingual language technology applications that are not limited to the usual two or three languages, but that can be applied with relatively little…

Computation and Language · Computer Science 2007-05-23 Ralf Steinberger , Bruno Pouliquen , Camelia Ignat

A Survey on Cross-Lingual Summarization

Cross-lingual summarization is the task of generating a summary in one language (e.g., English) for the given document(s) in a different language (e.g., Chinese). Under the globalization background, this task has attracted increasing…

Computation and Language · Computer Science 2022-08-31 Jiaan Wang , Fandong Meng , Duo Zheng , Yunlong Liang , Zhixu Li , Jianfeng Qu , Jie Zhou

Locally Measuring Cross-lingual Lexical Alignment: A Domain and Word Level Perspective

NLP research on aligning lexical representation spaces to one another has so far focused on aligning language spaces in their entirety. However, cognitive science has long focused on a local perspective, investigating whether translation…

Computation and Language · Computer Science 2024-10-11 Taelin Karidi , Eitan Grossman , Omri Abend

Understanding Cross-Lingual Alignment -- A Survey

Cross-lingual alignment, the meaningful similarity of representations across languages in multilingual language models, has been an active field of research in recent years. We survey the literature of techniques to improve cross-lingual…

Computation and Language · Computer Science 2024-06-12 Katharina Hämmerl , Jindřich Libovický , Alexander Fraser

A Survey Of Cross-lingual Word Embedding Models

Cross-lingual representations of words enable us to reason about word meaning in multilingual contexts and are a key facilitator of cross-lingual transfer when developing natural language processing models for low-resource languages. In…

Computation and Language · Computer Science 2019-10-08 Sebastian Ruder , Ivan Vulić , Anders Søgaard

Graph-Community Detection for Cross-Document Topic Segment Relationship Identification

In this paper we propose a graph-community detection approach to identify cross-document relationships at the topic segment level. Given a set of related documents, we automatically find these relationships by clustering segments with…

Computation and Language · Computer Science 2016-06-14 Pedro Mota , Maxine Eskenazi , Luisa Coheur

How Do Lexical Senses Correspond Between Spoken German and German Sign Language?

Sign language lexicographers construct bilingual dictionaries by establishing word-to-sign mappings, where polysemous and homonymous words corresponding to different signs across contexts are often underrepresented. A usage-based approach…

Computation and Language · Computer Science 2026-02-17 Melis Çelikkol , Wei Zhao

German Text Embedding Clustering Benchmark

This work introduces a benchmark assessing the performance of clustering German text embeddings in different domains. This benchmark is driven by the increasing use of clustering neural text embeddings in tasks that require the grouping of…

Computation and Language · Computer Science 2024-01-08 Silvan Wehrli , Bert Arnrich , Christopher Irrgang

Cross-Media Scientific Research Achievements Retrieval Based on Deep Language Model

Science and technology big data contain a lot of cross-media information.There are images and texts in the scientific paper.The s ingle modal search method cannot well meet the needs of scientific researchers.This paper proposes a…

Information Retrieval · Computer Science 2022-03-30 Benzhi Wang , Meiyu Liang , Feifei Kou , Mingying Xu

Getting Started with Neural Models for Semantic Matching in Web Search

The vocabulary mismatch problem is a long-standing problem in information retrieval. Semantic matching holds the promise of solving the problem. Recent advances in language technology have given rise to unsupervised neural models for…

Information Retrieval · Computer Science 2016-11-11 Kezban Dilek Onal , Ismail Sengor Altingovde , Pinar Karagoz , Maarten de Rijke

Advancing the Database of Cross-Linguistic Colexifications with New Workflows and Data

Lexical resources are crucial for cross-linguistic analysis and can provide new insights into computational models for natural language learning. Here, we present an advanced database for comparative studies of words with multiple meanings,…

Computation and Language · Computer Science 2025-08-22 Annika Tjuka , Robert Forkel , Christoph Rzymski , Johann-Mattis List

Co-creating a Transdisciplinary Map of Technology-mediated Harms, Risks and Vulnerabilities: Challenges, Ambivalences and Opportunities

The phrase "online harms" has emerged in recent years out of a growing political willingness to address the ethical and social issues associated with the use of the Internet and digital technology at large. The broad landscape that…

Human-Computer Interaction · Computer Science 2023-07-21 Andrés Domínguez Hernández , Kopo M. Ramokapane , Partha Das Chowdhury , Ola Michalec , Emily Johnstone , Emily Godwin , Alicia G Cork , Awais Rashid