Related papers: Multilingual Coreference Resolution with Harmonize…

Exploring Multiple Strategies to Improve Multilingual Coreference Resolution in CorefUD

Coreference resolution, the task of identifying expressions in text that refer to the same entity, is a critical component in various natural language processing applications. This paper presents a novel end-to-end neural coreference…

Computation and Language · Computer Science 2024-12-30 Ondřej Pražák , Miloslav Konopík , Pavel Král

Investigating Multilingual Coreference Resolution by Universal Annotations

Multilingual coreference resolution (MCR) has been a long-standing and challenging task. With the newly proposed multilingual coreference dataset, CorefUD (Nedoluzhko et al., 2022), we conduct an investigation into the task by using its…

Computation and Language · Computer Science 2023-10-30 Haixia Chai , Michael Strube

Findings of the Third Shared Task on Multilingual Coreference Resolution

The paper presents an overview of the third edition of the shared task on multilingual coreference resolution, held as part of the CRAC 2024 workshop. Similarly to the previous two editions, the participants were challenged to develop…

Computation and Language · Computer Science 2025-08-07 Michal Novák , Barbora Dohnalová , Miloslav Konopík , Anna Nedoluzhko , Martin Popel , Ondřej Pražák , Jakub Sido , Milan Straka , Zdeněk Žabokrtský , Daniel Zeman

Findings of the Shared Task on Multilingual Coreference Resolution

This paper presents an overview of the shared task on multilingual coreference resolution associated with the CRAC 2022 workshop. Shared task participants were supposed to develop trainable systems capable of identifying mentions and…

Computation and Language · Computer Science 2022-09-19 Zdeněk Žabokrtský , Miloslav Konopík , Anna Nedoluzhko , Michal Novák , Maciej Ogrodniczuk , Martin Popel , Ondřej Pražák , Jakub Sido , Daniel Zeman , Yilun Zhu

Findings of the Fifth Shared Task on Multilingual Coreference Resolution: Expanding Datasets for Long-Range Entities

This paper describes the fifth edition of the Shared Task on Multilingual Coreference Resolution, held in conjunction with the CODI-CRAC 2026 workshop. Building on previous iterations, the task required participants to develop systems…

Computation and Language · Computer Science 2026-05-21 Michal Novák , Miloslav Konopík , Anna Nedoluzhko , Martin Popel , Ondřej Pražák , Jakub Sido , Milan Straka , Zdeněk Žabokrtský , Daniel Zeman

Ensemble Transfer Learning for Multilingual Coreference Resolution

Entity coreference resolution is an important research problem with many applications, including information extraction and question answering. Coreference resolution for English has been studied extensively. However, there is relatively…

Computation and Language · Computer Science 2023-01-24 Tuan Manh Lai , Heng Ji

ezCoref: Towards Unifying Annotation Guidelines for Coreference Resolution

Large-scale, high-quality corpora are critical for advancing research in coreference resolution. However, existing datasets vary in their definition of coreferences and have been collected via complex and lengthy guidelines that are curated…

Computation and Language · Computer Science 2022-10-14 Ankita Gupta , Marzena Karpinska , Wenlong Zhao , Kalpesh Krishna , Jack Merullo , Luke Yeh , Mohit Iyyer , Brendan O'Connor

Coreference Resolution through a seq2seq Transition-Based System

Most recent coreference resolution systems use search algorithms over possible spans to identify mentions and resolve coreference. We instead present a coreference resolution system that uses a text-to-text (seq2seq) paradigm to predict…

Computation and Language · Computer Science 2022-11-23 Bernd Bohnet , Chris Alberti , Michael Collins

Are Large Language Models Robust Coreference Resolvers?

Recent work on extending coreference resolution across domains and languages relies on annotated data in both the target domain and language. At the same time, pre-trained large language models (LMs) have been reported to exhibit strong…

Computation and Language · Computer Science 2023-11-16 Nghia T. Le , Alan Ritter

Multilingual Coreference Resolution in Multiparty Dialogue

Existing multiparty dialogue datasets for entity coreference resolution are nascent, and many challenges are still unaddressed. We create a large-scale dataset, Multilingual Multiparty Coref (MMC), for this task based on TV transcripts. Due…

Computation and Language · Computer Science 2023-07-11 Boyuan Zheng , Patrick Xia , Mahsa Yarmohammadi , Benjamin Van Durme

Experiments with Universal CEFR Classification

The Common European Framework of Reference (CEFR) guidelines describe language proficiency of learners on a scale of 6 levels. While the description of CEFR guidelines is generic across languages, the development of automated proficiency…

Computation and Language · Computer Science 2018-04-19 Sowmya Vajjala , Taraka Rama

Evaluating and Improving the Coreference Capabilities of Machine Translation Models

Machine translation (MT) requires a wide range of linguistic capabilities, which current end-to-end models are expected to learn implicitly by observing aligned sentences in bilingual corpora. In this work, we ask: \emph{How well do MT…

Computation and Language · Computer Science 2023-02-17 Asaf Yehudai , Arie Cattan , Omri Abend , Gabriel Stanovsky

Findings of the Fourth Shared Task on Multilingual Coreference Resolution: Can LLMs Dethrone Traditional Approaches?

The paper presents an overview of the fourth edition of the Shared Task on Multilingual Coreference Resolution, organized as part of the CODI-CRAC 2025 workshop. As in the previous editions, participants were challenged to develop systems…

Computation and Language · Computer Science 2025-11-07 Michal Novák , Miloslav Konopík , Anna Nedoluzhko , Martin Popel , Ondřej Pražák , Jakub Sido , Milan Straka , Zdeněk Žabokrtský , Daniel Zeman

On Efficiently Acquiring Annotations for Multilingual Models

When tasked with supporting multiple languages for a given problem, two approaches have arisen: training a model for each language with the annotation budget divided equally among them, and training on a high-resource language followed by…

Computation and Language · Computer Science 2022-04-05 Joel Ruben Antony Moniz , Barun Patra , Matthew R. Gormley

End-to-end Multilingual Coreference Resolution with Mention Head Prediction

This paper describes our approach to the CRAC 2022 Shared Task on Multilingual Coreference Resolution. Our model is based on a state-of-the-art end-to-end coreference resolution system. Apart from joined multilingual training, we improved…

Computation and Language · Computer Science 2022-09-27 Ondřej Pražák , Miloslav Konopík

CorefInst: Leveraging LLMs for Multilingual Coreference Resolution

Coreference Resolution (CR) is a crucial yet challenging task in natural language understanding, often constrained by task-specific architectures and encoder-based language models that demand extensive training and lack adaptability. This…

Computation and Language · Computer Science 2025-09-23 Tuğba Pamay Arslan , Emircan Erol , Gülşen Eryiğit

Parallel Data Helps Neural Entity Coreference Resolution

Coreference resolution is the task of finding expressions that refer to the same entity in a text. Coreference models are generally trained on monolingual annotated data but annotating coreference is expensive and challenging. Hardmeier et…

Computation and Language · Computer Science 2023-05-30 Gongbo Tang , Christian Hardmeier

Multilingual Coreference Resolution in Low-resource South Asian Languages

Coreference resolution involves the task of identifying text spans within a discourse that pertain to the same real-world entity. While this task has been extensively explored in the English language, there has been a notable scarcity of…

Computation and Language · Computer Science 2024-03-26 Ritwik Mishra , Pooja Desur , Rajiv Ratn Shah , Ponnurangam Kumaraguru

Light Coreference Resolution for Russian with Hierarchical Discourse Features

Coreference resolution is the task of identifying and grouping mentions referring to the same real-world entity. Previous neural models have mainly focused on learning span representations and pairwise scores for coreference decisions.…

Computation and Language · Computer Science 2024-02-07 Elena Chistova , Ivan Smirnov

CoUDA: Coherence Evaluation via Unified Data Augmentation

Coherence evaluation aims to assess the organization and structure of a discourse, which remains challenging even in the era of large language models. Due to the scarcity of annotated data, data augmentation is commonly used for training…

Computation and Language · Computer Science 2024-04-02 Dawei Zhu , Wenhao Wu , Yifan Song , Fangwei Zhu , Ziqiang Cao , Sujian Li