Related papers: On Generalization in Coreference Resolution

Moving on from OntoNotes: Coreference Resolution Model Transfer

Academic neural models for coreference resolution (coref) are typically trained on a single dataset, OntoNotes, and model improvements are benchmarked on that same dataset. However, real-world applications of coref depend on the annotation…

Computation and Language · Computer Science 2021-10-04 Patrick Xia , Benjamin Van Durme

Challenges to Evaluating the Generalization of Coreference Resolution Models: A Measurement Modeling Perspective

It is increasingly common to evaluate the same coreference resolution (CR) model on multiple datasets. Do these multi-dataset evaluations allow us to draw meaningful conclusions about model generalization? Or, do they rather reflect the…

Computation and Language · Computer Science 2024-06-19 Ian Porada , Alexandra Olteanu , Kaheer Suleman , Adam Trischler , Jackie Chi Kit Cheung

Adapting Coreference Resolution Models through Active Learning

Neural coreference resolution models trained on one dataset may not transfer to new, low-resource domains. Active learning mitigates this problem by sampling a small subset of data for annotators to label. While active learning is…

Computation and Language · Computer Science 2022-03-30 Michelle Yuan , Patrick Xia , Chandler May , Benjamin Van Durme , Jordan Boyd-Graber

Parallel Data Helps Neural Entity Coreference Resolution

Coreference resolution is the task of finding expressions that refer to the same entity in a text. Coreference models are generally trained on monolingual annotated data but annotating coreference is expensive and challenging. Hardmeier et…

Computation and Language · Computer Science 2023-05-30 Gongbo Tang , Christian Hardmeier

Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

The success of monocular depth estimation relies on large and diverse training sets. Due to the challenges associated with acquiring dense ground-truth depth across different environments at scale, a number of datasets with distinct…

Computer Vision and Pattern Recognition · Computer Science 2020-08-26 René Ranftl , Katrin Lasinger , David Hafner , Konrad Schindler , Vladlen Koltun

Graph Refinement for Coreference Resolution

The state-of-the-art models for coreference resolution are based on independent mention pair-wise decisions. We propose a modelling approach that learns coreference at the document-level and takes global decisions. For this purpose, we…

Computation and Language · Computer Science 2022-04-01 Lesly Miculicich , James Henderson

Mention Annotations Alone Enable Efficient Domain Adaptation for Coreference Resolution

Although recent neural models for coreference resolution have led to substantial improvements on benchmark datasets, transferring these models to new target domains containing out-of-vocabulary spans and requiring differing annotation…

Computation and Language · Computer Science 2023-06-01 Nupoor Gandhi , Anjalie Field , Emma Strubell

Coreference Resolution in Research Papers from Multiple Domains

Coreference resolution is essential for automatic text understanding to facilitate high-level information retrieval tasks such as text summarisation or question answering. Previous work indicates that the performance of state-of-the-art…

Information Retrieval · Computer Science 2021-01-05 Arthur Brack , Daniel Uwe Müller , Anett Hoppe , Ralph Ewerth

Transfer Learning with Reconstruction Loss

In most applications of utilizing neural networks for mathematical optimization, a dedicated model is trained for each specific optimization objective. However, in many scenarios, several distinct yet correlated objectives or tasks often…

Machine Learning · Computer Science 2024-04-15 Wei Cui , Wei Yu

Cross-Dataset Generalization in Deep Learning

Deep learning has been extensively used in various fields, such as phase imaging, 3D imaging reconstruction, phase unwrapping, and laser speckle reduction, particularly for complex problems that lack analytic models. Its data-driven nature…

Machine Learning · Computer Science 2024-10-16 Xuyu Zhang , Haofan Huang , Dawei Zhang , Songlin Zhuang , Shensheng Han , Puxiang Lai , Honglin Liu

No One Representation to Rule Them All: Overlapping Features of Training Methods

Despite being able to capture a range of features of the data, high accuracy models trained with supervision tend to make similar predictions. This seemingly implies that high-performing models share similar biases regardless of training…

Machine Learning · Computer Science 2022-04-27 Raphael Gontijo-Lopes , Yann Dauphin , Ekin D. Cubuk

Semi-supervised Classification using Attention-based Regularization on Coarse-resolution Data

Many real-world phenomena are observed at multiple resolutions. Predictive models designed to predict these phenomena typically consider different resolutions separately. This approach might be limiting in applications where predictions are…

Machine Learning · Computer Science 2020-01-07 Guruprasad Nayak , Rahul Ghosh , Xiaowei Jia , Varun Mithal , Vipin Kumar

Improving Span Representation for Domain-adapted Coreference Resolution

Recent work has shown fine-tuning neural coreference models can produce strong performance when adapting to different domains. However, at the same time, this can require a large amount of annotated target examples. In this work, we focus…

Machine Learning · Computer Science 2021-09-22 Nupoor Gandhi , Anjalie Field , Yulia Tsvetkov

Incorporating Singletons and Mention-based Features in Coreference Resolution via Multi-task Learning for Better Generalization

Previous attempts to incorporate a mention detection step into end-to-end neural coreference resolution for English have been hampered by the lack of singleton mention span data as well as other entity information. This paper presents a…

Computation and Language · Computer Science 2023-09-22 Yilun Zhu , Siyao Peng , Sameer Pradhan , Amir Zeldes

Towards a General Time Series Forecasting Model with Unified Representation and Adaptive Transfer

With the growing availability of multi-domain time series data, there is an increasing demand for general forecasting models pre-trained on multi-source datasets to support diverse downstream prediction scenarios. Existing time series…

Machine Learning · Computer Science 2025-09-09 Yihang Wang , Yuying Qiu , Peng Chen , Kai Zhao , Yang Shu , Zhongwen Rao , Lujia Pan , Bin Yang , Chenjuan Guo

Ensemble Transfer Learning for Multilingual Coreference Resolution

Entity coreference resolution is an important research problem with many applications, including information extraction and question answering. Coreference resolution for English has been studied extensively. However, there is relatively…

Computation and Language · Computer Science 2023-01-24 Tuan Manh Lai , Heng Ji

One Model To Learn Them All

Deep learning yields great results across many fields, from speech recognition, image classification, to translation. But for each problem, getting a deep model to work well involves research into the architecture and a long period of…

Machine Learning · Computer Science 2017-06-19 Lukasz Kaiser , Aidan N. Gomez , Noam Shazeer , Ashish Vaswani , Niki Parmar , Llion Jones , Jakob Uszkoreit

Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

Recent works have found evidence of gender bias in models of machine translation and coreference resolution using mostly synthetic diagnostic datasets. While these quantify bias in a controlled experiment, they often do so on a small scale…

Computation and Language · Computer Science 2021-09-13 Shahar Levy , Koren Lazar , Gabriel Stanovsky

A Meta-Learning Approach for Custom Model Training

Transfer-learning and meta-learning are two effective methods to apply knowledge learned from large data sources to new tasks. In few-class, few-shot target task settings (i.e. when there are only a few classes and training examples…

Machine Learning · Computer Science 2019-02-11 Amir Erfan Eshratifar , Mohammad Saeed Abrishami , David Eigen , Massoud Pedram

MetaSets: Meta-Learning on Point Sets for Generalizable Representations

Deep learning techniques for point clouds have achieved strong performance on a range of 3D vision tasks. However, it is costly to annotate large-scale point sets, making it critical to learn generalizable representations that can transfer…

Computer Vision and Pattern Recognition · Computer Science 2022-04-18 Chao Huang , Zhangjie Cao , Yunbo Wang , Jianmin Wang , Mingsheng Long