Related papers: Cross-referencing using Fine-grained Topic Modelin…

Inferring Scientific Cross-Document Coreference and Hierarchy with Definition-Augmented Relational Reasoning

We address the fundamental task of inferring cross-document coreference and hierarchy in scientific texts, which has important applications in knowledge graph construction, search, recommendation and discovery. Large Language Models (LLMs)…

Computation and Language · Computer Science 2026-02-04 Lior Forer , Tom Hope

A Query-Driven Topic Model

Topic modeling is an unsupervised method for revealing the hidden semantic structure of a corpus. It has been increasingly widely adopted as a tool in the social sciences, including political science, digital humanities and sociological…

Information Retrieval · Computer Science 2022-01-12 Zheng Fang , Yulan He , Rob Procter

Distantly Labeling Data for Large Scale Cross-Document Coreference

Cross-document coreference, the problem of resolving entity mentions across multi-document collections, is crucial to automated knowledge base construction and data mining tasks. However, the scarcity of large labeled data sets has hindered…

Artificial Intelligence · Computer Science 2015-03-17 Sameer Singh , Michael Wick , Andrew McCallum

Model Transfer for Tagging Low-resource Languages using a Bilingual Dictionary

Cross-lingual model transfer is a compelling and popular method for predicting annotations in a low-resource language, whereby parallel corpora provide a bridge to a high-resource language and its associated annotated corpora. However,…

Computation and Language · Computer Science 2017-05-02 Meng Fang , Trevor Cohn

Learning Fine-grained Fact-Article Correspondence in Legal Cases

Automatically recommending relevant law articles to a given legal case has attracted much attention as it can greatly release human labor from searching over the large database of laws. However, current researches only support…

Computation and Language · Computer Science 2021-12-07 Jidong Ge , Yunyun huang , Xiaoyu Shen , Chuanyi Li , Wei Hu

ABCD-LINK: Annotation Bootstrapping for Cross-Document Fine-Grained Links

Understanding fine-grained links between documents is crucial for many applications, yet progress is limited by the lack of efficient methods for data curation. To address this limitation, we introduce a domain-agnostic framework for…

Computation and Language · Computer Science 2026-01-27 Serwar Basch , Ilia Kuznetsov , Tom Hope , Iryna Gurevych

A Cross-media Retrieval System for Lecture Videos

We propose a cross-media lecture-on-demand system, in which users can selectively view specific segments of lecture videos by submitting text queries. Users can easily formulate queries by using the textbook associated with a target…

Computation and Language · Computer Science 2007-05-23 Atsushi Fujii , Katunobu Itou , Tomoyosi Akiba , Tetsuya Ishikawa

CROSS-JEM: Accurate and Efficient Cross-encoders for Short-text Ranking Tasks

Ranking a set of items based on their relevance to a given query is a core problem in search and recommendation. Transformer-based ranking models are the state-of-the-art approaches for such tasks, but they score each query-item…

Information Retrieval · Computer Science 2024-09-17 Bhawna Paliwal , Deepak Saini , Mudit Dhawan , Siddarth Asokan , Nagarajan Natarajan , Surbhi Aggarwal , Pankaj Malhotra , Jian Jiao , Manik Varma

Sequential Cross-Document Coreference Resolution

Relating entities and events in text is a key component of natural language understanding. Cross-document coreference resolution, in particular, is important for the growing interest in multi-document analysis tasks. In this work we propose…

Computation and Language · Computer Science 2021-04-20 Emily Allaway , Shuai Wang , Miguel Ballesteros

Focus on what matters: Applying Discourse Coherence Theory to Cross Document Coreference

Performing event and entity coreference resolution across documents vastly increases the number of candidate mentions, making it intractable to do the full $n^2$ pairwise comparisons. Existing approaches simplify by considering coreference…

Computation and Language · Computer Science 2023-05-29 William Held , Dan Iter , Dan Jurafsky

Coarse-grained Cross-lingual Alignment of Comparable Texts with Topic Models and Encyclopedic Knowledge

We present a method for coarse-grained cross-lingual alignment of comparable texts: segments consisting of contiguous paragraphs that discuss the same theme (e.g. history, economy) are aligned based on induced multilingual topics. The…

Computation and Language · Computer Science 2014-12-01 Vivi Nastase , Angela Fahrni

Multilevel Text Alignment with Cross-Document Attention

Text alignment finds application in tasks such as citation recommendation and plagiarism detection. Existing alignment methods operate at a single, predefined level and cannot learn to align texts at, for example, sentence and document…

Computation and Language · Computer Science 2020-10-06 Xuhui Zhou , Nikolaos Pappas , Noah A. Smith

A pipeline for matching bibliographic references with incomplete metadata: experiments with Crossref and OpenCitations

While Crossref makes available more than 1.8 billion bibliographic references from publications for which it provides a DOI, more than 698 million of these references do not specify a DOI, making the creation of a formal citation link from…

Digital Libraries · Computer Science 2025-11-25 Matteo Guenci , Ivan Heibi , Chiara Parravicini , Silvio Peroni , Marta Soricetti

Streamlining Cross-Document Coreference Resolution: Evaluation and Modeling

Recent evaluation protocols for Cross-document (CD) coreference resolution have often been inconsistent or lenient, leading to incomparable results across works and overestimation of performance. To facilitate proper future research on this…

Computation and Language · Computer Science 2020-10-26 Arie Cattan , Alon Eirew , Gabriel Stanovsky , Mandar Joshi , Ido Dagan

Improving Fine-grained Entity Typing with Entity Linking

Fine-grained entity typing is a challenging problem since it usually involves a relatively large tag set and may require to understand the context of the entity mention. In this paper, we use entity linking to help with the fine-grained…

Computation and Language · Computer Science 2019-09-27 Hongliang Dai , Donghong Du , Xin Li , Yangqiu Song

Context-Dependent Fine-Grained Entity Type Tagging

Entity type tagging is the task of assigning category labels to each mention of an entity in a document. While standard systems focus on a small set of types, recent work (Ling and Weld, 2012) suggests that using a large fine-grained label…

Computation and Language · Computer Science 2016-08-03 Dan Gillick , Nevena Lazic , Kuzman Ganchev , Jesse Kirchner , David Huynh

Goal-based Course Recommendation

With cross-disciplinary academic interests increasing and academic advising resources over capacity, the importance of exploring data-assisted methods to support student decision making has never been higher. We build on the findings and…

Artificial Intelligence · Computer Science 2018-12-27 Weijie Jiang , Zachary A. Pardos , Qiang Wei

Identifying Reference Spans: Topic Modeling and Word Embeddings help IR

The CL-SciSumm 2016 shared task introduced an interesting problem: given a document D and a piece of text that cites D, how do we identify the text spans of D being referenced by the piece of text? The shared task provided the first…

Computation and Language · Computer Science 2017-08-11 Luis Moraes , Shahryar Baki , Rakesh Verma , Daniel Lee

Cross-topic Argument Mining from Heterogeneous Sources Using Attention-based Neural Networks

Argument mining is a core technology for automating argument search in large document collections. Despite its usefulness for this task, most current approaches to argument mining are designed for use only with specific text types and fall…

Computation and Language · Computer Science 2018-02-19 Christian Stab , Tristan Miller , Iryna Gurevych

Benchmark for Evaluation and Analysis of Citation Recommendation Models

Citation recommendation systems have attracted much academic interest, resulting in many studies and implementations. These systems help authors automatically generate proper citations by suggesting relevant references based on the text…

Information Retrieval · Computer Science 2024-12-11 Puja Maharjan