Related papers: Reducing semantic complexity in distributed Digita…

Treatment of Semantic Heterogeneity in Information Retrieval

The first step to handle semantic heterogeneity should be the attempt to enrich the semantic information about documents, i.e. to fill up the gaps in the documents meta-data automatically. Section 2 describes a set of cascading deductive…

Information Retrieval · Computer Science 2011-02-21 Heiko Hellweg , Jürgen Krause , Thomas Mandl , Jutta Marx , Matthias N. O. Müller , Peter Mutschke , Robert Strötgen

Improving Document Retrieval Coherence for Semantically Equivalent Queries

Dense Retrieval (DR) models have proven to be effective for Document Retrieval and Information Grounding tasks. Usually, these models are trained and optimized for improving the relevance of top-ranked documents for a given query. Previous…

Information Retrieval · Computer Science 2025-08-12 Stefano Campese , Alessandro Moschitti , Ivano Lauriola

Building Custom Term Suggestion Web Services with OAI-Harvested Open Data

The problem that the same information need can be expressed in a variety of ways is especially true for scientific literature. Each scientific discipline has its own domain-specific language and vocabulary. This language is coded into…

Digital Libraries · Computer Science 2012-03-27 Philipp Schaer , Thomas Lüke , Wilko van Hoek

Author Name Disambiguation via Heterogeneous Network Embedding from Structural and Semantic Perspectives

Name ambiguity is common in academic digital libraries, such as multiple authors having the same name. This creates challenges for academic data management and analysis, thus name disambiguation becomes necessary. The procedure of name…

Machine Learning · Computer Science 2024-04-02 Wenjin Xie , Siyuan Liu , Xiaomeng Wang , Tao Jia

Information Retrieval in long documents: Word clustering approach for improving Semantics

In this paper, we propose an alternative to deep neural networks for semantic information retrieval for the case of long documents. This new approach exploiting clustering techniques to take into account the meaning of words in Information…

Information Retrieval · Computer Science 2025-07-29 Paul Mbathe Mekontchou , Armel Fotsoh , Bernabe Batchakui , Eddy Ella

Re-Ranking Words to Improve Interpretability of Automatically Generated Topics

Topics models, such as LDA, are widely used in Natural Language Processing. Making their output interpretable is an important area of research with applications to areas such as the enhancement of exploratory search interfaces and the…

Computation and Language · Computer Science 2019-04-01 Areej Alokaili , Nikolaos Aletras , Mark Stevenson

Expert Finding in Heterogeneous Bibliographic Networks with Locally-trained Embeddings

Expert finding is an important task in both industry and academia. It is challenging to rank candidates with appropriate expertise for various queries. In addition, different types of objects interact with one another, which naturally forms…

Information Retrieval · Computer Science 2018-03-12 Huan Gui , Qi Zhu , Liyuan Liu , Aston Zhang , Jiawei Han

Stratified Data Integration

We propose a novel approach to the problem of semantic heterogeneity where data are organized into a set of stratified and independent representation layers, namely: conceptual(where a set of unique alinguistic identifiers are connected…

Databases · Computer Science 2021-05-21 Fausto Giunchiglia , Alessio Zamboni , Mayukh Bagchi , Simone Bocca

DSLR: Document Refinement with Sentence-Level Re-ranking and Reconstruction to Enhance Retrieval-Augmented Generation

Recent advancements in Large Language Models (LLMs) have significantly improved their performance across various Natural Language Processing (NLP) tasks. However, LLMs still struggle with generating non-factual responses due to limitations…

Computation and Language · Computer Science 2024-09-10 Taeho Hwang , Soyeong Jeong , Sukmin Cho , SeungYoon Han , Jong C. Park

Empowering Language Model with Guided Knowledge Fusion for Biomedical Document Re-ranking

Pre-trained language models (PLMs) have proven to be effective for document re-ranking task. However, they lack the ability to fully interpret the semantics of biomedical and health-care queries and often rely on simplistic patterns for…

Computation and Language · Computer Science 2023-05-09 Deepak Gupta , Dina Demner-Fushman

A Science Model Driven Retrieval Prototype

This paper is about a better understanding on the structure and dynamics of science and the usage of these insights for compensating the typical problems that arises in metadata-driven Digital Libraries. Three science model driven retrieval…

Information Retrieval · Computer Science 2011-01-11 Philipp Mayr , Philipp Schaer , Peter Mutschke

Demonstrating a Service-Enhanced Retrieval System

This paper is a short description of an information retrieval system enhanced by three model driven retrieval services: (1) co-word analysis based query expansion, re-ranking via (2) Bradfordizing and (3) author centrality. The different…

Information Retrieval · Computer Science 2017-05-03 Philipp Schaer , Philipp Mayr , Peter Mutschke

Concept-based indexing in text information retrieval

Traditional information retrieval systems rely on keywords to index documents and queries. In such systems, documents are retrieved based on the number of shared keywords with the query. This lexical-focused retrieval leads to inaccurate…

Information Retrieval · Computer Science 2013-03-08 Fatiha Boubekeur , Wassila Azzoug

An ontology-based approach for semantics ranking of the web search engines results

This work falls in the areas of information retrieval and semantic web, and aims to improve the evaluation of web search tools. Indeed, the huge number of information on the web as well as the growth of new inexperienced users creates new…

Information Retrieval · Computer Science 2012-12-12 Abdelkrim Bouramoul , Mohamed-Khireddine Kholladi , Bich-Liên Doan

CoRank: LLM-Based Compact Reranking with Document Features for Scientific Retrieval

Scientific retrieval is essential for advancing scientific knowledge discovery. Within this process, document reranking plays a critical role in refining first-stage retrieval results. However, standard LLM listwise reranking faces…

Information Retrieval · Computer Science 2025-08-19 Runchu Tian , Xueqiang Xu , Bowen Jin , SeongKu Kang , Jiawei Han

A Novel Combined Term Suggestion Service for Domain-Specific Digital Libraries

Interactive query expansion can assist users during their query formulation process. We conducted a user study with over 4,000 unique visitors and four different design approaches for a search term suggestion service. As a basis for our…

Digital Libraries · Computer Science 2019-03-29 Daniel Hienert , Philipp Schaer , Johann Schaible , Philipp Mayr

REALM: Recursive Relevance Modeling for LLM-based Document Re-Ranking

Large Language Models (LLMs) have shown strong capabilities in document re-ranking, a key component in modern Information Retrieval (IR) systems. However, existing LLM-based approaches face notable limitations, including ranking…

Information Retrieval · Computer Science 2025-10-03 Pinhuan Wang , Zhiqiu Xia , Chunhua Liao , Feiyi Wang , Hang Liu

Latent Semantic Analysis Approach for Document Summarization Based on Word Embeddings

Since the amount of information on the internet is growing rapidly, it is not easy for a user to find relevant information for his/her query. To tackle this issue, much attention has been paid to Automatic Document Summarization. The key…

Computation and Language · Computer Science 2019-02-05 Kamal Al-Sabahi , Zhang Zuping , Yang Kang

An Analysis of a BERT Deep Learning Strategy on a Technology Assisted Review Task

Document screening is a central task within Evidenced Based Medicine, which is a clinical discipline that supplements scientific proof to back medical decisions. Given the recent advances in DL (Deep Learning) methods applied to Information…

Information Retrieval · Computer Science 2021-04-20 Alexandros Ioannidis

Improving Low-Resource Cross-lingual Document Retrieval by Reranking with Deep Bilingual Representations

In this paper, we propose to boost low-resource cross-lingual document retrieval performance with deep bilingual query-document representations. We match queries and documents in both source and target languages with four components, each…

Computation and Language · Computer Science 2019-06-11 Rui Zhang , Caitlin Westerfield , Sungrok Shim , Garrett Bingham , Alexander Fabbri , Neha Verma , William Hu , Dragomir Radev