English
Related papers

Related papers: Recursive Abstractive Processing for Retrieval in …

200 papers

Retrieval-augmented language models can better adapt to changes in world state and incorporate long-tail knowledge. However, most existing methods retrieve only short contiguous chunks from a retrieval corpus, limiting holistic…

Computation and Language · Computer Science 2024-02-01 Parth Sarthi , Salman Abdullah , Aditi Tuli , Shubh Khanna , Anna Goldie , Christopher D. Manning

Information retrieval is a core component of many intelligent systems as it enables conditioning of outputs on new and large-scale datasets. While effective, the standard practice of encoding data into high-dimensional representations for…

Information Retrieval · Computer Science 2026-02-16 Shubham Gupta , Zichao Li , Tianyi Chen , Cem Subakan , Siva Reddy , Perouz Taslakian , Valentina Zantedeschi

Existing summarization systems mostly generate summaries purely relying on the content of the source document. However, even for humans, we usually need some references or exemplars to help us fully understand the source document and write…

Computation and Language · Computer Science 2021-12-14 Chenxin An , Ming Zhong , Zhichao Geng , Jianqiang Yang , Xipeng Qiu

In this paper we offer a new perspective on the well established agglomerative clustering algorithm, focusing on recovery of hierarchical structure. We recommend a simple variant of the standard algorithm, in which clusters are merged by…

Machine Learning · Statistics 2024-03-04 Annie Gray , Alexander Modell , Patrick Rubin-Delanchy , Nick Whiteley

Text summarization condenses a text to a shorter version while retaining the important informations. Abstractive summarization is a recent development that generates new phrases, rather than simply copying or rephrasing sentences within the…

Computation and Language · Computer Science 2018-02-06 André Cibils , Claudiu Musat , Andreea Hossman , Michael Baeriswyl

Retrieval-Augmented Generation (RAG) mitigates the hallucination problem of Large Language Models (LLMs) by incorporating external knowledge. Recursive summarization constructs a hierarchical summary tree by clustering text chunks,…

Computation and Language · Computer Science 2026-04-09 Guanran Luo , Zhongquan Jian , Wentao Qiu , Meihong Wang , Qingqiang Wu

Addressing the complexity of comprehensive information retrieval, this study introduces an innovative, iterative retrieval-augmented generation system. Our approach uniquely integrates a vector-space driven re-ranking mechanism with…

Information Theory · Computer Science 2024-01-04 Arash Shahmansoori

Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) finds meaningful patterns in spatial data by considering density and spatial proximity. As the clustering algorithm is inherently designed for static…

Databases · Computer Science 2024-12-12 Kayumov Abduaziz , Min Sik Kim , Ji Sun Shin

We propose a method to reconstruct and cluster incomplete high-dimensional data lying in a union of low-dimensional subspaces. Exploring the sparse representation model, we jointly estimate the missing data while imposing the intrinsic…

Computer Vision and Pattern Recognition · Computer Science 2017-09-06 João Carvalho , Manuel Marques , João P. Costeira

The task of automatic text summarization produces a concise and fluent text summary while preserving key information and overall meaning. Recent approaches to document-level summarization have seen significant improvements in recent years…

Computation and Language · Computer Science 2022-12-07 Gonçalo Raposo , Afonso Raposo , Ana Sofia Carmo

Navigating the vast scientific literature often starts with browsing a paper's abstract. However, when a reader seeks additional information, not present in the abstract, they face a costly cognitive chasm during their dive into the full…

Human-Computer Interaction · Computer Science 2024-04-17 Raymond Fok , Joseph Chee Chang , Tal August , Amy X. Zhang , Daniel S. Weld

We propose an abstraction-based multi-document summarization framework that can construct new sentences by exploring more fine-grained syntactic units than sentences, namely, noun/verb phrases. Different from existing abstraction-based…

Computation and Language · Computer Science 2015-06-08 Lidong Bing , Piji Li , Yi Liao , Wai Lam , Weiwei Guo , Rebecca J. Passonneau

We propose a new outline for adaptive dictionary learning methods for sparse encoding based on a hierarchical clustering of the training data. Through recursive application of a clustering method, the data is organized into a binary…

Machine Learning · Computer Science 2020-06-11 Renato Budinich , Gerlind Plonka

An approximate textual retrieval algorithm for searching sources with high levels of defects is presented. It considers splitting the words in a query into two overlapping segments and subsequently building composite regular expressions…

Information Retrieval · Computer Science 2007-05-23 Pere Constans

This paper presents a novel approach to binary classification using dynamic logistic ensemble models. The proposed method addresses the challenges posed by datasets containing inherent internal clusters that lack explicit feature-based…

Machine Learning · Computer Science 2024-12-02 Mohammad Zubair Khan , David Li

Clustering is often used for discovering structure in data. Clustering systems differ in the objective function used to evaluate clustering quality and the control strategy used to search the space of clusterings. Ideally, the search…

Artificial Intelligence · Computer Science 2014-11-17 D. Fisher

Recently, deep architectures, such as recurrent and recursive neural networks have been successfully applied to various natural language processing tasks. Inspired by bidirectional recurrent neural networks which use representations that…

Machine Learning · Computer Science 2013-12-03 Ozan İrsoy , Claire Cardie

In this research we address the problem of capturing recurring concepts in a data stream environment. Recurrence capture enables the re-use of previously learned classifiers without the need for re-learning while providing for better…

Machine Learning · Computer Science 2014-06-25 Sakthithasan Sripirakas , Russel Pears

Retrieving procedure-oriented evidence from materials science papers is difficult because key synthesis details are often scattered across long, context-heavy documents and are not well captured by paragraph-only dense retrieval. We present…

Signal Processing · Electrical Eng. & Systems 2026-04-14 Zhuoyu Wu , Wenhui Ou , Pei-Sze Tan , Wenqi Fang , Sailaja Rajanala , Raphaël C. -W. Phan

Inverted file structure is a common technique for accelerating dense retrieval. It clusters documents based on their embeddings; during searching, it probes nearby clusters w.r.t. an input query and only evaluates documents within them by…

Information Retrieval · Computer Science 2023-10-18 Peitian Zhang , Zheng Liu , Shitao Xiao , Zhicheng Dou , Jing Yao
‹ Prev 1 2 3 10 Next ›