Related papers: Recursive Abstractive Processing for Retrieval in …

RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Retrieval-augmented language models can better adapt to changes in world state and incorporate long-tail knowledge. However, most existing methods retrieve only short contiguous chunks from a retrieval corpus, limiting holistic…

Computation and Language · Computer Science 2024-02-01 Parth Sarthi , Salman Abdullah , Aditi Tuli , Shubh Khanna , Anna Goldie , Christopher D. Manning

Hierarchical Retrieval at Scale: Bridging Transparency and Efficiency

Information retrieval is a core component of many intelligent systems as it enables conditioning of outputs on new and large-scale datasets. While effective, the standard practice of encoding data into high-dimensional representations for…

Information Retrieval · Computer Science 2026-02-16 Shubham Gupta , Zichao Li , Tianyi Chen , Cem Subakan , Siva Reddy , Perouz Taslakian , Valentina Zantedeschi

RetrievalSum: A Retrieval Enhanced Framework for Abstractive Summarization

Existing summarization systems mostly generate summaries purely relying on the content of the source document. However, even for humans, we usually need some references or exemplars to help us fully understand the source document and write…

Computation and Language · Computer Science 2021-12-14 Chenxin An , Ming Zhong , Zhichao Geng , Jianqiang Yang , Xipeng Qiu

Hierarchical clustering with dot products recovers hidden tree structure

In this paper we offer a new perspective on the well established agglomerative clustering algorithm, focusing on recovery of hierarchical structure. We recommend a simple variant of the standard algorithm, in which clusters are merged by…

Machine Learning · Statistics 2024-03-04 Annie Gray , Alexander Modell , Patrick Rubin-Delanchy , Nick Whiteley

Diverse Beam Search for Increased Novelty in Abstractive Summarization

Text summarization condenses a text to a shorter version while retaining the important informations. Abstractive summarization is a recent development that generates new phrases, rather than simply copying or rephrasing sentences within the…

Computation and Language · Computer Science 2018-02-06 André Cibils , Claudiu Musat , Andreea Hossman , Michael Baeriswyl

DTCRS: Dynamic Tree Construction for Recursive Summarization

Retrieval-Augmented Generation (RAG) mitigates the hallucination problem of Large Language Models (LLMs) by incorporating external knowledge. Recursive summarization constructs a hierarchical summary tree by clustering text chunks,…

Computation and Language · Computer Science 2026-04-09 Guanran Luo , Zhongquan Jian , Wentao Qiu , Meihong Wang , Qingqiang Wu

Concurrent Brainstorming & Hypothesis Satisfying: An Iterative Framework for Enhanced Retrieval-Augmented Generation (R2CBR3H-SR)

Addressing the complexity of comprehensive information retrieval, this study introduces an innovative, iterative retrieval-augmented generation system. Our approach uniquely integrates a vector-space driven re-ranking mechanism with…

Information Theory · Computer Science 2024-01-04 Arash Shahmansoori

Dynamic data summarization for hierarchical spatial clustering

Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) finds meaningful patterns in spatial data by considering density and spatial proximity. As the clustering algorithm is inherently designed for static…

Databases · Computer Science 2024-12-12 Kayumov Abduaziz , Min Sik Kim , Ji Sun Shin

Subspace Segmentation by Successive Approximations: A Method for Low-Rank and High-Rank Data with Missing Entries

We propose a method to reconstruct and cluster incomplete high-dimensional data lying in a union of low-dimensional subspaces. Exploring the sparse representation model, we jointly estimate the missing data while imposing the intrinsic…

Computer Vision and Pattern Recognition · Computer Science 2017-09-06 João Carvalho , Manuel Marques , João P. Costeira

Document-Level Abstractive Summarization

The task of automatic text summarization produces a concise and fluent text summary while preserving key information and overall meaning. Recent approaches to document-level summarization have seen significant improvements in recent years…

Computation and Language · Computer Science 2022-12-07 Gonçalo Raposo , Afonso Raposo , Ana Sofia Carmo

Qlarify: Recursively Expandable Abstracts for Directed Information Retrieval over Scientific Papers

Navigating the vast scientific literature often starts with browsing a paper's abstract. However, when a reader seeks additional information, not present in the abstract, they face a costly cognitive chasm during their dive into the full…

Human-Computer Interaction · Computer Science 2024-04-17 Raymond Fok , Joseph Chee Chang , Tal August , Amy X. Zhang , Daniel S. Weld

Abstractive Multi-Document Summarization via Phrase Selection and Merging

We propose an abstraction-based multi-document summarization framework that can construct new sentences by exploring more fine-grained syntactic units than sentences, namely, noun/verb phrases. Different from existing abstraction-based…

Computation and Language · Computer Science 2015-06-08 Lidong Bing , Piji Li , Yi Liao , Wai Lam , Weiwei Guo , Rebecca J. Passonneau

A Tree-based Dictionary Learning Framework

We propose a new outline for adaptive dictionary learning methods for sparse encoding based on a hierarchical clustering of the training data. Through recursive application of a clustering method, the data is organized into a binary…

Machine Learning · Computer Science 2020-06-11 Renato Budinich , Gerlind Plonka

Approximate textual retrieval

An approximate textual retrieval algorithm for searching sources with high levels of defects is presented. It considers splitting the words in a query into two overlapping segments and subsequently building composite regular expressions…

Information Retrieval · Computer Science 2007-05-23 Pere Constans

Dynamic Logistic Ensembles with Recursive Probability and Automatic Subset Splitting for Enhanced Binary Classification

This paper presents a novel approach to binary classification using dynamic logistic ensemble models. The proposed method addresses the challenges posed by datasets containing inherent internal clusters that lack explicit feature-based…

Machine Learning · Computer Science 2024-12-02 Mohammad Zubair Khan , David Li

Iterative Optimization and Simplification of Hierarchical Clusterings

Clustering is often used for discovering structure in data. Clustering systems differ in the objective function used to evaluate clustering quality and the control strategy used to search the space of clusterings. Ideally, the search…

Artificial Intelligence · Computer Science 2014-11-17 D. Fisher

Bidirectional Recursive Neural Networks for Token-Level Labeling with Structure

Recently, deep architectures, such as recurrent and recursive neural networks have been successfully applied to various natural language processing tasks. Inspired by bidirectional recurrent neural networks which use representations that…

Machine Learning · Computer Science 2013-12-03 Ozan İrsoy , Claire Cardie

Mining Recurrent Concepts in Data Streams using the Discrete Fourier Transform

In this research we address the problem of capturing recurring concepts in a data stream environment. Recurrence capture enables the re-use of previously learned classifiers without the need for re-learning while providing for better…

Machine Learning · Computer Science 2014-06-25 Sakthithasan Sripirakas , Russel Pears

RECIPER: A Dual-View Retrieval Pipeline for Procedure-Oriented Materials Question Answering

Retrieving procedure-oriented evidence from materials science papers is difficult because key synthesis details are often scattered across long, context-heavy documents and are not well captured by paragraph-only dense retrieval. We present…

Signal Processing · Electrical Eng. & Systems 2026-04-14 Zhuoyu Wu , Wenhui Ou , Pei-Sze Tan , Wenqi Fang , Sailaja Rajanala , Raphaël C. -W. Phan

Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval

Inverted file structure is a common technique for accelerating dense retrieval. It clusters documents based on their embeddings; during searching, it probes nearby clusters w.r.t. an input query and only evaluates documents within them by…

Information Retrieval · Computer Science 2023-10-18 Peitian Zhang , Zheng Liu , Shitao Xiao , Zhicheng Dou , Jing Yao