Related papers: Aggregate Estimation Over Dynamic Hidden Web Datab…

Discover Aggregates Exceptions over Hidden Web Databases

Nowadays, many web databases "hidden" behind their restrictive search interfaces (e.g., Amazon, eBay) contain rich and valuable information that is of significant interests to various third parties. Recent studies have demonstrated the…

Databases · Computer Science 2016-11-22 Saad Bin Suhaim , Weimo Liu , Nan Zhang

Optimal Algorithms for Crawling a Hidden Database in the Web

A hidden database refers to a dataset that an organization makes accessible on the web by allowing users to issue queries through a search interface. In other words, data acquisition from such a source is not by following static…

Databases · Computer Science 2012-08-02 Cheng Sheng , Nan Zhang , Yufei Tao , Xin Jin

On Constrained Open-World Probabilistic Databases

Increasing amounts of available data have led to a heightened need for representing large-scale probabilistic knowledge bases. One approach is to use a probabilistic database, a model with strong assumptions that allow for efficiently…

Artificial Intelligence · Computer Science 2019-04-04 Tal Friedman , Guy Van den Broeck

Distributed Online Aggregative Optimization for Dynamic Multi-robot Coordination

This paper focuses on an online version of the emerging distributed constrained aggregative optimization framework, which is particularly suited for applications arising in cooperative robotics. Agents in a network want to minimize the sum…

Optimization and Control · Mathematics 2023-09-13 Guido Carnevale , Andrea Camisa , Giuseppe Notarstefano

Adaptive Learning of Aggregate Analytics under Dynamic Workloads

Large organizations have seamlessly incorporated data-driven decision making in their operations. However, as data volumes increase, expensive big data infrastructures are called to rescue. In this setting, analytics tasks become very…

Databases · Computer Science 2020-03-17 Fotis Savva , Christos Anagnostopoulos , Peter Triantafillou

Adaptive Search Optimization: Dynamic Algorithm Selection and Caching for Enhanced Database Performance

Efficient search operations in databases are paramount for timely retrieval of information various applications. This research introduces a novel approach, combining dynamicalgorithm1 selection and caching2 strategies, to optimize search…

Databases · Computer Science 2023-11-15 Hakikat Singh

Aggregate Queries on Sparse Databases

We propose an algebraic framework for studying efficient algorithms for query evaluation, aggregation, enumeration, and maintenance under updates, on sparse databases. Our framework allows to treat those problems in a unified way, by…

Logic in Computer Science · Computer Science 2020-01-01 Szymon Toruńczyk

Efficient and Private Approximations of Distributed Databases Calculations

In recent years, an increasing amount of data is collected in different and often, not cooperative, databases. The problem of privacy-preserving, distributed calculations over separated databases and, a relative to it, issue of private data…

Databases · Computer Science 2016-05-23 Philip Derbeko , Shlomi Dolev , Ehud Gudes , Jeffrey D. Ullman

Incremental Consistent Updating of Incomplete Databases

Efficient consistency maintenance of incomplete and dynamic real-life databases is a quality label for further data analysis. In prior work, we tackled the generic problem of database updating in the presence of tuple generating constraints…

Databases · Computer Science 2024-05-16 Jacques Chabin , Mirian Halfeld Ferrari , Nicolas Hiot , Dominique Laurent

Online Algorithms for Information Aggregation from Distributed and Correlated Sources

There is a fundamental trade-off between the communication cost and latency in information aggregation. Aggregating multiple communication messages over time can alleviate overhead and improve energy efficiency on one hand, but inevitably…

Networking and Internet Architecture · Computer Science 2017-09-25 Chi-Kin Chau , Majid Khonji , Muhammad Aftab

Efficient Uncertainty Tracking for Complex Queries with Attribute-level Bounds (extended version)

Certain answers are a principled method for coping with the uncertainty that arises in many practical data management tasks. Unfortunately, this method is expensive and may exclude useful (if uncertain) answers. Prior work introduced…

Databases · Computer Science 2021-02-24 Su Feng , Aaron Huber , Boris Glavic , Oliver Kennedy

Uncertainty Annotated Databases - A Lightweight Approach for Approximating Certain Answers (extended version)

Certain answers are a principled method for coping with uncertainty that arises in many practical data management tasks. Unfortunately, this method is expensive and may exclude useful (if uncertain) answers. Thus, users frequently resort to…

Databases · Computer Science 2019-04-02 Su Feng , Aaron Huber , Boris Glavic , Oliver Kennedy

Consistent query answers on numerical databases under aggregate constraints

The problem of extracting consistent information from relational databases violating integrity constraints on numerical data is addressed. In particular, aggregate constraints defined as linear inequalities on aggregate-sum queries on input…

Databases · Computer Science 2007-05-23 Sergio Flesca , Filippo Furfaro , Francesco Parisi

Discovering the Skyline of Web Databases

Many web databases are "hidden" behind proprietary search interfaces that enforce the top-$k$ output constraint, i.e., each query returns at most $k$ of all matching tuples, preferentially selected and returned according to a proprietary…

Databases · Computer Science 2017-05-10 Abolfazl Asudeh , Saravanan Thirumuruganathan , Nan Zhang , Gautam Das

Automatic Classification of Text Databases through Query Probing

Many text databases on the web are "hidden" behind search interfaces, and their documents are only accessible through querying. Search engines typically ignore the contents of such search-only databases. Recently, Yahoo-like directories…

Databases · Computer Science 2007-05-23 Panagiotis Ipeirotis , Luis Gravano , Mehran Sahami

Scalable Continual Top-k Keyword Search in Relational Databases

Keyword search in relational databases has been widely studied in recent years because it does not require users neither to master a certain structured query language nor to know the complex underlying database schemas. Most of existing…

Databases · Computer Science 2011-08-24 Yanwei XU

Dynamic Reserve Price Design with Distributed Solving Algorithm

Unexpected advertising items in sponsored search may reduce users' reliance on organic search, resulting in hidden cost for the e-commerce platform. To address this problem and promote sustainable growth, we propose a dynamic reserve price…

Computer Science and Game Theory · Computer Science 2025-08-26 Mang Li

A Survey of Distributed Data Aggregation Algorithms

Distributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, that can then be used to direct the execution of other applications. The resulting values result from the…

Distributed, Parallel, and Cluster Computing · Computer Science 2011-10-05 Paulo Jesus , Carlos Baquero , Paulo Sérgio Almeida

Online Algorithms for Estimating Change Rates of Web Pages

A search engine maintains local copies of different web pages to provide quick search results. This local cache is kept up-to-date by a web crawler that frequently visits these different pages to track changes in them. Ideally, the local…

Information Retrieval · Computer Science 2021-11-08 Konstantin Avrachenkov , Kishor Patil , Gugan Thoppe

Mining The Data From Distributed Database Using An Improved Mining Algorithm

Association rule mining is an active data mining research area and most ARM algorithms cater to a centralized environment. Centralized data mining to discover useful patterns in distributed databases isn't always feasible because merging…

Databases · Computer Science 2010-04-13 J. Arokia Renjit , K. L. Shunmuganathan