Related papers: Query Significance in Databases via Randomizations

Content-Based Table Retrieval for Web Queries

Understanding the connections between unstructured text and semi-structured table is an important yet neglected problem in natural language processing. In this work, we focus on content-based table retrieval. Given a query, the task is to…

Computation and Language · Computer Science 2017-06-09 Zhao Yan , Duyu Tang , Nan Duan , Junwei Bao , Yuanhua Lv , Ming Zhou , Zhoujun Li

Building and Maintaining Halls of Fame over a Database

Halls of Fame are fascinating constructs. They represent the elite of an often very large amount of entities---persons, companies, products, countries etc. Beyond their practical use as static rankings, changes to them are particularly…

Databases · Computer Science 2015-03-20 Foteini Alvanaki , Sebastian Michel , Aleksandar Stupar

Ranked Enumeration for Database Queries

Ranked enumeration is a query-answering paradigm where the query answers are returned incrementally in order of importance (instead of returning all answers at once). Importance is defined by a ranking function that can be specific to the…

Databases · Computer Science 2025-02-07 Nikolaos Tziavelis , Wolfgang Gatterbauer , Mirek Riedewald

Scientific Table Search Using Keyword Queries

Tables are common and important in scientific documents, yet most text-based document search systems do not capture structures and semantics specific to tables. How to bridge different types of mismatch between keywords queries and…

Information Retrieval · Computer Science 2017-07-13 Kyle Yingkai Gao , Jamie Callan

Probabilistic Search for Structured Data via Probabilistic Programming and Nonparametric Bayes

Databases are widespread, yet extracting relevant data can be difficult. Without substantial domain knowledge, multivariate search queries often return sparse or uninformative results. This paper introduces an approach for searching…

Artificial Intelligence · Computer Science 2017-04-05 Feras Saad , Leonardo Casarsa , Vikash Mansinghka

Efficient Probabilistic Inference with Partial Ranking Queries

Distributions over rankings are used to model data in various settings such as preference analysis and political elections. The factorial size of the space of rankings, however, typically forces one to make structural assumptions, such as…

Machine Learning · Computer Science 2012-02-20 Jonathan Huang , Ashish Kapoor , Carlos E. Guestrin

Criteria Sliders: Learning Continuous Database Criteria via Interactive Ranking

Large databases are often organized by hand-labeled metadata, or criteria, which are expensive to collect. We can use unsupervised learning to model database variation, but these models are often high dimensional, complex to parameterize,…

Computer Vision and Pattern Recognition · Computer Science 2017-06-14 James Tompkin , Kwang In Kim , Hanspeter Pfister , Christian Theobalt

Database Theory in Action: Direct Access to Query Answers

Direct access asks for the retrieval of query answers by their ranked position, given a query and a desired order. While the time complexity of data structures supporting such accesses has been studied in depth, and efficient algorithms for…

Databases · Computer Science 2026-03-23 Jiayin Hu , Nikolaos Tziavelis

Weighing the techniques for data optimization in a database

A set of preferred records can be obtained from a large database in a multi-criteria setting using various computational methods which either depend on the concept of dominance or on the concept of utility or scoring function based on the…

Databases · Computer Science 2022-03-18 Anagha Radhakrishnan

A New Scale for Attribute Dependency in Large Database Systems

Large, data centric applications are characterized by its different attributes. In modern day, a huge majority of the large data centric applications are based on relational model. The databases are collection of tables and every table…

Information Retrieval · Computer Science 2012-06-28 Soumya Sen , Anjan Dutta , Agostino Cortesi , Nabendu Chaki

Extracting Features from Ratings: The Role of Factor Models

Performing effective preference-based data retrieval requires detailed and preferentially meaningful structurized information about the current user as well as the items under consideration. A common problem is that representations of items…

Artificial Intelligence · Computer Science 2011-01-13 Joachim Selke , Wolf-Tilo Balke

Leveraging Usage Data for Linked Data Movie Entity Summarization

Novel research in the field of Linked Data focuses on the problem of entity summarization. This field addresses the problem of ranking features according to their importance for the task of identifying a particular entity. Next to a more…

Artificial Intelligence · Computer Science 2012-04-13 Andreas Thalhammer , Ioan Toma , Antonio Roa-Valverde , Dieter Fensel

Semantic Table Retrieval using Keyword and Table Queries

Tables on the Web contain a vast amount of knowledge in a structured form. To tap into this valuable resource, we address the problem of table retrieval: answering an information need with a ranked list of tables. We investigate this…

Information Retrieval · Computer Science 2021-05-14 Shuo Zhang , Krisztian Balog

Aggregation and Ordering in Factorised Databases

A common approach to data analysis involves understanding and manipulating succinct representations of data. In earlier work, we put forward a succinct representation system for relational data called factorised databases and reported on…

Databases · Computer Science 2013-07-02 Nurzhan Bakibayev , Tomáš Kočiský , Dan Olteanu , Jakub Závodný

A semantic association page rank algorithm for web search engines

The majority of Semantic Web search engines retrieve information by focusing on the use of concepts and relations restricted to the query provided by the user. By trying to guess the implicit meaning between these concepts and relations,…

Information Retrieval · Computer Science 2012-11-28 Manuel Rojas

Discovering Latent Patterns from the Analysis of User-Curated Movie Lists

User content curation is becoming an important source of preference data, as well as providing information regarding the items being curated. One popular approach involves the creation of lists. On Twitter, these lists might contain…

Social and Information Networks · Computer Science 2013-08-26 Derek Greene , Pádraig Cunningham

Tell Me Something I Don't Know: Randomization Strategies for Iterative Data Mining

There is a wide variety of data mining methods available, and it is generally useful in exploratory data analysis to use many different methods for the same dataset. This, however, leads to the problem of whether the results found by one…

Machine Learning · Computer Science 2020-06-18 Sami Hanhijärvi , Markus Ojala , Niko Vuokko , Kai Puolamäki , Nikolaj Tatti , Heikki Mannila

Mining Rank Data

The problem of frequent pattern mining has been studied quite extensively for various types of data, including sets, sequences, and graphs. Somewhat surprisingly, another important type of data, namely rank data, has received very little…

Machine Learning · Computer Science 2018-06-18 Sascha Henzgen , Eyke Hüllermeier

Challenges for Efficient Query Evaluation on Structured Probabilistic Data

Query answering over probabilistic data is an important task but is generally intractable. However, a new approach for this problem has recently been proposed, based on structural decompositions of input databases, following, e.g., tree…

Databases · Computer Science 2019-08-28 Antoine Amarilli , Silviu Maniu , Mikaël Monet

Automatic Recommendations for Evolving Relational Databases Schema

Relational databases play a central role in many information systems. Their schema contains structural (e.g. tables and columns) and behavioral (e.g. stored procedures or views) entity descriptions. Then, just like for ``normal'' software,…

Software Engineering · Computer Science 2024-04-15 Anne Etien , Nicolas Anquetil