Related papers: Practical Range Aggregation, Selection and Set Mai…

Multidimensional Data Structures and Techniques for Efficient Decision Making

In this paper we present several novel efficient techniques and multidimensional data structures which can improve the decision making process in many domains. We consider online range aggregation, range selection and range weighted median…

Computational Geometry · Computer Science 2010-01-12 Madalina Ecaterina Andreica , Mugurel Ionut Andreica , Nicolae Cataniciu

Efficient Algorithmic Techniques for Several Multidimensional Geometric Data Management and Analysis Problems

In this paper I present several novel, efficient, algorithmic techniques for solving some multidimensional geometric data management and analysis problems. The techniques are based on several data structures from computational geometry…

Computational Geometry · Computer Science 2013-01-03 Mugurel Ionut Andreica

Compact Representations of Event Sequences

We introduce a new technique for the efficient management of large sequences of multidimensional data, which takes advantage of regularities that arise in real-world datasets and supports different types of aggregation queries. More…

Data Structures and Algorithms · Computer Science 2018-03-08 Nieves R. Brisaboa , Guillermo de Bernardo , Gonzalo Navarro , Tirso V. Rodeiro , Diego Seco

Aggregation of Composite Solutions: strategies, models, examples

The paper addresses aggregation issues for composite (modular) solutions. A systemic view point is suggested for various aggregation problems. Several solution structures are considered: sets, set morphologies, trees, etc. Mainly, the…

Software Engineering · Computer Science 2011-12-01 Mark Sh. Levin

Variable Selection for Clustering and Classification

As data sets continue to grow in size and complexity, effective and efficient techniques are needed to target important features in the variable space. Many of the variable selection techniques that are commonly used alongside clustering…

Computation · Statistics 2013-03-22 Jeffrey L. Andrews , Paul D. McNicholas

Aggregated 2D Range Queries on Clustered Points

Efficient processing of aggregated range queries on two-dimensional grids is a common requirement in information retrieval and data mining systems, for example in Geographic Information Systems and OLAP cubes. We introduce a technique to…

Data Structures and Algorithms · Computer Science 2016-04-01 Nieves R. Brisaboa , Guillermo De Bernardo , Roberto Konow , Gonzalo Navarro , Diego Seco

A Survey of Distributed Data Aggregation Algorithms

Distributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, that can then be used to direct the execution of other applications. The resulting values result from the…

Distributed, Parallel, and Cluster Computing · Computer Science 2011-10-05 Paulo Jesus , Carlos Baquero , Paulo Sérgio Almeida

Practical Algorithmic Techniques for Several String Processing Problems

The domains of data mining and knowledge discovery make use of large amounts of textual data, which need to be handled efficiently. Specific problems, like finding the maximum weight ordered common subset of a set of ordered sets or…

Data Structures and Algorithms · Computer Science 2009-12-07 Mugurel Ionut Andreica , Nicolae Tapus

Estimating Range Queries using Aggregate Data with Integrity Constraints: a Probabilistic Approach

The problem of recovering (count and sum) range queries over multidimensional data only on the basis of aggregate information on such data is addressed. This problem can be formalized as follows. Suppose that a transformation T producing a…

Databases · Computer Science 2007-05-23 Francesco Buccafurri , Filippo Furfaro , Domenico Sacca'

Set-to-Sequence Methods in Machine Learning: a Review

Machine learning on sets towards sequential output is an important and ubiquitous task, with applications ranging from language modeling and meta-learning to multi-agent strategy games and power grid optimization. Combining elements of…

Machine Learning · Computer Science 2021-09-10 Mateusz Jurewicz , Leon Strømberg-Derczynski

Automating Box Folding: Sequence Extraction and Ranking Methodologies

Box folding represents a crucial challenge for automated packaging systems. This work bridges the gap between existing methods for folding sequence extraction and approaches focused on the adaptability of automated systems to specific box…

Robotics · Computer Science 2026-01-28 Giuseppe Fabio Preziosa , Davide Ferloni , Andrea Maria Zanchettin , Marco Faroni , Paolo Rocco

Variable Selection Methods for Model-based Clustering

Model-based clustering is a popular approach for clustering multivariate data which has seen applications in numerous fields. Nowadays, high-dimensional data are more and more common and the model-based clustering approach has adapted to…

Methodology · Statistics 2018-09-25 Michael Fop , Thomas Brendan Murphy

Neural network ensembles: Evaluation of aggregation algorithms

Ensembles of artificial neural networks show improved generalization capabilities that outperform those of single networks. However, for aggregation to be effective, the individual networks must be as accurate and diverse as possible. An…

Artificial Intelligence · Computer Science 2007-05-23 P. M. Granitto , P. F. Verdes , H. A. Ceccatto

When Dynamic Data Selection Meets Data Augmentation

Dynamic data selection aims to accelerate training with lossless performance. However, reducing training data inherently limits data diversity, potentially hindering generalization. While data augmentation is widely used to enhance…

Machine Learning · Computer Science 2025-05-13 Suorong Yang , Peng Ye , Furao Shen , Dongzhan Zhou

Diffusion-Driven High-Dimensional Variable Selection

Variable selection for high-dimensional, highly correlated data has long been a challenging problem, often yielding unstable and unreliable models. We propose a resample-aggregate framework that exploits diffusion models' ability to…

Methodology · Statistics 2025-08-20 Minjie Wang , Xiaotong Shen , Wei Pan

Selections and Higher Separation Axioms

This survey presents some historical background and recent developments in the area of selections for set-valued mappings along with several open questions. It was written with the hope that the presented material may pique an interest in…

General Topology · Mathematics 2018-10-16 Valentin Gutev

Model Selection Techniques -- An Overview

In the era of big data, analysts usually explore various statistical models or machine learning methods for observed data in order to facilitate scientific discoveries or gain predictive power. Whatever data and fitting procedures are…

Machine Learning · Statistics 2018-10-24 Jie Ding , Vahid Tarokh , Yuhong Yang

Method Chunks Selection by Multicriteria Techniques: an Extension of the Assembly-based Approach

The work presented in this paper is related to the area of situational method engineering (SME). In this domain, approaches are developed accordingly to specific project specifications. We propose to adapt an existing method construction…

Software Engineering · Computer Science 2009-11-10 Elena Kornyshova , Rebecca Deneckere , Camille Salinesi

A Cross-Entropy-based Method to Perform Information-based Feature Selection

From a machine learning point of view, identifying a subset of relevant features from a real data set can be useful to improve the results achieved by classification methods and to reduce their time and space complexity. To achieve this…

Machine Learning · Computer Science 2017-05-23 Pietro Cassara , Alessandro Rozza , Mirco Nanni

Heuristic Search for Rank Aggregation with Application to Label Ranking

Rank aggregation aims to combine the preference rankings of a number of alternatives from different voters into a single consensus ranking. As a useful model for a variety of practical applications, however, it is a computationally…

Neural and Evolutionary Computing · Computer Science 2022-01-12 Yangming Zhou , Jin-Kao Hao , Zhen Li , Fred Glover