English
Related papers

Related papers: Fusing heterogeneous data sets

200 papers

As the world population gets older, the healthcare system must be adapted, among others by providing continuous health monitoring at home and in the city. The social activities have a significant role in everyone health status. Hence, this…

Computers and Society · Computer Science 2016-12-01 Loïc Sevrin , Bertrand Massot , Norbert Noury , Nacer Abouchi , Fabrice Jumel , Jacques Saraydaryan

The continuing advances of omic technologies mean that it is now more tangible to measure the numerous features collectively reflecting the molecular properties of a sample. When multiple omic methods are used, statistical and computational…

Genomics · Quantitative Biology 2023-08-14 Tim Downing , Nicos Angelopoulos

In the context of functional data analysis, we propose new two sample tests for homogeneity. Based on some well-known depth measures, we construct four different statistics in order to measure distance between the two samples. A simulation…

Methodology · Statistics 2015-07-08 Ramón Flores , Rosa Lillo , Juan Romo

Measuring inter-dataset similarity is an important task in machine learning and data mining with various use cases and applications. Existing methods for measuring inter-dataset similarity are computationally expensive, limited, or…

Machine Learning · Computer Science 2025-05-06 Muhammad Rajabinasab , Anton D. Lautrup , Arthur Zimek

High-resolution estimates of population health indicators are critical for precision public health. We propose a method for high-resolution estimation that fuses distinct data sources: an unbiased, low-resolution data source (e.g.…

Methodology · Statistics 2025-08-21 Amy Guan , Marissa Reitsma , Roshni Sahoo , Joshua Salomon , Stefan Wager

Very often for the same scientific question, there may exist different techniques or experiments that measure the same numerical quantity. Historically, various methods have been developed to exploit the information within each type of data…

Methodology · Statistics 2021-09-22 Yiwen Liu , Xiaoxiao Sun , Wenxuan Zhong , Bing Li

Research on environmental risk modeling relies on numerous indicators to quantify the magnitude and frequency of extreme climate events, their ecological, economic, and social impacts, and the coping mechanisms that can reduce or mitigate…

Information Theory · Computer Science 2026-01-29 Abdullah Konak

Characterizing the dynamic interactive patterns of complex systems helps gain in-depth understanding of how components interrelate with each other while performing certain functions as a whole. In this study, we present a novel multimodal…

Machine Learning · Computer Science 2019-01-07 Miaolin Fan , Chun-An Chou , Sheng-Che Yen , Yingzi Lin

This paper proposes a novel framework for fusing multi-temporal, multispectral satellite images and OpenStreetMap (OSM) data for the classification of local climate zones (LCZs). Feature stacking is the most commonly-used method of data…

Machine Learning · Computer Science 2019-10-23 Guichen Zhang , Pedram Ghamisi , Xiao Xiang Zhu

This paper addresses the density based multi-sensor cooperative fusion using random finite set (RFS) type multi-object densities (MODs). Existing fusion methods use scalar weights to characterize the relative information confidence among…

Information Theory · Computer Science 2021-07-21 Wei Yi , Lei Chai

We introduce a new data fusion method that utilizes multiple data sources to estimate a smooth, finite-dimensional parameter. Most existing methods only make use of fully aligned data sources that share common conditional distributions of…

Methodology · Statistics 2025-04-30 Sijia Li , Peter B. Gilbert , Rui Duan , Alex Luedtke

Multicellular systems play a key role in bioprocess and biomedical engineering. Cell ensembles encountered in these setups show phenotypic variability like size and biochemical composition. As this variability may result in undesired…

Systems and Control · Computer Science 2018-07-16 Armin Küper , Robert Dürr , Steffen Waldherr

This paper addresses patient heterogeneity associated with prediction problems in biomedical applications. We propose a systematic hypothesis testing approach to determine the existence of patient subgroup structure and the number of…

Methodology · Statistics 2021-01-08 Xu Gao , Weining Shen , Jing Ning , Ziding Feng , Jianhua Hu

We study the problem of multi-task non-smooth optimization that arises ubiquitously in statistical learning, decision-making and risk management. We develop a data fusion approach that adaptively leverages commonalities among a large number…

Machine Learning · Statistics 2022-10-25 Henry Lam , Kaizheng Wang , Yuhang Wu , Yichen Zhang

Metagenomics offers a way to analyze biotopes at the genomic level and to reach functional and taxonomical conclusions. The bio-analyzes of large metagenomic projects face critical limitations: complex metagenomes cannot be assembled and…

Genomics · Quantitative Biology 2015-11-30 Maillet Nicolas , Collet Guillaume , Vanier Thomas , Lavenier Dominique , Pierre Peterlongo

Heterogeneous data pose serious challenges to data analysis tasks, including exploration and visualization. Current techniques often utilize dimensionality reductions, aggregation, or conversion to numerical values to analyze heterogeneous…

Graphics · Computer Science 2017-10-10 Mahsa Mirzargar , Ross T. Whitaker , Robert M. Kirby

Integrating heterogeneous datasets across different measurement platforms is a fundamental challenge in many scientific applications. A common example arises in deconvolution problems, such as cell type deconvolution, where one aims to…

Methodology · Statistics 2025-09-30 Dongyue Xie , Lin Gui , Jingshu Wang

Computer-Aided Diagnosis has shown stellar performance in providing accurate medical diagnoses across multiple testing modalities (medical images, electrophysiological signals, etc.). While this field has typically focused on fully…

Applications · Statistics 2020-10-21 Claire Donnat , Nina Miolane , Freddy Bunbury , Jack Kreindler

Measurement involves the determination of quantitative estimates of physical quantities from experiment, along with estimates of their associated uncertainties. Herewith an experimental system model is the key to extracting information from…

Applications · Statistics 2008-09-01 Vladimir B. Bokov

Datasets with a mixture of numerical and categorical attributes are routinely encountered in many application domains. In this work we examine an approach to clustering such datasets using homogeneity analysis. Homogeneity analysis…

Machine Learning · Statistics 2017-10-31 Rajiv Sambasivan , Sourish Das