English
Related papers

Related papers: Fusing heterogeneous data sets

200 papers

Data is a precious resource in today's society, and is generated at an unprecedented and constantly growing pace. The need to store, analyze, and make data promptly available to a multitude of users introduces formidable challenges in…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-08 Alessandro Margara , Gianpaolo Cugola , Nicolò Felicioni , Stefano Cilloni

We propose a novel framework for combining datasets via alignment of their intrinsic geometry. This alignment can be used to fuse data originating from disparate modalities, or to correct batch effects while preserving intrinsic data…

Machine Learning · Computer Science 2020-01-31 Jay S. Stanley , Scott Gigante , Guy Wolf , Smita Krishnaswamy

We present a statistical mechanics approach to the protein folding problem. We first review some of the basic properties of proteins, and introduce some physical models to describe their thermodynamics. These models rely on a random…

Disordered Systems and Neural Networks · Physics 2008-02-03 T. Garel , H. Orland , E. Pitard

Clustering mixed data presents numerous challenges inherent to the very heterogeneous nature of the variables. A clustering algorithm should be able, despite of this heterogeneity, to extract discriminant pieces of information from the…

Machine Learning · Computer Science 2022-05-10 Robin Fuchs , Denys Pommeret , Cinzia Viroli

Integrating data from multiple heterogeneous sources has become increasingly popular to achieve a large sample size and diverse study population. This paper reviews development in causal inference methods that combines multiple datasets…

Methodology · Statistics 2021-10-05 Xu Shi , Ziyang Pan , Wang Miao

We study the problem of parameter estimation for time-series possessing two, widely separated, characteristic time scales. The aim is to understand situations where it is desirable to fit a homogenized singlescale model to such multiscale…

Statistics Theory · Mathematics 2009-11-11 G. A. Pavliotis , A. M. Stuart

This paper proposes a heterogenous density fusion approach to scalable multisensor multitarget tracking where the inter-connected sensors run different types of random finite set (RFS) filters according to their respective capacity and…

Systems and Control · Electrical Eng. & Systems 2025-02-25 Tiancheng Li , Ruibo Yan , Kai Da , Hongqi Fan

Every design choice will have different effects on different units. However traditional A/B tests are often underpowered to identify these heterogeneous effects. This is especially true when the set of unit-level attributes is…

Artificial Intelligence · Computer Science 2016-11-09 Alexander Peysakhovich , Akos Lada

Diabetes is a worldwide health issue affecting millions of people. Machine learning methods have shown promising results in improving diabetes prediction, particularly through the analysis of diverse data types, namely gene expression data.…

Machine Learning · Computer Science 2024-04-24 Rita T. Sousa , Heiko Paulheim

Background: Understanding the relationship between the Omics and the phenotype is a central problem in precision medicine. The high dimensionality of metabolomics data challenges learning algorithms in terms of scalability and…

Multimodal single-cell technologies enable the simultaneous collection of diverse data types from individual cells, enhancing our understanding of cellular states. However, the integration of these datatypes and modeling the…

Machine Learning · Computer Science 2023-11-22 Bhavya Mehta , Nirmit Deliwala , Madhav Chandane

Data depth has been applied as a nonparametric measurement for ranking multivariate samples. In this paper, we focus on homogeneity tests to assess whether two multivariate samples are from the same distribution. There are many data…

Statistics Theory · Mathematics 2023-06-09 Yiting Chen , Wei Lin , Xiaoping Shi

Current efforts in the biomedical sciences and related interdisciplinary fields are focused on gaining a molecular understanding of health and disease, which is a problem of daunting complexity that spans many orders of magnitude in…

Quantitative Methods · Quantitative Biology 2014-01-24 Julián Candia , Jayanth R. Banavar , Wolfgang Losert

We consider fits to two or more datasets for which results from the sa me experiment share a common systematic uncertainty in addition to their individ ual statistical errors. This is important in extracting the maximum information from a…

Data Analysis, Statistics and Probability · Physics 2020-09-29 Roger John Barlow

Mathematical models come in many forms across biological applications. In the case of complex, spatial dynamics and pattern formation, stochastic models also face two main challenges: pattern data is largely qualitative, and model…

Cell Behavior · Quantitative Biology 2022-12-26 Electa Cleveland , Angela Zhu , Bjorn Sandstede , Alexandria Volkening

An applied problem facing all areas of data science is harmonizing data sources. Joining data from multiple origins with unmapped and only partially overlapping features is a prerequisite to developing and testing robust, generalizable…

1. Animal movement patterns contribute to our understanding of variation in breeding success and survival of individuals, and the implications for population dynamics. 2. Over time, sensor technology for measuring movement patterns has…

Quantitative Methods · Quantitative Biology 2018-01-11 Leah R. Johnson , Philipp H. Boersch-Supan , Richard A. Phillips , Sadie J. Ryan

Fusing probabilistic information is a fundamental task in signal and data processing with relevance to many fields of technology and science. In this work, we investigate the fusion of multiple probability density functions (pdfs) of a…

Signal Processing · Electrical Eng. & Systems 2023-01-20 Günther Koliander , Yousef El-Laham , Petar M. Djurić , Franz Hlawatsch

As researchers collect increasingly large molecular data sets to reconstruct the Tree of Life, the heterogeneity of signals in the genomes of diverse organisms poses challenges for traditional phylogenetic analysis. A class of phylogenetic…

Populations and Evolution · Quantitative Biology 2015-09-11 Liang Liu , Zhenxiang Xi , Shaoyuan Wu , Charles Davis , Scott V. Edwards

The exposome recognizes that individuals are exposed simultaneously to a multitude of different environmental factors and takes a holistic approach to the discovery of etiological factors for disease. However, challenges arise when trying…