English
Related papers

Related papers: A Data Analytics Framework for Aggregate Data Anal…

200 papers

The development of deep learning techniques is a leading field applied to cases in which medical data is used, particularly in cases of image diagnosis. This type of data has privacy and legal restrictions that in many cases prevent it from…

Machine Learning · Computer Science 2025-01-28 Judith Sáinz-Pardo Díaz , Álvaro López García

The integration of data from multiple sources is increasingly used to achieve larger sample sizes and enhance population diversity. Our previous work established that, under random sampling from the same underlying population, integrating…

Methodology · Statistics 2026-01-01 Farimah Shamsi , Andriy Derkach

Databases in domains such as healthcare are routinely released to the public in aggregated form. Unfortunately, naive modeling with aggregated data may significantly diminish the accuracy of inferences at the individual level. This paper…

Machine Learning · Statistics 2016-05-17 Avradeep Bhowmik , Joydeep Ghosh , Oluwasanmi Koyejo

In video surveillance applications, person search is a challenging task consisting in detecting people and extracting features from their silhouette for re-identification (re-ID) purpose. We propose a new end-to-end model that jointly…

Computer Vision and Pattern Recognition · Computer Science 2022-01-25 Angelique Loesch , Jaonary Rabarisoa , Romaric Audigier

As modern data pipelines continue to collect, produce, and store a variety of data formats, extracting and combining value from traditional and context-rich sources such as strings, text, video, audio, and logs becomes a manual process…

Databases · Computer Science 2023-12-05 Viktor Sanca , Anastasia Ailamaki

Ensembles of artificial neural networks show improved generalization capabilities that outperform those of single networks. However, for aggregation to be effective, the individual networks must be as accurate and diverse as possible. An…

Artificial Intelligence · Computer Science 2007-05-23 P. M. Granitto , P. F. Verdes , H. A. Ceccatto

Many machine learning systems today are trained on large amounts of human-annotated data. Data annotation tasks that require a high level of competency make data acquisition expensive, while the resulting labels are often subjective,…

Machine Learning · Computer Science 2020-04-08 Emmanouil Antonios Platanios , Maruan Al-Shedivat , Eric Xing , Tom Mitchell

Fine-Grained Change Detection and Regression Analysis are essential in many applications of ArtificialIntelligence. In practice, this task is often challenging owing to the lack of reliable ground truth information andcomplexity arising…

Machine Learning · Computer Science 2022-08-12 Niall O' Mahony , Sean Campbell , Lenka Krpalkova , Joseph Walsh , Daniel Riordan

Modern approach to artificial intelligence (AI) aims to design algorithms that learn directly from data. This approach has achieved impressive results and has contributed significantly to the progress of AI, particularly in the sphere of…

Machine Learning · Computer Science 2024-03-20 Alhassan Mumuni , Fuseini Mumuni

Precision medicine tailored to individual patients has gained significant attention in recent times. Machine learning techniques are now employed to process personalized data from various sources, including images, genetics, and…

Machine Learning · Computer Science 2023-11-27 Jie Lian , Xufang Luo , Caihua Shan , Dongqi Han , Varut Vardhanabhuti , Dongsheng Li

Large-scale data analysis poses both statistical and computational problems which need to be addressed simultaneously. A solution is often straightforward if the data are homogeneous: one can use classical ideas of subsampling and mean…

Methodology · Statistics 2014-09-10 Peter Bühlmann , Nicolai Meinshausen

As edge devices become increasingly powerful, data analytics are gradually moving from a centralized to a decentralized regime where edge compute resources are exploited to process more of the data locally. This regime of analytics is…

Applications · Statistics 2023-07-04 Xubo Yue , Raed Al Kontar , Ana María Estrada Gómez

Collecting fine-grained labels usually requires expert-level domain knowledge and is prohibitive to scale up. In this paper, we propose Attribute Mix, a data augmentation strategy at attribute level to expand the fine-grained samples. The…

Computer Vision and Pattern Recognition · Computer Science 2020-07-10 Hao Li , Xiaopeng Zhang , Hongkai Xiong , Qi Tian

We propose a clustering-based iterative algorithm to solve certain optimization problems in machine learning, where we start the algorithm by aggregating the original data, solving the problem on aggregated data, and then in subsequent…

Machine Learning · Statistics 2017-01-23 Young Woong Park , Diego Klabjan

The ability of the foundation models heavily relies on large-scale, diverse, and high-quality pretraining data. In order to improve data quality, researchers and practitioners often have to manually curate datasets from difference sources…

Machine Learning · Computer Science 2024-04-24 Yiding Sun , Feng Wang , Yutao Zhu , Wayne Xin Zhao , Jiaxin Mao

Often both Aggregate Data (AD) studies and Individual Patient Data (IPD) studies are available for specific treatments. Combining these two sources of data could improve the overall meta-analytic estimates of treatment effects. Moreover,…

Methodology · Statistics 2021-11-15 Neha Agarwala , Junyong Park , Anindya Roy

A key factor in developing high performing machine learning models is the availability of sufficiently large datasets. This work is motivated by applications arising in Software as a Service (SaaS) companies where there exist numerous…

Machine Learning · Computer Science 2018-12-05 Sophia Collet , Robert Dadashi , Zahi N. Karam , Chang Liu , Parinaz Sobhani , Yevgeniy Vahlis , Ji Chao Zhang

We propose a probabilistic model for refining coarse-grained spatial data by utilizing auxiliary spatial data sets. Existing methods require that the spatial granularities of the auxiliary data sets are the same as the desired granularity…

Machine Learning · Statistics 2019-07-19 Yusuke Tanaka , Tomoharu Iwata , Toshiyuki Tanaka , Takeshi Kurashima , Maya Okawa , Hiroyuki Toda

Access to individual-level health data is essential for gaining new insights and advancing science. In particular, modern methods based on artificial intelligence rely on the availability of and access to large datasets. In the health…

Entity alignment has always had significant uses within a multitude of diverse scientific fields. In particular, the concept of matching entities across networks has grown in significance in the world of social science as communicative…

Social and Information Networks · Computer Science 2020-04-21 James Flamino , Christopher Abriola , Ben Zimmerman , Zhongheng Li , Joel Douglas
‹ Prev 1 2 3 10 Next ›