English
Related papers

Related papers: Data collaboration analysis for distributed datase…

200 papers

This paper proposes an interpretable non-model sharing collaborative data analysis method as one of the federated learning systems, which is an emerging technology to analyze distributed data. Analyzing distributed data is essential in many…

Machine Learning · Computer Science 2020-11-10 Akira Imakura , Hiroaki Inaba , Yukihiko Okada , Tetsuya Sakurai

Distributed data analysis without revealing the individual data has recently attracted significant attention in several applications. A collaborative data analysis through sharing dimensionality reduced representations of data has been…

Machine Learning · Computer Science 2021-01-28 Akira Imakura , Anna Bogdanova , Takaya Yamazoe , Kazumasa Omote , Tetsuya Sakurai

In order to provide high-quality recommendations for users, it is desirable to share and integrate multiple datasets held by different parties. However, when sharing such distributed datasets, we need to protect personal and confidential…

Information Retrieval · Computer Science 2024-06-05 Tomoya Yanagi , Shunnosuke Ikeda , Noriyoshi Sukegawa , Yuichi Takano

Multi-source data fusion, in which multiple data sources are jointly analyzed to obtain improved information, has considerable research attention. For the datasets of multiple medical institutions, data confidentiality and…

Machine Learning · Computer Science 2022-09-01 Akira Imakura , Tetsuya Sakurai , Yukihiko Okada , Tomoya Fujii , Teppei Sakamoto , Hiroyuki Abe

Large amount of data is often required to train and deploy useful machine learning models in industry. Smaller enterprises do not have the luxury of accessing enough data for machine learning, For privacy sensitive fields such as banking,…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-09-05 Felix Ongati , Eng. Lawrence Muchemi

Machine learning algorithms can perform well when trained on large datasets. While large organisations often have considerable data assets, it can be difficult for these assets to be unified in a manner that makes training possible. Data is…

Machine Learning · Computer Science 2022-03-25 Tiffany Tuor , Joshua Lockhart , Daniele Magazzeni

Federated Learning is a distributed machine learning approach that enables geographically distributed data silos to collaboratively learn a joint machine learning model without sharing data. Most of the existing work operates on…

Machine Learning · Computer Science 2023-05-17 Dimitris Stripelis , Jose Luis Ambite

The demand for artificial intelligence has grown significantly over the last decade and this growth has been fueled by advances in machine learning techniques and the ability to leverage hardware acceleration. However, in order to increase…

Machine Learning · Computer Science 2022-11-28 Joost Verbraeken , Matthijs Wolting , Jonathan Katzy , Jeroen Kloppenburg , Tim Verbelen , Jan S. Rellermeyer

Given the time and expense associated with bringing a drug to market, numerous studies have been conducted to predict the properties of compounds based on their structure using machine learning. Federated learning has been applied to…

Machine Learning · Computer Science 2023-08-02 Akihiro Mizoguchi , Anna Bogdanova , Akira Imakura , Tetsuya Sakurai

As the complexity of our neural network models grow, so too do the data and computation requirements for successful training. One proposed solution to this problem is training on a distributed network of computational devices, thus…

Machine Learning · Computer Science 2020-05-22 Kyle Crandall , Dustin Webb

In recent years, the growing need to leverage sensitive data across institutions has led to increased attention on federated learning (FL), a decentralized machine learning paradigm that enables model training without sharing raw data.…

In this paper we propose a new approach for Big Data mining and analysis. This new approach works well on distributed datasets and deals with data clustering task of the analysis. The approach consists of two main phases, the first phase…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-03-05 Malika Bendechache , Nhien-An Le-Khac , M-Tahar Kechadi

Dataset Condensation aims to condense a large dataset into a smaller one while maintaining its ability to train a well-performing model, thus reducing the storage cost and training effort in deep learning applications. However, conventional…

Machine Learning · Computer Science 2023-07-20 Ganlong Zhao , Guanbin Li , Yipeng Qin , Yizhou Yu

With the growth of machine learning techniques, privacy of data of users has become a major concern. Most of the machine learning algorithms rely heavily on large amount of data which may be collected from various sources. Collecting these…

Machine Learning · Computer Science 2023-11-17 Mahfuzur Rahman Chowdhury , Muhammad Ibrahim

Valuable training data is often owned by independent organizations and located in multiple data centers. Most deep learning approaches require to centralize the multi-datacenter data for performance purpose. In practice, however, it is…

Machine Learning · Computer Science 2018-10-17 Kele Xu , Haibo Mi , Dawei Feng , Huaimin Wang , Chuan Chen , Zibin Zheng , Xu Lan

The analysis of data stored in multiple sites has become more popular, raising new concerns about the security of data storage and communication. Federated learning, which does not require centralizing data, is a common approach to…

Machine Learning · Statistics 2026-02-10 Z. F. Wang , X. Y. Zhang , Y-c I. Chang

We propose a distributed method for simultaneous inference for datasets with sample size much larger than the number of covariates, i.e., N >> p, in the generalized linear models framework. When such datasets are too big to be analyzed…

Methodology · Statistics 2020-07-23 Lu Tang , Ling Zhou , Peter X. -K. Song

Distributed machine learning generally aims at training a global model based on distributed data without collecting all the data to a centralized location, where two different approaches have been proposed: collecting and aggregating local…

Machine Learning · Computer Science 2020-07-08 Hanlin Lu , Changchang Liu , Ting He , Shiqiang Wang , Kevin S. Chan

Dimensionality Reduction is a commonly used element in a machine learning pipeline that helps to extract important features from high-dimensional data. In this work, we explore an alternative federated learning system that enables…

Machine Learning · Computer Science 2020-11-16 Anna Bogdanova , Akie Nakai , Yukihiko Okada , Akira Imakura , Tetsuya Sakurai

This paper proposes a data privacy protection framework based on federated learning, which aims to realize effective cross-domain data collaboration under the premise of ensuring data privacy through distributed learning. Federated learning…

Machine Learning · Computer Science 2025-04-02 Yiwei Zhang , Jie Liu , Jiawei Wang , Lu Dai , Fan Guo , Guohui Cai
‹ Prev 1 2 3 10 Next ›