Related papers: Cross-Problem Solving for Network Optimization: Is…

Linear Regression with Distributed Learning: A Generalization Error Perspective

Distributed learning provides an attractive framework for scaling the learning task by sharing the computational load over multiple nodes in a network. Here, we investigate the performance of distributed learning for large-scale linear…

Machine Learning · Statistics 2021-11-03 Martin Hellkvist , Ayça Özçelikkale , Anders Ahlén

Prime-Aware Adaptive Distillation

Knowledge distillation(KD) aims to improve the performance of a student network by mimicing the knowledge from a powerful teacher network. Existing methods focus on studying what knowledge should be transferred and treat all samples equally…

Computer Vision and Pattern Recognition · Computer Science 2020-08-05 Youcai Zhang , Zhonghao Lan , Yuchen Dai , Fangao Zeng , Yan Bai , Jie Chang , Yichen Wei

Topology-aware Robust Optimization for Out-of-distribution Generalization

Out-of-distribution (OOD) generalization is a challenging machine learning problem yet highly desirable in many high-stake applications. Existing methods suffer from overly pessimistic modeling with low generalization confidence. As…

Machine Learning · Computer Science 2024-06-10 Fengchun Qiao , Xi Peng

Bandwidth-Aware Network Topology Optimization for Decentralized Learning

Network topology is critical for efficient parameter synchronization in distributed learning over networks. However, most existing studies do not account for bandwidth limitations in network topology design. In this paper, we propose a…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-12-09 Yipeng Shen , Zehan Zhu , Yan Huang , Changzhi Yan , Cheng Zhuo , Jinming Xu

Network-Aware Optimization of Distributed Learning for Fog Computing

Fog computing promises to enable machine learning tasks to scale to large amounts of data by distributing processing across connected devices. Two key challenges to achieving this goal are heterogeneity in devices compute resources and…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-04-23 Su Wang , Yichen Ruan , Yuwei Tu , Satyavrat Wagle , Christopher G. Brinton , Carlee Joe-Wong

Don't drop your samples! Coherence-aware training benefits Conditional diffusion

Conditional diffusion models are powerful generative models that can leverage various types of conditional information, such as class labels, segmentation masks, or text captions. However, in many real-world scenarios, conditional…

Computer Vision and Pattern Recognition · Computer Science 2025-02-19 Nicolas Dufour , Victor Besnier , Vicky Kalogeiton , David Picard

Beyond Sharpness: A Flatness Decomposition Framework for Efficient Continual Learning

Continual Learning (CL) aims to enable models to sequentially learn multiple tasks without forgetting previous knowledge. Recent studies have shown that optimizing towards flatter loss minima can improve model generalization. However,…

Machine Learning · Computer Science 2026-01-13 Yanan Chen , Tieliang Gong , Yunjiao Zhang , Wen Wen

Distributed Learning in the Presence of Disturbances

We consider a problem where multiple agents must learn an action profile that maximises the sum of their utilities in a distributed manner. The agents are assumed to have no knowledge of either the utility functions or the actions and…

Systems and Control · Computer Science 2016-03-31 Chithrupa Ramesh , Marius Schmitt , John Lygeros

FEED: Fairness-Enhanced Meta-Learning for Domain Generalization

Generalizing to out-of-distribution data while being aware of model fairness is a significant and challenging problem in meta-learning. The goal of this problem is to find a set of fairness-aware invariant parameters of classifier that is…

Machine Learning · Computer Science 2024-11-05 Kai Jiang , Chen Zhao , Haoliang Wang , Feng Chen

Scalable and Order-robust Continual Learning with Additive Parameter Decomposition

While recent continual learning methods largely alleviate the catastrophic problem on toy-sized datasets, some issues remain to be tackled to apply them to real-world problem domains. First, a continual learning model should effectively…

Machine Learning · Computer Science 2020-02-18 Jaehong Yoon , Saehoon Kim , Eunho Yang , Sung Ju Hwang

CARD: Classification and Regression Diffusion Models

Learning the distribution of a continuous or categorical response variable $\boldsymbol y$ given its covariates $\boldsymbol x$ is a fundamental problem in statistics and machine learning. Deep neural network-based supervised learning…

Machine Learning · Statistics 2022-12-07 Xizewen Han , Huangjie Zheng , Mingyuan Zhou

Topology-Aware Knowledge Propagation in Decentralized Learning

Decentralized learning enables collaborative training of models across naturally distributed data without centralized coordination or maintenance of a global model. Instead, devices are organized in arbitrary communication topologies, in…

Machine Learning · Computer Science 2025-05-20 Mansi Sakarvadia , Nathaniel Hudson , Tian Li , Ian Foster , Kyle Chard

Modular Universal Reparameterization: Deep Multi-task Learning Across Diverse Domains

As deep learning applications continue to become more diverse, an interesting question arises: Can general problem solving arise from jointly learning several such diverse tasks? To approach this question, deep multi-task learning is…

Machine Learning · Computer Science 2019-10-29 Elliot Meyerson , Risto Miikkulainen

Alignment-Aware Decoding

Alignment of large language models remains a central challenge in natural language processing. Preference optimization has emerged as a popular and effective method for improving alignment, typically through training-time or prompt-based…

Machine Learning · Computer Science 2025-10-01 Frédéric Berdoz , Luca A. Lanzendörfer , René Caky , Roger Wattenhofer

Prediction with Action: Visual Policy Learning via Joint Denoising Process

Diffusion models have demonstrated remarkable capabilities in image generation tasks, including image editing and video creation, representing a good understanding of the physical world. On the other line, diffusion models have also shown…

Robotics · Computer Science 2024-11-28 Yanjiang Guo , Yucheng Hu , Jianke Zhang , Yen-Jen Wang , Xiaoyu Chen , Chaochao Lu , Jianyu Chen

Prediction-Centric Learning of Independent Cascade Dynamics from Partial Observations

Spreading processes play an increasingly important role in modeling for diffusion networks, information propagation, marketing and opinion setting. We address the problem of learning of a spreading model such that the predictions generated…

Social and Information Networks · Computer Science 2021-07-27 Mateusz Wilinski , Andrey Y. Lokhov

Learning of networked spreading models from noisy and incomplete data

Recent years have seen a lot of progress in algorithms for learning parameters of spreading dynamics from both full and partial data. Some of the remaining challenges include model selection under the scenarios of unknown network structure,…

Social and Information Networks · Computer Science 2024-01-02 Mateusz Wilinski , Andrey Y. Lokhov

Generating Progressive Images from Pathological Transitions via Diffusion Model

Deep learning is widely applied in computer-aided pathological diagnosis, which alleviates the pathologist workload and provide timely clinical analysis. However, most models generally require large-scale annotated data for training, which…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Zeyu Liu , Tianyi Zhang , Yufang He , Yunlu Feng , Yu Zhao , Guanglei Zhang

Model-Aware Regularization For Learning Approaches To Inverse Problems

There are various inverse problems -- including reconstruction problems arising in medical imaging -- where one is often aware of the forward operator that maps variables of interest to the observations. It is therefore natural to ask…

Image and Video Processing · Electrical Eng. & Systems 2020-06-23 Jaweria Amjad , Zhaoyan Lyu , Miguel R. D. Rodrigues

End to end learning and optimization on graphs

Real-world applications often combine learning and optimization problems on graphs. For instance, our objective may be to cluster the graph in order to detect meaningful communities (or solve other common graph optimization problems such as…

Machine Learning · Computer Science 2020-01-09 Bryan Wilder , Eric Ewing , Bistra Dilkina , Milind Tambe