Related papers: SCOPE: Scalable Composite Optimization for Learnin…

A Stochastic Large-scale Machine Learning Algorithm for Distributed Features and Observations

As the size of modern data sets exceeds the disk and memory capacities of a single computer, machine learning practitioners have resorted to parallel and distributed computing. Given that optimization is one of the pillars of machine…

Machine Learning · Statistics 2019-12-10 Biyi Fang , Diego Klabjan

Sparsity-Constraint Optimization via Splicing Iteration

Sparsity-constrained optimization underlies many problems in signal processing, statistics, and machine learning. State-of-the-art hard-thresholding (HT) algorithms rely on an appropriately selected continuous step-size parameter to ensure…

Machine Learning · Statistics 2026-05-13 Jin Zhu , Junxian Zhu , Zezhi Wang , Borui Tang , Hongmei Lin , Xueqin Wang

Proximal SCOPE for Distributed Sparse Learning: Better Data Partition Implies Faster Convergence Rate

Distributed sparse learning with a cluster of multiple machines has attracted much attention in machine learning, especially for large-scale applications with high-dimensional data. One popular way to implement sparse learning is to use…

Machine Learning · Statistics 2018-10-29 Shen-Yi Zhao , Gong-Duo Zhang , Ming-Wei Li , Wu-Jun Li

Learning Distributionally Robust Models at Scale via Composite Optimization

To train machine learning models that are robust to distribution shifts in the data, distributionally robust optimization (DRO) has been proven very effective. However, the existing approaches to learning a distributionally robust model…

Machine Learning · Computer Science 2022-03-21 Farzin Haddadpour , Mohammad Mahdi Kamani , Mehrdad Mahdavi , Amin Karbasi

SCOPE: Smooth Convex Optimization for Planned Evolution of Deformable Linear Objects

We present SCOPE, a fast and efficient framework for modeling and manipulating deformable linear objects (DLOs). Unlike conventional energy-based approaches, SCOPE leverages convex approximations to significantly reduce computational cost…

Robotics · Computer Science 2026-01-28 Ali Jnadi , Hadi Salloum , Yaroslav Kholodov , Alexander Gasnikov , Karam Almaghout

Distributed Stochastic Optimization via Adaptive SGD

Stochastic convex optimization algorithms are the most popular way to train machine learning models on large-scale data. Scaling up the training process of these models is crucial, but the most popular algorithm, Stochastic Gradient Descent…

Machine Learning · Statistics 2018-10-30 Ashok Cutkosky , Robert Busa-Fekete

Distributed Algorithms for Robust Convex Optimization via the Scenario Approach

This paper proposes distributed algorithms to solve robust convex optimization (RCO) when the constraints are affected by nonlinear uncertainty. We adopt a scenario approach by randomly sampling the uncertainty set. To facilitate the…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-01-16 Keyou You , Roberto Tempo , Pei Xie

Coordinate Descent for SLOPE

The lasso is the most famous sparse regression and feature selection method. One reason for its popularity is the speed at which the underlying optimization problem can be solved. Sorted L-One Penalized Estimation (SLOPE) is a…

Optimization and Control · Mathematics 2024-05-14 Johan Larsson , Quentin Klopfenstein , Mathurin Massias , Jonas Wallin

Stochastic Constraint Optimization using Propagation on Ordered Binary Decision Diagrams

A number of problems in relational Artificial Intelligence can be viewed as Stochastic Constraint Optimization Problems (SCOPs). These are constraint optimization problems that involve objectives or constraints with a stochastic component.…

Artificial Intelligence · Computer Science 2018-07-04 Anna L. D. Latour , Behrouz Babaki , Siegfried Nijssen

Models Under SCOPE: Scalable and Controllable Routing via Pre-hoc Reasoning

Model routing chooses which language model to use for each query. By sending easy queries to cheaper models and hard queries to stronger ones, it can significantly reduce inference cost while maintaining high accuracy. However, most…

Machine Learning · Computer Science 2026-02-17 Qi Cao , Shuhao Zhang , Ruizhe Zhou , Ruiyi Zhang , Peijia Qin , Pengtao Xie

SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting

On-policy reinforcement learning has become the dominant paradigm for reasoning alignment in large language models, yet its sparse, outcome-level rewards make token-level credit assignment notoriously difficult. On-Policy Distillation (OPD)…

Machine Learning · Computer Science 2026-04-14 Binbin Zheng , Xing Ma , Yiheng Liang , Jingqing Ruan , Xiaoliang Fu , Kepeng Lin , Benchang Zhu , Ke Zeng , Xunliang Cai

SCOPE: Semantic Coreset with Orthogonal Projection Embeddings for Federated learning

Scientific discovery increasingly requires learning on federated datasets, fed by streams from high-resolution instruments, that have extreme class imbalance. Current ML approaches either require impractical data aggregation or fail due to…

Machine Learning · Computer Science 2026-03-16 Md Anwar Hossen , Nathan R. Tallent , Luanzheng Guo , Ali Jannesary

Divide and Learn: Multi-Objective Combinatorial Optimization at Scale

Multi-objective combinatorial optimization seeks Pareto-optimal solutions over exponentially large discrete spaces, yet existing methods sacrifice generality, scalability, or theoretical guarantees. We reformulate it as an online learning…

Machine Learning · Computer Science 2026-02-13 Esha Singh , Dongxia Wu , Chien-Yi Yang , Tajana Rosing , Rose Yu , Yi-An Ma

Optimization for Large-Scale Machine Learning with Distributed Features and Observations

As the size of modern data sets exceeds the disk and memory capacities of a single computer, machine learning practitioners have resorted to parallel and distributed computing. Given that optimization is one of the pillars of machine…

Machine Learning · Statistics 2017-04-18 Alexandros Nathan , Diego Klabjan

Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems

Combinatorial optimization assumes that all parameters of the optimization problem, e.g. the weights in the objective function is fixed. Often, these weights are mere estimates and increasingly machine learning techniques are used to for…

Machine Learning · Computer Science 2019-11-25 Jaynta Mandi , Emir Demirović , Peter. J Stuckey , Tias Guns

Stochastic Distributed Learning with Gradient Quantization and Variance Reduction

We consider distributed optimization where the objective function is spread among different devices, each sending incremental model updates to a central server. To alleviate the communication bottleneck, recent work proposed various schemes…

Optimization and Control · Mathematics 2019-04-11 Samuel Horváth , Dmitry Kovalev , Konstantin Mishchenko , Sebastian Stich , Peter Richtárik

Stochastic, Distributed and Federated Optimization for Machine Learning

We study optimization algorithms for the finite sum problems frequently arising in machine learning applications. First, we propose novel variants of stochastic gradient descent with a variance reduction property that enables linear…

Machine Learning · Computer Science 2017-07-06 Jakub Konečný

On Scalable and Efficient Computation of Large Scale Optimal Transport

Optimal Transport (OT) naturally arises in many machine learning applications, yet the heavy computational burden limits its wide-spread uses. To address the scalability issue, we propose an implicit generative learning-based framework…

Machine Learning · Computer Science 2019-06-26 Yujia Xie , Minshuo Chen , Haoming Jiang , Tuo Zhao , Hongyuan Zha

An efficient distributed learning algorithm based on effective local functional approximations

Scalable machine learning over big data is an important problem that is receiving a lot of attention in recent years. On popular distributed environments such as Hadoop running on a cluster of commodity machines, communication costs are…

Machine Learning · Computer Science 2015-03-18 Dhruv Mahajan , Nikunj Agrawal , S. Sathiya Keerthi , S. Sundararajan , Leon Bottou

Aligning Distributionally Robust Optimization with Practical Deep Learning Needs

While traditional Deep Learning (DL) optimization methods treat all training samples equally, Distributionally Robust Optimization (DRO) adaptively assigns importance weights to different samples. However, a significant gap exists between…

Machine Learning · Computer Science 2025-09-26 Dmitrii Feoktistov , Igor Ignashin , Andrey Veprikov , Nikita Borovko , Alexander Bogdanov , Savelii Chezhegov , Aleksandr Beznosikov