Related papers: Scalable Optimal Margin Distribution Machine

Optimal Margin Distribution Machine

Support vector machine (SVM) has been one of the most popular learning algorithms, with the central idea of maximizing the minimum margin, i.e., the smallest distance from the instances to the classification boundary. Recent theoretical…

Machine Learning · Computer Science 2020-07-07 Teng Zhang , Zhi-Hua Zhou

Large Margin Distribution Machine

Support vector machine (SVM) has been one of the most popular learning algorithms, with the central idea of maximizing the minimum margin, i.e., the smallest distance from the instances to the classification boundary. Recent theoretical…

Machine Learning · Computer Science 2014-05-26 Teng Zhang , Zhi-Hua Zhou

Less but Better: Generalization Enhancement of Ordinal Embedding via Distributional Margin

In the absence of prior knowledge, ordinal embedding methods obtain new representation for items in a low-dimensional Euclidean space via a set of quadruple-wise comparisons. These ordinal comparisons often come from human annotators, and…

Machine Learning · Computer Science 2018-12-06 Ke Ma , Qianqian Xu , Zhiyong Yang , Xiaochun Cao

On the Optimization of Margin Distribution

Margin has played an important role on the design and analysis of learning algorithms during the past years, mostly working with the maximization of the minimum margin. Recent years have witnessed the increasing empirical studies on the…

Machine Learning · Computer Science 2022-06-01 Meng-Zhang Qian , Zheng Ai , Teng Zhang , Wei Gao

OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning

Large-scale deep learning models contribute to significant performance improvements on varieties of downstream tasks. Current data and model parallelism approaches utilize model replication and partition techniques to support the…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-05-19 Youhe Jiang , Fangcheng Fu , Xupeng Miao , Xiaonan Nie , Bin Cui

OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning

Large-scale deep learning models contribute to significant performance improvements on varieties of downstream tasks. Current data and model parallelism approaches utilize model replication and partition techniques to support the…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-05-22 Youhe Jiang , Fangcheng Fu , Xupeng Miao , Xiaonan Nie , Bin Cui

Order Optimal One-Shot Distributed Learning

We consider distributed statistical optimization in one-shot setting, where there are $m$ machines each observing $n$ i.i.d. samples. Based on its observed samples, each machine then sends an $O(\log(mn))$-length message to a server, at…

Machine Learning · Computer Science 2019-11-12 Arsalan Sharifnassab , Saber Salehkaleybar , S. Jamaloddin Golestani

Learning Imbalanced Datasets with Maximum Margin Loss

A learning algorithm referred to as Maximum Margin (MM) is proposed for considering the class-imbalance data learning issue: the trained model tends to predict the majority of classes rather than the minority ones. That is, underfitting for…

Machine Learning · Computer Science 2023-03-30 Haeyong Kang , Thang Vu , Chang D. Yoo

Distribution Regression with Sliced Wasserstein Kernels

The problem of learning functions over spaces of probabilities - or distribution regression - is gaining significant interest in the machine learning community. A key challenge behind this problem is to identify a suitable representation…

Machine Learning · Statistics 2022-06-20 Dimitri Meunier , Massimiliano Pontil , Carlo Ciliberto

Observation-Guided Diffusion Probabilistic Models

We propose a novel diffusion-based image generation method called the observation-guided diffusion probabilistic model (OGDM), which effectively addresses the tradeoff between quality control and fast sampling. Our approach reestablishes…

Machine Learning · Computer Science 2024-04-02 Junoh Kang , Jinyoung Choi , Sungik Choi , Bohyung Han

Unifying Masked Diffusion Models with Various Generation Orders and Beyond

Masked diffusion models (MDMs) are a potential alternative to autoregressive models (ARMs) for language generation, but generation quality depends critically on the generation order. Prior work either hard-codes an ordering (e.g., blockwise…

Machine Learning · Computer Science 2026-05-22 Chunsan Hong , Sanghyun Lee , Jong Chul Ye

Distributed Stochastic Optimization via Adaptive SGD

Stochastic convex optimization algorithms are the most popular way to train machine learning models on large-scale data. Scaling up the training process of these models is crucial, but the most popular algorithm, Stochastic Gradient Descent…

Machine Learning · Statistics 2018-10-30 Ashok Cutkosky , Robert Busa-Fekete

Enhancing Distributional Stability among Sub-populations

Enhancing the stability of machine learning algorithms under distributional shifts is at the heart of the Out-of-Distribution (OOD) Generalization problem. Derived from causal learning, recent works of invariant learning pursue strict…

Machine Learning · Computer Science 2024-02-15 Jiashuo Liu , Jiayun Wu , Jie Peng , Xiaoyu Wu , Yang Zheng , Bo Li , Peng Cui

Meta-Learned Invariant Risk Minimization

Empirical Risk Minimization (ERM) based machine learning algorithms have suffered from weak generalization performance on data obtained from out-of-distribution (OOD). To address this problem, Invariant Risk Minimization (IRM) objective was…

Machine Learning · Computer Science 2021-03-25 Jun-Hyun Bae , Inchul Choi , Minho Lee

Tri-Level Navigator: LLM-Empowered Tri-Level Learning for Time Series OOD Generalization

Out-of-Distribution (OOD) generalization in machine learning is a burgeoning area of study. Its primary goal is to enhance the adaptability and resilience of machine learning models when faced with new, unseen, and potentially adversarial…

Machine Learning · Computer Science 2024-11-05 Chengtao Jian , Kai Yang , Yang Jiao

Towards Principled Unsupervised Learning

General unsupervised learning is a long-standing conceptual problem in machine learning. Supervised learning is successful because it can be solved by the minimization of the training error cost function. Unsupervised learning is not as…

Machine Learning · Computer Science 2015-12-04 Ilya Sutskever , Rafal Jozefowicz , Karol Gregor , Danilo Rezende , Tim Lillicrap , Oriol Vinyals

Momentum-based Accelerated Algorithm for Distributed Optimization under Sector-Bound Nonlinearity

Distributed optimization advances centralized machine learning methods by enabling parallel and decentralized learning processes over a network of computing nodes. This work provides an accelerated consensus-based distributed algorithm for…

Systems and Control · Electrical Eng. & Systems 2025-07-01 Mohammadreza Doostmohammadian , Hamid R. Rabiee

Generative Marginalization Models

We introduce marginalization models (MAMs), a new family of generative models for high-dimensional discrete data. They offer scalable and flexible generative modeling by explicitly modeling all induced marginal distributions.…

Machine Learning · Computer Science 2024-10-08 Sulin Liu , Peter J. Ramadge , Ryan P. Adams

The peculiar statistical mechanics of Optimal Learning Machines

Optimal Learning Machines (OLM) are systems that extract maximally informative representation of the environment they are in contact with, or of the data they are presented. It has recently been suggested that these systems are…

Data Analysis, Statistics and Probability · Physics 2020-01-29 Matteo Marsili

Optimized Dimensionality Reduction for Moment-based Distributionally Robust Optimization

Moment-based distributionally robust optimization (DRO) provides an optimization framework to integrate statistical information with traditional optimization approaches. Under this framework, one assumes that the underlying joint…

Optimization and Control · Mathematics 2023-11-01 Shiyi Jiang , Jianqiang Cheng , Kai Pan , Zuo-Jun Max Shen