Related papers: Learning Direct Optimization for Scene Understandi…

End-To-End Optimization of LiDAR Beam Configuration for 3D Object Detection and Localization

Existing learning methods for LiDAR-based applications use 3D points scanned under a pre-determined beam configuration, e.g., the elevation angles of beams are often evenly distributed. Those fixed configurations are task-agnostic, so…

Robotics · Computer Science 2023-03-29 Niclas Vödisch , Ozan Unal , Ke Li , Luc Van Gool , Dengxin Dai

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

While likelihood-based generative models, particularly diffusion and autoregressive models, have achieved remarkable fidelity in visual generation, the maximum likelihood estimation (MLE) objective, which minimizes the forward KL…

Computer Vision and Pattern Recognition · Computer Science 2025-06-24 Kaiwen Zheng , Yongxin Chen , Huayu Chen , Guande He , Ming-Yu Liu , Jun Zhu , Qinsheng Zhang

Make Optimization Once and for All with Fine-grained Guidance

Learning to Optimize (L2O) enhances optimization efficiency with integrated neural networks. L2O paradigms achieve great outcomes, e.g., refitting optimizer, generating unseen solutions iteratively or directly. However, conventional L2O…

Machine Learning · Computer Science 2025-03-17 Mingjia Shi , Ruihan Lin , Xuxi Chen , Yuhao Zhou , Zezhen Ding , Pingzhi Li , Tong Wang , Kai Wang , Zhangyang Wang , Jiheng Zhang , Tianlong Chen

Meta-Learning with Latent Embedding Optimization

Gradient-based meta-learning techniques are both widely applicable and proficient at solving challenging few-shot learning and fast adaptation problems. However, they have practical difficulties when operating on high-dimensional parameter…

Machine Learning · Computer Science 2019-03-27 Andrei A. Rusu , Dushyant Rao , Jakub Sygnowski , Oriol Vinyals , Razvan Pascanu , Simon Osindero , Raia Hadsell

Towards Robust Learning to Optimize with Theoretical Guarantees

Learning to optimize (L2O) is an emerging technique to solve mathematical optimization problems with learning-based methods. Although with great success in many real-world scenarios such as wireless communications, computer networks, and…

Machine Learning · Computer Science 2025-06-18 Qingyu Song , Wei Lin , Juncheng Wang , Hong Xu

3D Object Positioning Using Differentiable Multimodal Learning

This article describes a multi-modal method using simulated Lidar data via ray tracing and image pixel loss with differentiable rendering to optimize an object's position with respect to an observer or some referential objects in a computer…

Systems and Control · Electrical Eng. & Systems 2023-09-07 Sean Zanyk-McLean , Krishna Kumar , Paul Navratil

Memory-Efficient Learning of Stable Linear Dynamical Systems for Prediction and Control

Learning a stable Linear Dynamical System (LDS) from data involves creating models that both minimize reconstruction error and enforce stability of the learned representation. We propose a novel algorithm for learning stable LDSs. Using a…

Machine Learning · Computer Science 2020-11-19 Giorgos Mamakoukas , Orest Xherija , T. D. Murphey

Learned Vertex Descent: A New Direction for 3D Human Model Fitting

We propose a novel optimization-based paradigm for 3D human model fitting on images and scans. In contrast to existing approaches that directly regress the parameters of a low-dimensional statistical body model (e.g. SMPL) from input…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Enric Corona , Gerard Pons-Moll , Guillem Alenyà , Francesc Moreno-Noguer

Learning Constrained Optimization with Deep Augmented Lagrangian Methods

Learning to Optimize (LtO) is a problem setting in which a machine learning (ML) model is trained to emulate a constrained optimization solver. Learning to produce optimal and feasible solutions subject to complex constraints is a difficult…

Machine Learning · Computer Science 2024-03-18 James Kotary , Ferdinando Fioretto

Learning to optimize: A tutorial for continuous and mixed-integer optimization

Learning to Optimize (L2O) stands at the intersection of traditional optimization and machine learning, utilizing the capabilities of machine learning to enhance conventional optimization techniques. As real-world optimization problems…

Optimization and Control · Mathematics 2024-05-27 Xiaohan Chen , Jialin Liu , Wotao Yin

LAGO: Language-Guided Adaptive Object-Region Focus for Zero-Shot Visual-Text Alignment

Zero-shot recognition aims to classify an image by selecting the most compatible label description from a set of candidate classes without any task-specific supervision. In fine-grained settings, however, the relevant evidence often lies in…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Junyi Hu , Qiji Zhou , Lei Zhang , Yue Zhang

Learning to Optimize Quasi-Newton Methods

Fast gradient-based optimization algorithms have become increasingly essential for the computationally efficient training of machine learning models. One technique is to multiply the gradient by a preconditioner matrix to produce a step,…

Machine Learning · Computer Science 2023-09-12 Isaac Liao , Rumen R. Dangovski , Jakob N. Foerster , Marin Soljačić

Towards Realistic Scene Generation with LiDAR Diffusion Models

Diffusion models (DMs) excel in photo-realistic image synthesis, but their adaptation to LiDAR scene generation poses a substantial hurdle. This is primarily because DMs operating in the point space struggle to preserve the curve-like…

Computer Vision and Pattern Recognition · Computer Science 2024-04-22 Haoxi Ran , Vitor Guizilini , Yue Wang

Learned Camera Gain and Exposure Control for Improved Visual Feature Detection and Matching

Successful visual navigation depends upon capturing images that contain sufficient useful information. In this letter, we explore a data-driven approach to account for environmental lighting changes, improving the quality of images for use…

Robotics · Computer Science 2022-07-12 Justin Tomasi , Brandon Wagstaff , Steven L. Waslander , Jonathan Kelly

Boosting Data-Driven Mirror Descent with Randomization, Equivariance, and Acceleration

Learning-to-optimize (L2O) is an emerging research area in large-scale optimization with applications in data science. Recently, researchers have proposed a novel L2O framework called learned mirror descent (LMD), based on the classical…

Optimization and Control · Mathematics 2024-05-13 Hong Ye Tan , Subhadip Mukherjee , Junqi Tang , Carola-Bibiane Schönlieb

Learning to Optimize: A Primer and A Benchmark

Learning to optimize (L2O) is an emerging approach that leverages machine learning to develop optimization methods, aiming at reducing the laborious iterations of hand engineering. It automates the design of an optimization method based on…

Optimization and Control · Mathematics 2021-07-05 Tianlong Chen , Xiaohan Chen , Wuyang Chen , Howard Heaton , Jialin Liu , Zhangyang Wang , Wotao Yin

End-to-End Diffusion Latent Optimization Improves Classifier Guidance

Classifier guidance -- using the gradients of an image classifier to steer the generations of a diffusion model -- has the potential to dramatically expand the creative control over image generation and editing. However, currently…

Computer Vision and Pattern Recognition · Computer Science 2023-06-02 Bram Wallace , Akash Gokul , Stefano Ermon , Nikhil Naik

DRO: Deep Recurrent Optimizer for Video to Depth

There are increasing interests of studying the video-to-depth (V2D) problem with machine learning techniques. While earlier methods directly learn a mapping from images to depth maps and camera poses, more recent works enforce multi-view…

Computer Vision and Pattern Recognition · Computer Science 2023-03-08 Xiaodong Gu , Weihao Yuan , Zuozhuo Dai , Siyu Zhu , Chengzhou Tang , Zilong Dong , Ping Tan

SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization

Previous text-to-image diffusion models typically employ supervised fine-tuning (SFT) to enhance pre-trained base models. However, this approach primarily minimizes the loss of mean squared error (MSE) at the pixel level, neglecting the…

Computer Vision and Pattern Recognition · Computer Science 2025-04-22 Liang Peng , Boxi Wu , Haoran Cheng , Yibo Zhao , Xiaofei He

Visual Data Augmentation through Learning

The rapid progress in machine learning methods has been empowered by i) huge datasets that have been collected and annotated, ii) improved engineering (e.g. data pre-processing/normalization). The existing datasets typically include several…

Computer Vision and Pattern Recognition · Computer Science 2018-01-23 Grigorios G. Chrysos , Yannis Panagakis , Stefanos Zafeiriou