Related papers: Sampling Decisions

Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time

Discrete diffusion models have emerged as powerful tools for high-quality data generation. Despite their success in discrete spaces, such as text generation tasks, the acceleration of discrete diffusion models remains under-explored. In…

Machine Learning · Computer Science 2024-12-09 Zixiang Chen , Huizhuo Yuan , Yongqian Li , Yiwen Kou , Junkai Zhang , Quanquan Gu

Sufficient Markov Decision Processes with Alternating Deep Neural Networks

Advances in mobile computing technologies have made it possible to monitor and apply data-driven interventions across complex systems in real time. Markov decision processes (MDPs) are the primary model for sequential decision problems with…

Methodology · Statistics 2018-03-20 Longshaokan Wang , Eric B. Laber , Katie Witkiewitz

Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory optimization

We tackle the problem of sampling from intractable high-dimensional density functions, a fundamental task that often appears in machine learning and statistics. We extend recent sampling-based approaches that leverage controlled stochastic…

Machine Learning · Computer Science 2024-03-12 Dinghuai Zhang , Ricky T. Q. Chen , Cheng-Hao Liu , Aaron Courville , Yoshua Bengio

Efficiently Solving Discounted MDPs with Predictions on Transition Matrices

We study infinite-horizon Discounted Markov Decision Processes (DMDPs) under a generative model. Motivated by the Algorithm with Advice framework Mitzenmacher and Vassilvitskii 2022, we propose a novel framework to investigate how a…

Machine Learning · Computer Science 2025-02-24 Lixing Lyu , Jiashuo Jiang , Wang Chi Cheung

Active Sampling for the Quickest Detection of Markov Networks

Consider $n$ random variables forming a Markov random field (MRF). The true model of the MRF is unknown, and it is assumed to belong to a binary set. The objective is to sequentially sample the random variables (one-at-a-time) such that the…

Methodology · Statistics 2020-08-04 Javad Heydari , Ali Tajer , H. Vincent Poor

Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling

Conventional diffusion models typically relies on a fixed forward process, which implicitly defines complex marginal distributions over latent variables. This can often complicate the reverse process' task in learning generative…

Machine Learning · Statistics 2025-06-10 Grigory Bartosh , Dmitry Vetrov , Christian A. Naesseth

Learning To Sample From Diffusion Models Via Inverse Reinforcement Learning

Diffusion models generate samples through an iterative denoising process, guided by a neural network. While training the denoiser on real-world data is computationally demanding, the sampling procedure itself is more flexible. This…

Machine Learning · Computer Science 2026-02-10 Constant Bourdrez , Alexandre Vérine , Olivier Cappé

Strategy Synthesis in Markov Decision Processes Under Limited Sampling Access

A central task in control theory, artificial intelligence, and formal methods is to synthesize reward-maximizing strategies for agents that operate in partially unknown environments. In environments modeled by gray-box Markov decision…

Machine Learning · Computer Science 2023-04-25 Christel Baier , Clemens Dubslaff , Patrick Wienhöft , Stefan J. Kiebel

Markovian Flow Matching: Accelerating MCMC with Continuous Normalizing Flows

Continuous normalizing flows (CNFs) learn the probability path between a reference distribution and a target distribution by modeling the vector field generating said path using neural networks. Recently, Lipman et al. (2022) introduced a…

Methodology · Statistics 2024-10-29 Alberto Cabezas , Louis Sharrock , Christopher Nemeth

Reconfigurable Digital Channelizer Design Using Factored Markov Decision Processes

In this work, a novel digital channelizer design is developed through the use of a compact, system-level modeling approach. The model efficiently captures key properties of a digital channelizer system and its time-varying operation. The…

Signal Processing · Electrical Eng. & Systems 2017-12-25 A. Sapio , L. Li , J. Wu , M. Wolf , S. S. Bhattacharyya

Finite-Horizon Markov Decision Processes with Sequentially-Observed Transitions

Markov Decision Processes (MDPs) have been used to formulate many decision-making problems in science and engineering. The objective is to synthesize the best decision (action selection) policies to maximize expected rewards (or minimize…

Optimization and Control · Mathematics 2015-07-07 Mahmoud El Chamie , Behcet Acikmese

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

In this paper, we consider a class of continuous-time, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation methods and sampling-based algorithms for deterministic path planning,…

Robotics · Computer Science 2012-02-27 Vu Anh Huynh , Sertac Karaman , Emilio Frazzoli

Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes

The main goal of this paper is to investigate continuous-time distributed dynamic programming (DP) algorithms for networked multi-agent Markov decision problems (MAMDPs). In our study, we adopt a distributed multi-agent framework where…

Systems and Control · Electrical Eng. & Systems 2024-06-14 Donghwan Lee , Han-Dong Lim , Do Wan Kim

Drift Flow Matching

Iterative generative models such as Flow Matching and Diffusion models have demonstrated strong test-time scaling behavior, where additional inference computation can improve generation quality. In contrast, Drift Models offer efficient…

Machine Learning · Computer Science 2026-05-19 Chenrui Ma , Xi Xiao , Lin Zhao , Tianyang Wang , Ferdinando Fioretto , Yanning Shen

Generative Flow Networks: a Markov Chain Perspective

While Markov chain Monte Carlo methods (MCMC) provide a general framework to sample from a probability distribution defined up to normalization, they often suffer from slow convergence to the target distribution when the latter is highly…

Machine Learning · Computer Science 2023-07-06 Tristan Deleu , Yoshua Bengio

Expert-Guided Symmetry Detection in Markov Decision Processes

Learning a Markov Decision Process (MDP) from a fixed batch of trajectories is a non-trivial task whose outcome's quality depends on both the amount and the diversity of the sampled regions of the state-action space. Yet, many MDPs are…

Machine Learning · Computer Science 2022-03-08 Giorgio Angelotti , Nicolas Drougard , Caroline P. C. Chanel

Near-Optimal Time and Sample Complexities for Solving Discounted Markov Decision Process with a Generative Model

In this paper we consider the problem of computing an $\epsilon$-optimal policy of a discounted Markov Decision Process (DMDP) provided we can only access its transition function through a generative sampling model that given any…

Optimization and Control · Mathematics 2019-06-07 Aaron Sidford , Mengdi Wang , Xian Wu , Lin F. Yang , Yinyu Ye

Decision Flow Policy Optimization

In recent years, generative models have shown remarkable capabilities across diverse fields, including images, videos, language, and decision-making. By applying powerful generative models such as flow-based models to reinforcement…

Machine Learning · Computer Science 2025-05-28 Jifeng Hu , Sili Huang , Siyuan Guo , Zhaogeng Liu , Li Shen , Lichao Sun , Hechang Chen , Yi Chang , Dacheng Tao

Discrete Probabilistic Inference as Control in Multi-path Environments

We consider the problem of sampling from a discrete and structured distribution as a sequential decision problem, where the objective is to find a stochastic policy such that objects are sampled at the end of this sequential process…

Machine Learning · Computer Science 2024-05-29 Tristan Deleu , Padideh Nouri , Nikolay Malkin , Doina Precup , Yoshua Bengio

Optimal strategies in Markov decision processes with finitely additive evaluations

We study infinite-horizon Markov decision processes (MDPs) where the decision maker evaluates each of her strategies by aggregating the infinite stream of expected stage-rewards. The crucial feature of our approach is that the aggregation…

Optimization and Control · Mathematics 2026-03-05 János Flesch , Arkadi Predtetchinski , William D Sudderth , Xavier Venel