Related papers: Some Experiments with Real-Time Decision Algorithm…

An Anytime Algorithm for Decision Making under Uncertainty

We present an anytime algorithm which computes policies for decision problems represented as multi-stage influence diagrams. Our algorithm constructs policies incrementally, starting from a policy which makes no use of the available…

Artificial Intelligence · Computer Science 2013-02-01 Michael C. Horsch , David L. Poole

Solving Limited Memory Influence Diagrams

We present a new algorithm for exactly solving decision making problems represented as influence diagrams. We do not require the usual assumptions of no forgetting and regularity; this allows us to solve problems with simultaneous decisions…

Artificial Intelligence · Computer Science 2015-03-19 Denis Deratani Mauá , Cassio Polpo de Campos , Marco Zaffalon

Real-Time Reinforcement Learning of Constrained Markov Decision Processes with Weak Derivatives

We present on-line policy gradient algorithms for computing the locally optimal policy of a constrained, average cost, finite state Markov Decision Process. The stochastic approximation algorithms require estimation of the gradient of the…

Optimization and Control · Mathematics 2018-12-18 Vikram Krishnamurthy , Felisa Vazquez Abad

Multi-Timescale, Gradient Descent, Temporal Difference Learning with Linear Options

Deliberating on large or continuous state spaces have been long standing challenges in reinforcement learning. Temporal Abstraction have somewhat made this possible, but efficiently planing using temporal abstraction still remains an issue.…

Artificial Intelligence · Computer Science 2017-03-21 Peeyush Kumar , Doina Precup

Active Inference for Stochastic Control

Active inference has emerged as an alternative approach to control problems given its intuitive (probabilistic) formalism. However, despite its theoretical utility, computational implementations have largely been restricted to…

Machine Learning · Computer Science 2022-03-01 Aswin Paul , Noor Sajid , Manoj Gopalkrishnan , Adeel Razi

Deep Learning Model Acceleration and Optimization Strategies for Real-Time Recommendation Systems

With the rapid growth of Internet services, recommendation systems play a central role in delivering personalized content. Faced with massive user requests and complex model architectures, the key challenge for real-time recommendation…

Information Retrieval · Computer Science 2025-08-14 Junli Shao , Jing Dong , Dingzhou Wang , Kowei Shih , Dannier Li , Chengrui Zhou

A PAC Approach to Application-Specific Algorithm Selection

The best algorithm for a computational problem generally depends on the "relevant inputs," a concept that depends on the application domain and often defies formal articulation. While there is a large literature on empirical approaches to…

Machine Learning · Computer Science 2016-09-06 Rishi Gupta , Tim Roughgarden

Evaluating influence diagrams with decision circuits

Although a number of related algorithms have been developed to evaluate influence diagrams, exploiting the conditional independence in the diagram, the exact solution has remained intractable for many important problems. In this paper we…

Artificial Intelligence · Computer Science 2012-06-26 Debarun Bhattacharjya , Ross D. Shachter

Adaptive Experimental Design with Temporal Interference: A Maximum Likelihood Approach

Suppose an online platform wants to compare a treatment and control policy, e.g., two different matching algorithms in a ridesharing system, or two different inventory management algorithms in an online retail site. Standard randomized…

Methodology · Statistics 2022-12-27 Peter Glynn , Ramesh Johari , Mohammad Rasouli

Temporal-Difference estimation of dynamic discrete choice models

We study the use of Temporal-Difference learning for estimating the structural parameters in dynamic discrete choice models. Our algorithms are based on the conditional choice probability approach but use functional approximations to…

Econometrics · Economics 2022-12-23 Karun Adusumilli , Dita Eckardt

Intelligent Probabilistic Inference

The analysis of practical probabilistic models on the computer demands a convenient representation for the available knowledge and an efficient algorithm to perform inference. An appealing representation is the influence diagram, a network…

Artificial Intelligence · Computer Science 2013-04-15 Ross D. Shachter

Anytime Inference in Valuation Algebras

Anytime inference is inference performed incrementally, with the accuracy of the inference being controlled by a tunable parameter, usually time. Such anytime inference algorithms are also usually interruptible, gradually converging to the…

Artificial Intelligence · Computer Science 2016-05-16 Abhishek Dasgupta , Samson Abramsky

Reinforcement Learning with Random Time Horizons

We extend the standard reinforcement learning framework to random time horizons. While the classical setting typically assumes finite and deterministic or infinite runtimes of trajectories, we argue that multiple real-world applications…

Machine Learning · Computer Science 2025-08-15 Enric Ribera Borrell , Lorenz Richter , Christof Schütte

A model for efficient dynamical ranking in networks

We present a physics-inspired method for inferring dynamic rankings in directed temporal networks - networks in which each directed and timestamped edge reflects the outcome and timing of a pairwise interaction. The inferred ranking of each…

Physics and Society · Physics 2024-10-10 Andrea Della Vecchia , Kibidi Neocosmos , Daniel B. Larremore , Cristopher Moore , Caterina De Bacco

Deterministic Polynomial Time Algorithms for Matrix Completion Problems

We present new deterministic algorithms for several cases of the maximum rank matrix completion problem (for short matrix completion), i.e. the problem of assigning values to the variables in a given symbolic matrix as to maximize the…

Data Structures and Algorithms · Computer Science 2014-07-11 Gábor Ivanyos , Marek Karpinski , Nitin Saxena

Deterministic Trajectory Optimization through Probabilistic Optimal Control

In this article, we discuss two algorithms tailored to discrete-time deterministic finite-horizon nonlinear optimal control problems or so-called deterministic trajectory optimization problems. Both algorithms can be derived from an…

Optimization and Control · Mathematics 2024-12-10 Mohammad Mahmoudi Filabadi , Tom Lefebvre , Guillaume Crevecoeur

Probabilistic Timed Automata with Clock-Dependent Probabilities

Probabilistic timed automata are classical timed automata extended with discrete probability distributions over edges. We introduce clock-dependent probabilistic timed automata, a variant of probabilistic timed automata in which transition…

Logic in Computer Science · Computer Science 2017-07-17 Jeremy Sproston

Comparing the Finite-Time Performance of Simulation-Optimization Algorithms

We empirically evaluate the finite-time performance of several simulation-optimization algorithms on a testbed of problems with the goal of motivating further development of algorithms with strong finite-time performance. We investigate if…

Optimization and Control · Mathematics 2017-05-23 Naijia Dong , David J. Eckman , Matthias Poloczek , Xueqi Zhao , Shane G. Henderson

Decision Making with Interval Influence Diagrams

In previous work (Fertig and Breese, 1989; Fertig and Breese, 1990) we defined a mechanism for performing probabilistic reasoning in influence diagrams using interval rather than point-valued probabilities. In this paper we extend these…

Artificial Intelligence · Computer Science 2013-04-05 John S. Breese , Kenneth W. Fertig

Deep Optimal Timing Strategies for Time Series

Deciding the best future execution time is a critical task in many business activities while evolving time series forecasting, and optimal timing strategy provides such a solution, which is driven by observed data. This solution has plenty…

Artificial Intelligence · Computer Science 2023-10-10 Chen Pan , Fan Zhou , Xuanwei Hu , Xinxin Zhu , Wenxin Ning , Zi Zhuang , Siqiao Xue , James Zhang , Yunhua Hu