Related papers: Parameterized Reinforcement Learning for Optical S…

A Reinforcement learning method for Optical Thin-Film Design

Machine learning, especially deep learning, is dramatically changing the methods associated with optical thin-film inverse design. The vast majority of this research has focused on the parameter optimization (layer thickness, and structure…

Machine Learning · Computer Science 2021-02-19 Anqing Jiang , Liangyao Chen , Osamu Yoshie

A reinforced learning approach to optimal design under model uncertainty

Optimal designs are usually model-dependent and likely to be sub-optimal if the postulated model is not correctly specified. In practice, it is common that a researcher has a list of candidate models at hand and a design has to be found…

Statistics Theory · Mathematics 2023-03-29 Mingyao Ai , Holger Dette , Zhengfu Liu , Jun Yu

Optical Multilayer Thin Film Structure Inverse Design: From Optimization to Deep Learning

Optical multilayer thin film structures have been widely used in numerous photonic domains and applications. The key component to enable these applications is the inverse design. Different from other photonic structures such as metasurface…

Optics · Physics 2024-09-27 Taigao Ma , Mingqian Ma , L. Jay Guo

Automated Optical Multi-layer Design via Deep Reinforcement Learning

Optical multi-layer thin films are widely used in optical and energy applications requiring photonic designs. Engineers often design such structures based on their physical intuition. However, solely relying on human experts can be…

Signal Processing · Electrical Eng. & Systems 2020-06-23 Haozhu Wang , Zeyu Zheng , Chengang Ji , L. Jay Guo

A new multilayer optical film optimal method based on deep q-learning

Multi-layer optical film has been found to afford important applications in optical communication, optical absorbers, optical filters, etc. Different algorithms of multi-layer optical film design has been developed, as simplex method,…

Machine Learning · Computer Science 2018-12-10 Anqing Jiang , Osamu Yoshie , LiangYao Chen

Multi-Agent Reinforcement Learning for Inverse Design in Photonic Integrated Circuits

Inverse design of photonic integrated circuits (PICs) has traditionally relied on gradientbased optimization. However, this approach is prone to end up in local minima, which results in suboptimal design functionality. As interest in PICs…

Machine Learning · Computer Science 2025-06-24 Yannik Mahlau , Maximilian Schier , Christoph Reinders , Frederik Schubert , Marco Bügling , Bodo Rosenhahn

Online Reinforcement Learning for Dynamic Multimedia Systems

In our previous work, we proposed a systematic cross-layer framework for dynamic multimedia systems, which allows each layer to make autonomous and foresighted decisions that maximize the system's long-term performance, while meeting the…

Machine Learning · Computer Science 2013-06-06 Nicholas Mastronarde , Mihaela van der Schaar

Value Iteration is Optic Composition

Dynamic programming is a class of algorithms used to compute optimal control policies for Markov decision processes. Dynamic programming is ubiquitous in control theory, and is also the foundation of reinforcement learning. In this paper,…

Category Theory · Mathematics 2023-08-01 Jules Hedges , Riu Rodríguez Sakamoto

Finding the best design parameters for optical nanostructures using reinforcement learning

Recently, a novel machine learning model has emerged in the field of reinforcement learning known as deep Q-learning. This model is capable of finding the best possible solution in systems consisting of millions of choices, without ever…

Image and Video Processing · Electrical Eng. & Systems 2018-10-26 Iman Sajedian , Trevon Badloe , Junsuk Rho

Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

To overcome the curses of dimensionality and modeling of Dynamic Programming (DP) methods to solve Markov Decision Process (MDP) problems, Reinforcement Learning (RL) methods are adopted in practice. Contrary to traditional RL algorithms…

Machine Learning · Computer Science 2021-08-24 Arghyadip Roy , Vivek Borkar , Abhay Karandikar , Prasanna Chaporkar

Reinforcement Learning based Constrained Optimal Control: an Interpretable Reward Design

This paper presents an interpretable reward design framework for reinforcement learning based constrained optimal control problems with state and terminal constraints. The problem is formalized within a standard partially observable Markov…

Systems and Control · Electrical Eng. & Systems 2025-03-05 Jingjie Ni , Fangfei Li , Xin Jin , Xianlun Peng , Yang Tang

Reinforcement Learning with Parameterized Actions

We introduce a model-free algorithm for learning in Markov decision processes with parameterized actions-discrete actions with continuous parameters. At each step the agent must select both which action to use and which parameters to use…

Artificial Intelligence · Computer Science 2015-11-30 Warwick Masson , Pravesh Ranchod , George Konidaris

Feature Markov Decision Processes

General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observations, actions, and rewards. On the other hand, reinforcement learning is well-developed for small finite state Markov Decision Processes…

Artificial Intelligence · Computer Science 2009-12-30 Marcus Hutter

A reinforcement learning based decision support system in textile manufacturing process

This paper introduced a reinforcement learning based decision support system in textile manufacturing process. A solution optimization problem of color fading ozonation is discussed and set up as a Markov Decision Process (MDP) in terms of…

Machine Learning · Computer Science 2020-05-21 Zhenglei He , Kim Phuc Tran , Sébastien Thomassey , Xianyi Zeng , Changhai Yi

Performative Reinforcement Learning with Linear Markov Decision Process

We study the setting of \emph{performative reinforcement learning} where the deployed policy affects both the reward, and the transition of the underlying Markov decision process. Prior work~\parencite{MTR23} has addressed this problem…

Machine Learning · Computer Science 2025-03-18 Debmalya Mandal , Goran Radanovic

Sampling-guided exploration of active feature selection policies

Determining the most appropriate features for machine learning predictive models is challenging regarding performance and feature acquisition costs. In particular, global feature choice is limited given that some features will only benefit…

Machine Learning · Computer Science 2026-03-17 Gabriel Bernardino , Anders Jonsson , Patrick Clarysse , Nicolas Duchateau

Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes

Multi-objective Markov decision processes are sequential decision-making problems that involve multiple conflicting reward functions that cannot be optimized simultaneously without a compromise. This type of problems cannot be solved by a…

Machine Learning · Computer Science 2023-08-22 Sherif Abdelfattah , Kathryn Merrick , Jiankun Hu

Automated design of compound lenses with discrete-continuous optimization

We introduce a method that automatically and jointly updates both continuous and discrete parameters of a compound lens design, to improve its performance in terms of sharpness, speed, or both. Previous methods for compound lens design use…

Graphics · Computer Science 2025-09-30 Arjun Teh , Delio Vicini , Bernd Bickel , Ioannis Gkioulekas , Matthew O'Toole

A Distributional View on Multi-Objective Policy Optimization

Many real-world problems require trading off multiple competing objectives. However, these objectives are often in different units and/or scales, which can make it challenging for practitioners to express numerical preferences over…

Machine Learning · Computer Science 2020-05-18 Abbas Abdolmaleki , Sandy H. Huang , Leonard Hasenclever , Michael Neunert , H. Francis Song , Martina Zambelli , Murilo F. Martins , Nicolas Heess , Raia Hadsell , Martin Riedmiller

General Inverse Design of Thin-Film Metamaterials With Convolutional Neural Networks

The design of metamaterials which support unique optical responses is the basis for most thin-film nanophotonics applications. In practice this inverse design problem can be difficult to solve systematically due to the large design…

Computational Physics · Physics 2026-05-25 Andrew Lininger , Michael Hinczewski , Giuseppe Strangi