Linear Approximations from Factored Markov Dicision Processes

Linear Approximations from Factored Markov Dicision Processes
Author: Relu-Eugen Patrascu
Publisher:
Total Pages:
Release: 2004
Genre:
ISBN:


Download Linear Approximations from Factored Markov Dicision Processes Book in PDF, Epub and Kindle

A Markov Decision Process (MDP) is a model employed to describe problems in which a decision must be made at each one of several stages, while receiving feedback from the environment. This type of model has been extensively studied in the operations research community and fundamental algorithms have been developed to solve associated problems. However, these algorithms are quite inefficient for very large problems, leading to a need for alternatives; since MDP problems are provably hard on compressed representations, one becomes content even with algorithms which may perform well at least on specific classes of problems. The class of problems we deal with in this thesis allows succinct representations for the MDP as a dynamic Bayes network, and for its solution as a weighted combination of basis functions. We develop novel algorithms for producing, improving, and calculating the error of approximate solutions for MDPs using a compressed representation. Specifically, we develop an efficient branch-and-bound algorithm for computing the Bellman error of the compact approximate solution regardless of its provenance. We introduce an efficient direct linear programming algorithm which, using incremental constraints generation, achieves run times significantly smaller than existing approximate algorithms without much loss of accuracy. We also show a novel direct linear programming algorithm which, instead of employing constraints generation, transforms the exponentially many constraints into a compact form more amenable for tractable solutions. In spite of its perceived importance, the efficient optimization of the Bellman error towards an approximate MDP solution has eluded current algorithms; to this end we propose a novel branch-and-bound approximate policy iteration algorithm which makes direct use of our branch-and-bound method for computing the Bellman error. We further investigate another procedure for obtaining an approximate solution based on the dual of the direct, approximate linear programming formulation for solving MDPs. To address both the loss of accuracy resulting from the direct, approximate linear program solution and the question of where basis functions come from we also develop a principled system able not only to produce the initial set of basis functions, but also able to augment it with new basis functions automatically generated such that the approximation error decreases according to the user's requirements and time limitations.

Handbook of Markov Decision Processes

Handbook of Markov Decision Processes
Author: Eugene A. Feinberg
Publisher: Springer Science & Business Media
Total Pages: 560
Release: 2012-12-06
Genre: Business & Economics
ISBN: 1461508053


Download Handbook of Markov Decision Processes Book in PDF, Epub and Kindle

Eugene A. Feinberg Adam Shwartz This volume deals with the theory of Markov Decision Processes (MDPs) and their applications. Each chapter was written by a leading expert in the re spective area. The papers cover major research areas and methodologies, and discuss open questions and future research directions. The papers can be read independently, with the basic notation and concepts ofSection 1.2. Most chap ters should be accessible by graduate or advanced undergraduate students in fields of operations research, electrical engineering, and computer science. 1.1 AN OVERVIEW OF MARKOV DECISION PROCESSES The theory of Markov Decision Processes-also known under several other names including sequential stochastic optimization, discrete-time stochastic control, and stochastic dynamic programming-studiessequential optimization ofdiscrete time stochastic systems. The basic object is a discrete-time stochas tic system whose transition mechanism can be controlled over time. Each control policy defines the stochastic process and values of objective functions associated with this process. The goal is to select a "good" control policy. In real life, decisions that humans and computers make on all levels usually have two types ofimpacts: (i) they cost orsavetime, money, or other resources, or they bring revenues, as well as (ii) they have an impact on the future, by influencing the dynamics. In many situations, decisions with the largest immediate profit may not be good in view offuture events. MDPs model this paradigm and provide results on the structure and existence of good policies and on methods for their calculation.

Control Techniques for Complex Networks

Control Techniques for Complex Networks
Author: Sean Meyn
Publisher: Cambridge University Press
Total Pages: 33
Release: 2008
Genre: Mathematics
ISBN: 0521884411


Download Control Techniques for Complex Networks Book in PDF, Epub and Kindle

From foundations to state-of-the-art; the tools and philosophy you need to build network models.

Constrained Markov Decision Processes

Constrained Markov Decision Processes
Author: Eitan Altman
Publisher: Routledge
Total Pages: 256
Release: 2021-12-17
Genre: Mathematics
ISBN: 1351458248


Download Constrained Markov Decision Processes Book in PDF, Epub and Kindle

This book provides a unified approach for the study of constrained Markov decision processes with a finite state space and unbounded costs. Unlike the single controller case considered in many other books, the author considers a single controller with several objectives, such as minimizing delays and loss, probabilities, and maximization of throughputs. It is desirable to design a controller that minimizes one cost objective, subject to inequality constraints on other cost objectives. This framework describes dynamic decision problems arising frequently in many engineering fields. A thorough overview of these applications is presented in the introduction. The book is then divided into three sections that build upon each other.

Learning Representation and Control in Markov Decision Processes

Learning Representation and Control in Markov Decision Processes
Author: Sridhar Mahadevan
Publisher: Now Publishers Inc
Total Pages: 185
Release: 2009
Genre: Computers
ISBN: 1601982380


Download Learning Representation and Control in Markov Decision Processes Book in PDF, Epub and Kindle

Provides a comprehensive survey of techniques to automatically construct basis functions or features for value function approximation in Markov decision processes and reinforcement learning.