Dynamic Programming

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

An introduction to the mathematical theory of multistage decision processes, this text takes a "functional equation" approach to the discovery of optimum policies. The text examines existence and uniqueness theorems, the optimal inventory equation, bottleneck problems in multistage production processes, a new formalism in the calculus of variation, multistage games, and more. 1957 edition. Includes 37 figures.

Author(s): Richard Bellman
Series: Dover Books on Computer Science
Edition: Reprint
Publisher: Dover Publications Inc.
Year: 2002

Language: English
Pages: 366
Tags: algorithms; algorithm; design; analysis; data structures; computer science; mathematics; maths; logic; math; cs; cse

DYNAMIC
PROGRAMMING
RICHARD BELLMAN
Preface
Contents
A Multi-Stage Allocation Process
§ 1. Introduction
§ 2. A multi-stage allocation process
§ 3. Discussion
§ 4. Functional equation approach
§ 5. Discussion
§ 6. A multi-dimensional maximization problem
§ 7. A “smoothing” problem
§ 8. Infinite stage approximation
§ 9. Existence and uniqueness theorems
§ 10. Successive approximations
§ 11. Approximation in policy space
§ 12. Properties of the solution—I: Convexity
§ 13. Properties of the solution—II: Concavity
§ 14. Properties of the solution—III: Concavity
§ 15. An “ornery” example
§ 16. A particular example—I
§ 18. Approximation and stability
§ 19. Time-dependent processes
§ 20. Multi-activity processes
KJ
§ 21. Multi-dimensional structure theorems
§ 22. Locating the unique maximum of a concave function
§ 23. Continuity and memory
§ 24. Stochastic allocation processes
§ 25. Functional equations
§ 26. Stieltjes integrals
Exercises and Research Problems for Chapter I
A Stochastic Multi-Stage Decision Process
§ 1. Introduction
§ 2. Stochastic gold-mining
§ 5. Infinite stage approximation
§ 6. Existence and uniqueness
§ 7. Approximation in policy space and monotone convergence
§ 8. The solution
§ 9. Discussion
§ 10. Some generalizations
§11. The form of f (x, y)
§ 12. The problem for a finite number of stages
§ 13. A three-choice problem
§ 14. A stability theorem
Exercises and Research Problems for Chapter II
The Structure of Dynamic Programming Processes
§ 1. Introduction
§ 4. Mathematical formulation—I. A discrete deterministic process
§ 7. Continuous stochastic processes
§ 8. Generalizations
§ 9. Causality and optimality
§ 10. Approximation in policy space
Exercises and Research Problems for Chapter III
Existence and Uniqueness Theorems
§ 1. Introduction
§ 2. A Fundamental inequality
§ 3. Equations of Type One
§ 4. Equations of Type Two
§ 5. Monotone convergence
§ 6. Stability theorems
§ 7. Some directions of generalization
§ 8. An equation of the third type
§ 9. An “optimal inventory” equation
Exercises and Research Problems for Chapter IV
The Optimal Inventory Equation
§ 1. Introduction
§ 2. Formulation of the general problem
A. Finite total time period
E. Unbounded time period—two period lag
§ 3. A simple observation
§ 4. Constant stock level—preliminary discussion
§ 5. Proportional cost—one-dimensional case
§ 6. Proportional cost—multi-dimensional case
§ 8. Finite time—multi-dimensional case
§ 9. Non-proportional penalty cost—red tape
§ 10. Particular cases
§ 11. The form of the general solution
§ 12. Fixed costs
§ 13. Preliminaries to a discussion of more complicated policies
§ 14. Unbounded process—one period time lag
Appendix Chapter V—The Renewal Equation
Exercises and research problems
Bottleneck Problems in Multi-Stage Production Processes
§ 1. Introduction
§ 2. A General class of multi-stage production problems
§ 3. Discussion of the preceding model
§ 4. Functional equations
§ 5. A Continuous version 5
§ 6. Notation
§ 7. Dynamic programming formulation
§ 9. The resultant nonlinear partial differential equation
§ 10. Application of the partial differential equation
§ 11. A Particular example
§ 12. A Dual problem
§ 13. Verification of the solution given in § 11
§ 14. Computational solution
§ 15. Nonlinear problems
Exercises and Research Problems for Chapter VI
Bottleneck Problems: Examples
§ 1. Introduction
§ 2. Preliminaries
§ 3. Delta-functions
§ 6. The equilibrium solution
§ 8. Description of solution and proof
Summary Initial Adjustments
A Continuous Stochastic Decision Process
§ 1. Introduction
§ 2. Continuous versions—I: A differential approach
§ 3. Continuous versions—II: An integral approach
§ 4. Preliminary discussion
§ 5. Mixing at a point
§ 6. Reformulation of the gold-mining process
§ 7. Derivation of the differential equations
§ 8. The variational procedure
§ 9. The behavior of Ki.
§ 10. The solution for T = oo
§ 11. Solution for finite total time
§ 12. The three-choice problem
§ 13. Some lemmas and preliminary results
§ 14. Mixed policies
§ 15. The solution for infinite time, D > 0
§ 16. D < 0
§ 17. The case r3 = r4
§ 18. Nonlinear utility—two-choice problem
A New Formalism in the Calculus of Variations
§ 1. Introduction
§ 2. A new approach
§ 4. Discussion
§ 5. The two dimensional case
§ 7. Max F(x,y) dt under the Constraint 0 < y < x y Jo
§ 8. Computational solution
§ 9. Discussion
§ 10. An example
§ 11. A discrete version
§ 12. A convergence proof
§ 14. Generalization and discussion
§ 15. Integral constraints
§ 16. Further remarks concerning numerical solution
§ 17. Eigenvalue problem
§ 18. The first formulation
§ 19. An approximate solution
§ 20. Second formulation
§ 21. Discrete approximations
§ 22. Successive approximation
§ 23. Monotone approximation
§ 24. Uniqueness of solution
§ 25. Minimum maximum deviation
Exercises and Research Problems for Chapter IX
Multi-Stage Games
§ 1. Introduction
§ 2. A Single-stage discrete game
§ 3. The min-max theorem
§ 4. Continuous games
§ 5. Finite resources
§ 6. Games of survival
§ 7. Pursuit games
§ 8. General formulation
§ 9. The principle of optimality and functional equations
§ 10. More general process
§ 11. A basic lemma
§ 12. Existence and uniqueness
§ 13. Proof of results
§ 14. Alternate proof of existence
§ 15. Successive approximations in general
§ 16. Effectiveness of solution
§ 17. Further results
§ 18. One-sided min-max
§ 19. Existence and uniqueness for games of survival
§ 20. An approximation
§ 21. Non-zero-sum games—games of survival
§ 22. An approximate solution
§ 23. Proof of the extended min-max theorem
§ 24. A rationale for non-zero sum games
Exercises and research problems for Chapter X
Markovian Decision Processes
§ 1. Introduction
§ 2. Markovian decision processes
§ 3. Notation
§ 4. A lemma
§ 5. Existence and uniqueness—I
§ 6. Existence and uniqueness—II
§ 7. Existence and uniqueness—III
§ 8. The Riccati equation
§ 9. Approximation in policy space
§ 10. Discrete versions
§ 11. The recurrence relation
§ 12. Min-max
§ 13. Generalization of a von Neumann result
Exercises and Research Problems for Chapter XI