Handbook of Learning and Approximate Dynamic Programming

Si, Jennie / Barto, Andrew G. / Powell, Warren B. / Wunsch, Don (Herausgeber)

IEEE Press Series on Computational Intelligence

1. Auflage August 2004
672 Seiten, Hardcover
Wiley & Sons Ltd

ISBN: 978-0-471-66054-5

John Wiley & Sons

Kurzbeschreibung

ADP or Approximate Dynamic Programming has gone by many different names including: reinforcement learning (RL), adaptive critics (AC), and neuro-dynamic programming (NDP). The dynamic programming approach to decision and control problems involving nonlinear dynamic systems provides the optimal solution in any stochastic or uncertain environment. This handbook presents general-purpose programming tools for doing optimization over time by using learning and approximation to handle problems that severely challenge conventional methods.

Beschreibung
Autoreninfo

* A complete resource to Approximate Dynamic Programming (ADP), including on-line simulation code
* Provides a tutorial that readers can use to start implementing the learning algorithms provided in the book
* Includes ideas, directions, and recent results on current research issues and addresses applications where ADP has been successfully implemented
* The contributors are leading researchers in the field

JENNIE SI is Professor of Electrical Engineering, Arizona State University, Tempe, AZ. She is director of Intelligent Systems Laboratory, which focuses on analysis and design of learning and adaptive systems. In addition to her own publications, she is the Associate Editor for IEEE Transactions on Neural Networks, and past Associate Editor for IEEE Transactions on Automatic Control and IEEE Transactions on Semiconductor Manufacturing. She was the co-chair for the 2002 NSF Workshop on Learning and Approximate Dynamic Programming.
ANDREW G. BARTO is Professor of Computer Science, University of Massachusetts, Amherst. He is co-director of the Autonomous Learning Laboratory, which carries out interdisciplinary research on machine learning and modeling of biological learning. He is a core faculty member of the Neuroscience and Behavior Program of the University of Massachusetts and was the co-chair for the 2002 NSF Workshop on Learning and Approximate Dynamic Programming. He currently serves as an associate editor of Neural Computation.
WARREN B. POWELL is Professor of Operations Research and Financial Engineering at Princeton University. He is director of CASTLE Laboratory, which focuses on real-time optimization of complex dynamic systems arising in transportation and logistics.
DONALD C. WUNSCH is the Mary K. Finley Missouri Distinguished Professor in the Electrical and Computer Engineering Department at the University of Missouri, Rolla. He heads the Applied Computational Intelligence Laboratory and also has a joint appointment in Computer Science, and is President-Elect of the International Neural Networks Society.

J. Si, Arizona State University (Tempe, AZ); A. G. Barto, University of Massachusetts (Amherst, MA); W. B. Powell, Princeton University (NJ); D. Wunsch, University of Missouri (Rolla, MO)