Special issue on approximate dynamic programming and reinforcement learning Silvia FerrariJagannathan SarangapaniFrank L. Lewis Editorial 19 July 2011 Pages: 309 - 309
Approximate policy iteration: a survey and some new methods Dimitri P. Bertsekas OriginalPaper 19 July 2011 Pages: 310 - 335
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications Warren B. PowellJun Ma OriginalPaper 19 July 2011 Pages: 336 - 352
Adaptive dynamic programming for online solution of a zero-sum differential game Draguna VrabieFrank Lewis OriginalPaper 19 July 2011 Pages: 353 - 360
Online optimal control of nonlinear discrete-time systems using approximate dynamic programming Travis DierksSarangapani Jagannathan OriginalPaper 19 July 2011 Pages: 361 - 369
Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems Jie DingS. N. Balakrishnan OriginalPaper 19 July 2011 Pages: 370 - 380
Finite horizon optimal control of discrete-time nonlinear systems with unfixed initial state using adaptive dynamic programming Qinglai WeiDerong Liu OriginalPaper 19 July 2011 Pages: 381 - 390
A model-based approximate λ-policy iteration approach to online evasive path planning and the video game Ms. Pac-Man Greg FoderaroVikram RajuSilvia Ferrari OriginalPaper 19 July 2011 Pages: 391 - 399
Asymptotic tracking by a reinforcement learning-based adaptive critic controller Shubhendu BhasinNitin SharmaWarren Dixon OriginalPaper 19 July 2011 Pages: 400 - 409
Stable reinforcement learning with recurrent neural networks James Nate KnightCharles Anderson OriginalPaper 19 July 2011 Pages: 410 - 420
Semi-Markov adaptive critic heuristics with application to airline revenue management Ketaki KulkarniAbhijit GosaviKatie Grantham OriginalPaper 19 July 2011 Pages: 421 - 430
Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization Amanda LamptonJohn ValasekMrinal Kumar OriginalPaper 19 July 2011 Pages: 431 - 439
Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning Xueqing SunTao MaoJerald Kralik OriginalPaper 19 July 2011 Pages: 440 - 450
Moving least-squares approximations for linearly-solvable stochastic optimal control problems Mingyuan ZhongEmanuel Todorov OriginalPaper 19 July 2011 Pages: 451 - 463