TY - GEN
T1 - Reinforcement learning for real-world control applications
AU - Pendrith, Mark
AU - Ryan, Malcolm
PY - 1996
Y1 - 1996
N2 - If reinforcement learning (RL) techniques are to be used for “real world” dynamic system control, the problems of noise and plant disturbance will have to be addressed, along with various issues resulting from learning in non-Markovian settings. We present experimental results from three domains: A simulated noisy pole-and-cart system, an artificial non-Markovian decision problem, and a real six-legged walking robot. The results from each of these domains suggest that that actual return (Monte Carlo) approaches to the credit-assignment problem may be more suited than temporal difference (TD) methods for many real- world control applications. A new algorithm we call C-Trace, a variant of the P-Trace RL algorithm is introduced, and some possible advantages of using algorithms of this type are discussed.
AB - If reinforcement learning (RL) techniques are to be used for “real world” dynamic system control, the problems of noise and plant disturbance will have to be addressed, along with various issues resulting from learning in non-Markovian settings. We present experimental results from three domains: A simulated noisy pole-and-cart system, an artificial non-Markovian decision problem, and a real six-legged walking robot. The results from each of these domains suggest that that actual return (Monte Carlo) approaches to the credit-assignment problem may be more suited than temporal difference (TD) methods for many real- world control applications. A new algorithm we call C-Trace, a variant of the P-Trace RL algorithm is introduced, and some possible advantages of using algorithms of this type are discussed.
UR - http://www.scopus.com/inward/record.url?scp=84957867534&partnerID=8YFLogxK
U2 - 10.1007/3-540-61291-2_57
DO - 10.1007/3-540-61291-2_57
M3 - Conference proceeding contribution
AN - SCOPUS:84957867534
SN - 3540612912
SN - 9783540612919
VL - 1081
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 257
EP - 270
BT - Advances in Artificial Intelligence - 11th Biennial Conference of the Canadian Society for Computational Studies of Intelligence, AI 1996, Proceedings
PB - Springer, Springer Nature
CY - Berlin; Heidelberg
T2 - 11th Biennial Conference of the Canadian Society for Computational Studies of Intelligence, AI 1996
Y2 - 21 May 1996 through 24 May 1996
ER -