Navigational behavior of humans and deep reinforcement learning agents

Research output: Contribution to journalArticlepeer-review

3 Downloads (Pure)

Abstract

Rapid advances in the field of Deep Reinforcement Learning (DRL) over the past several years have led to artificial agents (AAs) capable of producing behavior that meets or exceeds human-level performance in a wide variety of tasks. However, research on DRL frequently lacks adequate discussion of the low-level dynamics of the behavior itself and instead focuses on meta-level or global-level performance metrics. In doing so, the current literature lacks perspective on the qualitative nature of AA behavior, leaving questions regarding the spatiotemporal patterning of their behavior largely unanswered. The current study explored the degree to which the navigation and route selection trajectories of DRL agents (i.e., AAs trained using DRL) through simple obstacle ridden virtual environments were equivalent (and/or different) from those produced by human agents. The second and related aim was to determine whether a task-dynamical model of human route navigation could not only be used to capture both human and DRL navigational behavior, but also to help identify whether any observed differences in the navigational trajectories of humans and DRL agents were a function of differences in the dynamical environmental couplings.
Original languageEnglish
Article number725932
Pages (from-to)1-15
Number of pages15
JournalFrontiers in Psychology
Volume12
DOIs
Publication statusPublished - 22 Sep 2021

Bibliographical note

Copyright the Author(s) 2021. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Keywords

  • task-dynamical model
  • dynamical perceptual motor primitives
  • reinforcement learning
  • navigational behavior
  • obstacle avoidance
  • route selection

Fingerprint

Dive into the research topics of 'Navigational behavior of humans and deep reinforcement learning agents'. Together they form a unique fingerprint.

Cite this