• 대한전기학회
Mobile QR Code QR CODE : The Transactions of the Korean Institute of Electrical Engineers
  • COPE
  • kcse
  • 한국과학기술단체총연합회
  • 한국학술지인용색인
  • Scopus
  • crossref
  • orcid
Title Policy-based Deep Reinforcement Learning for Sparse Reward Environment
Authors 김명섭(MyeongSeop Kim) ; 김정수(Jung-Su Kim)
DOI https://doi.org/10.5370/KIEE.2021.70.3.506
Page pp.506-514
ISSN 1975-8359
Keywords Reinforcement Learning; Sparse Reward Problem
Abstract Sparse reward environment is the main problems encountered by reinforcement learning. When there are many specific tasks that the agent must go through to reach the final goal, the reward signal becomes very sparse in the environment. And this situation makes reinforcement learning less effective. To overcome this, we give the agent an intrinsic reward to induce the agent to explore more.
With this reward setting, the agent can continue to search for reward signal and learn another action that is better than the best action which is currently known. In this paper, we describe the implementation of the proposed method and estimate its performance. For the learning algorithm, we use Proximal Policy Optimization(PPO) and train the agent in a distributed environment.
The agent is trained to solve the game of Tetris that is a representative sparse reward problem.