KIEE - The Transactions of the Korean Institute of Electrical Engineers

Mobile QR Code QR CODE : The Transactions of the Korean Institute of Electrical Engineers

QR CODE : The Transactions of the Korean Institute of Electrical Engineers

The Transactions of the Korean Institute of Electrical Engineers

The Transactions of the Korean Institute of Electrical Engineers

ISO Journal TitleTrans. Korean. Inst. Elect. Eng.

SCImago Journal & Country Rank

Main Menu

Journal Search

XML PDF INFO REF


Title	Policy-based Deep Reinforcement Learning for Sparse Reward Environment
Authors	김명섭(MyeongSeop Kim) ; 김정수(Jung-Su Kim)
DOI	https://doi.org/10.5370/KIEE.2021.70.3.506
Page	pp.506-514
ISSN	1975-8359
Keywords	Reinforcement Learning; Sparse Reward Problem
Abstract	Sparse reward environment is the main problems encountered by reinforcement learning. When there are many specific tasks that the agent must go through to reach the final goal, the reward signal becomes very sparse in the environment. And this situation makes reinforcement learning less effective. To overcome this, we give the agent an intrinsic reward to induce the agent to explore more. With this reward setting, the agent can continue to search for reward signal and learn another action that is better than the best action which is currently known. In this paper, we describe the implementation of the proposed method and estimate its performance. For the learning algorithm, we use Proximal Policy Optimization(PPO) and train the agent in a distributed environment. The agent is trained to solve the game of Tetris that is a representative sparse reward problem.

© KIEE All right's reserved

This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution and reproduction in any medium, provided the original work is property cited.