Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning - Journal of Software Engineering and Applications

JSEA > Vol.5 No.12, December 2012

Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning ()

HTML

Download as PDF (Size: 459KB) PP. 128-133

DOI: 10.4236/jsea.2012.512B025 3,939 Downloads 5,777 Views Citations

Author(s)

Takashi Kuremoto, Koichi Hashiguchi, Keita Morisaki, Shun Watanabe, Kunikazu Kobayashi, Shingo Mabu, Masanao Obayashi

Affiliation(s)

Graduate School of Science and Engineering, Yamaguchi University, Ube, Yamaguchi, Japan.
School of Information Science and Technology, Aichi Prefectural University, Nagakute, Aichi, Japan.

ABSTRACT

This paper proposes how to learn and generate multiple action sequences of a humanoid robot. At first, all the basic action sequences, also called primitive behaviors, are learned by a recurrent neural network with parametric bias (RNNPB) and the value of the internal nodes which are parametric bias (PB) determining the output with different primitive behaviors are obtained. The training of the RNN uses back propagation through time (BPTT) method. After that, to generate the learned behaviors, or a more complex behavior which is the combination of the primitive behaviors, a reinforcement learning algorithm: Q-learning (QL) is adopt to determine which PB value is adaptive for the generation. Finally, using a real humanoid robot, the proposed method was confirmed its effectiveness by the results of experiment.

KEYWORDS

RNNPB; Humanoid robot; BPTT; reinforcement learning; multiple action sequences

Share and Cite:

T. Kuremoto, K. Hashiguchi, K. Morisaki, S. Watanabe, K. Kobayashi, S. Mabu and M. Obayashi, "Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning," Journal of Software Engineering and Applications, Vol. 5 No. 12B, 2012, pp. 128-133. doi: 10.4236/jsea.2012.512B025.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies