Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G., et al. (2015) Human-Level Control through Deep Reinforcement Learning. Nature, 518, 529-533. - References

Journals by Subject

Publish with us

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Article citationsMore>>

Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G., et al. (2015) Human-Level Control through Deep Reinforcement Learning. Nature, 518, 529-533. http://dx.doi.org/10.1038/nature14236

has been cited by the following article:

TITLE: Exploring Deep Reinforcement Learning with Multi Q-Learning

AUTHORS: Ethan Duryea, Michael Ganger, Wei Hu

KEYWORDS: Reinforcement Learning, Deep Learning, Multi Q-Learning

JOURNAL NAME: Intelligent Control and Automation, Vol.7 No.4, November 15, 2016

ABSTRACT: Q-learning is a popular temporal-difference reinforcement learning algorithm which often explicitly stores state values using lookup tables. This implementation has been proven to converge to the optimal solution, but it is often beneficial to use a function-approximation system, such as deep neural networks, to estimate state values. It has been previously observed that Q-learning can be unstable when using value function approximation or when operating in a stochastic environment. This instability can adversely affect the algorithm’s ability to maximize its returns. In this paper, we present a new algorithm called Multi Q-learning to attempt to overcome the instability seen in Q-learning. We test our algorithm on a 4 × 4 grid-world with different stochastic reward functions using various deep neural networks and convolutional networks. Our results show that in most cases, Multi Q-learning outperforms Q-learning, achieving average returns up to 2.5 times higher than Q-learning and having a standard deviation of state values as low as 0.58.

Open Access

Articles

Reinforcement Learning with Deep Quantum Neural Networks

Wei Hu, James Hu

Journal of Quantum Information Science Vol.9 No.1, March 8, 2019

DOI: 10.4236/jqis.2019.91001
Open Access

Articles

Exploring Deep Reinforcement Learning with Multi Q-Learning

Ethan Duryea, Michael Ganger, Wei Hu

Intelligent Control and Automation Vol.7 No.4, November 15, 2016

DOI: 10.4236/ica.2016.74012
Open Access

Articles

Formation of X-120 M Line Pipe through J-C-O-E Technique

Jai Dev Chandel, Nand Lal Singh

Engineering Vol.3 No.4, April 8, 2011

DOI: 10.4236/eng.2011.34046
Open Access

Articles

Reinforcement Learning-Based Control for Resilient Community Microgrid Applications

Md Mahmudul Hasan, Ishtiaque Zaman, Miao He, Michael Giesselmann

Journal of Power and Energy Engineering Vol.10 No.9, September 30, 2022

DOI: 10.4236/jpee.2022.109001
Open Access

Articles

Toward Artificial General Intelligence: Deep Reinforcement Learning Method to AI in Medicine

Daniel Schilling Weiss Nguyen, Richard Odigie

Journal of Computer and Communications Vol.11 No.9, September 28, 2023

DOI: 10.4236/jcc.2023.119006

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals by Subject

Publish with us

Article citationsMore>>

Home

About SCIRP

Service

Policies