Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., et al. (2016) Mastering the Game of Go with Deep Neural Networks and Tree Search. Nature, 529, 484-489. - References

Journals by Subject

Publish with us

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Article citationsMore>>

Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., et al. (2016) Mastering the Game of Go with Deep Neural Networks and Tree Search. Nature, 529, 484-489.
http://dx.doi.org/10.1038/nature16961

has been cited by the following article:

TITLE: Exploring Deep Reinforcement Learning with Multi Q-Learning

AUTHORS: Ethan Duryea, Michael Ganger, Wei Hu

KEYWORDS: Reinforcement Learning, Deep Learning, Multi Q-Learning

JOURNAL NAME: Intelligent Control and Automation, Vol.7 No.4, November 15, 2016

ABSTRACT: Q-learning is a popular temporal-difference reinforcement learning algorithm which often explicitly stores state values using lookup tables. This implementation has been proven to converge to the optimal solution, but it is often beneficial to use a function-approximation system, such as deep neural networks, to estimate state values. It has been previously observed that Q-learning can be unstable when using value function approximation or when operating in a stochastic environment. This instability can adversely affect the algorithm’s ability to maximize its returns. In this paper, we present a new algorithm called Multi Q-learning to attempt to overcome the instability seen in Q-learning. We test our algorithm on a 4 × 4 grid-world with different stochastic reward functions using various deep neural networks and convolutional networks. Our results show that in most cases, Multi Q-learning outperforms Q-learning, achieving average returns up to 2.5 times higher than Q-learning and having a standard deviation of state values as low as 0.58.

Open Access

Articles

Reinforcement Learning with Deep Quantum Neural Networks

Wei Hu, James Hu

Journal of Quantum Information Science Vol.9 No.1, March 8, 2019

DOI: 10.4236/jqis.2019.91001
Open Access

Articles

Recurrent Neural Networks & Deep Neural Networks Based on Intrusion Detection System

Rabeb Zarai, Mnaouer Kachout, Mohamed A. G. Hazber, Mohammed A. Mahdi

Open Access Library Journal Vol.7 No.3, March 25, 2020

DOI: 10.4236/oalib.1106151
Open Access

Articles

Human Detection by Robotic Urban Search and Rescue Using Image Processing and Neural Networks

Fahed Awad, Rufaida Shamroukh

International Journal of Intelligence Science Vol.4 No.2, April 16, 2014

DOI: 10.4236/ijis.2014.42006
Open Access

Articles

Generation Reliability Evaluation in Deregulated Power Systems Using Game Theory and Neural Networks

Hossein Haroonabadi, Hassan Barati

Smart Grid and Renewable Energy Vol.3 No.2, May 23, 2012

DOI: 10.4236/sgre.2012.32013
Open Access

Articles

Automatic Segmentation of Liver Tumor in CT Images with Deep Convolutional Neural Networks

Wen Li, Fucang Jia, Qingmao Hu

Journal of Computer and Communications Vol.3 No.11, November 19, 2015

DOI: 10.4236/jcc.2015.311023

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals by Subject

Publish with us

Article citationsMore>>

Home

About SCIRP

Service

Policies