Solving the playing strategy of Dou Dizhu using convolutional neural network: A residual learning approach

Tan, Guangyun; Wei, Peipei; He, Yongyi; Xu, Huahu; Shi, Xinxin

doi:10.3233/JCM-204344

Solving the playing strategy of Dou Dizhu using convolutional neural network: A residual learning approach

Article type: Research Article

Authors: Tan, Guangyun^{a; b} | Wei, Peipei^{b; *} | He, Yongyi^c | Xu, Huahu^c | Shi, Xinxin^b

Affiliations: [a] School of Mechatronic Engineering and Automation, Shanghai University, Shanghai, China | [b] Shanghai Qiansi Network Technology Limited Liability Company, Shanghai, China | [c] School of Computer Engineering and Science, Shanghai University, Shanghai, China

Correspondence: [*] Corresponding author: Peipei Wei, Shanghai Qiansi Network Technology Limited Liability Company, 800 Naxian Road, Pudong New District, Shanghai 201210, China. Tel.: +86 18616154315; E-mail: ppw2017@163.com.

Abstract: Poker is the typical game of incomplete information, and remains a longstanding challenge problem in artificial intelligence (AI). The game of Dou Dizhu has been viewed as a thorny topic in AI since it is featured with hidden information and large branching factors, and the cooperation and competition should also be handled. In this article, deep learning is adopted to train a supervised learning playing strategy network (PSN) for Dou Dizhu directly from expert human playing. Through experiments, it was found that the sample design with the appropriate historical playing hand sequence and more features of the playing situation, can help the PSN learn more competitive and accurate playing strategies faster. In the online game platform, the strategy network-based game agent reaches an average winning rate of 52.22% against the human players. In addition, the analysis of the gameplay data against human players shows that the playing strategy network has learned the rules of playing and the characteristics of card recognition and reasonable demolition, cooperation and reasoning. Finally, we improve the performance of the PSN in the aspect of sample design. Then, the experimental results show that with proper marking of the number of remaining hands, the performance of the PSN can be enhanced.

Keywords: Supervised learning, convolutional neural networks, playing strategy, incomplete information, Dou Dizhu

DOI: 10.3233/JCM-204344

Journal: Journal of Computational Methods in Sciences and Engineering, vol. 21, no. 1, pp. 3-18, 2021

Published: 25 March 2021

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl

Share this:

North America

Europe

Asia