Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Tan, Guangyuna; b | Wei, Peipeib; * | He, Yongyic | Xu, Huahuc | Shi, Xinxinb
Affiliations: [a] School of Mechatronic Engineering and Automation, Shanghai University, Shanghai, China | [b] Shanghai Qiansi Network Technology Limited Liability Company, Shanghai, China | [c] School of Computer Engineering and Science, Shanghai University, Shanghai, China
Correspondence: [*] Corresponding author: Peipei Wei, Shanghai Qiansi Network Technology Limited Liability Company, 800 Naxian Road, Pudong New District, Shanghai 201210, China. Tel.: +86 18616154315; E-mail: ppw2017@163.com.
Abstract: Poker is the typical game of incomplete information, and remains a longstanding challenge problem in artificial intelligence (AI). The game of Dou Dizhu has been viewed as a thorny topic in AI since it is featured with hidden information and large branching factors, and the cooperation and competition should also be handled. In this article, deep learning is adopted to train a supervised learning playing strategy network (PSN) for Dou Dizhu directly from expert human playing. Through experiments, it was found that the sample design with the appropriate historical playing hand sequence and more features of the playing situation, can help the PSN learn more competitive and accurate playing strategies faster. In the online game platform, the strategy network-based game agent reaches an average winning rate of 52.22% against the human players. In addition, the analysis of the gameplay data against human players shows that the playing strategy network has learned the rules of playing and the characteristics of card recognition and reasonable demolition, cooperation and reasoning. Finally, we improve the performance of the PSN in the aspect of sample design. Then, the experimental results show that with proper marking of the number of remaining hands, the performance of the PSN can be enhanced.
Keywords: Supervised learning, convolutional neural networks, playing strategy, incomplete information, Dou Dizhu
DOI: 10.3233/JCM-204344
Journal: Journal of Computational Methods in Sciences and Engineering, vol. 21, no. 1, pp. 3-18, 2021
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl