A fuzzy-based function approximation technique for reinforcement learning

Wu, Cheng; Song, Huichun; Yan, Changsheng; Wang, Yiming

doi:10.3233/IFS-162212

A fuzzy-based function approximation technique for reinforcement learning¹

Article type: Research Article

Authors: Wu, Cheng^a | Song, Huichun^a | Yan, Changsheng^b | Wang, Yiming^{a; *}

Affiliations: [a] School of Urban Rail Transportation, Soochow University, Suzhou, China | [b] Hewlett-Packard Laboratories, Suzhou, China

Correspondence: [*] Corresponding author. Yiming Wang, School of Urban Rail Transportation, Soochow University, Suzhou 215011, China. Tel.: +86 18001555858; Fax: +86 512 67601052; E-mail: ymwang@suda.edu.cn.

Note: [1] This work was supported in part by the National Natural Science Foundation of China (Grant No. 61471252).

Abstract: Reinforcement learning is hard to solve optimization problems in multi-agent system because of the inefficiency of function approximation. Sparse distributed memories, which is implemented using Radial Basis Functions or Kanerva Coding, can be used to improve the efficiency. But this approach still often gives poor performance when applied to large-scale multi-agent systems. In this paper, we attempt to solve four-rooms problem in the predator-prey pursuit domain and argue that the poor performance that we observe is caused by frequent prototype collisions. We show that dynamic prototype allocation and adaptation can give better results by reducing these collisions. By using our novel approach about fuzzy Kanerva-based function approximation, that uses a fine-grained fuzzy membership grade to describe a state-action pair’s adjacency with respect to each prototype, we give some results that prototype collisions are completely eliminated and learning performance is greatly improved. We further show that prototype density varies widely across the state-action space and that this variation causes prototypes’ receptive fields to be unevenly distributed. This distribution limits the ability of fuzzy Kanerva Coding to achieve better results. It can be observed that fuzzy Kanerva Coding allows prototypes to adaptively tune their receptive fields for a target application. We conclude that fuzzy Kanerva Coding with prototype tuning and adaptation can significantly improve a reinforcement learner’s ability to solve large-scale multi-agent problems.

Keywords: Reinforcement learning, function approximation, fuzzy system, pursuit problem

DOI: 10.3233/IFS-162212

Journal: Journal of Intelligent & Fuzzy Systems, vol. 32, no. 6, pp. 3909-3920, 2017

Published: 23 May 2017

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl

Share this:

North America

Europe

Asia