Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Wan, Yongquana; b | Yan, Cairongc; * | Zou, Guobinga | Zhang, Bofengd; *
Affiliations: [a] School of Computer Engineering and Science, Shanghai University, Shanghai, China | [b] School of Information Technology, Shanghai Jian Qiao University, Shanghai, China | [c] School of Computer Science and Technology, Donghua University, Shanghai, China | [d] School of Computer and Information Engineering, Shanghai Polytechnic University, Shanghai, China
Correspondence: [*] Corresponding authors: Cairong Yan, School of Computer Science and Technology, Donghua University, Shanghai, China. E-mail: cryan@dhu.edu.cn. Bofeng Zhang, School of Computer and Information Engineering, Shanghai Polytechnic University, Shanghai, China. E-mail: bfzhang@sspu.edu.cn.
Abstract: Learning the similarity between fashion items is essential for many fashion-related tasks. Most methods based on global or local image similarity cannot meet the fine-grained retrieval requirements related to attributes. We are the first to clearly distinguish the concepts of attribute name and their values and divide fashion retrieval tasks that combine images and text into: attribute-guided retrieval and attribute-manipulated retrieval. We propose a hierarchical attribute-aware embedding network (HAEN) that takes images and attributes as input, learns multiple attribute-specific embedding spaces, and measures fine-grained similarity in the corresponding spaces. It can accurately map different attributes to the corresponding areas of the image, thereby facilitating the feature fusion of two different modalities of text and image, including enhancement and replacement. Then on this basis, we propose three attribute-manipulated similarity learning methods, HAEN_Avg, HAEN_Rec, and HAEN_Cmb. With comprehensive validation on two real-world fashion datasets, we demonstrate that our methods can effectively leverage semantic knowledge to improve image retrieval performance, including attribute-guided and attribute-manipulated retrieval tasks.
Keywords: Similarity learning, image retrieval, attribute-guided, attribute-manipulated, fashion
DOI: 10.3233/IDA-226740
Journal: Intelligent Data Analysis, vol. 27, no. 3, pp. 733-751, 2023
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl