Attribute-guided and attribute-manipulated similarity learning network for fashion image retrieval

Wan, Yongquan; Yan, Cairong; Zou, Guobing; Zhang, Bofeng

doi:10.3233/IDA-226740

Attribute-guided and attribute-manipulated similarity learning network for fashion image retrieval

Article type: Research Article

Authors: Wan, Yongquan^{a; b} | Yan, Cairong^{c; *} | Zou, Guobing^a | Zhang, Bofeng^{d; *}

Affiliations: [a] School of Computer Engineering and Science, Shanghai University, Shanghai, China | [b] School of Information Technology, Shanghai Jian Qiao University, Shanghai, China | [c] School of Computer Science and Technology, Donghua University, Shanghai, China | [d] School of Computer and Information Engineering, Shanghai Polytechnic University, Shanghai, China

Correspondence: [*] Corresponding authors: Cairong Yan, School of Computer Science and Technology, Donghua University, Shanghai, China. E-mail: cryan@dhu.edu.cn. Bofeng Zhang, School of Computer and Information Engineering, Shanghai Polytechnic University, Shanghai, China. E-mail: bfzhang@sspu.edu.cn.

Abstract: Learning the similarity between fashion items is essential for many fashion-related tasks. Most methods based on global or local image similarity cannot meet the fine-grained retrieval requirements related to attributes. We are the first to clearly distinguish the concepts of attribute name and their values and divide fashion retrieval tasks that combine images and text into: attribute-guided retrieval and attribute-manipulated retrieval. We propose a hierarchical attribute-aware embedding network (HAEN) that takes images and attributes as input, learns multiple attribute-specific embedding spaces, and measures fine-grained similarity in the corresponding spaces. It can accurately map different attributes to the corresponding areas of the image, thereby facilitating the feature fusion of two different modalities of text and image, including enhancement and replacement. Then on this basis, we propose three attribute-manipulated similarity learning methods, HAEN_Avg, HAEN_Rec, and HAEN_Cmb. With comprehensive validation on two real-world fashion datasets, we demonstrate that our methods can effectively leverage semantic knowledge to improve image retrieval performance, including attribute-guided and attribute-manipulated retrieval tasks.

Keywords: Similarity learning, image retrieval, attribute-guided, attribute-manipulated, fashion

DOI: 10.3233/IDA-226740

Journal: Intelligent Data Analysis, vol. 27, no. 3, pp. 733-751, 2023

Published: 18 May 2023

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl

Share this:

North America

Europe

Asia