TY - JOUR
T1 - Delving into the representation learning of deep hashing
AU - Cui, Quan
AU - Chen, Zhao Min
AU - Yoshie, Osamu
N1 - Funding Information:
This work was supported in part by the Zhejiang Provincial Natural Science Foundation of China under Grant No.LQ22F020006.
Publisher Copyright:
© 2022 The Author(s)
PY - 2022/7/14
Y1 - 2022/7/14
N2 - Searching for the nearest neighbor is a fundamental problem in the computer vision field, and deep hashing has become one of the most representative and widely used methods, which learns to generate compact binary codes for visual data. In this paper, we first delve into the representation learning of deep hashing and surprisingly find that deep hashing could be a double-edged sword, i.e., deep hashing can accelerate the query speed and decrease the storage cost in the nearest neighbor search progress, but it greatly sacrifices the discriminability of deep representations especially with extremely short target code lengths. To solve this problem, we propose a two-step deep hashing learning framework. The first step focuses on learning deep discriminative representations with metric learning. Subsequently, the learning framework concentrates on simultaneously learning compact binary codes and preserving representations learned in the former step from being sacrificed. Extensive experiments on two general image datasets and four challenging image datasets validate the effectiveness of our proposed learning framework. Moreover, the side effect of deep hashing is successfully mitigated with our learning framework.
AB - Searching for the nearest neighbor is a fundamental problem in the computer vision field, and deep hashing has become one of the most representative and widely used methods, which learns to generate compact binary codes for visual data. In this paper, we first delve into the representation learning of deep hashing and surprisingly find that deep hashing could be a double-edged sword, i.e., deep hashing can accelerate the query speed and decrease the storage cost in the nearest neighbor search progress, but it greatly sacrifices the discriminability of deep representations especially with extremely short target code lengths. To solve this problem, we propose a two-step deep hashing learning framework. The first step focuses on learning deep discriminative representations with metric learning. Subsequently, the learning framework concentrates on simultaneously learning compact binary codes and preserving representations learned in the former step from being sacrificed. Extensive experiments on two general image datasets and four challenging image datasets validate the effectiveness of our proposed learning framework. Moreover, the side effect of deep hashing is successfully mitigated with our learning framework.
KW - Computer vision
KW - Deep hashing
KW - Metric learning
KW - Representation learning
KW - Transfer learning
UR - http://www.scopus.com/inward/record.url?scp=85129507391&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85129507391&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2022.04.082
DO - 10.1016/j.neucom.2022.04.082
M3 - Article
AN - SCOPUS:85129507391
SN - 0925-2312
VL - 494
SP - 67
EP - 78
JO - Neurocomputing
JF - Neurocomputing
ER -