Delving into the representation learning of deep hashing

Quan Cui, Zhao Min Chen, Osamu Yoshie*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Searching for the nearest neighbor is a fundamental problem in the computer vision field, and deep hashing has become one of the most representative and widely used methods, which learns to generate compact binary codes for visual data. In this paper, we first delve into the representation learning of deep hashing and surprisingly find that deep hashing could be a double-edged sword, i.e., deep hashing can accelerate the query speed and decrease the storage cost in the nearest neighbor search progress, but it greatly sacrifices the discriminability of deep representations especially with extremely short target code lengths. To solve this problem, we propose a two-step deep hashing learning framework. The first step focuses on learning deep discriminative representations with metric learning. Subsequently, the learning framework concentrates on simultaneously learning compact binary codes and preserving representations learned in the former step from being sacrificed. Extensive experiments on two general image datasets and four challenging image datasets validate the effectiveness of our proposed learning framework. Moreover, the side effect of deep hashing is successfully mitigated with our learning framework.

Original languageEnglish
Pages (from-to)67-78
Number of pages12
JournalNeurocomputing
Volume494
DOIs
Publication statusPublished - 2022 Jul 14

Keywords

  • Computer vision
  • Deep hashing
  • Metric learning
  • Representation learning
  • Transfer learning

ASJC Scopus subject areas

  • Computer Science Applications
  • Cognitive Neuroscience
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Delving into the representation learning of deep hashing'. Together they form a unique fingerprint.

Cite this