K Nearest Neighbor Similarity Join Algorithm on High-Dimensional Data Using Novel Partitioning Strategy

Youzhong Ma, Qiaozhi Hua*, Zheng Wen, Ruiling Zhang, Yongxin Zhang, Haipeng Li

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

k nearest neighbor similarity join on high-dimensional data has broad applications in many fields; several key challenges still exist for this task such as "curse of dimensionality"and large scale of the dataset. A new dimensionality reduction scheme is proposed by using random projection technique, then we design two novel partition strategies, including equal width partition strategy and distance split tree-based partition strategy, and finally, we propose k nearest neighbor join algorithm on high-dimensional data based on the above partition strategies. We conduct comprehensive experiments to test the performance of the proposed approaches, and the experimental results show that the proposed methods have good effectiveness and performance.

Original languageEnglish
Article number1249393
JournalSecurity and Communication Networks
Volume2022
DOIs
Publication statusPublished - 2022

ASJC Scopus subject areas

  • Information Systems
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'K Nearest Neighbor Similarity Join Algorithm on High-Dimensional Data Using Novel Partitioning Strategy'. Together they form a unique fingerprint.

Cite this