Discovering similar malware samples using API call topics

Akinori Fujino, Junichi Murakami, Tatsuya Mori

Research output: Chapter in Book/Report/Conference proceedingConference contribution

23 Citations (Scopus)

Abstract

To automate malware analysis, dynamic malware analysis systems have attracted increasing attention from both the industry and research communities. Of the various logs collected by such systems, the API call is a very promising source of information for characterizing malware behavior. This work aims to extract similar malware samples automatically using the concept of 'API call topics,' which represents a set of API calls that are intrinsic to a specific group of malware samples. We first convert Win32 API calls into 'API words.' We then apply non-negative matrix factorization (NMF) clustering analysis to the corpus of the extracted API words. NMF automatically generates the API call topics from the API words. The contributions of this work can be summarized as follows. We present an unsupervised approach to extract API call topics from a large corpus of API calls. Through analysis of the API call logs collected from thousands of malware samples, we demonstrate that the extracted API call topics can detect similar malware samples. The proposed approach is expected to be useful for automating the process of analyzing a huge volume of logs collected from dynamic malware analysis systems.

Original languageEnglish
Title of host publication2015 12th Annual IEEE Consumer Communications and Networking Conference, CCNC 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages140-147
Number of pages8
ISBN (Electronic)9781479963904
DOIs
Publication statusPublished - 2015 Jul 14
Event2015 12th Annual IEEE Consumer Communications and Networking Conference, CCNC 2015 - Las Vegas, United States
Duration: 2015 Jan 92015 Jan 12

Publication series

Name2015 12th Annual IEEE Consumer Communications and Networking Conference, CCNC 2015

Other

Other2015 12th Annual IEEE Consumer Communications and Networking Conference, CCNC 2015
Country/TerritoryUnited States
CityLas Vegas
Period15/1/915/1/12

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Discovering similar malware samples using API call topics'. Together they form a unique fingerprint.

Cite this