Research Article Open Access

Concordance and Term Frequency in Analyzing API Calls for Malware Behavior Detection

Nur Hilda Amira Abd Wahab1, Masnizah Mohd1, Ravie Chandren Muniyandi1, Balaji Rajendran2 and Gopinath Palaniappan2
  • 1 Universiti Kebangsaan Malaysia, Malaysia
  • 2 Centre for Development of Advanced Computing, India

Abstract

Application Programming Interface (API) is used for the software to interact with an operating system to do certain task such as opening file, deleting file and many more. Programmers use this API to make it easier for their program to communicate with the operating system without having the knowledge of the hardware of the target system. Malware author is an attacker that may belong to an organization or work for themselves. Some malware author has the capabilities to write their own malware, uses the same kind of APIs that is used to create normal programs to create malware. There are many researches done in this field, however, most researchers used n-gram to detect the sequence of API calls and although it gave good results, it is time consuming to process through all the output. This is the reason why this paper proposed to use Concordance to search for the API call sequence of a malware because it uses KWIC (Key Word in Context), thus only displayed the output based on the queried keyword. After that, Term Frequency (TF) is used to search for the most commonly used APIs in the dataset. The results of the experiment show that concordance can be used to search for API call sequence as we manage to identify six malicious behaviors (Install Itself at Startup, Enumerate All Process, Privilege Escalation, Terminate Process, Process Hollowing and Ant debugging) using this method. And based on the TF score, the most commonly used API in the dataset is the Reg Close Key (TF: 1.388), which on its own is not a dangerous API, hence we can infer that most API is not malicious in nature, it is how they were implemented is making them dangerous.

Journal of Computer Science
Volume 15 No. 9, 2019, 1307-1319

DOI: https://doi.org/10.3844/jcssp.2019.1307.1319

Submitted On: 10 June 2019 Published On: 25 September 2019

How to Cite: Abd Wahab, N. H. A., Mohd, M., Muniyandi, R. C., Rajendran, B. & Palaniappan, G. (2019). Concordance and Term Frequency in Analyzing API Calls for Malware Behavior Detection. Journal of Computer Science, 15(9), 1307-1319. https://doi.org/10.3844/jcssp.2019.1307.1319

  • 3,266 Views
  • 1,504 Downloads
  • 0 Citations

Download

Keywords

  • Concordance
  • KWIC
  • API Call Sequence
  • Malware Behaviors
  • Dynamic Analysis