Automatic malware categorization based on K-means clustering technique

Nazifa Mosharrat, Iqbal H. Sarker*, Md Musfique Anwar, Muhammad Nazrul Islam, Paul Watters, Mohammad Hammoudeh

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

    4 Citations (Scopus)

    Abstract

    The android operating system is a popular operating system for mobile phone applications. This is also known as an open-source operating system so that the developers can easily update and add new features to it. However, it poses significant challenges related to malicious attacks or cyberattacks because of its open system design philosophy. Nowadays, the number of malware applications is increasing rapidly and proportionally with safe android applications. As a result, it has become very challenging to identify their behaviors or signatures or categorizes them to implement protection in the android system. In this research work, we propose an automated system for malware categorization using the K-means clustering method that automatically chooses the cluster number. In our method, we have categorized malware into an optimum number of different cluster families by using a real-time malware dataset. We also compare our automated model with the traditional cluster selection technique with Elbow and Silhouette method. Experimental results demonstrate that our model determines the optimal cluster number with less user intervention for malware categorization.

    Original languageEnglish
    Title of host publicationProceedings of the International Conference on Big Data, IoT, and Machine Learning
    Subtitle of host publicationBIM 2021
    EditorsMohammad Shamsul Arefin, M. Shamim Kaiser, Anirban Bandyopadhyay, Md. Atiqur Rahman Ahad, Kanad Ray
    Place of PublicationSingapore
    PublisherSpringer, Springer Nature
    Pages653-664
    Number of pages12
    ISBN (Electronic)9789811666360
    ISBN (Print)9789811666353
    DOIs
    Publication statusPublished - 2022
    EventInternational Conference on Big Data, IoT, and Machine Learning (2021) - Cox's Bazar, Bangladesh
    Duration: 23 Sept 202125 Sept 2021

    Publication series

    NameLecture Notes on Data Engineering and Communications Technologies
    Volume95
    ISSN (Print)2367-4512
    ISSN (Electronic)2367-4520

    Conference

    ConferenceInternational Conference on Big Data, IoT, and Machine Learning (2021)
    Abbreviated titleBIM 2021
    Country/TerritoryBangladesh
    CityCox's Bazar
    Period23/09/2125/09/21

    Keywords

    • Android applications
    • Clustering
    • Cybersecurity
    • K-means
    • Machine learning
    • Malware categorization
    • Malware detection

    Fingerprint

    Dive into the research topics of 'Automatic malware categorization based on K-means clustering technique'. Together they form a unique fingerprint.

    Cite this