Cybersecurity data science: an overview from machine learning perspective

Iqbal H. Sarker*, A. S. M. Kayes, Shahriar Badsha, Hamed Alqahtani, Paul Watters, Alex Ng

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

274 Citations (Scopus)
184 Downloads (Pure)


In a computing context, cybersecurity is undergoing massive shifts in technology and its operations in recent days, and data science is driving the change. Extracting security incident patterns or insights from cybersecurity data and building corresponding data-driven model, is the key to make a security system automated and intelligent. To understand and analyze the actual phenomena with data, various scientific methods, machine learning techniques, processes, and systems are used, which is commonly known as data science. In this paper, we focus and briefly discuss on cybersecurity data science, where the data is being gathered from relevant cybersecurity sources, and the analytics complement the latest data-driven patterns for providing more effective security solutions. The concept of cybersecurity data science allows making the computing process more actionable and intelligent as compared to traditional ones in the domain of cybersecurity. We then discuss and summarize a number of associated research issues and future directions. Furthermore, we provide a machine learning based multi-layered framework for the purpose of cybersecurity modeling. Overall, our goal is not only to discuss cybersecurity data science and relevant methods but also to focus the applicability towards data-driven intelligent decision making for protecting the systems from cyber-attacks.

Original languageEnglish
Article number41
Pages (from-to)1-29
Number of pages29
JournalJournal of Big Data
Issue number1
Publication statusPublished - 1 Jul 2020

Bibliographical note

Copyright the Author(s) 2020. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.


  • Cyber threat intelligence
  • Cyber-attack
  • Cybersecurity
  • Data science
  • Decision making
  • Intrusion detection
  • Machine learning
  • Security modeling


Dive into the research topics of 'Cybersecurity data science: an overview from machine learning perspective'. Together they form a unique fingerprint.

Cite this