Am I rare? An intelligent summarization approach for identifying hidden anomalies

Samira Ghodratnama*, Mehrdad Zakershahrak, Fariborz Sobhanmanesh

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

6 Citations (Scopus)

Abstract

Monitoring network traffic data to detect any hidden patterns of anomalies is a challenging and time-consuming task which requires high computing resources. To this end, an appropriate summarization technique is of great importance, where it can be a substitute for the original data. However, the summarized data is under the threat of removing anomalies. Therefore, it is vital to create a summary that can reflect the same pattern as the original data. Therefore, in this paper, we propose an INtelligent Summarization approach for IDENTifying hidden anomalies, called INSIDENT. The proposed approach guarantees to keep the original data distribution in summarized data. Our approach is a clustering-based algorithm that dynamically maps original feature space to a new feature space by locally weighting features in each cluster. Therefore, in new feature space, similar samples are closer, and consequently, outliers are more detectable. Besides, selecting representatives based on cluster size keeps the same distribution as the original data in summarized data. INSIDENT can be used both as the preprocess approach before performing anomaly detection algorithms and anomaly detection algorithm. The experimental results on benchmark datasets prove a summary of the data can be a substitute for original data in the anomaly detection task.

Original languageEnglish
Title of host publicationService-Oriented Computing – ICSOC 2020 Workshops
Subtitle of host publicationAIOps, CFTIC, STRAPS, AI-PA, AI-IOTS, and Satellite Events Dubai, United Arab Emirates, December 14–17, 2020: Proceedings
EditorsHakim Hacid, Fatma Outay, Hye-young Paik, Amira Alloum, Marinella Petrocchi, Mohamed Reda Bouadjenek, Amin Beheshti, Xumin Liu, Abderrahmane Maaradji
Place of PublicationCham, Switzerland
PublisherSpringer, Springer Nature
Pages309-323
Number of pages15
ISBN (Electronic)9783030763527
ISBN (Print)9783030763510
DOIs
Publication statusPublished - 2021
EventAIOps, CFTIC, STRAPS, AI-PA, AI-IOTS, and Satellite Events held in conjunction with 18th International Conference on Service-Oriented Computing, ICSOC 2020 - Virtual, Online
Duration: 14 Dec 202017 Dec 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12632
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceAIOps, CFTIC, STRAPS, AI-PA, AI-IOTS, and Satellite Events held in conjunction with 18th International Conference on Service-Oriented Computing, ICSOC 2020
CityVirtual, Online
Period14/12/2017/12/20

Keywords

  • Anomaly detection
  • Summarization
  • Network data
  • Clustering
  • Classification

Fingerprint

Dive into the research topics of 'Am I rare? An intelligent summarization approach for identifying hidden anomalies'. Together they form a unique fingerprint.

Cite this