Projects per year
Abstract
Anomaly detection is a significant but challenging data mining task in a wide range of applications. Different domains usually use different ways to measure the characteristics of data and to define the anomaly types. As a result, it is a big challenge to develop a versatile anomaly detection framework that can be universally applied with satisfactory performance in most, if not all, applications. In this article, we propose a generic isolation forest based ensemble framework named EDBHiForest, which can be universally applied to data spaces with arbitrary distance measures. It is realized through embedding the isolation forest structure with extended distance-based hashing (EDBH), which can significantly enhance the versatility and applicability of isolation forest based anomaly detection. This framework overcomes the limitations of existing isolation forest based methods that can only be applied to datasets with a very limited range of distance measure types. Extensive experiments on various non-independent and identically distributed datasets demonstrate the effectiveness and efficiency of our approach.
Original language | English |
---|---|
Pages (from-to) | 31-40 |
Number of pages | 10 |
Journal | IEEE Intelligent Systems |
Volume | 36 |
Issue number | 3 |
DOIs | |
Publication status | Published - May 2021 |
Keywords
- anomaly detection
- data mining
- measurement
- extraterrestrial measurements
- hash functions
- intelligent systems
Fingerprint
Dive into the research topics of 'Isolation forest based anomaly detection framework on non-IID data'. Together they form a unique fingerprint.Projects
- 1 Finished
-
DE21 : Scalable and Deep Anomaly Detection from Big Data with Similarity Hashing
1/01/21 → 31/12/23
Project: Research