VM-NSP: vertical negative sequential pattern mining with loose negative element constraints

Wei Wang, Longbing Cao

Research output: Contribution to journalArticlepeer-review

8 Citations (Scopus)

Abstract

Negative sequential patterns (NSPs) capture more informative and actionable knowledge than classic positive sequential patterns (PSPs) due to the involvement of both occurring and nonoccurring behaviors and events, which can contribute to many relevant applications. However, NSP mining is nontrivial, as it involves fundamental challenges requiring distinct theoretical foundations and is not directly addressable by PSP mining. In the very limited research reported on NSP mining, a negative element constraint (NEC) is incorporated to only consider the NSPs composed of specific forms of elements (containing either positive or negative items), which results in many valuable NSPs being missed. Here, we loosen the NEC (called loose negative element constraint (LNEC)) to include partial negative elements containing both positive and negative items, which enables the discovery of more flexible patterns but incorporates significant new learning challenges, such as representing and mining complete NSPs. Accordingly, we formalize the LNEC-based NSP mining problem and propose a novel vertical NSP mining framework, VM-NSP, to efficiently mine the complete set of NSPs by a vertical representation (VR) of each sequence. An efficient bitmap-based vertical NSP mining algorithm, bM-NSP, introduces a bitmap hash table-based VR and a prefix-based negative sequential candidate generation strategy to optimize the discovery performance. VM-NSP and its implementation bM-NSP form the first VR-based approach for complete NSP mining with LNEC. Theoretical analyses and experiments confirm the performance superiority of bM-NSP on synthetic and real-life datasets w.r.t. diverse data factors, which substantially expands existing NSP mining methods toward flexible NSP discovery.

Original languageEnglish
Article number22
Pages (from-to)1-27
Number of pages27
JournalACM Transactions on Information Systems
Volume39
Issue number2
DOIs
Publication statusPublished - Apr 2021
Externally publishedYes

Keywords

  • Sequence analysis
  • negative sequence analysis
  • negative sequential pattern mining
  • vertical representation
  • nonoccurring behavior analytics
  • behavior informatics

Fingerprint

Dive into the research topics of 'VM-NSP: vertical negative sequential pattern mining with loose negative element constraints'. Together they form a unique fingerprint.

Cite this