Non-iidness learning in behavioral and social data

Longbing Cao*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

68 Citations (Scopus)

Abstract

Most of the classic theoretical systems and tools in statistics, data mining and machine learning are built on the fundamental assumption of IIDness, which assumes the independence and identical distribution of underlying objects, attributes and/or values. However, complex behavioral and social problems often exhibit strong couplings and heterogeneity between values, attributes and objects (i.e., non-IIDness). This fundamentally challenges the IIDness-based learning methodologies and techniques. This paper presents a high-level overview of the needs, challenges and opportunities of non-IIDness learning for handling complex behavioral and social problems. By reviewing the nature and issues of classic IIDness-based algorithms in frequent pattern mining, clustering and classification to complex behavioral and social applications, concepts, structures, frameworks and exemplar techniques are discussed for non-IIDness learning. Case studies, relatedwork and prospects of non-IIDness learning are presented. Non-IIDness learning is also a fundamental issue in big data analytics.

Original languageEnglish
Pages (from-to)1358-1370
Number of pages13
JournalComputer Journal
Volume57
Issue number9
DOIs
Publication statusPublished - Sept 2014
Externally publishedYes

Keywords

  • non-IIDness learning
  • IIDness
  • IID data
  • non-IID data
  • coupling
  • behavior informatics
  • social informatics

Fingerprint

Dive into the research topics of 'Non-iidness learning in behavioral and social data'. Together they form a unique fingerprint.

Cite this