TY - JOUR
T1 - Non-iidness learning in behavioral and social data
AU - Cao, Longbing
PY - 2014/9
Y1 - 2014/9
N2 - Most of the classic theoretical systems and tools in statistics, data mining and machine learning are built on the fundamental assumption of IIDness, which assumes the independence and identical distribution of underlying objects, attributes and/or values. However, complex behavioral and social problems often exhibit strong couplings and heterogeneity between values, attributes and objects (i.e., non-IIDness). This fundamentally challenges the IIDness-based learning methodologies and techniques. This paper presents a high-level overview of the needs, challenges and opportunities of non-IIDness learning for handling complex behavioral and social problems. By reviewing the nature and issues of classic IIDness-based algorithms in frequent pattern mining, clustering and classification to complex behavioral and social applications, concepts, structures, frameworks and exemplar techniques are discussed for non-IIDness learning. Case studies, relatedwork and prospects of non-IIDness learning are presented. Non-IIDness learning is also a fundamental issue in big data analytics.
AB - Most of the classic theoretical systems and tools in statistics, data mining and machine learning are built on the fundamental assumption of IIDness, which assumes the independence and identical distribution of underlying objects, attributes and/or values. However, complex behavioral and social problems often exhibit strong couplings and heterogeneity between values, attributes and objects (i.e., non-IIDness). This fundamentally challenges the IIDness-based learning methodologies and techniques. This paper presents a high-level overview of the needs, challenges and opportunities of non-IIDness learning for handling complex behavioral and social problems. By reviewing the nature and issues of classic IIDness-based algorithms in frequent pattern mining, clustering and classification to complex behavioral and social applications, concepts, structures, frameworks and exemplar techniques are discussed for non-IIDness learning. Case studies, relatedwork and prospects of non-IIDness learning are presented. Non-IIDness learning is also a fundamental issue in big data analytics.
KW - non-IIDness learning
KW - IIDness
KW - IID data
KW - non-IID data
KW - coupling
KW - behavior informatics
KW - social informatics
UR - http://www.scopus.com/inward/record.url?scp=84901672955&partnerID=8YFLogxK
UR - http://purl.org/au-research/grants/arc/DP1096218
UR - http://purl.org/au-research/grants/arc/DP130102691
UR - http://purl.org/au-research/grants/arc/LP100200774
U2 - 10.1093/comjnl/bxt084
DO - 10.1093/comjnl/bxt084
M3 - Article
SN - 0010-4620
VL - 57
SP - 1358
EP - 1370
JO - Computer Journal
JF - Computer Journal
IS - 9
ER -