Mining outlier participants: insights using directional distributions in latent models

Didi Surian, Sanjay Chawla

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

3 Citations (Scopus)

Abstract

In this paper we will propose a new probabilistic topic model to score the expertise of participants on the projects that they contribute to based on their previous experience. Based on each participant's score, we rank participants and define those who have the lowest scores as outlier participants. Since the focus of our study is on outliers, we name the model as Mining Outlier Participants from Projects (MOPP) model. MOPP is a topic model that is based on directional distributions which are particularly suitable for outlier detection in high-dimensional spaces. Extensive experiments on both synthetic and real data sets have shown that MOPP gives better results on both topic modeling and outlier detection tasks.

Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2013, Proceedings, Part 3
EditorsHendrik Blockeel
Place of PublicationPrague, Czech Republic
PublisherSpringer, Springer Nature
Pages337-352
Number of pages16
Volume8190 LNAI
ISBN (Electronic)9783642409943
ISBN (Print)9783642409936
DOIs
Publication statusPublished - 2013
Externally publishedYes
EventEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2013 - Prague, Czech Republic
Duration: 23 Sept 201327 Sept 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 3
Volume8190 LNAI
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

OtherEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2013
Country/TerritoryCzech Republic
CityPrague
Period23/09/1327/09/13

Fingerprint

Dive into the research topics of 'Mining outlier participants: insights using directional distributions in latent models'. Together they form a unique fingerprint.

Cite this