Recentred local profiles for authorship attribution

Robert Layton*, Paul Watters, Richard Dazeley

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

23 Citations (Scopus)

Abstract

Authorship attribution methods aim to determine the author of a document, by using information gathered from a set of documents with known authors. One method of performing this task is to create profiles containing distinctive features known to be used by each author. In this paper, a new method of creating an author or document profile is presented that detects features considered distinctive, compared to normal language usage. This recentreing approach creates more accurate profiles than previous methods, as demonstrated empirically using a known corpus of authorship problems. This method, named recentred local profiles, determines authorship accurately using a simple 'best matching author' approach to classification, compared to other methods in the literature. The proposed method is shown to be more stable than related methods as parameter values change. Using a weighted voting scheme, recentred local profiles is shown to outperform other methods in authorship attribution, with an overall accuracy of 69.9% on the ad-hoc authorship attribution competition corpus, representing a significant improvement over related methods.

Original languageEnglish
Pages (from-to)293-312
Number of pages20
JournalNatural Language Engineering
Volume18
Issue number3
DOIs
Publication statusPublished - Jul 2012
Externally publishedYes

Fingerprint

Dive into the research topics of 'Recentred local profiles for authorship attribution'. Together they form a unique fingerprint.

Cite this