Abstract
Internet Relay Chat (IRC) is a useful and relativelysimple protocol for text based chat online, used in a variety ofareas online such as for discussion and technical support. IRC isalso used for cybercrime, with online rooms selling stolen creditcard details, botnet access and malware. The reasons for theuse of IRC in cybercrime include the widespread adoption andease of use, but also focus around the anonymity granted bythe protocol, allowing users to hide behind aliases that can bechanged regularly. In this research, we apply authorship analysistechniques to be able to attribute chat messages to known aliases.A preliminary experiment shows that this application is verydifficult, due to the short messages and repeated information.To improve the accuracy, we apply inverse-author-frequency(iaf) weighting, which gives higher weights to features used byfewer authors. This research is the first time that iaf has beenapplied to character n-gram models, previously being applied toword based models of authorship. We find that this improvesthe accuracy significantly for the RLP method and provides aplatform for successful applications of authorship analysis in thefuture. Overall, the method achieves accuracies of over 55% ina very difficult application domain.
Original language | English |
---|---|
Title of host publication | Proceedings - 2012 3rd Cybercrime and Trustworthy Computing Workshop, CTC 2012 |
Place of Publication | Piscataway, NJ |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 7-13 |
Number of pages | 7 |
ISBN (Electronic) | 9780769549408 |
ISBN (Print) | 9781467364607 |
DOIs | |
Publication status | Published - 2013 |
Event | 2012 3rd Cybercrime and Trustworthy Computing Workshop, CTC 2012 - Ballarat, VIC, Australia Duration: 29 Oct 2012 → 30 Oct 2012 |
Other
Other | 2012 3rd Cybercrime and Trustworthy Computing Workshop, CTC 2012 |
---|---|
Country | Australia |
City | Ballarat, VIC |
Period | 29/10/12 → 30/10/12 |