Fake news detection in multiple platforms and languages

Pedro Henrique Arruda Faustini, Thiago Ferreira Covões*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

104 Citations (Scopus)

Abstract

The debate around fake news has grown recently because of the potential harm they can have on different fields, being politics one of the most affected. Due to the amount of news being published every day, several studies in computer science have proposed models using machine learning to detect fake news. However, most of these studies focus on news from one language (mostly English) or rely on characteristics of social media-specific platforms (like Twitter or Sina Weibo). Our work proposes to detect fake news using only text features that can be generated regardless of the source platform and are the most independent of the language as possible. We carried out experiments from five datasets, comprising both texts and social media posts, in three language groups: Germanic, Latin, and Slavic, and got competitive results when compared to benchmarks. We compared the results obtained through a custom set of features and with other popular techniques when dealing with natural language processing, such as bag-of-words and Word2Vec.

Original languageEnglish
Article number113503
Pages (from-to)1-9
Number of pages9
JournalExpert Systems with Applications
Volume158
DOIs
Publication statusPublished - 15 Nov 2020
Externally publishedYes

Keywords

  • Fake news
  • Machine learning
  • Supervised learning

Fingerprint

Dive into the research topics of 'Fake news detection in multiple platforms and languages'. Together they form a unique fingerprint.

Cite this