Textual article clustering in newspaper pages

Marco Aiello*, Andrea Pegoretti

*Corresponding author for this work

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

In the analysis of a newspaper page an important step is the clustering of various text blocks into logical units, i.e., into articles. We propose three algorithms based on text processing techniques to cluster articles in newspaper pages. Based on the complexity of the three algorithms and experiments on actual pages from the Italian newspaper L'Adige, we select one of the algorithms as the preferred choice to solve the textual clustering problem.

Original languageEnglish
Pages (from-to)767-796
Number of pages30
JournalApplied Artificial Intelligence
Volume20
Issue number9
DOIs
Publication statusPublished - 1 Dec 2006
Externally publishedYes

Fingerprint Dive into the research topics of 'Textual article clustering in newspaper pages'. Together they form a unique fingerprint.

  • Cite this