The implementation and evaluation of a lexicon-based stemmer

Gilberto Silva*, Claudia Oliveira

*Corresponding author for this work

Research output: Contribution to journalReview article

2 Citations (Scopus)

Abstract

This paper describes a stemming technique that depends principally on a target language's lexicon, organised as an automaton of word strings. The clear distinction between the lexicon and the procedure itself allows the stemmer to be customised for any language with little or even no changes to the program's source code. An implementation of the stemmer, with a medium sized Portuguese lexicon is evaluated using Paice's [16] evaluation method.

Original languageEnglish
Pages (from-to)266-276
Number of pages11
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2857
Publication statusPublished - 2003

Fingerprint Dive into the research topics of 'The implementation and evaluation of a lexicon-based stemmer'. Together they form a unique fingerprint.

  • Cite this