The implementation and evaluation of a lexicon-based stemmer

Gilberto Silva*, Claudia Oliveira

*Corresponding author for this work

    Research output: Contribution to journalReview articlepeer-review

    2 Citations (Scopus)

    Abstract

    This paper describes a stemming technique that depends principally on a target language's lexicon, organised as an automaton of word strings. The clear distinction between the lexicon and the procedure itself allows the stemmer to be customised for any language with little or even no changes to the program's source code. An implementation of the stemmer, with a medium sized Portuguese lexicon is evaluated using Paice's [16] evaluation method.

    Original languageEnglish
    Pages (from-to)266-276
    Number of pages11
    JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume2857
    Publication statusPublished - 2003

    Fingerprint

    Dive into the research topics of 'The implementation and evaluation of a lexicon-based stemmer'. Together they form a unique fingerprint.

    Cite this