Tree edit distance as a baseline approach for paraphrase representation

Marta Vila*, Mark Dras

*Corresponding author for this work

Research output: Contribution to journalArticle

1 Citation (Scopus)


Finding an adequate paraphrase representation formalism is a challenging issue in Natural Language Processing. In this paper, we analyse the performance of Tree Edit Distance as a paraphrase representation baseline. Our experiments using Edit Distance Textual Entailment Suite show that, as Tree Edit Distance consists of a purely syntactic approach, paraphrase alternations not based on structural reorganizations do not find an adequate representation. They also show that there is much scope for better modelling of the way trees are aligned.

Original languageEnglish
Pages (from-to)89-95
Number of pages7
JournalProcesamiento de Lenguaje Natural
Publication statusPublished - Mar 2012

Fingerprint Dive into the research topics of 'Tree edit distance as a baseline approach for paraphrase representation'. Together they form a unique fingerprint.

  • Cite this