Parser features for sentence grammaticality classification

Sze-Meng Jojo Wong, Mark Dras

Research output: Contribution to journalConference paperpeer-review

Abstract

Automatically judging sentences for their grammaticality is potentially useful for several purposes — evaluating language technology systems, assessing language competence of second or foreign language learners, and so on. Previous work has examined parser ‘byproducts’, in particular parse probabilities, to distinguish grammatical sentences from ungrammatical ones. The aim of the present paper is to examine whether the primary output of a parser, which we characterise via CFG production rules embodied in a parse, contains useful information for sentence grammaticality classification; and also to examine which feature selection metrics are most useful in this task. Our results show that using gold standard production rules alone can improve over using parse probabilities alone. Combining parser-produced production rules with parse probabilities further produces an improvement of 1.6% on average in the overall classification accuracy.
Original languageEnglish
Pages (from-to)67-75
Number of pages9
JournalProceedings of the Australasian Language Technology Association Workshop 2010
Publication statusPublished - 2010
EventAustralasian Language Technology Association Workshop - Melbourne
Duration: 9 Dec 201010 Dec 2010

Fingerprint

Dive into the research topics of 'Parser features for sentence grammaticality classification'. Together they form a unique fingerprint.

Cite this