Abstract
We present a new approach to stochastic modeling of constraint-based grammars that is based on loglinear models and uses EM for estimation from unannotated data. The techniques are applied to an LFG grammar for German. Evaluation on an exact match task yields 86% precision for an ambiguity rate of 5.4, and 90% precision on a subcat frame match for an ambiguity rate of 25. Experimental comparison to training from a parsebank shows a 10% gain from EM training. Also, a new class-based grammar lexicalization is presented, showing a 10% gain over unlexicalized models.
Original language | English |
---|---|
Title of host publication | Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics (ACL'00), Hong Kong |
Place of Publication | Stroudsburg, PA |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 480-487 |
Number of pages | 8 |
DOIs | |
Publication status | Published - 2000 |
Externally published | Yes |
Event | Annual Meeting of the Association for Computational Linguistics (38th : 2000) - Hong Kong Duration: 1 Oct 2000 → 8 Oct 2000 |
Conference
Conference | Annual Meeting of the Association for Computational Linguistics (38th : 2000) |
---|---|
City | Hong Kong |
Period | 1/10/00 → 8/10/00 |