Abstract
This paper presents a comparative study of five parameter estimation algorithms on four NLP tasks. Three of the five algorithms are well-known in the computational linguistics community: Maximum Entropy (ME) estimation with L 2 regularization, the Averaged Perceptron (AP), and Boosting. We also investigate ME estimation with L 1 regularization using a novel optimization algorithm, and BLasso, which is a version of Boosting with Lasso (L 1) regularization. We first investigate all of our estimators on two re-ranking tasks: a parse selection task and a language model (LM) adaptation task. Then we apply the best of these estimators to two additional tasks involving conditional sequence models: a Conditional Markov Model (CMM) for part of speech tagging and a Conditional Random Field (CRF) for Chinese word segmentation. Our experiments show that across tasks, three of the estimators - ME estimation with L 1 or L 2 regularization, and AP - are in a near statistical tie for first place.
Original language | English |
---|---|
Title of host publication | ACL 2007 - Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics |
Place of Publication | East Stroudsburg, PA |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 824-831 |
Number of pages | 8 |
ISBN (Print) | 9781932432862 |
Publication status | Published - 2007 |
Externally published | Yes |
Event | 45th Annual Meeting of the Association for Computational Linguistics, ACL 2007 - Prague, Czech Republic Duration: 23 Jun 2007 → 30 Jun 2007 |
Other
Other | 45th Annual Meeting of the Association for Computational Linguistics, ACL 2007 |
---|---|
Country/Territory | Czech Republic |
City | Prague |
Period | 23/06/07 → 30/06/07 |