Abstract
We describe a generative model for clustering named entities which also models named entity internal structure, clustering related words by role. The model is entirely unsupervised; it uses features from the named entity itself and its syntactic context, and coreference information from an unsupervised pronoun re-solver. The model scores 86% on the MUC-7 named-entity dataset. To our knowledge, this is the best reported score for a fully unsuper-vised model, and the best score for a generative model.
Original language | English |
---|---|
Title of host publication | Proceeding NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics |
Place of Publication | Stroudsburg, PA |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 164-172 |
Number of pages | 9 |
ISBN (Print) | 9781932432411 |
Publication status | Published - May 2009 |
Externally published | Yes |
Event | Annual Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies, NAACL HLT (10th : 2009) - Boulder, United States Duration: 31 May 2009 → 5 Jun 2009 |
Other
Other | Annual Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies, NAACL HLT (10th : 2009) |
---|---|
Country/Territory | United States |
City | Boulder |
Period | 31/05/09 → 5/06/09 |