Abstract
Quote extraction and attribution is the task of automatically extracting quotes from text and attributing each quote to its correct speaker. The present state-of-the-art system uses gold standard information from previous decisions in its features, which, when removed, results in a large drop in performance. We treat the problem as a sequence labelling task, which allows us to incorporate sequence features without using gold standard information. We present results on two new corpora and an augmented version of a third, achieving a new state-of-the-art for systems using only realistic features.
Original language | English |
---|---|
Title of host publication | EMNLP-CoNLL 2012 - 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the Conference |
Pages | 790-799 |
Number of pages | 10 |
Publication status | Published - 2012 |
Event | 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL 2012 - Jeju Island, Korea, Republic of Duration: 12 Jul 2012 → 14 Jul 2012 |
Other
Other | 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL 2012 |
---|---|
Country/Territory | Korea, Republic of |
City | Jeju Island |
Period | 12/07/12 → 14/07/12 |