Abstract
In many text-to-text generation scenarios (for instance, summarisation), we encounter human-authored sentences that could be composed by recycling portions of related sentences to form new sentences. In this paper, we couch the generation of such sentences as a search problem. We investigate a statistical sentence generation method which recombines words to form new sentences. We propose an extension to the Viterbi algorithm designed to improve the grammaticality of generated sentences. Within a statistical framework, the extension favours those partially generated strings with a probable dependency tree structure. Our preliminary evaluations show that our approach generates less fragmented text than a bigram baseline.
Original language | English |
---|---|
Title of host publication | Proceedings of the 10th European Workshop on Natural Language Generation, ENLG-05 |
Editors | Graham Wilcock, Kristiina Jokinen, Chris Mellish, Ehud Reiter |
Place of Publication | Aberdeen, UK |
Pages | 211-216 |
Number of pages | 6 |
Publication status | Published - 2005 |
Event | 10th European Workshop on Natural Language Generation, ENLG-05 - Aberdeen, United Kingdom Duration: 8 Aug 2005 → 10 Aug 2005 |
Other
Other | 10th European Workshop on Natural Language Generation, ENLG-05 |
---|---|
Country/Territory | United Kingdom |
City | Aberdeen |
Period | 8/08/05 → 10/08/05 |