Development of a corpus for evidence based medicine summarisation

Diego Mollá, Maria Elena Santiago-Martinez

Research output: Contribution to journalConference paperpeer-review

26 Citations (Scopus)
71 Downloads (Pure)


In this paper we introduce some of the key NLP-related problems related to the practice of Evidence Based Medicine and propose the task of multi-document query-focused summarisation as a key approach to solve these problems. We have completed a corpus for the development of such multi-document queryfocused summarisation task. The process to build the corpus combined the use of automated extraction of text, manual annotation, and crowdsourcing to find the reference IDs. We perform a statistical analysis of the corpus for the particular use of single-document summarisation and show that there is still a lot of room for improvement from the current baselines.
Original languageEnglish
Pages (from-to)86-94
Number of pages9
JournalProceedings of the Australasian Language Technology Association Workshop 2011
Publication statusPublished - 2011
EventAustralasian Language Technology Association Workshop (9th : 2011) - Canberra
Duration: 1 Dec 20112 Dec 2011

Bibliographical note

Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.


Dive into the research topics of 'Development of a corpus for evidence based medicine summarisation'. Together they form a unique fingerprint.

Cite this