Abstract
The automation of text summarisation of biomedical publications is a pressing need due to the plethora of information available online. This paper explores the impact of several supervised machine learning approaches for extracting multi-document summaries for given queries. In particular, we compare classification and regression approaches for query-based extractive summarisation using data provided by the BioASQ Challenge. We tackled the problem of annotating sentences
for training classification systems and show that a simple annotation approach outperforms regression-based summarisation.
for training classification systems and show that a simple annotation approach outperforms regression-based summarisation.
Original language | English |
---|---|
Title of host publication | Ninth International Workshop on Health Text Mining and Information Analysis (LOUHI) |
Subtitle of host publication | Proceedings of the Workshop |
Place of Publication | Stroudsburg |
Publisher | Association for Computational Linguistics |
Pages | 29-37 |
Number of pages | 9 |
ISBN (Electronic) | 9781948087742 |
Publication status | Published - 2018 |
Event | 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP) - Brussels, Belgium Duration: 31 Oct 2018 → 4 Nov 2018 |
Conference
Conference | 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP) |
---|---|
Country/Territory | Belgium |
City | Brussels |
Period | 31/10/18 → 4/11/18 |