Supervised machine learning for extractive query based summarisation of biomedical data

Mandeep Kaur, Diego Molla

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionResearchpeer-review

Abstract

The automation of text summarisation of biomedical publications is a pressing need due to the plethora of information available online. This paper explores the impact of several supervised machine learning approaches for extracting multi-document summaries for given queries. In particular, we compare classification and regression approaches for query-based extractive summarisation using data provided by the BioASQ Challenge. We tackled the problem of annotating sentences
for training classification systems and show that a simple annotation approach outperforms regression-based summarisation.
LanguageEnglish
Title of host publicationNinth International Workshop on Health Text Mining and Information Analysis (LOUHI)
Subtitle of host publicationProceedings of the Workshop
Place of PublicationStroudsburg
PublisherAssociation for Computational Linguistics
Pages29-37
Number of pages9
ISBN (Electronic)9781948087742
Publication statusPublished - 2018
Event2018 Conference on Empirical Methods in Natural Language Processing (EMNLP) - Brussels, Belgium
Duration: 31 Oct 20184 Nov 2018

Conference

Conference2018 Conference on Empirical Methods in Natural Language Processing (EMNLP)
CountryBelgium
CityBrussels
Period31/10/184/11/18

Fingerprint

Learning systems
Automation

Cite this

Mandeep Kaur, & Molla, D. (2018). Supervised machine learning for extractive query based summarisation of biomedical data. In Ninth International Workshop on Health Text Mining and Information Analysis (LOUHI): Proceedings of the Workshop (pp. 29-37). Stroudsburg: Association for Computational Linguistics.
Mandeep Kaur, ; Molla, Diego. / Supervised machine learning for extractive query based summarisation of biomedical data. Ninth International Workshop on Health Text Mining and Information Analysis (LOUHI): Proceedings of the Workshop. Stroudsburg : Association for Computational Linguistics, 2018. pp. 29-37
@inproceedings{5c7c4034066a4a8192f18e9c1d20db9c,
title = "Supervised machine learning for extractive query based summarisation of biomedical data",
abstract = "The automation of text summarisation of biomedical publications is a pressing need due to the plethora of information available online. This paper explores the impact of several supervised machine learning approaches for extracting multi-document summaries for given queries. In particular, we compare classification and regression approaches for query-based extractive summarisation using data provided by the BioASQ Challenge. We tackled the problem of annotating sentencesfor training classification systems and show that a simple annotation approach outperforms regression-based summarisation.",
author = "{Mandeep Kaur} and Diego Molla",
year = "2018",
language = "English",
pages = "29--37",
booktitle = "Ninth International Workshop on Health Text Mining and Information Analysis (LOUHI)",
publisher = "Association for Computational Linguistics",

}

Mandeep Kaur, & Molla, D 2018, Supervised machine learning for extractive query based summarisation of biomedical data. in Ninth International Workshop on Health Text Mining and Information Analysis (LOUHI): Proceedings of the Workshop. Association for Computational Linguistics, Stroudsburg, pp. 29-37, 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), Brussels, Belgium, 31/10/18.

Supervised machine learning for extractive query based summarisation of biomedical data. / Mandeep Kaur, ; Molla, Diego.

Ninth International Workshop on Health Text Mining and Information Analysis (LOUHI): Proceedings of the Workshop. Stroudsburg : Association for Computational Linguistics, 2018. p. 29-37.

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionResearchpeer-review

TY - GEN

T1 - Supervised machine learning for extractive query based summarisation of biomedical data

AU - Mandeep Kaur,null

AU - Molla,Diego

PY - 2018

Y1 - 2018

N2 - The automation of text summarisation of biomedical publications is a pressing need due to the plethora of information available online. This paper explores the impact of several supervised machine learning approaches for extracting multi-document summaries for given queries. In particular, we compare classification and regression approaches for query-based extractive summarisation using data provided by the BioASQ Challenge. We tackled the problem of annotating sentencesfor training classification systems and show that a simple annotation approach outperforms regression-based summarisation.

AB - The automation of text summarisation of biomedical publications is a pressing need due to the plethora of information available online. This paper explores the impact of several supervised machine learning approaches for extracting multi-document summaries for given queries. In particular, we compare classification and regression approaches for query-based extractive summarisation using data provided by the BioASQ Challenge. We tackled the problem of annotating sentencesfor training classification systems and show that a simple annotation approach outperforms regression-based summarisation.

M3 - Conference proceeding contribution

SP - 29

EP - 37

BT - Ninth International Workshop on Health Text Mining and Information Analysis (LOUHI)

PB - Association for Computational Linguistics

CY - Stroudsburg

ER -

Mandeep Kaur , Molla D. Supervised machine learning for extractive query based summarisation of biomedical data. In Ninth International Workshop on Health Text Mining and Information Analysis (LOUHI): Proceedings of the Workshop. Stroudsburg: Association for Computational Linguistics. 2018. p. 29-37