Projects per year
Abstract
Speech summarization techniques take human speech as input and then output an abridged version as text or speech. Speech summarization has applications in many domains from information technology to health care, for example improving speech archives or reducing clinical documentation burden. This scoping review maps close to 2 decades of speech summarization literature, spanning from the early machine learning works up to ensemble models, with no restrictions on the language summarized, research method, or paper type. We reviewed a total of 110 papers out of a set of 188 found through a literature search and extracted speech features used, methods, scope, and training corpora. Most studies employ one of four speech summarization architectures: (1) Sentence extraction and compaction; (2) Feature extraction and classification or rank-based sentence selection; (3) Sentence compression and compression
summarization; and (4) Language modelling. We also discuss the strengths and weaknesses of these different methods and speech features. Overall, supervised methods (e.g. Hidden Markov support vector machines, Ranking support vector machines, Conditional random fields) performed better than unsupervised methods. As supervised methods require manually annotated training data which can be costly, there was more interest in unsupervised methods. Recent
research into unsupervised methods focusses on extending language modelling, for example by combining Uni-gram modelling with deep neural networks. This review does not include recent work in deep learning.
summarization; and (4) Language modelling. We also discuss the strengths and weaknesses of these different methods and speech features. Overall, supervised methods (e.g. Hidden Markov support vector machines, Ranking support vector machines, Conditional random fields) performed better than unsupervised methods. As supervised methods require manually annotated training data which can be costly, there was more interest in unsupervised methods. Recent
research into unsupervised methods focusses on extending language modelling, for example by combining Uni-gram modelling with deep neural networks. This review does not include recent work in deep learning.
Original language | English |
---|---|
Article number | 101305 |
Pages (from-to) | 1-17 |
Number of pages | 17 |
Journal | Computer Speech and Language |
Volume | 72 |
Early online date | 29 Sept 2021 |
DOIs | |
Publication status | Published - Mar 2022 |
Keywords
- Speech summarization
- Spontaneous speech
- Automatic speech recognition
- Extractive summarization
- Abstractive summarization
Fingerprint
Dive into the research topics of 'Symbolic and statistical learning approaches to speech summarization: a scoping review'. Together they form a unique fingerprint.-
Creating safe, effective systems of care: the translational challenge
Braithwaite, J., Westbrook, J. & Coiera, E.
1/11/14 → …
Project: Research
-
Centre of Research Excellence in Digital Health (CREDiH)
Coiera, E., Glasziou, P., Hansen, D., Magrabi, F., Sintchenko, V., Verspoor, K., Gallego-Luxan, B., Lau, A., Dunn, A., Longhurst, C., Tsafnat, G., Cutler, H., Makeham, M., Shaw, T., Shah, N., Runciman, W. & Liaw, S. T.
1/01/18 → 31/12/22
Project: Research