Symbolic and statistical learning approaches to speech summarization: a scoping review

Research output: Contribution to journalReview articlepeer-review

2 Citations (Scopus)

Abstract

Speech summarization techniques take human speech as input and then output an abridged version as text or speech. Speech summarization has applications in many domains from information technology to health care, for example improving speech archives or reducing clinical documentation burden. This scoping review maps close to 2 decades of speech summarization literature, spanning from the early machine learning works up to ensemble models, with no restrictions on the language summarized, research method, or paper type. We reviewed a total of 110 papers out of a set of 188 found through a literature search and extracted speech features used, methods, scope, and training corpora. Most studies employ one of four speech summarization architectures: (1) Sentence extraction and compaction; (2) Feature extraction and classification or rank-based sentence selection; (3) Sentence compression and compression
summarization; and (4) Language modelling. We also discuss the strengths and weaknesses of these different methods and speech features. Overall, supervised methods (e.g. Hidden Markov support vector machines, Ranking support vector machines, Conditional random fields) performed better than unsupervised methods. As supervised methods require manually annotated training data which can be costly, there was more interest in unsupervised methods. Recent
research into unsupervised methods focusses on extending language modelling, for example by combining Uni-gram modelling with deep neural networks. This review does not include recent work in deep learning.
Original languageEnglish
Article number101305
Pages (from-to)1-17
Number of pages17
JournalComputer Speech and Language
Volume72
Early online date29 Sept 2021
DOIs
Publication statusPublished - Mar 2022

Keywords

  • Speech summarization
  • Spontaneous speech
  • Automatic speech recognition
  • Extractive summarization
  • Abstractive summarization

Fingerprint

Dive into the research topics of 'Symbolic and statistical learning approaches to speech summarization: a scoping review'. Together they form a unique fingerprint.

Cite this