Text-to-speech rule and dictionary development

R. Mannell*, J. E. Clark

*Corresponding author for this work

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

This paper describes the development and evaluation of the grapheme-to-phoneme sub-system of a complete real-time synthesis system under development at Macquarie University. It has been developed around a lexicon knowledge base which contains the 4000-5000 most common English words and which has been augmented by a suffix stripper and a set of grapheme to phoneme rules. Evaluation and development of this system has been facilitated by using weighted statistics which reflect the frequency of occurence of each word in the LOB and Brown corpora of English. These statistics are derived from a test word database which includes all acceptable Australian pronunciations (as defined by the Macquarie Dictionary) of each word, as well as their LOB and Brown frequency counts. The pronunciation derived by the system is compared to the Macquarie Dictionary pronunciations and given a score proportional to its frequency in the two corpora. These scores facilitate decisions to be made about which alterations to the rules or lexicon will have the greatest effect on total system accuracy in ordinary running text (as reflected by the corpora frequencies).

Original languageEnglish
Pages (from-to)317-324
Number of pages8
JournalSpeech Communication
Volume6
Issue number4
DOIs
Publication statusPublished - 1987

Keywords

  • corpus
  • letter-to-sound rules
  • lexicon
  • suffix
  • Text-to-speech

Fingerprint Dive into the research topics of 'Text-to-speech rule and dictionary development'. Together they form a unique fingerprint.

Cite this