Text-to-speech rule and dictionary development

R. Mannell*, J. E. Clark

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    6 Citations (Scopus)

    Abstract

    This paper describes the development and evaluation of the grapheme-to-phoneme sub-system of a complete real-time synthesis system under development at Macquarie University. It has been developed around a lexicon knowledge base which contains the 4000-5000 most common English words and which has been augmented by a suffix stripper and a set of grapheme to phoneme rules. Evaluation and development of this system has been facilitated by using weighted statistics which reflect the frequency of occurence of each word in the LOB and Brown corpora of English. These statistics are derived from a test word database which includes all acceptable Australian pronunciations (as defined by the Macquarie Dictionary) of each word, as well as their LOB and Brown frequency counts. The pronunciation derived by the system is compared to the Macquarie Dictionary pronunciations and given a score proportional to its frequency in the two corpora. These scores facilitate decisions to be made about which alterations to the rules or lexicon will have the greatest effect on total system accuracy in ordinary running text (as reflected by the corpora frequencies).

    Original languageEnglish
    Pages (from-to)317-324
    Number of pages8
    JournalSpeech Communication
    Volume6
    Issue number4
    DOIs
    Publication statusPublished - 1987

    Keywords

    • corpus
    • letter-to-sound rules
    • lexicon
    • suffix
    • Text-to-speech

    Fingerprint

    Dive into the research topics of 'Text-to-speech rule and dictionary development'. Together they form a unique fingerprint.

    Cite this