Exploring the role of stress in Bayesian word segmentation using adaptor grammars

Benjamin Börschinger, Mark Johnson

Research output: Contribution to journalArticlepeer-review

24 Downloads (Pure)

Abstract

Stress has long been established as a major cue in word segmentation for English infants. We show that enabling a current state-of-the-art Bayesian word segmentation model to take advantage of stress cues noticeably improves its performance. We find that the improvements range from 10 to 4%, depending on both the use of phonotactic cues and, to a lesser extent, the amount of evidence available to the learner. We also find that in particular early on, stress cues are much more useful for our model than phonotactic cues by themselves, consistent with the finding that children do seem to use stress cues before they use phonotactic cues. Finally, we study how the model's knowledge about stress patterns evolves over time. We not only find that our model correctly acquires the most frequent patterns relatively quickly but also that the Unique Stress Constraint that is at the heart of a previously proposed model does not need to be built in but can be acquired jointly with word segmentation.
Original languageEnglish
Pages (from-to)93-104
Number of pages12
JournalTransactions of the Association for Computational Linguistics
Volume2
Issue number1
Publication statusPublished - 2014

Bibliographical note

Copyright the Publisher 2014. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Fingerprint

Dive into the research topics of 'Exploring the role of stress in Bayesian word segmentation using adaptor grammars'. Together they form a unique fingerprint.

Cite this