Abstract
This paper presents Bayesian non-parametric models that simultaneously learn to segment words from phoneme strings and learn the referents of some of those words, and shows that there is a synergistic interaction in the acquisition of these two kinds of linguistic information. The models themselves are novel kinds of Adaptor Grammars that are an extension of an embedding of topic models into PCFGs. These models simultaneously segment phoneme sequences into words and learn the relationship between non-linguistic objects to the words that refer to them. We show (i) that modelling inter-word dependencies not only improves the accuracy of the word segmentation but also of word-object relationships, and (ii) that a model that simultaneously learns word-object relationships and word segmentation segments more accurately than one that just learns word segmentation on its own. We argue that these results support an interactive view of language acquisition that can take advantage of synergies such as these.
Original language | English |
---|---|
Title of host publication | Advances in Neural Information Processing Systems 23 |
Subtitle of host publication | 24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010 |
Editors | J. Lafferty, C. K. Williams, J. Shawe-Taylor, R. S. Zemel, A. Culotta |
Place of Publication | United States |
Publisher | Neural Information Processing Systems (NIPS) Foundation |
Pages | 1018-1026 |
Number of pages | 9 |
ISBN (Print) | 9781617823800 |
Publication status | Published - 2010 |
Event | 24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010 - Vancouver, BC, Canada Duration: 6 Dec 2010 → 9 Dec 2010 |
Other
Other | 24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010 |
---|---|
Country/Territory | Canada |
City | Vancouver, BC |
Period | 6/12/10 → 9/12/10 |