Unsupervised syntactic chunking with acoustic cues

computational models for prosodic bootstrapping

John K. Pate, Sharon Goldwater

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contribution

Abstract

Learning to group words into phrases without supervision is a hard task for NLP systems, but infants routinely accomplish it. We hypothesize that infants use acoustic cues to prosody, which NLP systems typically ignore. To evaluate the utility of prosodic information for phrase discovery, we present an HMM-based unsupervised chunker that learns from only transcribed words and raw acoustic correlates to prosody. Unlike previous work on unsupervised parsing and chunking, we use neither gold standard part-of-speech tags nor punctuation in the input. Evaluated on the Switchboard corpus, our model outperforms several baselines that exploit either lexical or prosodic information alone, and, despite producing a flat structure, performs competitively with a state-of-the-art unsupervised lexicalized parser, with a substantial advantage in precision. Our results support the hypothesis that acoustic-prosodic cues provide useful evidence about syntactic phrases for language-learning infants.
Original languageEnglish
Title of host publicationProceedings of the 2nd workshop on cognitive modeling and computational linguistics
EditorsFrank Keller, David Reitter
Place of PublicationMadison, WI, USA
PublisherAssociation for Computational Linguistics
Pages20-29
Number of pages10
ISBN (Print)9781932432954
Publication statusPublished - 2011
Externally publishedYes
EventWorkshop on cognitive modeling and computational linguistics - Portland, Oregon, USA
Duration: 23 Jun 201123 Jun 2011

Workshop

WorkshopWorkshop on cognitive modeling and computational linguistics
CityPortland, Oregon, USA
Period23/06/1123/06/11

    Fingerprint

Cite this

Pate, J. K., & Goldwater, S. (2011). Unsupervised syntactic chunking with acoustic cues: computational models for prosodic bootstrapping. In F. Keller, & D. Reitter (Eds.), Proceedings of the 2nd workshop on cognitive modeling and computational linguistics (pp. 20-29). Madison, WI, USA: Association for Computational Linguistics.