Parsing and disfluency placement

Donald Engel, Eugene Charniak, Mark Johnson

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contribution

Abstract

It has been suggested that some forms of speech disfluencies, most notable interjections and parentheticals, tend to occur disproportionally at major clause boundaries [6] and thus might serve to aid parsers in establishing these boundaries. We have tested a current statistical parser [1] on Switchboard text with and without interjections and parentheticals and found that the parser performed better when not faced with these extra phenomena. This suggest that for current parsers, at least, interjection and parenthetical placement does not help in the parsing process.
Original languageEnglish
Title of host publicationProceedings of the 2002 Conference on Empirical Methods in Natural Language Processing
Place of PublicationStroudsburg, PA
PublisherAssociation for Computational Linguistics (ACL)
Pages49-54
Number of pages6
Volume10
DOIs
Publication statusPublished - 2002
Externally publishedYes
EventConference on Empirical Methods in Natural Language Processing (2002) - Philadelphia, United States
Duration: 6 Jul 20027 Jul 2002

Conference

ConferenceConference on Empirical Methods in Natural Language Processing (2002)
Abbreviated titleEMNLP 2002
CountryUnited States
CityPhiladelphia
Period6/07/027/07/02

Bibliographical note

Copyright the Publisher 2002. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Fingerprint Dive into the research topics of 'Parsing and disfluency placement'. Together they form a unique fingerprint.

  • Cite this

    Engel, D., Charniak, E., & Johnson, M. (2002). Parsing and disfluency placement. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (Vol. 10, pp. 49-54). Stroudsburg, PA: Association for Computational Linguistics (ACL). https://doi.org/10.3115/1118693.1118700