Choosing the right translation: A syntactically informed classification approach

Simon Zwarts*, Mark Dras

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

15 Citations (Scopus)

Abstract

One style of Multi-Engine Machine Translation architecture involves choosing the best of a set of outputs from different systems. Choosing the best translation from an arbitrary set, even in the presence of human references, is a difficult problem; it may prove better to look at mechanisms for making such choices in more restricted contexts. In this paper we take a classification-based approach to choosing between candidates from syntactically informed translations. The idea is that using multiple parsers as part of a classifier could help detect syntactic problems in this context that lead to bad translations; these problems could be detected on either the source side - perhaps sentences with difficult or incorrect parses could lead to bad translations - or on the target side - perhaps the output quality could be measured in a more syntactically informed way, looking for syntactic abnormalities. We show that there is no evidence that the source side information is useful. However, a target-side classifier, when used to identify particularly bad translation candidates, can lead to significant improvements in BLEU score. Improvements are even greater when combined with existing language and alignment model approaches.

Original languageEnglish
Title of host publicationColing 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference
Place of PublicationStroudsburg
PublisherAssociation for Computational Linguistics (ACL)
Pages1153-1160
Number of pages8
Volume1
ISBN (Print)9781905593446
Publication statusPublished - 2008
Event22nd International Conference on Computational Linguistics, Coling 2008 - Manchester, United Kingdom
Duration: 18 Aug 200822 Aug 2008

Other

Other22nd International Conference on Computational Linguistics, Coling 2008
Country/TerritoryUnited Kingdom
CityManchester
Period18/08/0822/08/08

Fingerprint

Dive into the research topics of 'Choosing the right translation: A syntactically informed classification approach'. Together they form a unique fingerprint.

Cite this