Expected dependency pair match: predicting translation quality with expected syntactic structure

Kahn, Jeremy G.; Snover, Matthew; Ostendorf, Mari

doi:10.1007/s10590-009-9057-6

Expected dependency pair match: predicting translation quality with expected syntactic structure

Published: 31 October 2009

Volume 23, pages 169–179, (2009)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Machine Translation

Expected dependency pair match: predicting translation quality with expected syntactic structure

Download PDF

Jeremy G. Kahn¹,
Matthew Snover² &
Mari Ostendorf¹

111 Accesses
3 Citations
Explore all metrics

Abstract

Recent efforts to develop new machine translation evaluation methods have tried to account for allowable wording differences either in terms of syntactic structure or synonyms/paraphrases. This paper primarily considers syntactic structure, combining scores from partial syntactic dependency matches with standard local n-gram matches using a statistical parser, and taking advantage of N-best parse probabilities. The new scoring metric, expected dependency pair match (EDPM), is shown to outperform BLEU and TER in terms of correlation to human judgments and as a predictor of HTER. Further, we combine the syntactic features of EDPM with the alternative wording features of TERp, showing a benefit to accounting for syntactic structure on top of semantic equivalency features.

Article PDF

Metrics of Syntactic Equivalence to Assess Translation Difficulty

Machine Translation Quality Estimation: Applications and Future Perspectives

Neutralizing the Effect of Translation Shifts on Automatic Machine Translation Evaluation

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Banerjee S, Lavie A (2005) METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACL workshop on intrinsic and extrinsic evaluation measures for MT and/or summarization, pp 65–72
Cahill A, Burke M, O’Donovan R, Van Genabith J, Way A (2004) Long-distance dependency resolution in automatically acquired wide-coverage PCFG-based LFG approximations. In: Proceedings of the ACL, pp 319–326
Callison-Burch C (2006) Re-evaluating the role of BLEU in machine translation research. In: Proceedings of the EACL, pp 249–256
Charniak E, Johnson M (2005) Coarse-to-fine n-best parsing and MaxEnt discriminative reranking. In: Proceedings of the ACL, pp 173–180
Charniak E, Knight K, Yamada K (2003) Syntax-based language models for statistical machine translation. In: Proceedings of the MT Summit IX
Kahn JG, Roark B, Ostendorf M (2008) Automatic syntactic MT evaluation with expected dependency pair match. In: MetricsMATR: NIST metrics for machine translation challenge
LDC (2003) Multiple translation Chinese corpus, part 2. Catalog number LDC2003T17
LDC (2006) Multiple translation Chinese corpus, part 4. Catalog number LDC2006T04
LDC (2008) GALE phase 2 + retest evaluation references. Catalog number LDC2008E11
Liu D, Gildea D (2005) Syntactic features for evaluation of machine translation. In Proceedings of the ACL workshop on intrinsic and extrinsic evaluation measures for MT and/or summarization, pp 25–32
Magerman DM (1995) Statistical decision-tree models for parsing. In: Proceedings of the ACL, pp 276–283
Owczarzak K, van Genabith J, Way A (2007a) Evaluating machine translation with LFG dependencies. Mach Transl 21(2): 95–119
Article Google Scholar
Owczarzak K, van Genabith J, Way A (2007b) Labelled dependencies in machine translation evaluation. In: Proceedings of the second workshop on statistical machine translation, pp 104–111
Papineni K, Roukos S, Ward T, Zhu WJ (2002) BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the ACL, pp 311–318
Roark B, Harper M, Charniak E, Dorr B, Johnson M, Kahn JG, Liu Y, Ostendorf M, Hale J, Krasnyanskaya A, Lease M, Shafran I, Snover M, Stewart R, Yung L (2006) SParseval: evaluation metrics for parsing speech. In: Proceedings of the LREC
Snover M, Dorr B, Schwartz R, Micciulla L, Makhoul J (2006) A study of translation edit rate with targeted human annotation. In: Proceedings of the AMTA
Snover M, Madnani N, Dorr B, Schwartz R (2009) Fluency, adequacy, or HTER? Exploring different human judgments with a tunable MT metric. In: Proceedings of the workshop on statistical machine translation at EACL

Download references

Author information

Authors and Affiliations

University of Washington, Seattle, WA, USA
Jeremy G. Kahn & Mari Ostendorf
University of Maryland, College Park, MD, USA
Matthew Snover

Authors

Jeremy G. Kahn
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Snover
View author publications
You can also search for this author in PubMed Google Scholar
Mari Ostendorf
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jeremy G. Kahn.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kahn, J.G., Snover, M. & Ostendorf, M. Expected dependency pair match: predicting translation quality with expected syntactic structure. Machine Translation 23, 169–179 (2009). https://doi.org/10.1007/s10590-009-9057-6

Download citation

Received: 15 May 2009
Accepted: 10 October 2009
Published: 31 October 2009
Issue Date: September 2009
DOI: https://doi.org/10.1007/s10590-009-9057-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Expected dependency pair match: predicting translation quality with expected syntactic structure

Abstract

Article PDF

Similar content being viewed by others

Metrics of Syntactic Equivalence to Assess Translation Difficulty

Machine Translation Quality Estimation: Applications and Future Perspectives

Neutralizing the Effect of Translation Shifts on Automatic Machine Translation Evaluation

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Expected dependency pair match: predicting translation quality with expected syntactic structure

Abstract

Article PDF

Similar content being viewed by others

Metrics of Syntactic Equivalence to Assess Translation Difficulty

Machine Translation Quality Estimation: Applications and Future Perspectives

Neutralizing the Effect of Translation Shifts on Automatic Machine Translation Evaluation

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation