Abstract
In this paper, we introduce our ongoing research on a Chinese-English Contrastive Language Knowledge Base, including its architecture, the selection of its entries and the XML-based annotation schemes used. We also report on the progress of annotation. The knowledge base is linguistically motivated, focusing on a wide range of sub-sentential contrasts between Chinese and English. It will offer a new form of bilingual resources for NLP tasks, for use in contrastive linguistic research and translation studies, amongst others. Currently, joint efforts are being made to develop tools for Computer-Assisted Translation and Second Language Acquisition using this knowledge base.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
References
Yu, S., Duan, H., Zhu, X., Zhang, H.: The Construction and Utilization of a Comprehensive Language Knowledge-base. Journal of Chinese Information Processing 18(5), 1–10 (2004)
Bai, X., Chang, B., Zhan, W.: The Construction of a Large-scale Chinese-English Parallel Corpus. In: Huang, H. (ed.) Recent Development in Machine Translation Studies - Proceedings of the National Conference on Machine Translation 2002, pp. 124–131. Publishing House of Electronics Industry, Beijing (2002)
Benito, D.: Future Trends in Translation Memory. Tradumàtica 7 (2009), http://www.fti.uab.cat/tradumatica/revista/hemeroteca.htm , ISSN 1578-7559
Simard, M., Langlais, P.: Sub-sentential Exploitation of Translation Memories. In: Proceedings of Machine Translation Summit VIII: Machine Translation in the Information Age, pp. 335–339 (2001)
Bowker, L., Barlow, M.: Bilingual Concordancers and Translation Memories: A Comparative Evaluation. In: Proceedings of the Second International Workshop on Language Resources for Translation Work, Research and Training, pp. 70–79 (2004)
Macken, L.: Sub-Sentential Alignment of Translational Correspondences. University of Antwerp (2010)
Véronis, J. (ed.): Parallel Text Processing – Alignment and Use of Translation Corpora. Kluwer Academic Publishers, The Netherlands (2003)
Melamed, D.: Annotation Style Guide for the Blinker Project – Version 1.0.4 (2008), http://arxiv.org/pdf/cmp-lg/9805004v1.pdf
Véronis, J.: ARCADE: Tagging Guidelines for Word Alignment – Version 1.0 (1998), http://aune.lpl.univ-aix.fr/projects/arcade/2nd/word/guide/
Merkel, M.: Annotation Style Guide for the PLUG Link Annotator. Linköping University (1999)
Linguistic Data Consortium. Guidelines for Chinese-English Word Alignment – Version 4.0 (2009), http://catalog.ldc.upenn.edu/docs/LDC2012T24/GALE_Chinese_alignment_guidelines_v4.0.pdf
Zhao, H., Liu, Q., Zhang, R., Lv, Y., Sumita, E.: Guidelines for Chinese-English Word Alignment. Journal of Chinese Information Processing 23(3), 65–87 (2009)
Bond, F., Wang, S.: Issues in building English-Chinese parallel corpora with WordNets. In: Orav, H., Fellbaum, C., Vossen, P. (eds.) Proceedings of the Seventh Global WordNet Conference (GWC-7), Tartu, Estonia, pp. 391–399 (2014)
Johansson, S.: Seeing through Multilingual Corpora: On the Use of Corpora in Contrastive Studies. John Benjamins Publishing, Amsterdam (2007)
Zan, H., Zhang, K., Zhu, X., Yu, S.: Research on the Chinese Function Word Usage Knowledge Base. International Journal on Asian Language Processing 21(4), 185–198 (2011)
Lu, J., Ma, Z.: Some Comments on the Function Words in Contemporary Chinese. Language & Culture Press, Beijing (1999)
Lu, J.: On Semantic Link Analysis. Essays on Linguistics, 1. Beijing Language and Culture University Press, Beijing (1997)
Santorini, B.: Part-of-Speech Tagging Guidelines for the Penn Treebank Project (3rd revision) (1990), http://repository.upenn.edu/cis_reports/570/
Xia, F.: The Part-Of-Speech Tagging Guidelines for the Penn Chinese Treebank (3.0) (2000), http://repository.upenn.edu/ircs_reports/38/
Institute of Linguistics, Chinese Academy of Social Science. The Contemporary Chinese Dictionary [Chinese-English Edition]. Foreign Language Teaching and Research Press, Beijing (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Bai, X., Zähner, C., Zan, H., Yu, S. (2014). The Chinese-English Contrastive Language Knowledge Base and Its Applications. In: Sun, M., Liu, Y., Zhao, J. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2014 2014. Lecture Notes in Computer Science(), vol 8801. Springer, Cham. https://doi.org/10.1007/978-3-319-12277-9_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-12277-9_10
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12276-2
Online ISBN: 978-3-319-12277-9
eBook Packages: Computer ScienceComputer Science (R0)