Abstract
In this paper we present two components of an automatic text simplification system for Spanish, aimed at making news articles more accessible to readers with cognitive disabilities. Our system in its current state consists of a rule-based lexical transformation component and a module for syntactic simplification. We evaluate the two components separately and as a whole, with a view to determining the level of simplification and the preservation of meaning and grammaticality. In order to test the readability level pre- and post-simplification, we apply seven readability measures for Spanish to three sets of randomly chosen news articles: the original texts, the output obtained after lexical transformations, the syntactic simplification output, and the output of both system components. To test whether the simplification output is grammatically correct and semantically adequate, we ask human annotators to grade pairs of original and simplified sentences according to these two criteria. Our results suggest that both components of our system produce simpler output when compared to the original, and that grammaticality and meaning preservation are positively rated by the annotators.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Medero, J., Ostendorf, M.: Identifying Targets for Syntactic Simplification. In: Proceedings of Speech and Language Technology in Education Workshop (2011)
Aluísio, S.M., Specia, L., Pardo, T.A.S., Maziero, E., De Mattor Fortes, R.P.: Towards Brazilian Portuguese Automatic Text Simplification systems. In: Proceedings of the ACM Symposium on Document Engineering (2008)
Inui, K., Fujita, A., Takahashi, T., Iida, R., Iwakura, T.: Text Simplification for Reading Assistance: A Project Note. In: Proceedings of the 2nd International Workshop on Paraphrasing: Paraphrase Acquisition and Applications, IWP (2003)
Aranzabe, M.J., Díaz De Ilarraza, A., González, I.: First Approach to Automatic Text Simplification in Basque. In: Proceedings of the First Natural Language Processing for Improving Textual Accessibility Workshop, NLP4ITA (2012)
Rybing, J., Smithr, C., Silvervarg, A.: Towards a Rule Based System for Automatic Simplification of Texts. In: The Third Swedish Language Technology Conference (2010)
Saggion, H., Gómez Martínez, E., Etayo, E., Anula, A., Bourg, L.: Text Simplification in Simplext: Making Text More Accessible. In: Revista de la Sociedad Española Para el Procesamiento del Lenguaje Natural (2011)
Freyhoff, G., Hess, G., Kerr, L., Menzel, E., Tronbacke, B., Van Der Veken, K.: Make it Simple, European Guidelines for the Production of Easy-to-Read Information for People with Learning Disability; for authors, editors, information providers, translators and other interested persons (1998)
Cooper, M., Reid, L., Vanderheiden, G., Caldwell, B.: Understanding WCAG 2.0. A guide to understanding and implementing Web Content Accessibility Guidelines 2.0. In: World Wide Web Consortium, W3C (2010)
Bott, S., Rello, L., Drndarević, B., Saggion, H.: Can Spanish Be Simpler? LexSiS: Lexical Simplification for Spanish. In: Proceedings of Coling 2012: The 24th International Conference on Computational Linguistics (2012)
Chandrasekar, R., Doran, D., Srinivas, B.: Motivations and Methods for Text Simplification. In: COLING, pp. 1041–1044 (1996)
Burstein, J., Shore, J., Sabatini, J., Lee, Y.W., Ventura, M.: The Automated Text Adaptation Tool. In: HLT-NAACL (Demonstrations), pp. 3–4 (2007)
Devlin, S., Unthank, G.: Helping aphasic people process online information. In: Proceedings of the 8th International ACM SIGACCESS Conference on Computers and Accessibility (2006)
Specia, L.: Translating from complex to simplified sentences. In: Proceedings of the 9th International Conference on Computational Processing of the Portuguese Language, Berlin, Heidelberg, pp. 30–39 (2010)
Siddharthan, A.: An Architecture for a Text Simplification System. In: Proceedings of the Language Engineering Conference (LEC 2002), 64–71 (2002)
Carroll, J., Minnen, G., Canning, Y., Devlin, S., Tait, J.: Practical Simplification of English Newspaper Text to Assist Aphasic Readers. In: Proc. of AAAI 1998 Workshop on Integrating Artificial Intelligence and Assistive Technology (1998)
Quinlan, P.: The Oxford Psycholinguistic Database. Oxford University Press (1992)
Bautista, S., Gervás, P., Madrid, R.: Feasibility Analysis for SemiAutomatic Conversion of Text to Improve Readability. In: Proceedings of the Second International Conference on Information and Communication Technologies and Accessibility (2009)
De Belder, J., Deschacht, K., Moens, M.F.: Lexical simplification. In: Proceedings of the 1st International Conference on Interdisciplinary Research on Technology, Education and Communication (2010)
Biran, O., Brody, S., Elhadad, N.: Putting it Simply: a Context-Aware Approach to Lexical Simplificaion. In: Proceedings of the ACL (2011)
Zhu, A., Bernhard, D., Gurevych, I.: A Monolingual Tree-based Translation Model for Sentence Simplification. In: Proceedings of The 23rd International Conference on Computational Linguistics, Beijing, China, pp. 1353–1361 (2010)
Coster, W., Kauchak, D.: Simple English Wikipedia: a New Text Simplification Task. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (2011)
Anula, A.: Tipos de Textos, Complejidad Lingüística y Facilicitación Lectora. In: Actas del Sexto Congreso de Hispanistas de Asia, pp. 45–61 (2007)
Bott, S., Saggion, H.: Automatic Simplification of Spanish Text for e-Accessibility. In: Proceedings of the 13th International Conference on Computers Helping People with Special Needs, pp. 54–56 (2012)
Bautista, S., Drndarević, B., Hervás, R., Saggion, H., Gervás, P.: Análisis de la Simplificación de Expresiones Numéricas en Español mediante un Estudio Empírico. Linguamática 4 (2012)
Maynard, D., Tablan, V., Cunningham, H., Ursu, C., Saggion, H., Bontcheva, K., Wilks, Y.: Architectural Elements of Language Engineering Robustness. Journal of Natural Language Engineering – Special Issue on Robust Methods in Analysis of Natural Language Data 8, 257–274 (2002)
Spaulding, S.: A Spanish Readability Formula. Modern Language Journal (1956)
Drndarević, B., Saggion, H.: Reducing Text Complexity through Automatic Lexical Simplification: an Empirical Study for Spanish. SEPLN Journal, 13–20 (2012)
Boone Jr., H., Boone, D.: Analizing Likert Data. Journal of Extension (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Drndarević, B., Štajner, S., Bott, S., Bautista, S., Saggion, H. (2013). Automatic Text Simplification in Spanish: A Comparative Evaluation of Complementing Modules. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7817. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37256-8_40
Download citation
DOI: https://doi.org/10.1007/978-3-642-37256-8_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37255-1
Online ISBN: 978-3-642-37256-8
eBook Packages: Computer ScienceComputer Science (R0)