Comparing Semantic Models for Evaluating Automatic Document Summarization

Campr, Michal; Ježek, Karel

doi:10.1007/978-3-319-24033-6_29

Michal Campr¹⁵ &
Karel Ježek¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9302))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

2005 Accesses
24 Citations

Abstract

The main focus of this paper is the examination of semantic modelling in the context of automatic document summarization and its evaluation. The main area of our research is extractive summarization, more specifically, contrastive opinion summarization. And as it is with all summarization tasks, the evaluation of their performance is a challenging problem on its own. Nowadays, the most commonly used evaluation technique is ROUGE (Recall-Oriented Understudy for Gisting Evaluation). It includes measures (such as the count of overlapping n-grams or word sequences) for automatically determining the quality of summaries by comparing them to ideal human-made summaries. However, these measures do not take into account the semantics of words and thus, for example, synonyms are not treated as equal. We explore this issue by experimenting with various language models, examining their performance in the task of computing document similarity. In particular, we chose four semantic models (LSA, LDA, Word2Vec and Doc2Vec) and one frequency-based model (TfIdf), for extracting document features. The experiments were then performed on our custom dataset and the results of each model are then compared to the similarity values assessed by human annotators. We also compare these values with the ROUGE scores and observe the correlations between them. The aim of our experiments is to find a model, which can best imitate a human estimate of document similarity.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Performance of Evaluation Methods Without Human References for Multi-document Text Summarization

Robust Single-Document Summarizations and a Semantic Measurement of Quality

Benchmarking Semantic, Centroid, and Graph-Based Approaches for Multi-document Summarization

Keywords

References

Steinberger, J., Ježek, K.: Evaluation measures for text summarization. Computing and Informatics 25, 1001–1025 (2012)
MATH Google Scholar
Lin, C.: Rouge: A package for automatic evaluation of summaries. In: Text Summarization Branches Out: Proceedings of the ACL 2004 Workshop, vol. (1), pp. 74–81 (2004)
Google Scholar
Nenkova, A., Passonneau, R.: Evaluating content selection in summarization: The pyramid method. In: HLT-NAACL, pp. 145–152 (2004)
Google Scholar
Eneko, A., Mona, D., Daniel, C., Gonzalez-Agirre, A.: Semeval-2012 task 6: A pilot on semantic textual similarity. In: Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation, Number 3, pp. 385–393 (2012)
Google Scholar
Pala, K., Čapek, T., Zajíčková, B., Bartůšková, D., Kulková, K., Hoffmannová, P., Bejček, E., Straňák, P., Hajič, J.: Czech WordNet 1.9 PDT (2011)
Google Scholar
Deerwester, S., Dumais, S., Landauer, T.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6), 391–407 (1990)
Article Google Scholar
Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. The Journal of Machine Learning Research 3, 993–1022 (2003)
MATH Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of Workshop at ICLR, January 2013
Google Scholar
Mikolov, T., View, M., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of NIPS (2013)
Google Scholar
Mikolov, T., Yih, W.T., Zweig, G.: Linguistic regularities in continuous space word representations. In: Proceedings of NAACL HLT (2013)
Google Scholar
Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, Valletta, Malta, pp. 45–50. ELRA, May 2010. http://is.muni.cz/publication/884893/en
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of The 31st International Conference on Machine Learning, vol. 32, pp. 1188–1196 (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, FAS, University of West Bohemia, Univerzitn 8, 306 14, Pilsen, Czech Republic
Michal Campr & Karel Ježek

Authors

Michal Campr
View author publications
You can also search for this author in PubMed Google Scholar
Karel Ježek
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michal Campr .

Editor information

Editors and Affiliations

University of West Bohemia, Pilsen, Czech Republic
Pavel Král
University of West Bohemia, Pilsen, Czech Republic
Václav Matoušek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Campr, M., Ježek, K. (2015). Comparing Semantic Models for Evaluating Automatic Document Summarization. In: Král, P., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2015. Lecture Notes in Computer Science(), vol 9302. Springer, Cham. https://doi.org/10.1007/978-3-319-24033-6_29

Download citation

DOI: https://doi.org/10.1007/978-3-319-24033-6_29
Published: 11 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24032-9
Online ISBN: 978-3-319-24033-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Comparing Semantic Models for Evaluating Automatic Document Summarization

Abstract

Chapter PDF

Similar content being viewed by others

Performance of Evaluation Methods Without Human References for Multi-document Text Summarization

Robust Single-Document Summarizations and a Semantic Measurement of Quality

Benchmarking Semantic, Centroid, and Graph-Based Approaches for Multi-document Summarization

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Comparing Semantic Models for Evaluating Automatic Document Summarization

Abstract

Chapter PDF

Similar content being viewed by others

Performance of Evaluation Methods Without Human References for Multi-document Text Summarization

Robust Single-Document Summarizations and a Semantic Measurement of Quality

Benchmarking Semantic, Centroid, and Graph-Based Approaches for Multi-document Summarization

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation