Abstract
As the number of publications is increasing rapidly, it becomes increasingly difficult for researchers to find existing scientific papers most relevant for their work, even when the domain is limited. To overcome this, it is common to use paper summarization techniques in specific domains. In difference to approaches that exploit the paper content itself, in this paper we perform summarization of the citation context of a paper. For this, we adjust and apply existing summarization techniques and we come up with a hybrid method, based on clustering and latent semantic analysis. We apply this on medical informatics publications and compare performance of methods that outscore other techniques on a standard database. Summarization of the citation context can be complementary to full text summarization, particularly to find candidate papers. The reached performance seems good for routine use even though it was only tested on a small database.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Aliguliyev, R.: A new sentence similarity measure and sentence based extractive technique for automatic text summarization. Expert Systems with Applications 36(4), 7764–7772 (2009)
Bergmark, D.: Automatic extraction of reference linking information from online documents. Tech. rep., Cornell University, Ithaca, NY, USA (2000)
Bird, S.: NLTK: the natural language toolkit. In: Proceedings of the Coling/ACL on Interactive Presentation Sessions, pp. 69–72, Stroudsburg, PA, USA (2006)
Bradshaw, S.: Reference directed indexing: redeeming relevance for subject search in citation indexes. In: Koch, T., Sølvberg, I.T. (eds.) ECDL 2003. LNCS, vol. 2769, pp. 499–510. Springer, Heidelberg (2003)
Conroy, J., O’Leary, D.: Text summarization via Hidden Markov models. In: Proceedings of the 24th Annual International ACM SIGIR Conference, pp. 406–407, New York, NY, USA (2001)
Elkiss, A., Shen, S., Fader, A., Erkan, G., States, D., Radev, D.: Blind men and elephants: What do citation summaries tell us about a research article? Journal of the American Society Information Science and Technology 59(1), 51–62 (2008)
Frey, B., Dueck, D.: Clustering by passing messages between data points. Science 315(5814), 972–976 (2007)
Jing, H., Barzilay, R., McKeown, K., Elhadad, M.: Summarization evaluation methods: experiments and analysis. In: AAAI Symposium on Intelligent Summarization, pp. 51–59 (1998)
Jurafsky, D., Martin, J.: Speech & Language Processing. Pearson Education India (2000)
Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, pp. 74–81 (2004)
Miller, G.: Wordnet: A lexical database for english. Communications of the ACM 38(11), 39–41 (1995)
Haddou ou Moussa, K., Mayr, P.: Automatische referenzextraktion mit parscit. In: Social Media and Web Science - Das Web als Lebensraum, DGI, pp. 425–428 (2012)
Nenkova, A., Passonneau, R.: Evaluating content selection in summarization: the pyramid method. In: Proceedings of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 145–152 (2004)
Niwa, Y., Nitta, Y.: Co-occurrence vectors from corpora vs. distance vectors from dictionaries. In: Proceedings of the 15th Conference on Computational Linguistics, COLING ’94, vol. 1, pp. 304–309 (1994)
Ozsoy, M.G., Cicekli, I., Alpaslan, F.N.: Text summarization of turkish texts using latent semantic analysis. In: Huang, C.R., Jurafsky, D. (eds.) Proceedings of the 23rd International Conference on Computational Linguistics, pp. 869–876. Tsinghua University Press (2010)
Pei-Ying, Z., Cun-He, L.: Automatic text summarization based on sentences clustering and extraction. In: Proceedings of 2nd IEEE International Conference on the Computer Science and Information Technology, pp. 167–170. IEEE (2009)
Qazvinian, V., Radev, D.: Scientific paper summarization using citation summarynetworks. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1, pp. 689–696 (2008)
Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, vol. 1, pp. 448–453 (1995)
Ritchie, A., Robertson, S., Teufel, S.: Comparing citation contexts for information retrieval. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management. CIKM ’08, pp. 213–222. ACM, New York (2008)
Saad, S.M., Kamarudin, S.S.: Comparative analysis of similarity measures for sentence level semantic measurement of text. In: IEEE International Conference on Control System, Computing and Engineering, pp. 90–94. IEEE (2013)
Steinberger, J., Ježek, K.: Using latent semantic analysis in text summarization and summary evaluation. In: Proceedings of Industrial Management, ISIM ’04, pp. 93–100 (2004)
Svore, K.M., Vanderwende, L., Burges, C.: Enhancing single-document summarization by combining ranknet and third-party sources. In: Proceedings of Conference on Empirical Methods on Natural Language Processing and Computational Natural Language Learning, pp. 448–457 (2007)
Torres-Moreno, J.M., Saggion, H., da Cunha, I., SanJuan, E.: Summary Evaluation With and Without References. Polibits: Research Journal on Computer Science and Computer Engineering with Applications 42, 13–19 (2010)
Zechner, K.: Fast generation of abstracts from general domain text corpora by extracting relevant sentences. In: Proceedings of the 16th Conference on Computational Linguistics, vol. 2, pp. 986–989 (1996)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Mitrović, S., Müller, H. (2015). Summarizing Citation Contexts of Scientific Publications. In: Mothe, J., et al. Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2015. Lecture Notes in Computer Science(), vol 9283. Springer, Cham. https://doi.org/10.1007/978-3-319-24027-5_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-24027-5_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24026-8
Online ISBN: 978-3-319-24027-5
eBook Packages: Computer ScienceComputer Science (R0)