Abstract
Graph-based ranking algorithms have recently been proposed for single document summarizations and such algorithms evaluate the importance of a sentence by making use of the relationships between sentences in the document in a recursive way. In this paper, we investigate using other related or relevant documents to improve summarization of one single document based on the graph-based ranking algorithm. In addition to the within-document relationships between sentences in the specified document, the cross-document relationships between sentences in different documents are also taken into account in the proposed approach. We evaluate the performance of the proposed approach on DUC 2002 data with the ROUGE metric and results demonstrate that the cross-document relationships between sentences in different but related documents can significantly improve the performance of single document summarization.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Amini, M.R., Gallinari, P.: The Use of Unlabeled Data to Improve Supervised Learning for Text Summarization. In: Proceedings of SIGIR 2002, pp. 105–112 (2002)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30, 1–7 (1998)
Carbonell, J., Goldstein, J.: The Use of MMR, Diversity-based Reranking for Reordering Documents and Producing Summaries. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 335–336 (1998)
Conroy, J.M., O’Leary, D.P.: Text Summarization via Hidden Markov Models. In: Proceedings of SIGIR 2001, pp. 406–407 (2001)
Edmundson, H.P.: New Methods in Automatic Abstracting. Journal of the Association for computing Machinery 16(2), 264–285 (1969)
ErKan, G., Radev, D.: LexPageRank: Prestige in Multi-Document Text Summarization. In: Proceedings of EMNLP 2004 (2004)
Gong, Y.H., Liu, X.: Generic Text Summarization Using Relevance Measure and Latent Semantic Analysis. In: Proceedings of SIGIR 2001, pp. 19–25 (2001)
Hovy, E., Lin, C.Y.: Automated Text Summarization in SUMMARIST. In: Proceeding of ACL 1997/EACL 1997 Worshop on Intelligent Scalable Text Summarization (1997)
Jing, H.: Sentence Reduction for Automatic Text Summarization. In: Proceedings of ANLP 2000 (2000)
Jing, H., McKeown, K.R.: Cut and Paste Based Text Summarization. In: Proceedings of NAACL 2000, pp. 178–185 (2000)
Jones, W.P., Furnas, G.W.: Pictures of relevance: a geometric analysis of similarity measure. Journal of the American Society for Information Science 38(6), 420–442 (1987)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. Journal of the ACM 46(5), 604–632 (1999)
Knight, K., Marcu, D.: Summarization beyond Sentence Extraction: A Probabilistic Approach to Sentence Compression. Artificial Intelligence 139(1), 91–107 (2002)
Kupiec, J., Pedersen, J., Chen, F.: A Trainable Document Summarizer. In: Proceedings of SIGIR 1995, pp. 68–73 (1995)
Lin, C.Y., Hovy, E.: Automatic Evaluation of Summaries Using N-Gram Co-Occurrence Statistics. In: Proceedings of HLT-NAACL 2003 (2003)
Lin, C.Y., Hovy, E.: The Automated Acquisition of Topic Signatures for Text Summarization. In: Proceedings of the 17th Conference on Computational Linguistics, pp. 495–501 (2000)
Luhn, H.P.: The Automatic Creation of literature Abstracts. IBM Journal of Research and Development 2(2) (1969)
McDonald, D., Chen, H.: Using Sentence-Selection Heuristics to Rank Text Segment in TXTRACTOR. In: Proceedings of JCDL 2002, pp. 28–35 (2002)
Mihalcea, R., Tarau, P.: TextRank: Bringing Order into Texts. In: Proceedings of EMNLP 2004 (2004)
Mihalcea, R., Tarau, P.: A language independent algorithm for single and multiple document summarization. In: Proceedings of IJCNLP 2005 (2005)
Nomoto, T., Matsumoto, Y.: A New Approach to Unsupervised Text Summarization. In: Proceedings of SIGIR 2001, pp. 26–34 (2001)
Silber, H.G., McCoy, K.: Efficient Text Summarization Using Lexical Chains. In: Proceedings of the 5th International Conference on Intelligent User Interfaces, pp. 252–255 (2000)
Zha, H.Y.: Generic Summarization and Keyphrase Extraction Using Mutual Reinforcement Principle and Sentence Clustering. In: Proceedings of SIGIR 2002, pp. 113–120 (2002)
Zhang, B., Li, H., Liu, Y., Ji, L., Xi, W., Fan, W., Chen, Z., Ma, W.-Y.: Improving web search results using affinity graph. In: Proceedings of SIGIR 2005 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wan, X., Yang, J., Xiao, J. (2006). Incorporating Cross-Document Relationships Between Sentences for Single Document Summarizations. In: Gonzalo, J., Thanos, C., Verdejo, M.F., Carrasco, R.C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2006. Lecture Notes in Computer Science, vol 4172. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11863878_34
Download citation
DOI: https://doi.org/10.1007/11863878_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44636-1
Online ISBN: 978-3-540-44638-5
eBook Packages: Computer ScienceComputer Science (R0)