Incorporating Cross-Document Relationships Between Sentences for Single Document Summarizations

Wan, Xiaojun; Yang, Jianwu; Xiao, Jianguo

doi:10.1007/11863878_34

Xiaojun Wan²⁰,
Jianwu Yang²⁰ &
Jianguo Xiao²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4172))

Included in the following conference series:

International Conference on Theory and Practice of Digital Libraries

953 Accesses
2 Citations

Abstract

Graph-based ranking algorithms have recently been proposed for single document summarizations and such algorithms evaluate the importance of a sentence by making use of the relationships between sentences in the document in a recursive way. In this paper, we investigate using other related or relevant documents to improve summarization of one single document based on the graph-based ranking algorithm. In addition to the within-document relationships between sentences in the specified document, the cross-document relationships between sentences in different documents are also taken into account in the proposed approach. We evaluate the performance of the proposed approach on DUC 2002 data with the ROUGE metric and results demonstrate that the cross-document relationships between sentences in different but related documents can significantly improve the performance of single document summarization.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

User Intention-Based Document Summarization on Heterogeneous Sentence Networks

Multi-Document Extractive Summarization as a Non-linear Combinatorial Optimization Problem

GuideRank: A Guided Ranking Graph Model for Multilingual Multi-document Summarization

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Amini, M.R., Gallinari, P.: The Use of Unlabeled Data to Improve Supervised Learning for Text Summarization. In: Proceedings of SIGIR 2002, pp. 105–112 (2002)
Google Scholar
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30, 1–7 (1998)
Article Google Scholar
Carbonell, J., Goldstein, J.: The Use of MMR, Diversity-based Reranking for Reordering Documents and Producing Summaries. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 335–336 (1998)
Google Scholar
Conroy, J.M., O’Leary, D.P.: Text Summarization via Hidden Markov Models. In: Proceedings of SIGIR 2001, pp. 406–407 (2001)
Google Scholar
Edmundson, H.P.: New Methods in Automatic Abstracting. Journal of the Association for computing Machinery 16(2), 264–285 (1969)
MATH Google Scholar
ErKan, G., Radev, D.: LexPageRank: Prestige in Multi-Document Text Summarization. In: Proceedings of EMNLP 2004 (2004)
Google Scholar
Gong, Y.H., Liu, X.: Generic Text Summarization Using Relevance Measure and Latent Semantic Analysis. In: Proceedings of SIGIR 2001, pp. 19–25 (2001)
Google Scholar
Hovy, E., Lin, C.Y.: Automated Text Summarization in SUMMARIST. In: Proceeding of ACL 1997/EACL 1997 Worshop on Intelligent Scalable Text Summarization (1997)
Google Scholar
Jing, H.: Sentence Reduction for Automatic Text Summarization. In: Proceedings of ANLP 2000 (2000)
Google Scholar
Jing, H., McKeown, K.R.: Cut and Paste Based Text Summarization. In: Proceedings of NAACL 2000, pp. 178–185 (2000)
Google Scholar
Jones, W.P., Furnas, G.W.: Pictures of relevance: a geometric analysis of similarity measure. Journal of the American Society for Information Science 38(6), 420–442 (1987)
Article Google Scholar
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. Journal of the ACM 46(5), 604–632 (1999)
Article MATH MathSciNet Google Scholar
Knight, K., Marcu, D.: Summarization beyond Sentence Extraction: A Probabilistic Approach to Sentence Compression. Artificial Intelligence 139(1), 91–107 (2002)
Article MATH MathSciNet Google Scholar
Kupiec, J., Pedersen, J., Chen, F.: A Trainable Document Summarizer. In: Proceedings of SIGIR 1995, pp. 68–73 (1995)
Google Scholar
Lin, C.Y., Hovy, E.: Automatic Evaluation of Summaries Using N-Gram Co-Occurrence Statistics. In: Proceedings of HLT-NAACL 2003 (2003)
Google Scholar
Lin, C.Y., Hovy, E.: The Automated Acquisition of Topic Signatures for Text Summarization. In: Proceedings of the 17th Conference on Computational Linguistics, pp. 495–501 (2000)
Google Scholar
Luhn, H.P.: The Automatic Creation of literature Abstracts. IBM Journal of Research and Development 2(2) (1969)
Google Scholar
McDonald, D., Chen, H.: Using Sentence-Selection Heuristics to Rank Text Segment in TXTRACTOR. In: Proceedings of JCDL 2002, pp. 28–35 (2002)
Google Scholar
Mihalcea, R., Tarau, P.: TextRank: Bringing Order into Texts. In: Proceedings of EMNLP 2004 (2004)
Google Scholar
Mihalcea, R., Tarau, P.: A language independent algorithm for single and multiple document summarization. In: Proceedings of IJCNLP 2005 (2005)
Google Scholar
Nomoto, T., Matsumoto, Y.: A New Approach to Unsupervised Text Summarization. In: Proceedings of SIGIR 2001, pp. 26–34 (2001)
Google Scholar
Silber, H.G., McCoy, K.: Efficient Text Summarization Using Lexical Chains. In: Proceedings of the 5th International Conference on Intelligent User Interfaces, pp. 252–255 (2000)
Google Scholar
Zha, H.Y.: Generic Summarization and Keyphrase Extraction Using Mutual Reinforcement Principle and Sentence Clustering. In: Proceedings of SIGIR 2002, pp. 113–120 (2002)
Google Scholar
Zhang, B., Li, H., Liu, Y., Ji, L., Xi, W., Fan, W., Chen, Z., Ma, W.-Y.: Improving web search results using affinity graph. In: Proceedings of SIGIR 2005 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science and Technology, Peking University, Beijing, 100871, China
Xiaojun Wan, Jianwu Yang & Jianguo Xiao

Authors

Xiaojun Wan
View author publications
You can also search for this author in PubMed Google Scholar
Jianwu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jianguo Xiao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

No Affiliations,
Julio Gonzalo
Istituto di Scienza e Tecnologie dell’Informazione, Consiglio Nazionale delle Richerche, Via Moruzzi, 1, 56124, Pisa, Italy
Costantino Thanos
Dpto. Lenguajes y Sistemas Informáticos, UNED,
M. Felisa Verdejo
Dep. de Lenguajes y Sistemas Informáticos, Universidad de Alicante, E-03071, Alicante, Spain
Rafael C. Carrasco

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wan, X., Yang, J., Xiao, J. (2006). Incorporating Cross-Document Relationships Between Sentences for Single Document Summarizations. In: Gonzalo, J., Thanos, C., Verdejo, M.F., Carrasco, R.C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2006. Lecture Notes in Computer Science, vol 4172. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11863878_34

Download citation

DOI: https://doi.org/10.1007/11863878_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44636-1
Online ISBN: 978-3-540-44638-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Incorporating Cross-Document Relationships Between Sentences for Single Document Summarizations

Abstract

Chapter PDF

Similar content being viewed by others

User Intention-Based Document Summarization on Heterogeneous Sentence Networks

Multi-Document Extractive Summarization as a Non-linear Combinatorial Optimization Problem

GuideRank: A Guided Ranking Graph Model for Multilingual Multi-document Summarization

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Incorporating Cross-Document Relationships Between Sentences for Single Document Summarizations

Abstract

Chapter PDF

Similar content being viewed by others

User Intention-Based Document Summarization on Heterogeneous Sentence Networks

Multi-Document Extractive Summarization as a Non-linear Combinatorial Optimization Problem

GuideRank: A Guided Ranking Graph Model for Multilingual Multi-document Summarization

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation