Abstract
Since the early ages of artificial intelligence, associative or semantic networks have been proposed as representations that enable the storage of language units and the relationships that interconnect them, allowing for a variety of inference and reasoning processes, and simulating some of the functionalities of the human mind. The symbolic structures that emerge from these representations correspond naturally to graphs – relational structures capable of encoding the meaning and structure of a cohesive text, following closely the associative or semantic memory representations. The activation or ranking of nodes in such graph structures mimics to some extent the functioning of human memory, and can be turned into a rich source of knowledge useful for several language processing applications. In this paper, we suggest a framework for the application of graph-based ranking algorithms to natural language processing, and illustrate the application of this framework to two traditionally difficult text processing tasks: word sense disambiguation and text summarization.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Anderson, J.: A spreading activation theory of memory. Journal of Verbal Learning and Verbal Behavior 22 (1983)
Berger, H., Dittenbach, M., Merkl, D.: An adaptive information retrieval system based on associative networks. In: Proceedings of the first Asian-Pacific conference on Conceptual modelling, Dunedin, New Zealand (2004)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30, 1–7 (1998)
Budanitsky, A., Hirst, G.: Semantic distance in wordnet: An experimental, application-oriented evaluation of five measures. In: Proceedings of the NAACL Workshop on WordNet and Other Lexical Resources, Pittsburgh (June 2001)
Cohen, P., Kjeldsen, R.: Information retrieval by constrained spreading activation in semantic networks. Information Processing and Management 23, 4 (1987)
Collins, A.M., Loftus, E.: A spreading-activation theory of semantic processing. Psychological Review 82, 6 (1975)
Dom, B., Eiron, I., Cozzi, A., Shang, Y.: Graph-based ranking algorithms for e-mail expertise analysis. In: Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery, San Diego, California (2003)
DUC. Document understanding conference (2002), http://www-nlpir.nist.gov/projects/duc/
Freud, S. Psychopathology of everyday life. Payot (1901)
Grimmett, G., Stirzaker, D.: Probability and Random Processes. Oxford University Press, Oxford (1989)
Hirst, G.: Resolving lexical ambiguity computationally with spreading activation and Polaroid words. In: Small, S., Cottrell, G., Tanenhaus, M. (eds.) Lexical Ambiguity Resolution. Morgan Kaufmann, San Francisco (1988)
Jannink, J.: A Word Nexus for Systematic Interoperation of Semantically Heterogeneous Data Sources. PhD thesis, Stanford University (2001)
Kleinberg, J.: Authoritative sources in a hyperlinked environment. Journal of the ACM 46(5), 604–632 (1999)
Landauer, T.K., Foltz, P., Laham, D.: Introduction to latent semantic analysis. Discourse Processes 25 (1998)
Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. In: Proceedings of the SIGDOC Conference 1986, Toronto (June 1986)
Lin, C., Hovy, E.: Automatic evaluation of summaries using n-gram co-occurrence statistics. In: Proceedings of Human Language Technology Conference (HLT-NAACL 2003), Edmonton, Canada (May 2003)
Mihalcea, R.: Graph-based ranking algorithms for sentence extraction, applied to text summarization. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Lingusitics (ACL 2004), Barcelona, Spain (2004) (companion volume)
Mihalcea, R.: Large vocabulary unsupervised word sense disambiguation with graph-based algorithms for sequence data labeling. In: Proceedings of the Human Language Technology Empirical Methods in Natural Language Processing conference, Vancouver (2005)
Mihalcea, R., Tarau, P.: TextRank – bringing order into texts. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2004), Barcelona, Spain (2004)
Mihalcea, R., Tarau, P.: An algorithm for language independent single and multiple document summarization. In: Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP-2005), Korea (2005)
Mihalcea, R., Tarau, P., Figa, E.: PageRank on semantic networks, with application to word sense disambiguation. In: Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), Geneva, Switzerland (2004)
Miller, G.: Wordnet: A lexical database. Communication of the ACM 38(11), 39–41 (1995)
Miller, G., Leacock, C., Randee, T., Bunker, R.: A semantic concordance. In: Proceedings of the 3rd DARPA Workshop on Human Language Technology, Plainsboro, New Jersey (1993)
Moldovan, D., Lee, W., Lin, C.: Parallel knowledge processing on SNAP. IEEE Transactions on Knowledge and Data Engineering 5(1) (1993)
Palmer, M., Fellbaum, C., Cotton, S., Delfs, L., Dang, H.: English tasks: all-words and verb lexical sample. In: Proceedings of ACL/SIGLEX Senseval-2, Toulouse, France (2001)
Quillian, M.: Semantic memory. In: Minsky, M. (ed.) Semantic Information Processing. MIT Press, Cambridge (1968)
Schvaneveldt, R.: Pathfinder Associative networks: studies in knowledge organization, Norwood (1989)
Snyder, B., Palmer, M.: The English all-words task. In: Proceedings of ACL/SIGLEX Senseval-3, Barcelona, Spain (July 2004)
Spitzer, M.: The mind within the net: models of learning, thinking, and acting. MIT Press, Cambridge (1999)
Vanderwende, L., Banko, M., Menezes, A.: Event-centric summary generation. In: Proceedings of the Document Understanding Conference (2004)
Veronis, J., Ide, N.: Word sense disambiguation with very large neural networks extracted from machine readable dictionaries. In: Proceedings of the 13th International Conference on Computational Linguistics (COLING 1990), Helsinki, Finland (August 1990)
Wolf, F., Gibson, E.: Paragraph-, word-, and coherence-based approaches to sentence ranking: A comparison of algorithm and human performance. In: Proceedings of the 42nd Meeting of the Association for Computational Linguistics, Barcelona, Spain (July 2004)
Zock, M., Bilac, S.: Word lookup on the basis of associations: from an idea to a roadmap. In: Proceedings of the Coling 2004 workshop on Enhancing and Using Electronic Dictionaries, Geneva, Switzerland (August 2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mihalcea, R. (2006). Random Walks on Text Structures. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2006. Lecture Notes in Computer Science, vol 3878. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11671299_27
Download citation
DOI: https://doi.org/10.1007/11671299_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32205-4
Online ISBN: 978-3-540-32206-1
eBook Packages: Computer ScienceComputer Science (R0)