Efficient Ranking Framework for Information Retrieval Using Similarity Measure

Irfan, Shadab; Ghosh, Subhajit

doi:10.1007/978-3-030-37218-7_141

Shadab Irfan¹⁸ &
Subhajit Ghosh¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1108))

Included in the following conference series:

International Conference On Computational Vision and Bio Inspired Computing

1941 Accesses
1 Citations

Abstract

The information on the web is increasing day by day and to manage such vast amount of information is really a difficult task. The user finds it really hard to capture the desired information as per their need and maximum amount of time is spent in framing proper query and filtering the resultant web pages. The search engine plays a major role in filtering the information and ranking the desired result. The quest for accurate information is still a dream and in this regard this paper presents an approach that tries to optimize the ranking algorithm by employing document clustering and similarity measures. In this paper we present an outline of different ranking algorithms and proposed an approach where PageRank algorithm is optimized by using document clustering. It also employs content mining along with structural mining that help to reduce the computational complexity of the algorithm and thereby diminish the time in performing the ranking of the web pages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A novel approach for ranking web documents based on query-optimized personalized pagerank

Article 18 August 2020

A Comprehensive Study of Page-Rank Algorithm

Query-Optimized PageRank: A Novel Approach

References

Alam, M., Sadaf, K.: A review on clustering of web search result. In: Advances in Computing & Information Technology, AISC, vol. 177, pp. 153–159. Springer, Heidelberg (2013)
Google Scholar
Steinbach, M., Karypis, G., Kumar, V.: A comparison of document clustering techniques (2000)
Google Scholar
Leuski, A., Allan, J.: Improving interactive retrieval by combining ranked lists and clustering (2000)
Google Scholar
Sheshasaayee, A., Thailambal, G.: Comparison of classification algorithms in text mining. Int. J. Pure Appl. Math. 116(22), 425–433 (2017)
Google Scholar
Jain, R., Purohit, G.N.: page ranking algorithms for web mining. Int. J. Comput. Appl. 13(5), 0975–8887 (2011)
Google Scholar
Srivastava, J., Cooley, R., Deshpande, M., Tan, P.-N.: Web usage mining: discovery and applications of usage patterns from web data. In: ACM SIGKDD, January 2000
Article Google Scholar
Wang, Z.: Improved link-based algorithms for ranking web pages. In: WAIM. LNCS, vol. 3129, pp. 291–302. Springer, Heidelberg (2004)
Chapter Google Scholar
Yates, R.B., Hurtado, C., Mendoza, M.: Query clustering for boosting web page ranking. In: AWIC 2004. LNAI, vol. 3034, pp. 164–175. Springer, Heidelberg (2004)
Google Scholar
Irfan, S., Ghosh, S.: A review on different ranking algorithms. In: International Conference on Advances in Computing, Communication Control and Networking IEEE ICACCCN (2018)
Google Scholar
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. In: Proceedings of the Seventh International World Wide Web Conference (1998)
Google Scholar
Masterton, G., Olsson, E.J.: From impact to importance: the current state of the wisdom-of-crowds justification of link-based ranking algorithms. Philos. Technol. 31, 593–609 (2018)
Article Google Scholar
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM 46, 604–632 (1999)
Article MathSciNet Google Scholar
Xing, W., Ghorbani, A.: Weighted pagerank algorithm. In: Proceedings of the Second Annual Conference on Communication Networks and Services Research. IEEE (2004)
Google Scholar
Fujimura, K., Inoue, T., Sugisaki, M.: The eigenrumor algorithm for ranking blogs. In: WWW (2005)
Google Scholar
Bidoki, A.M.Z., Yazdani, N.: DistanceRank: an intelligent ranking algorithm for web pages. Inf. Process. Manag. 44, 877–892 (2007)
Article Google Scholar
Jiang, H.: TIMERANK: a method of improving ranking scores by visited time. In: Proceedings of the Seventh International Conference Machine Learning and Cybernetics, Kunming, 12–15 July 2008 (2008)
Google Scholar
LaTorre, A., Pena, J.M., Robles, V., Perez, M.S.: A survey in web page clustering techniques (2019)
Google Scholar
Sandhya, N., Govardhan, A.: Analysis of similarity measures with wordnet based text document clustering. In: Proceedings of the InConINDIA. AISC, vol. 132, pp. 703–714. Springer, Heidelberg (2012)
Google Scholar
Rafi, M., Shaikh, M.S.: An improved semantic similarity measure for document clustering based on topic maps (2013)
Google Scholar
Markov, Z., Larose, D.T.: Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage. Wiley, Hoboken (2007)
Google Scholar
Manning, C.D., Raghavan, P.: An Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008). Preliminary draft© 2008
Book Google Scholar

Download references

Author information

Authors and Affiliations

Galgotias University, Greater Noida, Uttar Pradesh, India
Shadab Irfan & Subhajit Ghosh

Authors

Shadab Irfan
View author publications
You can also search for this author in PubMed Google Scholar
Subhajit Ghosh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shadab Irfan .

Editor information

Editors and Affiliations

Department of CSE, RVS Technical Campus, Coimbatore, India
S. Smys
Faculty of Engineering, Faculdade de Engenharia da Universidade do Porto, Porto, Portugal
João Manuel R. S. Tavares
Faculty of Engineering, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas
School of Computing, Tokyo Institute of Technology, Tokyo, Japan
Abdullah M. Iliyasu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Irfan, S., Ghosh, S. (2020). Efficient Ranking Framework for Information Retrieval Using Similarity Measure. In: Smys, S., Tavares, J., Balas, V., Iliyasu, A. (eds) Computational Vision and Bio-Inspired Computing. ICCVBIC 2019. Advances in Intelligent Systems and Computing, vol 1108. Springer, Cham. https://doi.org/10.1007/978-3-030-37218-7_141

Download citation

DOI: https://doi.org/10.1007/978-3-030-37218-7_141
Published: 07 January 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37217-0
Online ISBN: 978-3-030-37218-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Efficient Ranking Framework for Information Retrieval Using Similarity Measure

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A novel approach for ranking web documents based on query-optimized personalized pagerank

A Comprehensive Study of Page-Rank Algorithm

Query-Optimized PageRank: A Novel Approach

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Efficient Ranking Framework for Information Retrieval Using Similarity Measure

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A novel approach for ranking web documents based on query-optimized personalized pagerank

A Comprehensive Study of Page-Rank Algorithm

Query-Optimized PageRank: A Novel Approach

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation