Abstract
Most Web search engines use the content of the Web documents and their link structures to assess the relevance of the document to the user’s query. With the growth of the information available on the web, it becomes difficult for such Web search engines to satisfy the user information need expressed by few keywords. First, personalized information retrieval is a promising way to resolve this problem by modeling the user profile by his general interests and then integrating it in a personalized document ranking model. In this paper, we present a personalized search approach that involves a graph-based representation of the user profile. The user profile refers to the user interest in a specific search session defined as a sequence of related queries. It is built by means of score propagation that allows activating a set of semantically related concepts of reference ontology, namely the ODP. The user profile is maintained across related search activities using a graph-based merging strategy. For the purpose of detecting related search activities, we define a session boundary recognition mechanism based on the Kendall rank correlation measure that tracks changes in the dominant concepts held by the user profile relatively to a new submitted query. Personalization is performed by re-ranking the search results of related queries using the user profile. Our experimental evaluation is carried out using the HARD 2003 TREC collection and showed that our session boundary recognition mechanism based on the Kendall measure provides a significant precision comparatively to other non-ranking based measures like the cosine and the WebJaccard similarity measures. Moreover, results proved that the graph-based search personalization is effective for improving the search accuracy.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Alexandru CP, Wolfgang N, Raluca P, Christian K (2005) Using odp metadata to personalize search. In: SIGIR ’05: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval, pp 178–185
Allan J (2003) Hard track overview in trec 2003: high accuracy retrieval from documents. In: TREC, pp 24–37
Allan J et al (2003) Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002. SIGIR Forum 37(1): 31–47
Begg IM, Gnocato J, Moore WE (1993) A prototype intelligent user interface for real-time supervisory control systems. In: IUI ’93: Proceedings of the 1st international conference on Intelligent user interfaces. ACM Press, New York, pp 211–214
Boughanem M, Chrisment C, Mothe J, Soul-Dupuy C, Tamine L (2000) Connectionist and genetic approaches to achieve IR. In: Crestani F, Gabriella P (eds) Soft computing in information retrieval techniques and applications. Springer, Berlin, pp 173–198
Boughanem M, Tamine L (2002) A study on genetic niching for query optimisation in document retrieval. In: Europeen colloquium on information retrieval, Glasgow, 25–27 March 2002, pp 135–149
Chen M-S, Park JS, Yu PS (1998) Efficient data mining for path traversal patterns. IEEE Trans Knowl Data Eng 10(2): 209–221
Cooley R, Mobasher B, Srivastava J (1999) Data preparation for mining world wide web browsing patterns. Knowl Inf Syst 1: 5–32
Daoud M, Tamine L, Boughanem M (2008) Learning user interests for session-based personalized search. In: ACM Information Interaction in context (IIiX). ACM Press, London, pp 57–64
Daoud M, Tamine L, Boughanem M, Chebaro B (2009) A session based personalized search using an ontological user profile. In: ACM symposium on applied computing (SAC), Haiwai (USA). ACM Press, London, pp 1031–1035
Ding C, Patra JC, Peng FC (2005) Personalized web search with self-organizing map. In: EEE ’05: Proceedings of the 2005 IEEE international conference on e-Technology, e-Commerce and e-Service (EEE’05) on e-Technology, e-Commerce and e-Service. IEEE Computer Society, Washington, DC, pp 144–147
Dumais S, Cadiz ECJJ, Jancke G, Sarin R, Daniel CR (2003) Stuff i’ve seen: a system for personal information retrieval and re-use. In: SIGIR’03: Proceedings of the 26th annual international ACM SIGIR conference on research and development in informaion retrieval (SIGIR ’03). ACM Press, London, pp 72–79
Feng Q, Junghoo C (2006) Automatic identification of user interest for personalized search. In: WWW ’06: Proceedings of the 15th international conference on world wide web, pp 727–736
Foss A, Wang W, Zaane OR (2001) A non-parametric approach to web log analysis. In: Workshop on webmining in first international SIAM conference on data mining, pp 41–50
Gauch S, Chaffee J, Pretschner A (2003) Ontology-based personalized search and browsing. Web Intell Agent Syst 1(3–4): 219–234
Gowan J (2003) A multiple model approach to personalised information access. Master thesis in computer science, Faculty of science, Universitt de College Dublin
Haveliwala TH, Gionis A, Klein D, Indyk P (2002) Evaluating strategies for similarity search on the web. In: WWW’02: Proceedings of the eleventh international world wide web conference, pp 432–442
He D (2000) Detecting session boundaries from web user logs. In: Proceedings of the BCS-IRSG 22nd annual colloquium on information retrieval research, pp 57–66
Huang X, Peng F, An A, Schuurmans D (2004) Dynamic web log session identification with statistical language models. J Am Soc Inf Sci Technol 55(14): 1290–1303
Huang X, Yao Q, An A (2006) Applying language modeling to session identification from database trace logs. Knowl Inf Syst 10(4): 473–504
Jaime T, T., DS, Eric H (2005) Personalizing search via automated analysis of interests and activities. In: SIGIR ’05: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval, pp 449–456
Jansen BJ, Spink A, Kathuria V (2006) How to define searching sessions on web search engines. In: Advances in web mining and web usage analysis, 8th international workshop on knowledge discovery on the web, WebKDD 2006, Philadelphia, pp 92–109
Jeh G, Widom J (2003) Scaling personalized web search. In: WWW ’03: Proceedings of the 12th international conference on world wide web. ACM Press, New York, pp 271–279
John RI, Mooney GJ (2001) Fuzzy user modeling for information retrieval on the world wide web. Knowl Inf Syst 3(1): 81–95
Kelly D, Teevan J (2003) Implicit feedback for inferring user preference: a bibliography. SIGIR Forum 37(2): 18–28
Kim HR, Chan PK (2003) Learning implicit user interest hierarchy for context in personalization. In: IUI ’03: Proceedings of the 8th international conference on Intelligent user interfaces. ACM Press, New York, pp 101–108
Koutrika G, Ioannidis Y (2005) A unified user profile framework for query disambiguation and personalization. In: Proceedings of workshop on new technologies for personalized information access
Leung CW, Chan SC, Chung F (2006) A collaborative filtering framework based on fuzzy association rules and multiple-level similarity. Knowl Inf Syst 10(3): 357–381
Lieberman H (1995) Letizia: an agent that assists web browsing. In: IJCAI 95: Proceedings of international joint proceedings of the fourteenth international joint conference on artificial intelligence, pp 924–929
Lieberman H (1997) Autonomous interface agents. In: CHI, pp 67–74
Lin C, Xue G, Zeng H, YU Y (2005) Using probabilistic latent semantic analysis for personalised web search. In: Proceedings of the APWeb conference, pp 707–711
Liu F, Yu C, Meng W (2004) Personalized web search for improving retrieval effectiveness. IEEE Trans Knowl Data Eng 16(1): 28–40
Ma Z, Pant G, Sheng ORL (2007) Interest-based personalized search. ACM Trans Inf Syst 25(1): 5
Maguitman AG, Menczer F, Roinestad H, Vespignani A (2005) Algorithmic detection of semantic similarity. In: WWW ’05: Proceedings of the 14th international conference on World Wide Web. ACM Press, New York, pp 107–116
Menczer F, Pant G, Srinivasan P (2004) Topical web crawlers: evaluating adaptive algorithms. ACM Trans Interet Technol 4(4): 378–419
Micarelli A, Sciarrone F (2004) Anatomy and empirical evaluation of an adaptive web-based information filtering system. User Model User-Adapt Interact 14(2–3): 159–200
Mobasher B (2007) Data mining for web personalization. In: Brusilovsky P, Kobsa A, Nejdl W (eds) Lecture notes in computer science. Springer, New York
Rocchio JJ (1971) Relevance feedback in information retrieval. In: Salton G. (ed) The SMART retrieval system—experiments in automatic document processing. Prentice-Hall, Englewood Cliffs
Shahabi C, Chen Y-S (2003) Web information personalization: Challenges and approaches. In: Bianchi-Berthouze N (ed) DNIS. Lecture notes in computer science, vol 2822. Springer, Berlin, pp 5–15
Shen D, Chen Z, Yang Q, Zeng H-J, Zhang B, Lu Y, Ma W-Y (2004) Web-page classification through summarization. In: SIGIR ’04: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. ACM Press, New York, pp 242–249
Shen X, Tan B, Zhai C (2005a) Context-sensitive information retrieval using implicit feedback. In: SIGIR ’05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval. ACM Press, New York, pp 43–50
Shen X, Tan B, Zhai C (2005b) Implicit user modeling for personalized search. In: CIKM ’05: Proceedings of the 14th ACM international conference on Information and knowledge management. ACM Press, New York, pp 824–831
Sieg A, Mobasher B, Burke R (2007) Web search personalization with ontological user profiles. In: CIKM’07: Proceedings of the sixteenth ACM conference on conference on information and knowledge management. ACM Press, New York, pp 525–534
Sieg A, Mobasher B, Burke R, Prabu G, Lytinen S (2004) Using concept hierarchies to enhance user queries in web-based information retrieval. In: The IASTED international conference on artificial intelligence and applications. Innsbruck, Austria
Spink A, Ozmutlu S, Ozmutlu HC, Jansen BJ (2002) U.S. versus European web searching trends. SIGIR Forum 36(2): 32–38
Srinivasan P, Menczer F, Pant G (2005) A general evaluation framework for topical crawlers. Inf Retr 8(3): 417–447
Sriram S, Shen X, Zhai C (2004) A session-based search engine. In: SIGIR’04: Proceedings of the international ACM SIGIR conference
Tamine L, Boughanem M, Daoud M (2009) Evaluation of contextual information retrieval: overview of issues and research. Knowl Inf Syst (Kais) (to appear)
Tamine L, Boughanem M, Zemirli WN (2008) Personalized document ranking: exploiting evidence from multiple user interests for profiling and retrieval. J Digit Inf Manage 6(5): 354–365
Tan A-H, Ong H-L, Pan H, Ng J, Li Q-X (2004) Towards personalised web intelligence. Knowl Inf Syst 6(5): 595–616
Tan B, Shen X, Zhai C (2006) Mining long-term search history to improve search accuracy. In: KDD ’06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM Press, New York, pp 718–723
Wang W, Zanane OR (2002) Clustering web sessions by sequence alignment. In: Proceedings of the 13th international workshop on database and expert systems applications (DEXA 2002). Springer, Aix-en-Provence, pp 394–398
Webb G, Pazzani M, Billsus D (2001) Machine learning for user modeling. User Model User-Adapt Interact 11(1–2): 19–29
Wu X, Kumar V, Ross Quinlan J, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Yu PS, Zhou Z-H, Steinbach M, Hand DJ, Steinberg D (2007) Top 10 algorithms in data mining. Knowl Inf Syst 14(1): 1–37
Zhicheng D, Ruihua S, Ji-Rong W (2007) A large-scale evaluation and analysis of personalized search strategies. In: WWW ’07: Proceedings of the 16th international conference on World Wide Web, pp 581–590
Zhou Y, Croft WB (2008) Measuring ranked list robustness for query performance prediction. Knowl. Inf. Syst. 16(2): 155–171
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Daoud, M., Lechani, LT. & Boughanem, M. Towards a graph-based user profile modeling for a session-based personalized search. Knowl Inf Syst 21, 365–398 (2009). https://doi.org/10.1007/s10115-009-0232-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-009-0232-0