Abstract
With the explosive growth of information sources available on the World Wide Web, how to combine the results of multiple search engines has become a valuable problem. In this paper, a search strategy based on genetic simulated annealing for search engines in Web mining is proposed. According to the proposed strategy, there exists some important relationship among Web statistical studies, search engines and optimization techniques. We have proven experimentally the relevance of our approach to the presented queries by comparing the qualities of output pages with those of the original downloaded pages, as the number of iterations increases better results are obtained with reasonable execution time.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Crestani F, Pasi G. Soft Computing in Information Retrieval: Techniques and Application[M]. Heidelberg: Springer-Verlag, 2000: 154–164.
Kim S, Zhang B T. Web Document Retrieval by Genetic Learning of Importance Factors for Html Tags[C]// Int’l Workshop on Text and Web Mining. New York: Springer-Verlag, 2000: 13–23.
Boughanem M, Chrisment C, Mothe J, et al. Connectionist and Genetic Approaches for Information Retrieval[C]// Soft Computing in Information Retrieval:Techniques and Applications. Heidelberg: Springer-Verlag, 2000:102–121.
Loia V, Luongo P. An Evolutionary Approach to Automatic Web Page Categorization and Updating[M]. Singapore: Springer-Verlag, 2001: 292–302.
Martino V, Mililotti M. Sub-Optimal Scheduling in a Grid Using Genetic Algorithm[J]. Parallel Computing, 2004, 30(5): 553–565.
Etzioni O, Perkowitz M. Adaptive Web Sites:An AI Challenge[C]// Proc of the 15th Int’l Joint Conf Artificial Intelligence. San Francisco: Morgan Kaufmann Publishers, 1997: 16–21.
Shu Wanneng, Zheng Shijue. A Parallel Genetic Simulated Annealing Hybrid Algorithm for Task Scheduling[J]. Wuhan University Journal of Natural Sciences, 2006, 12(5): 1378–1382.
Abraham A, Buyya R. Nature’s Heuristics for Scheduling Jobs on Computational Grids[EB/OL]. [2007-12-21]. http://www.softcomputing.net/adcom.pdf .
Zhang Jiangshe, Xu Zongben, Liang Yi. Global Annealing Genetic Algorithm and Its Convergence Well Necessary Condition[J]. Science in China, 1997, 27(2): 154–164.
Lu Shan, Chen Tong, Xu Shijie. Optimal Lambert Transfer Based on Adaptive Simulated Annealing Genetic Algorithm[J]. Journal of Beijing University of Aeronautics and Astronautics, 2007, 33(10): 1191–1195(Ch).
Wang Xia, Zhou Guobiao. Strong Convergence of Global Annealing Genetic Algorithm[J]. Mathematica Applicata, 2003, 16(3): 1–7.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chen, H., Zou, B. & Bian, N. Optimization of web search engine and its application to web mining. Wuhan Univ. J. Nat. Sci. 14, 115–118 (2009). https://doi.org/10.1007/s11859-009-0204-y
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11859-009-0204-y