Abstract
This paper presents the dense areas based algorithm for generating fuzzy rules for classification WWW documents. Description document clusters in the form of fuzzy rules (FR) make possible the presentation of information in the form fuzzy granules. Moreover, each cluster might be described by several fuzzy rules. These fuzzy rules can be used as the knowledge base for searching new information from WWW resources with regard to specific topics and users’ requirements.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Baldi, P., Frasconi, P., Smyth, P.: Modeling the Internet and the Web, Probabilistic Methods and Algorithms. Wiley, Chichester (2003)
Berry, M.W.: Survey of Text Mining, Clustering, Classification, and Retrieval. Springer, New York (2004)
Bo-Yeong, K., Dae-Won, K., Sang-Jo, L.: Exploiting concept clusters for content-based information retrieval. Information Sciences 170, 443–462 (2005)
Chakrabarti, S., Van den Berg, M., Dom, B.: Focused crawling: a new approach to topic-specific Web resource discovery. Computer Networks 31, 1623–1640 (1999)
Chiu, S.L.: Fuzzy model identification based on cluster estimation. J. Intell. Fuzzy Systems 2(3), 267–278 (1994)
Cho, J., Garcia-Molina, H., Page, L.: Efficient crawling through URL ordering. Computer Networks and ISDN Systems 30, 161–172 (1998)
Cortes, C., Vapnik, V.N.: Support vector networks. Machine Learning 20, 1–25 (1995)
Euntai, K., Minkee, P., Seunghwan, J., Mignon, P.: A new approach to fuzzy modeling. IEEE Transaction on Fuzzy Systems 5(3), 328–337 (1997)
Gulli, A., Signorini, A.: The Indexable Web is More than 11.5 bilion pages, http://www.cs.uiowa.edu/~asignori/pubs/web-size
Hastie, T., Tibshirani, R., Friedman, J.: Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, Heidelberg (2001)
Hersovici, M., Jacovi, M., Maarek, Y.S., Pelleg, D., Shtalhaim, M., Ur, S.: The shark-search algorithm - an application: tailored web site mapping. In: Proceedings of the Seventh International World Wide Web Conference, Brisbane, Australia, pp. 317–326 (1998)
Jones, S.K., Willet, P.: Readings in Information Retrieval. Morgan Kaufmann, San Francisco (1997)
Kłopotek, A.M.: Intelligent Search Engines (in Polish). EXIT, Warszawa (2001)
Kraft, D.H., Martin-Bautista, M.J., Chen, J., Sanchez, D.: Rules and fuzzy rules in text concept, extraction and usage. International Journal of Approximate Reasoning 34, 145–161 (2003)
Lam, W., Ho, C.Y.: Using a generalized instance set for automatic text categorization. In: Proc. SIGIR 1998, 21stACM Int. Conf. on Research and Development in Information Retrieval, pp. 81–89 (1998)
Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1978)
Rungsawang, A., Angkawattanawit, N.: Learnable topic-specific web crawler. Journal of Network and Computer Application 28, 97–114 (2005)
Rutkowska, D., Piliński, M., Rutkowski, L.: Neural Networks, Genetic Algorithms and Fuzzy Systems (in Polish). PWN, Warszawa (1999)
Rutkowska, D.: Neuro–Fuzzy Architectures and Hybrid Learning. Springer, Heidelberg (2002)
Rutkowski, L.: Artificial Inteligence Methods and Techniques (in Polish). PWN, Warszawa (2005)
Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys, 1–47 (2002)
Tao, C.W.: Unsupervised fuzzy clustering with multi-center clusters. Fuzzy Sets and Systems 128, 305–322 (2002)
Zadeh, L.A.: Fuzzy logic = Computing with words. IEEE Transactions of Fuzzy Systems 4(2) (1996)
Zadeh, L.A.: Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic. Fuzzy Sets and Systems 90, 111–127 (1997)
Source test data: http://www.cs.rochester.edu/trs
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dziwiñski, P., Rutkowska, D. (2006). Algorithm for Generating Fuzzy Rules for WWW Document Classification. In: Rutkowski, L., Tadeusiewicz, R., Zadeh, L.A., Żurada, J.M. (eds) Artificial Intelligence and Soft Computing – ICAISC 2006. ICAISC 2006. Lecture Notes in Computer Science(), vol 4029. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11785231_116
Download citation
DOI: https://doi.org/10.1007/11785231_116
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-35748-3
Online ISBN: 978-3-540-35750-6
eBook Packages: Computer ScienceComputer Science (R0)