Abstract
This paper presents new methodology towards the automatic development of multilingual Web portal for multilingual knowledge discovery and management. It aims to provide an efficient and effective framework for selecting and organizing knowledge from voluminous linguistically diverse Web contents. To achieve this, a concept-based approach that incorporates text mining and Web content mining using neural network and fuzzy techniques is proposed. First, a concept-based taxonomy of themes, which will act as the hierarchical backbone of the Web portal, is automatically generated. Second, a concept-based multilingual Web crawler is developed to intelligently harvest relevant multilingual documents from the Web. Finally, a concept-based multilingual text categorization technique is proposed to organize multilingual documents by concepts. As such, correlated multilingual Web documents can be gathered/filtered/organised/ based on their semantic content to facilitate high-performance multilingual information access.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Chakrabarti, S.: Data mining for hypertext: a tutorial survey. ACM SIGKDD Exploration 1(2), 1–11 (2000)
Berry, M.: Survey of Text Mining: Clustering, Classification, and Retrieval. Springer, Heidelberg (2003)
Chang, C., Healey, M.J., McHugh, J.A.M., Wang, J.T.L.: Mining the World Wide Web: an information search approach. Kluwer Academic Publishers, Dordrecht (2001)
Kosala, R., Blockeel, H.: Web mining research: a survey. ACM SIGKDD Exploration 2(1), 1–15 (2000)
Chakrabarti, S.: Mining the Web: Discovering Knowledge from Hypertext Data. Morgan Kaufmann, San Francisco (2002)
Tan, A.-H.: Text Mining: The state of the art and the challenges. In: Proceedings of PAKDD 1999 workshop on Knowledge Disocovery from Advanced Databases, Beijing, pp. 65–70 (1999)
Carbonell, J.G., Yang, Y., Frederking, R.E., Brown, R.D., Geng, Y., Lee, D.: Translingual information retrieval: a comparative evaluation. In: Pollack, M.E. (ed.) IJCAI 1997 Proceedings of the 15th International Joint Conference on Artificial Intelligence, pp. 708–714 (1997)
Davis, M.: New experiments in cross-language text retrieval at nmsu’s computing research lab. In: Proceedings of the Fifth Retrieval Conference (TREC-5) Gaithersburg, National Institute of Standards and Technology, Gaithersburg (1996)
Landauer, T.K., Littman, M.L.: Fully automatic cross-language document retrieval. In: Proceedings of the Sixth Conference on Electronic Text Research, pp. 31–38 (1990)
Kohonen, T.: Self-Organizing Maps. Springer, Berlin (1995)
Salton, G.: Automatic Text Processing: The Transformation, analysis, and Retrieva of Information by Computer. Addison-Wesley, Reading (1989)
Anderberg, M.R.: Cluster analysis for applications. Academic Press, Inc., New York (1973)
Kumar, R., Raghavan, P., Sridhar Rajagopalan, S., Sivakumar, D., Tompkins, A., Upfal, E.: The Web as a graph. In: Proceedings of the nineteenth ACM SIGMOD SIGACT SIGART symposium on Principles of database systems, pp. 1–10 (2000)
Keller, J.M., Gray, M.R., Givens, J.A.: A fuzzy k-nearest neighbor algorithm. IEEE Transactions of Systems, Man and Cybernetics SMC-15(4), 580–585 (1985)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chau, R., Yeh, CH., Smith-Miles, K. (2006). Fuzzy-neuro Web-Based Multilingual Knowledge Management. In: Wang, L., Jiao, L., Shi, G., Li, X., Liu, J. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2006. Lecture Notes in Computer Science(), vol 4223. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881599_154
Download citation
DOI: https://doi.org/10.1007/11881599_154
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45916-3
Online ISBN: 978-3-540-45917-0
eBook Packages: Computer ScienceComputer Science (R0)