Abstract
This paper presents a new way to extract concept that can be used to improve text classification performance (precision and recall). The computational measure will be divided into two layers. The bottom layer called document layer is concerned with extracting the concepts of particular document and the upper layer called category layer is with finding the description and subject concepts of particular category. The relevant implementation algorithm that dramatically decreases the search space is discussed in detail. The experiment based on real-world data collected from Infor-Bank shows that the approach is superior to the traditional ones.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Tan A H (2001) Predictive self-organizing networks for text categorization. The 5th Pacific-Asia Conference on Knowiedge Discovery and Data Mining, Hong Kong.
Sebastiani F (2003) Machine learning in automated text categorization. ACM Computing Surveys. http://www.cvc.uab. es/shared/teach/a20368/AC-MCS00. pdf.
Lewis D D (1992) Feature selection and feature extraction for text categorization. Speech and Natural Language Workshop, San Francsico.
Han J W, Kamber M (2001) Data mining: concepts and techniques. California: Morgan Kaufmann.
Li C, Luc Z S, Li Y H (2002) Research on automatic classification of documents based on concept attributes. 2002 IEEE international Conference on Systems, Man and Cybernetics.
Bakus J, Kamel M, Carey T (2002) Extraction of text phrases using hierarchical grammar. The Fifteenth Canadian Conference on Artificial Intelligence (Al'2002), Ottawa.
Author information
Authors and Affiliations
Corresponding author
Additional information
Project supported by the National Natural Science Foundation of China (No. 60082003) and the National High Technology Research and Development Program of China (No. 863-306-ZD03-04-1).
About this article
Cite this article
Yuntao, Z., Ling, G., Yongcheng, W. et al. An effective concept extraction method for improving text classification performance. Geo-spat. Inf. Sci. 6, 66–72 (2003). https://doi.org/10.1007/BF02826953
Issue Date:
DOI: https://doi.org/10.1007/BF02826953