Abstract
KNN algorithm which is one of the best methods of text classifying in the vector space model (VSM) is a simple, example based and none-parameter method. But in the KNN algorithm, the fixed K value ignores the influence of the category and the document number of training text. So, selecting the correct K value can achieve better classification results. This paper proposes a kind of dynamic obtain k-valued for KNN classification algorithm, experimental results show that the dynamic obtain k-valued KNN classification algorithm with high performance.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Pang, J., Bu, D., Bai, S.: Based on vector space model the automatic Text Classification System and implementation. Computer Applications 18(9), 23–26 (2001)
Han, E.H., George, K., Vipin, K.: Text categorization using weight adjusted k-nearest neighbor classification: Technical Report. University of Minnesota (2000)
Deerwester, S., Dumas, S., Furnas, G., Landauer, T., Harsrtian, R.: Indexing by Laent Semantic Analysis. Journal of the American Society for Information Science 41(6), 391–417 (1994)
Zhang, X., Li, Y., Wang, H.: Using Characteristics of polymerization to improve the KNN algorithm for Chinese Text Classification. Northeastern University (Natural Science) (3), 229–232 (2003)
Liu, B., Yang, L., Yuan, F.: Improved KNN method and its application in Chinese text categorization. West China University of Technology (Natural Science) 27(2), 33 (2008)
Song, F., Gao, L.: The evaluation about text classification performance. Computer Engineering 30(13), 107–109 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gong, A., Liu, Y. (2011). Improved KNN Classification Algorithm by Dynamic Obtaining K. In: Shen, G., Huang, X. (eds) Advanced Research on Electronic Commerce, Web Application, and Communication. ECWAC 2011. Communications in Computer and Information Science, vol 143. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20367-1_51
Download citation
DOI: https://doi.org/10.1007/978-3-642-20367-1_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20366-4
Online ISBN: 978-3-642-20367-1
eBook Packages: Computer ScienceComputer Science (R0)