Abstract
The bloom of Internet has made fast text categorization very essential. Generally, the popular methods have good classification accuracy but slow speed, and vice versa. This paper proposes a novel approach for fast text categorization, in which a collaborative work framework based on a linear classifier and an extreme learning machine (ELM) is constructed. The linear classifier, obtained by a modified non-negative matrix factorization algorithm, maps all documents from the original term space into the class space directly such that it performs classification very fast. The ELM, with good classification accuracy via some nonlinear and linear transformations, classifies a few of documents according to some given criteria to improve the classification quality of the total system. Experimental results show that the proposed method not only achieves good accuracy but also performs classification very fast, which improves the averaged speed by 180 % compared with its corresponding method.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Cao, J., Lin, Z., Huang, G.B., Liu, N.: Voting based extreme learning machine. Inf. Sci. 185(1), 66–77 (2012)
De Souza, A., Pedroni, F., Oliveira, E., Ciarelli, P., Henrique, W., Veronese, L., Badue, C.: Automated multi-label text categorization with vg-ram weightless neural networks. Neurocomputing 72(10–12), 2209–2217 (2009)
Gabrilovich, E., Markovitch, S.: Text categorization with many redundant features: using aggressive feature selection to make svms competitive with c4. 5. In: International Conference on Machine Learning, pp. 321–328 (2004)
Henzinger, M., Chang, B.W., Milch, B., Brin, S.: Query-free news search. World Wide Web 8(2), 101–126 (2005)
Hmeidi, I., Hawashin, B., El-Qawasmeh, E.: Performance of knn and svm classifiers on full word arabic articles. Adv. Eng. Inform. 22(1), 106–111 (2008)
Huang, G., Chen, L.: Convex incremental extreme learning machine. Neurocomputing 70(16–18), 3056–3062 (2007)
Huang, G., Ding, X., Zhou, H.: Optimization method based extreme learning machine for classification. Neurocomputing 74, 155–163 (2010)
Huang, G., Wang, D., Lan, Y.: Extreme learning machines: a survey. Int. J. Mach. Learn. Cybern. 2(2), 107–122 (2011)
Huang, G., Zhou, H., Ding, X., Zhang, R.: Extreme learning machine for regression and multi-class classification. IEEE Trans. Syst. Man Cybern. Part B 42(2), 513–529 (2011)
Huang, G., Zhu, Q., Siew, C.: Extreme learning machine: a new learning scheme of feedforward neural networks. In: Proceedings of the International Joint Conference on Neural Networks, vol. 2, pp. 985–990 (2004)
Huang, G., Zhu, Q., Siew, C.: Extreme learning machine: theory and applications. Neurocomputing 70(1–3), 489–501 (2006)
Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: 10th European Conference on Machine Learning, pp. 137–142 (1998)
Kumar, M.A., Gopal, M.: A comparison study on multiple binary-class svm methods for unilabel text categorization. Pattern Recogn. Lett. 31(11), 1437–1444 (2010)
Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems, pp. 556–562 (2001)
Lewis, D., Ringuette, M.: A comparison of two learning algorithms for text categorization. In: Third Annual Symposium on Document Analysis and Information Retrieval, vol. 33, pp. 81–93 (1994)
Li, W., Zhong, N., Yao, Y., Liu, J.: An operable email based intelligent personal assistant. World Wide Web 12(2), 125–147 (2009)
Liu, Y., Loh, H., Tor, S.: Comparison of extreme learning machine with support vector machine for text classification. Innov. Appl. Artif. Intell. 3533, 390–399 (2005)
Man, Z., Lee, K., Wang, D., Cao, Z., Khoo, S.: Robust single-hidden layer feedforward network-based pattern classifier. IEEE Trans. Neural Netw. Learning Syst. 23(12), 1974–1986 (2012)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)
Savitha, R., Suresh, S., Sundararajan, N.: Fast learning circular complex-valued extreme learning machine (cc-elm) for real-valued classification problems. Inf. Sci. 187(1), 277–290 (2012)
Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002)
Silva, C., Ribeiro, B.: Knowledge extraction with non-negative matrix factorization for text classification. In: Proceedings of Intelligent Data Engineering and Automated Learning, vol. 5788, pp. 300–308 (2009)
Soucy, P., Mineau, G.: A simple knn algorithm for text categorization. In: International Conference on Data Mining, pp. 647–648 (2001)
Wang, X., Chen, A., Feng, H.: Upper integral network with extreme learning mechanism. Neurocomputing 74(16), 2520–2525 (2011)
Xing, H.J., Wang, X.M.: Training extreme learning machine via regularized correntropy criterion. Neural Comput. Appl. (2012). doi:10.1007/s00521-012-1184-y
Yang, Y., Chute, C.: An example-based mapping method for text categorization and retrieval. ACM Trans. Inf. Syst. 12(3), 252–277 (1994)
Yeung, D.S., Ng, W.W., Wang, D., Tsang, E.C., Wang, X.Z.: Localized generalization error model and its application to architecture selection for radial basis function neural network. IEEE Trans. Neural Netw. 18(5), 1294–1305 (2007)
Zakos, J., Verma, B.: A novel context-based technique for web information retrieval. World Wide Web 9(4), 485–503 (2006)
Zhang, R., Huang, G., Sundararajan, N., Saratchandran, P.: Multicategory classification using an extreme learning machine for microarray gene expression cancer diagnosis. IEEE/ACM Trans. Comput. Biol. Bioinf. 4(3), 485–495 (2007)
Zhang, T., Oles, F.: Text categorization based on regularized linear classification methods. Inf. Retr. 4, 5–31 (2001)
Zheng, W., Qian, Y., Lu, H.: Text categorization based on regularization extreme learning machine. Neural Comput. Appl. 22(3), 447–456 (2013)
Zheng, W., Zhang, H., Qian, Y.: Fast text categorization based on collaborative work in the semantic and class spaces. In: International Conference on Machine Learning and Cybernetics, vol. 4, pp. 1497–1502 (2011)
Zhu, Q., Qin, A., Suganthan, P., Huang, G.: Evolutionary extreme learning machine. Pattern Recogn. 38(10), 1759–1763 (2005)
Zong, W., Huang, G.B.: Face recognition based on extreme learning machine. Neurocomputing 74(16), 2541–2551 (2011)
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was supported by the 973 Program (No. 2012CB316400), and the National Natural Science Foundation of China (No. 11202202, No. 61272315 and No. 61171151), and the Natural Science Foundation of Zhejiang Province (No. Y6110147).
Rights and permissions
About this article
Cite this article
Zheng, W., Tang, H. & Qian, Y. Collaborative work with linear classifier and extreme learning machine for fast text categorization. World Wide Web 18, 235–252 (2015). https://doi.org/10.1007/s11280-013-0225-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-013-0225-5