Abstract
In text classification feature selection is used to reduce the feature space to improve the classification accuracy. In this paper, we propose a method max-min document frequency-based feature selection and we applied Extreme Learning Machine (ELM) model to improvise the text classification performance. For this text classification, we used the multilabel Reuters dataset which consists of 10788 number of documents. In this experiment, the ELM model performs better using max-min document frequency-based feature selection in terms of precision, recall, and F-measure as is compared to the ELM model using full feature space without using any feature selection technique.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Roul, R. K., Gugnani, S., & Kalpeshbhai, S. M. : Clustering based feature selection using extreme learning machines for text classification. In 2015 Annual IEEE India Conference (INDICON) (pp. 1-6). IEEE (2015)
Guzella, T.S., Caminhas, W.M.: A review of machine learning approaches to spam filtering. Expert Systems with Applications 36(7), 10206–10222 (2009)
Idris, I., Selamat, A.: Improved email spam detection model with negative selection algorithm and particle swarm optimization. Applied Soft Computing 22, 11–27 (2014)
Zeng, J., Zhang, S.: Variable space hidden Markov model for topic detection and analysis. Knowledge-Based Systems 20(7), 607–613 (2007)
Jiang, L., Li, C., Wang, S., Zhang, L.: Deep feature weighting for naive Bayes and its application to text classification. Engineering Applications of Artificial Intelligence 52, 26–39 (2016)
Saeys, Y., Inza, I., & Larraaga, P. : A review of feature selection techniques in bioinformatics. bioinformatics, 23(19), 2507-2517 (2007)
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: A survey. Ain Shams engineering journal 5(4), 1093–1113 (2014)
Roul, R. K., & Sahay, S. K. : K-means and wordnet based feature selection combined with extreme learning machines for text classification. In International Conference on Distributed Computing and Internet Technology (pp. 103-112). Springer, Cham (2016)
Yin, Y., Zhao, Y., Zhang, B., Li, C., Guo, S.: Enhancing ELM by Markov Boundary based feature selection. Neurocomputing 261, 57–69 (2017)
Rehman, A., Javed, K., Babri, H.A.: Feature selection based on a normalized difference measure for text classification. Information Processing & Management 53(2), 473–489 (2017)
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1–3), 489–501 (2017)
Asuncion, A., & Newman, D. : UCI machine learning repository (2007)
Dash, R., Dash, R., Mishra, D.: A hybridized rough-PCA approach of attribute reduction for high dimensional data set. European Journal of Scientific Research 44(1), 29–38 (2010)
Zheng, W., Qian, Y., Lu, H.: Text categorization based on regularization extreme learning machine. Neural Computing and Applications 22(3–4), 447–456 (2013)
Li, M., Xiao, P., & Zhang, J. : Text classification based on ensemble extreme learning machine. arXiv preprint arXiv:1805.06525 (2018)
Thaoma, M. :The Reuters Dataset.https://martin-thoma.com/nlp-reuters/. (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Behera, S.K., Dash, R. (2021). Performance of ELM Using Max-Min Document Frequency-Based Feature Selection in Multilabeled Text Classification. In: Mishra, D., Buyya, R., Mohapatra, P., Patnaik, S. (eds) Intelligent and Cloud Computing. Smart Innovation, Systems and Technologies, vol 194. Springer, Singapore. https://doi.org/10.1007/978-981-15-5971-6_46
Download citation
DOI: https://doi.org/10.1007/978-981-15-5971-6_46
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5970-9
Online ISBN: 978-981-15-5971-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)