Performance of ELM Using Max-Min Document Frequency-Based Feature Selection in Multilabeled Text Classification

Behera, Santosh Kumar; Dash, Rajashree

doi:10.1007/978-981-15-5971-6_46

Santosh Kumar Behera⁷ &
Rajashree Dash⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 194))

893 Accesses
2 Citations

Abstract

In text classification feature selection is used to reduce the feature space to improve the classification accuracy. In this paper, we propose a method max-min document frequency-based feature selection and we applied Extreme Learning Machine (ELM) model to improvise the text classification performance. For this text classification, we used the multilabel Reuters dataset which consists of 10788 number of documents. In this experiment, the ELM model performs better using max-min document frequency-based feature selection in terms of precision, recall, and F-measure as is compared to the ELM model using full feature space without using any feature selection technique.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

K-means and Wordnet Based Feature Selection Combined with Extreme Learning Machines for Text Classification

Text Categorization Using a Novel Feature Selection Technique Combined with ELM

Multi-label Text Categorization Using $$L_{21}$$ -norm Minimization Extreme Learning Machine

References

Roul, R. K., Gugnani, S., & Kalpeshbhai, S. M. : Clustering based feature selection using extreme learning machines for text classification. In 2015 Annual IEEE India Conference (INDICON) (pp. 1-6). IEEE (2015)
Google Scholar
Guzella, T.S., Caminhas, W.M.: A review of machine learning approaches to spam filtering. Expert Systems with Applications 36(7), 10206–10222 (2009)
Article Google Scholar
Idris, I., Selamat, A.: Improved email spam detection model with negative selection algorithm and particle swarm optimization. Applied Soft Computing 22, 11–27 (2014)
Article Google Scholar
Zeng, J., Zhang, S.: Variable space hidden Markov model for topic detection and analysis. Knowledge-Based Systems 20(7), 607–613 (2007)
Article MathSciNet Google Scholar
Jiang, L., Li, C., Wang, S., Zhang, L.: Deep feature weighting for naive Bayes and its application to text classification. Engineering Applications of Artificial Intelligence 52, 26–39 (2016)
Article Google Scholar
Saeys, Y., Inza, I., & Larraaga, P. : A review of feature selection techniques in bioinformatics. bioinformatics, 23(19), 2507-2517 (2007)
Google Scholar
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: A survey. Ain Shams engineering journal 5(4), 1093–1113 (2014)
Article Google Scholar
Roul, R. K., & Sahay, S. K. : K-means and wordnet based feature selection combined with extreme learning machines for text classification. In International Conference on Distributed Computing and Internet Technology (pp. 103-112). Springer, Cham (2016)
Google Scholar
Yin, Y., Zhao, Y., Zhang, B., Li, C., Guo, S.: Enhancing ELM by Markov Boundary based feature selection. Neurocomputing 261, 57–69 (2017)
Article Google Scholar
Rehman, A., Javed, K., Babri, H.A.: Feature selection based on a normalized difference measure for text classification. Information Processing & Management 53(2), 473–489 (2017)
Article Google Scholar
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1–3), 489–501 (2017)
Google Scholar
Asuncion, A., & Newman, D. : UCI machine learning repository (2007)
Google Scholar
Dash, R., Dash, R., Mishra, D.: A hybridized rough-PCA approach of attribute reduction for high dimensional data set. European Journal of Scientific Research 44(1), 29–38 (2010)
Google Scholar
Zheng, W., Qian, Y., Lu, H.: Text categorization based on regularization extreme learning machine. Neural Computing and Applications 22(3–4), 447–456 (2013)
Article Google Scholar
Li, M., Xiao, P., & Zhang, J. : Text classification based on ensemble extreme learning machine. arXiv preprint arXiv:1805.06525 (2018)
Thaoma, M. :The Reuters Dataset.https://martin-thoma.com/nlp-reuters/. (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Siksha ’O’ Anusandhan Deemed to be University, Bhubaneswar, Odisha, India
Santosh Kumar Behera & Rajashree Dash

Authors

Santosh Kumar Behera
View author publications
You can also search for this author in PubMed Google Scholar
Rajashree Dash
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Santosh Kumar Behera .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Siksha ‘O’ Anusandhan Deemed to be University, Bhubaneswar, Odisha, India
Debahuti Mishra
University of Melbourne, Melbourne, VIC, Australia
Rajkumar Buyya
Department of Computer Science, University of California, Davis, CA, USA
Prasant Mohapatra
Department of Computer Science and Engineering, Siksha ‘O’ Anusandhan Deemed to be University, Bhubaneswar, Odisha, India
Srikanta Patnaik

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Behera, S.K., Dash, R. (2021). Performance of ELM Using Max-Min Document Frequency-Based Feature Selection in Multilabeled Text Classification. In: Mishra, D., Buyya, R., Mohapatra, P., Patnaik, S. (eds) Intelligent and Cloud Computing. Smart Innovation, Systems and Technologies, vol 194. Springer, Singapore. https://doi.org/10.1007/978-981-15-5971-6_46

Download citation

DOI: https://doi.org/10.1007/978-981-15-5971-6_46
Published: 31 October 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5970-9
Online ISBN: 978-981-15-5971-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Performance of ELM Using Max-Min Document Frequency-Based Feature Selection in Multilabeled Text Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

K-means and Wordnet Based Feature Selection Combined with Extreme Learning Machines for Text Classification

Text Categorization Using a Novel Feature Selection Technique Combined with ELM

Multi-label Text Categorization Using $$L_{21}$$ -norm Minimization Extreme Learning Machine

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Performance of ELM Using Max-Min Document Frequency-Based Feature Selection in Multilabeled Text Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

K-means and Wordnet Based Feature Selection Combined with Extreme Learning Machines for Text Classification

Text Categorization Using a Novel Feature Selection Technique Combined with ELM

Multi-label Text Categorization Using $$L_{21}$$ -norm Minimization Extreme Learning Machine

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation