Skip to main content

A Comparative Study of Classifiers for Extractive Text Summarization

  • Conference paper
  • First Online:
Machine Learning and Information Processing

Abstract

Automatic text summarization (ATS) is a widely used approach. Through the years, various techniques have been implemented to produce the summary. An extractive summary is a traditional mechanism for information extraction, where important sentences are selected which refers to the basic concepts of the article. In this paper, extractive summarization has been considered as a classification problem. Machine learning techniques have been implemented for classification problems in various domains. To solve the summarization problem in this paper, machine learning is taken into consideration, and KNN, random forest, support vector machine, multilayer perceptron, decision tree and logistic regression algorithm have been implemented on Newsroom dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Luhn, H.P. 1958. The automatic creation of literature abstracts. IBM Journal of Research and Development 2 (2): 159–165.

    Article  MathSciNet  Google Scholar 

  2. Gambhir, M., and V. Gupta. 2017. Recent automatic text summarization techniques: A survey. Artificial Intelligence Review 47 (1): 1–66.

    Article  Google Scholar 

  3. Meena, Y.K., and D. Gopalani. 2014. Analysis of sentence scoring methods for extractive automatic text summarization. In Proceedings of the 2014 international conference on information and communication technology for competitive strategies, November 2014, 53. ACM.

    Google Scholar 

  4. Pattanaik, A., S. Sagnika, M. Das, and B.S.P. Mishra. 2019. Extractive summary: An optimization approach using bat algorithm. Ambient communications and computer systems, 175–186. Singapore: Springer.

    Chapter  Google Scholar 

  5. Joachims, T. 1998. Text categorization with support vector machines: Learning with many relevant features. In European conference on machine learning, April 1998, 137–142. Springer, Berlin, Heidelberg.

    Google Scholar 

  6. Nobata, C., S. Sekine, M. Murata, K. Uchimoto, M. Utiyama, H., and Isahara. 2001. Sentence extraction system assembling multiple evidence. In NTCIR.

    Google Scholar 

  7. Jafari, M., J. Wang, Y. Qin, M. Gheisari, A.S. Shahabi, and X. Tao. 2016. Automatic text summarization using fuzzy inference. In 22nd International conference on automation and computing (ICAC), September 2016, 256–260. IEEE.

    Google Scholar 

  8. Matsuo, Y., and M. Ishizuka. 2004. Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools 13 (01): 157–169.

    Article  Google Scholar 

  9. NewsRoom Dataset Available (2017) Cornell Newsroom. https://summari.es. 2017.

  10. Powers, D.M. 2011. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation.

    Google Scholar 

  11. Davis, J., and M. Goadrich. 2006. The relationship between precision-recall and ROC curves. In Proceedings of the 23rd international conference on machine learning, June 2006, 233–240. ACM.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anshuman Pattanaik .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Pattanaik, A., Mishra, S.S., Das, M. (2020). A Comparative Study of Classifiers for Extractive Text Summarization. In: Swain, D., Pattnaik, P., Gupta, P. (eds) Machine Learning and Information Processing. Advances in Intelligent Systems and Computing, vol 1101. Springer, Singapore. https://doi.org/10.1007/978-981-15-1884-3_16

Download citation

Publish with us

Policies and ethics