Abstract
Automatic text summarization (ATS) is a widely used approach. Through the years, various techniques have been implemented to produce the summary. An extractive summary is a traditional mechanism for information extraction, where important sentences are selected which refers to the basic concepts of the article. In this paper, extractive summarization has been considered as a classification problem. Machine learning techniques have been implemented for classification problems in various domains. To solve the summarization problem in this paper, machine learning is taken into consideration, and KNN, random forest, support vector machine, multilayer perceptron, decision tree and logistic regression algorithm have been implemented on Newsroom dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Luhn, H.P. 1958. The automatic creation of literature abstracts. IBM Journal of Research and Development 2 (2): 159–165.
Gambhir, M., and V. Gupta. 2017. Recent automatic text summarization techniques: A survey. Artificial Intelligence Review 47 (1): 1–66.
Meena, Y.K., and D. Gopalani. 2014. Analysis of sentence scoring methods for extractive automatic text summarization. In Proceedings of the 2014 international conference on information and communication technology for competitive strategies, November 2014, 53. ACM.
Pattanaik, A., S. Sagnika, M. Das, and B.S.P. Mishra. 2019. Extractive summary: An optimization approach using bat algorithm. Ambient communications and computer systems, 175–186. Singapore: Springer.
Joachims, T. 1998. Text categorization with support vector machines: Learning with many relevant features. In European conference on machine learning, April 1998, 137–142. Springer, Berlin, Heidelberg.
Nobata, C., S. Sekine, M. Murata, K. Uchimoto, M. Utiyama, H., and Isahara. 2001. Sentence extraction system assembling multiple evidence. In NTCIR.
Jafari, M., J. Wang, Y. Qin, M. Gheisari, A.S. Shahabi, and X. Tao. 2016. Automatic text summarization using fuzzy inference. In 22nd International conference on automation and computing (ICAC), September 2016, 256–260. IEEE.
Matsuo, Y., and M. Ishizuka. 2004. Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools 13 (01): 157–169.
NewsRoom Dataset Available (2017) Cornell Newsroom. https://summari.es. 2017.
Powers, D.M. 2011. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation.
Davis, J., and M. Goadrich. 2006. The relationship between precision-recall and ROC curves. In Proceedings of the 23rd international conference on machine learning, June 2006, 233–240. ACM.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Pattanaik, A., Mishra, S.S., Das, M. (2020). A Comparative Study of Classifiers for Extractive Text Summarization. In: Swain, D., Pattnaik, P., Gupta, P. (eds) Machine Learning and Information Processing. Advances in Intelligent Systems and Computing, vol 1101. Springer, Singapore. https://doi.org/10.1007/978-981-15-1884-3_16
Download citation
DOI: https://doi.org/10.1007/978-981-15-1884-3_16
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1883-6
Online ISBN: 978-981-15-1884-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)