Inclusive Review on Extractive and Abstractive Text Summarization: Taxonomy, Datasets, Techniques and Challenges

Mishra, Gitanjali; Sethi, Nilambar; Agilandeeswari, L.

doi:10.1007/978-3-031-35501-1_7

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 716))

Included in the following conference series:

International Conference on Intelligent Systems Design and Applications

330 Accesses
1 Citations

Abstract

Condensing a lengthy text into a manageable length while maintaining the essential informational components and the meaning of the content is known as summarization. Manual text summarizing is a time-consuming and generally arduous activity that is becoming more and more popular, which is a major driving force behind academic research. Automatic Text summarization (ATS) has significant uses in a variety of Natural Language Processing (NLP) related activities, including text classification, question answering, summarizing legal texts, and news, and creating headlines. This is an emerging research field where most researchers are involved from popular companies namely, Google, Microsoft, Facebook, etc. This motivates us to present an inclusive review of extractive and abstractive summarization techniques for various inputs. In this paper, we are presenting a comparative study of different models, classified based on their techniques used. We have also classified them based on the dataset used at some places for better under-standing and the parametric evaluation of these techniques and their challenges have also been presented. Thus, the study presents a clear-cut view of the happenings of text summarization techniques and provides a roadmap for new re-searchers in this field.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 219.00; Price excludes VAT (USA)

Softcover Book: USD 279.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Concise Review on Automatic Text Summarization

Automatic Text Summarization Methods: A Comprehensive Review

Article 28 October 2022

A Survey on Automatic Text Summarisation

References

Mishra, G., Sethi, N., Agilandeeswari, L.: Two phase ensemble learning based extractive summarization for short documents. In Proceedings of the 14th International Conference on Soft Computing and Pattern Recognition (SoCPaR 2022), pp. 129–142. Springer, Cham (2023)
Google Scholar
Mishra, G., Sethi, N., Agilandeeswari, L.: Fuzzy Bi-GRU based hybrid extractive and abstractive text summarization for long multi-documents. In: Proceedings of the 14th International Conference on Soft Computing and Pattern Recognition (SoCPaR 2022), pp. 153–166. Springer, Cham
Google Scholar
Wu, Z., Lei, L., Li, G., Huang, H., Zheng, C., Chen, E., et al.: A topic modeling based approach to novel document automatic summarization. Expert Syst. Appl. 84, 12–23 (2017). https://doi.org/10.1016/j.eswa.2017.04.054
Article Google Scholar
Cai, X., Li, W.: A spectral analysis approach to document summarization: clustering and ranking sentences simultaneously. Inf. Sci. 181(18), 3816–3827 (2011). https://doi.org/10.1016/j.ins.2011.04.052
Article Google Scholar
Greene, D., Cunningham, P.: Practical solutions to the problem of diagonal dominance in kernel document clustering. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 377–384 (June 2006)
Google Scholar
Lo, K., Wang, L.L., Neumann, M., Kinney, R., Weld, D.S.: S2ORC: the semantic scholar open research corpus. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 4969–4983 (2020)
Google Scholar
Cohan, A., et al.: A discourse-aware attention model for abstractive summarization of long documents. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 2 (Short Papers), pp. 615–621 (2018)
Google Scholar
Wang, L.L., et al.: CORD-19: the COVID-19 open research dataset. In: Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020 (2020)
Google Scholar
Zhong, M., Liu, P., Chen, Y., Wang, D., Qiu, X., Huang, X.-J.: Extractive summarization as text matching. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 6197–6208 (2020)
Google Scholar
Xiao, W., Carenini, G.: Extractive summarization of long documents by combining global and local context. arXiv preprint arXiv:1909 (2019)
Google Scholar
Gu, N., Ash, E., Hahnloser, R.H.: MemSum: Extractive Summarization of Long Documents using Multi-step Episodic Markov Decision Processes. arXiv preprint arXiv:2107 (2021)
Google Scholar
Lin, C.-Y., Hovy, E.: Automatic evaluation of summaries using n-gram co-occurrence statistics. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 71–78. Association for Computational Linguistics (2003)
Google Scholar
Ferreira, R., de Souza Cabral, L., Freitas, F., Lins, R.D., de Franca Silva, G., Simske, S.J., et al.: A multi-document summarization system based on statistics and linguistic treatment. Expert Syst. Appl. 41, 5780–5787 (2014)
Google Scholar
Sankarasubramaniam, Y., Ramanathan, K., Ghosh, S.: Text summarization using Wikipedia. Inf. Process. Manag. 50, 443–461 (2014)
Article Google Scholar
Chatterjee, N., Mittal, A., Goyal, S.: Single document extractive text summarization using genetic algorithms. In: Third International Conference on Emerging Applications of Information Technology (2012)
Google Scholar
Chatterjee, N., Jain, G., Bajwa, G.S.: Single document extractive text summarization using neural networks and genetic algorithm. In: Science and Information Conference (2018)
Google Scholar
Saini, N., Saha, S., Chakraborty, D., Bhattacharyya, P.: Extractive single document summarization using binary differential evolution: optimization of different sentence quality measures. PLoS ONE 14, e0223477 (2019)
Article Google Scholar
Qaroush, A., Farha, I.A., Ghanem, W., Washaha, M., Maali, E.: An efficient single document Arabic text summarization using a combination of statistical and semantic features. J. King Saud Univ.-Comput. Inf. Sci. 33, 677–692 (2021)
Google Scholar
Christian, H., Agus, M.P., Suhartono, D.: Single document automatic text summarization using term frequency-inverse document frequency (TF-IDF). ComTech: Comput. Math. Eng. Appl. 7, 285–294 (2016)
Google Scholar
Lin, H., Bilmes, J.: Multi-document summarization via budgeted maximization of submodular functions. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (2010)
Google Scholar
Ouyang, Y., Li, W., Li, S., Lu, Q.: Applying regression models to query-focused multi-document summarization. Inf. Process. Manag. 47, 227–237 (2011)
Article Google Scholar
Shapira, O., Pasunuru, R., Ronen, H., Bansal, M., Amsterdamer, Y., Dagan, I.: Extending multi-document summarization evaluation to the interactive setting. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 657–677 (2021)
Google Scholar
Mojrian, M., Mirroshandel, S.A.: A novel extractive multi-document text summarization system using quantum-inspired genetic algorithm: MTSQIGA. Expert Syst. Appl. 171, 114555 (2021)
Article Google Scholar
Patel, D., Shah, S., Chhinkaniwala, H.: Fuzzy logic based multi document summarization with improved sentence scoring and redundancy removal technique. Expert Syst. Appl. 134, 167–177 (2019)
Article Google Scholar
Mutlu, B., Sezer, E.A., Akcayol, M.A.: Multi-document extractive text summarization: a comparative assessment on features. Knowl.-Based Syst. 183, 104848 (2019)
Article Google Scholar
Uçkan, T., Karcı, A.: Extractive multi-document text summarization based on graph independent sets. Egypt. Inform. J. 21, 145–157 (2020)
Article Google Scholar
Abdi, A., Hasan, S., Shamsuddin, S.M., Idris, N., Piran, J.: A hybrid deep learning architecture for opinion-oriented multi-document summarization based on multi-feature fusion. Knowl.-Based Syst. 213, 106658 (2021)
Article Google Scholar
See, A., Liu, P.J., Manning, C.D.: Get to the point: summarization with pointer-generator networks. arXiv preprint arXiv:1704.04368 (2017)
Paulus, R., Xiong, C., Socher, R.: A deep reinforced model for abstractive summarization. arXiv preprint arXiv:1705.04304 (2017)
Rush, A.M., Chopra, S., Weston, J.: A neural attention model for abstractive sentence summarization. arXiv preprint arXiv:1509.00685 (2015)
Laskar, M.T., Hoque, E., Huang, J.X.: Domain adaptation with pre-trained transformers for query-focused abstractive text summarization. Comput. Linguist. 48, 279–320 (2022)
Article Google Scholar
Baumel, T., Eyal, M., Elhadad, M.: Query focused abstractive summarization: Incorporating query relevance, multi-document coverage, and summary length constraints into seq2seq models. arXiv preprint arXiv:1801.07704 (2018)
Nema, P., Khapra, M., Laha, A., Ravindran, B.: Diversity driven attention mod-el for query-based abstractive summarization. arXiv preprint arXiv:1704.08300 (2017)
Conroy, J., Schlesinger, J.D., O’leary, D.P.: Topic-focused multi-document summarization using an approximate oracle score. In: Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions (2006)
Google Scholar
Ahuir, V., Hurtado, L.-F., González, J.Á., Segarra, E.: NASca and NASes: two mono-lingual pre-trained models for abstractive summarization in Catalan and Spanish. Appl. Sci. 11, 9872 (2021)
Article Google Scholar
Singh, S.P., Kumar, A., Mangal, A., Singhal, S.: Bilingual automatic text summarization using unsupervised deep learning. In: International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) (2016)
Google Scholar
Litvak, M., Vanetik, N., Last, M., Churkin, E.: Museec: a multilingual text summarization tool. In: Proceedings of ACL-2016 System Demonstrations (2016)
Google Scholar
Litvak, M., Last, M., Friedman, M.: A new approach to improving multilingual summarization using a genetic algorithm. In: 48th Annual Meeting of the Association for Computational Linguistics (2010)
Google Scholar
Patel, A., Siddiqui, T., Tiwary, U.S.: A language independent approach to multilingual text summarization. Large scale semantic access to content (text, image, video, and sound) (2007)
Google Scholar
To, H.Q., Nguyen, K.V., Nguyen, N.L.-T., Nguyen, A.G.-T.: Monolingual versus multilingual bertology for Vietnamese extractive multi-document summarization. arXiv preprint arXiv:2108 (2021)
Google Scholar
Abdel-Salam, S., Rafea, A.: Performance study on extractive text summarization using BERT models. Information 13, 67 (2022)
Article Google Scholar
Joshi, A., Fidalgo, E., Alegre, E., Alaiz-Rodriguez, R.: RankSum—an unsupervised extractive text summarization based on rank fusion. Expert Syst. Appl. 200, 116846 (2022)
Article Google Scholar
Rayan, C.R., Nayeem, M.T., Mim, T.T., Chowdhury, M., Rahman, S., Jannat, T.: Unsupervised abstractive summarization of Bengali text documents. arXiv preprint arXiv:2102.04490 (2021)
Mao, X., Yang, H., Huang, S., Liu, Y., Li, R.: Extractive summarization using supervised and unsupervised learning. Expert Syst. Appl. 133, 173–181 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

GIET University, Gunupur, 765022, Odisha, India
Gitanjali Mishra & Nilambar Sethi
School of Information Technology and Engineering, VIT Vellore, Vellore, 632014, TN, India
L. Agilandeeswari

Authors

Gitanjali Mishra
View author publications
You can also search for this author in PubMed Google Scholar
Nilambar Sethi
View author publications
You can also search for this author in PubMed Google Scholar
L. Agilandeeswari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Gitanjali Mishra or L. Agilandeeswari .

Editor information

Editors and Affiliations

Faculty of Computing and Data Science, FLAME University, Pune, Maharashtra, India
Ajith Abraham
Center for Smart Computing Continuum, Burgenland, Austria
Sabri Pllana
University of Bari, Bari, Italy
Gabriella Casalino
University of Jinan, Jinan, Shandong, China
Kun Ma
Department of Computer Science and Engineering, Thapar Institute of Engineering and Technology, Patiala, Punjab, India
Anu Bajaj

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mishra, G., Sethi, N., Agilandeeswari, L. (2023). Inclusive Review on Extractive and Abstractive Text Summarization: Taxonomy, Datasets, Techniques and Challenges. In: Abraham, A., Pllana, S., Casalino, G., Ma, K., Bajaj, A. (eds) Intelligent Systems Design and Applications. ISDA 2022. Lecture Notes in Networks and Systems, vol 716. Springer, Cham. https://doi.org/10.1007/978-3-031-35501-1_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-35501-1_7
Published: 03 June 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-35500-4
Online ISBN: 978-3-031-35501-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Inclusive Review on Extractive and Abstractive Text Summarization: Taxonomy, Datasets, Techniques and Challenges

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Concise Review on Automatic Text Summarization

Automatic Text Summarization Methods: A Comprehensive Review

A Survey on Automatic Text Summarisation

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Inclusive Review on Extractive and Abstractive Text Summarization: Taxonomy, Datasets, Techniques and Challenges

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Concise Review on Automatic Text Summarization

Automatic Text Summarization Methods: A Comprehensive Review

A Survey on Automatic Text Summarisation

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation