Single Document Extractive Text Summarization Using Neural Networks and Genetic Algorithm

Chatterjee, Niladri; Jain, Gautam; Bajwa, Gurkirat Singh

doi:10.1007/978-3-030-01174-1_26

Niladri Chatterjee¹⁷,
Gautam Jain¹⁷ &
Gurkirat Singh Bajwa¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 858))

Included in the following conference series:

Science and Information Conference

1353 Accesses
3 Citations

Abstract

The presented paper proposes an extractive text summarization technique for single documents using Neural Networks and Genetic Algorithms. The Neural Network helps to define a fitness function to express mathematically the quality of the generated summary through six desired properties which are theme similarity, cohesion, sentiment, readability, aggregate similarity and sentence position. Genetic Algorithm maximizes the above-mentioned fitness function, and extracts the most important sentences to create the extractive summary. The results are compiled using DUC2002 data as a benchmark and calculated using the precision-recall technique. They are compared with techniques using Genetic Algorithm, Neural Network and a summarizer made by Microsoft. The comparison between the results clearly demonstrates the superiority of the technique and is very encouraging for future work in this area.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Single Extractive Text Summarization Based on a Genetic Algorithm

Extractive Single Document Summarization Using NSGA-II

Multi-document Text Summarization Based on Genetic Algorithm and the Relevance of Sentence Features

References

Li, S., Karatzoglou, A., Gentile, C.: Collaborative filtering bandits. In: The 39th ACM SIGIR, pp. 539–548 (2016)
Google Scholar
Gentile, C., Li, S., Kar, P., Karatzoglou, A., Etrue, E., Zappella, G.: On context-dependent clustering of bandits. In: The 34th ICML, pp. 1253–1262 (2017)
Google Scholar
Korda, N., Szorenyi, B., Li, S.: Distributed clustering of linear bandits in peer to peer networks. In: The 33rd ICML, pp. 1301–1309 (2016)
Google Scholar
Mani, I.: Automatic Summarization. John Benjamins Publishing Company (2001)
Google Scholar
Luhn, H.P.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)
Article MathSciNet Google Scholar
Edmundson, H.P.: New methods in automatic extracting. J. ACM 16(2), 264–285 (1969)
Article Google Scholar
Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. In: Mani, I., Maybury, M.T. (eds.) Advances in Automatic Text Summarization, pp. 111–121. MIT Press (1999)
Google Scholar
Fattah, M.A., Ren, F.: GA, MR, FFNN, PNN and GMM based models for automatic text summarization. Comput. Speech Lang. 23(1), 126–144 (2009)
Article Google Scholar
Chatterjee, N., Bhardwaj, A.: Single document text summarization using random indexing and neural networks. In: KEOD 2010, pp. 171–176 (2010)
Google Scholar
Chatterjee, N., Mittal, A., Goyal, S.: Single document extractive text summarization using genetic algorithms. In: Third International Conference on Emerging Applications of Information Technology (EAIT) (2012)
Google Scholar
Qazvinian, V., Hasaanabadi, L.S., Halavati, R.: Summarising text with a genetic algorithm-based sentence extraction. Int. J. Knowl. Manag. Stud. 2(4), 426–444 (2008)
Article Google Scholar
Yates, R.B., Neto, B.R.: Modern Information Retrieval. Addison Wesley (1999)
Google Scholar
Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly Media (2009)
Google Scholar
Mitra, M., Singhal, A., Buckley, C.: Automatic text summarization by paragraph extraction. In: ACL Workshop on Intelligent and Scalable Text Summarization, Madrid Spain, pp. 39–46 (1997)
Google Scholar
Cormen, T.H., Leiserson, C.E., Rivest, R.L.: Introduction to Algorithms, 2nd edn. The MIT Press, Cambridge (2009)
MATH Google Scholar
Goldberg, D.E.: Genetic Algorithms. Addison Wiley Longman Inc. (1999)
Google Scholar
Spears, W.M., Anand, V.: A study of crossover operators in genetic programming. In: Proceedings of the 6th International Symposium on Methodologies for Intelligent Systems, ser. ISMIS 1991, Springer, London, pp. 409–418 (1991)
Google Scholar
Jizba, R.: Measuring Search Effectiveness, Precision and Recall. Creighton University (2007)
Google Scholar
Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6(2), 182–197 (2002)
Article Google Scholar
Perumal, K., Chaudhuri, B.B.: Language independent sentence extraction based text summarization. In: ICON-2011: 9th International Conference on Natural Language Processing, pp. 213–217. Macmillan, India (2011)
Google Scholar
Kupiec, J., Pedersen, J., Chen, F.: A trainable document summarizer. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, ser. SIGIR 1995, pp. 68–73. ACM, New York (1995)
Google Scholar
Nielsen, F.Å.: A new ANEW: evaluation of a word list for sentiment analysis in microblogs. In: Proceedings of the ESWC2011 Workshop on ‘Making Sense of Microposts’: Big Things Come in Small Packages 718 in CEUR Workshop Proceedings, pp. 93–98, May 2011
Google Scholar
Ramanujam, N., Kaliappan, M.: An automatic multidocument text summarization approach based on Naïve Bayesian classifier using timestamp strategy. Sci. World J. 2016 (2016). Article ID 1784827
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, Indian Institute of Technology, New Delhi, 110016, India
Niladri Chatterjee, Gautam Jain & Gurkirat Singh Bajwa

Authors

Niladri Chatterjee
View author publications
You can also search for this author in PubMed Google Scholar
Gautam Jain
View author publications
You can also search for this author in PubMed Google Scholar
Gurkirat Singh Bajwa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Niladri Chatterjee .

Editor information

Editors and Affiliations

Faculty of Science and Engineering, Department of Information Science, Saga University, Honjo, Saga, Japan
Kohei Arai
The Science and Information (SAI) Organization, Bradford, West Yorkshire, UK
Supriya Kapoor
The Science and Information (SAI) Organization, Bradford, West Yorkshire, UK
Rahul Bhatia

A Appendix

Document No. 17 of the DUC 2002 data set containing 28 sentences.

Ideal Summary for Document No.17, containing sentences 1, 2, 3, 4, 5, 6, 7, 8, 9, 12, 14, 15, 17, 18, 23, 24.

Summary for Document No.17 generated by our algorithm, containing sentences 1, 2, 3, 4, 5, 6, 7, 8, 10, 11, 12, 16, 17, 18, 25, 26. Precision = 0.6875.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chatterjee, N., Jain, G., Bajwa, G.S. (2019). Single Document Extractive Text Summarization Using Neural Networks and Genetic Algorithm. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Computing. SAI 2018. Advances in Intelligent Systems and Computing, vol 858. Springer, Cham. https://doi.org/10.1007/978-3-030-01174-1_26

Download citation

DOI: https://doi.org/10.1007/978-3-030-01174-1_26
Published: 02 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01173-4
Online ISBN: 978-3-030-01174-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics