Skip to main content

Extractive Summarization of Text Using Weighted Average of Feature Scores

  • Conference paper
  • First Online:
Machine Intelligence and Smart Systems

Part of the book series: Algorithms for Intelligent Systems ((AIS))

  • 550 Accesses

Abstract

In the era of information overload, need for applications to comb through huge number of documents to extract important information is increasing. This information is helpful in assessing whether or not a document is relevant. Automatic text summarization is one of the solutions to the problem of extracting useful information from huge collection of textual data. A summarizer converts a lengthy document into a short summary by extracting important sentences from it without losing the crucial information. A summarizer can be either abstractive or extractive. An extractive summarizer relies on the statistical features of the input text to create a summary by merely copying the important sentences, whereas an abstractive summarizer tries to understand the context of the document and generates a summary which may contain new sentences not part of the original document. This paper focuses on extractive summarization technique. An approach for generating short and precise summary from a single document using weighted average of feature scores has been proposed. Sentences are ranked based on their scores, and top 40% sentences are selected to form the summary. Experiments were carried out on 250 documents from BBC News summary dataset. The results were compared with existing online summarizers and the proposed summarizer gave better average recall, precision and F-measure values.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Luhn HP (1958) The automatic creation of literature abstracts. IBM J Res Develop

    Google Scholar 

  2. Baxendale P (1958) Machine-made index for technical literature—an experiment. IBM J Res Develop 2(4):354–361

    Article  Google Scholar 

  3. Edmundson HP (1969) New methods in automatic extracting. J ACM 16(2):264–285

    Article  Google Scholar 

  4. Jones KS (1972) A statistical interpretation of term specificity and its application in retrieval. J Document 28(1):11–21

    Article  Google Scholar 

  5. Lee GG, Seo J, Lee S, Jung H (2001) SiteQ: engineering high performance QA system using Lexico-semantic pattern matching and shallow NLP. In: Text retrieval conference (TREC)

    Google Scholar 

  6. Hu M, Lim EP, Sun A (2007) Comment-oriented blog summarization by sentence extraction. In: Proceedings of the 16th ACM conference on information and knowledge management, CIKM

    Google Scholar 

  7. Seki Y (2003) Sentence extraction by TF-IDF and position weighting from newspaper articles. In: Proceedings of the third NTCIR workshop

    Google Scholar 

  8. Babar SA, Patil PD (2014) Improving performance of text summarization. In: International conference on information and communication technologies

    Google Scholar 

  9. Natural Language Toolkit. https://www.nltk.org/

  10. BBC Datasets. https://mlg.ucd.ie/datasets/bbc.html

  11. Lin C-Y (2004) ROUGE: a package for automatic evaluation of summaries. In: Proceedings of the ACL workshop: text summarization braches out 2004

    Google Scholar 

  12. Josef Steinberger KJ (2009) Evaluation measures for text summarization. Comput Inform 28

    Google Scholar 

  13. TextTeaser. https://pypi.org/project/textteaser/

  14. Sumy: simple library and command line utility for extracting summary. https://pypi.org/project/sumy/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shatajbegum Nadaf .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nadaf, S., Hemadri, V.B. (2021). Extractive Summarization of Text Using Weighted Average of Feature Scores. In: Agrawal, S., Kumar Gupta, K., H. Chan, J., Agrawal, J., Gupta, M. (eds) Machine Intelligence and Smart Systems . Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-33-4893-6_20

Download citation

Publish with us

Policies and ethics