Abstract
Summarization techniques are becoming an essential part of everyday life, basically because summaries allow users to spend less time making effective access to the desired information. In this paper, we present a general framework for retrieving relevant information from news articles and a novel summarization algorithm based on a deep semantic analysis of texts. In particular, a set of triples (subject, predicate, object) is extracted from each document and it is then used to build a summary through an unsupervised clustering algorithm exploiting the notion of semantic similarity. Finally, we leverage the centroids of clusters to determine the most significant summary sentences using some heuristics. Several experiments are carried out using the standard DUC methodology and ROUGE software and show how the proposed method outperforms several summarizer systems in terms of recall and readability.
Access provided by CONRICYT-eBooks. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
- Semantic Similarity
- Resource Description Framework
- Name Entity Recognition
- Heterogeneous Source
- Terror Attack
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Leonard Barolli and Fatos Xhafa. Jxta-overlay: A p2p platform for distributed, collaborative, and ubiquitous computing. Industrial Electronics, IEEE Transactions on, 58(6):2163–2172, 2011.
Fatos Xhafa, Raul Fernandez, Thanasis Daradoumis, Leonard Barolli, and Santi Caballé. Improvement of jxta protocols for supporting reliable distributed applications in p2p systems. In Network-Based Information Systems, pages 345–354. Springer, 2007.
Leonard Barolli, Fatos Xhafa, Arjan Durresi, and Giuseppe De Marco. M3ps: a jxtabased multi-platform p2p system and its web application tools. International Journal of Web Information Systems, 2(3/4):187–196, 2007.
Mario Sicuranza, Angelo Esposito, and Mario Ciampi. An access control model to minimize the data exchange in the information retrieval. Journal of Ambient Intelligence and Humanized Computing, pages 1–12, 2015.
Aniello Minutolo, Massimo Esposito, and Giuseppe De Pietro. A fuzzy framework for encoding uncertainty in clinical decision-making. Knowledge-Based Systems, 98:95–116, 2016.
Aniello Minutolo, Massimo Esposito, and Giuseppe De Pietro. Design and validation of a light-weight reasoning system to support remote health monitoring applications. Engineering Applications of Artificial Intelligence, 41:232–248, 2015.
F. Amato, A.R. Fasolino, A. Mazzeo, V. Moscato, A. Picariello, S. Romano, and P. Tramontana. Ensuring semantic interoperability for e-health applications. pages 315–320, 2011.
F. Amato, A. Mazzeo, V. Moscato, and A. Picariello. A framework for semantic interoperability over the cloud. pages 1259–1264, 2013.
Tim French, Nik Bessis, Fatos Xhafa, and Carsten Maple. Towards a corporate governance trust agent scoring model for collaborative virtual organisations. International Journal of Grid and Utility Computing, 2(2):98–108, 2011.
Valentin Cristea, F. Pop, C. Stratan, A. Costan, C. Leordeanu, and E. Tirsa. A dependability layer for large-scale distributed systems. International Journal of Grid and Utility Computing, 2(2):109–118, 2011.
Soichi Sawamura, Admir Barolli, Ailixier Aikebaier, Makoto Takizawa, and Tomoya Enokido. Design and evaluation of algorithms for obtaining objective trustworthiness on acquaintances in p2p overlay networks. International Journal of Grid and Utility Computing, 2(3):196–203, 2011.
Evjola Spaho, Gjergji Mino, Leonard Barolli, and Fatos Xhafa. Goodput and pdr analysis of aodv, olsr and dymo protocols for vehicular networks using cavenet. International Journal of Grid and Utility Computing, 2(2):130–138, 2011.
H. Takamura and M. Okumura. Text summarization model based on maximum coverage problem and its variant. In Proceedings of the 12th Conference of the European Chapter of the AC, pages 781–789, 2009.
Dan Gillick and Benoit Favre. A scalable global model for summarization. In Proceedings of the Workshop on Integer Linear Programming for Natural Language Processing, ILP ’09, pages 10–18. Association for Computational Linguistics, 2009.
Salvatore Cuomo, Pasquale De Michele, Ardelio Galletti, and Giovanni Ponti. Intelligent Interactive Multimedia Systems and Services 2016, volume 55 of Smart Innovation, Systems and Technologies, chapter Influence of Some Parameters on Visiting Style Classification in a Cultural Heritage Case Study, pages 567–576. Springer International Publishing, 2016.
Salvatore Cuomo, Pasquale De Michele, Ardelio Galletti, and Giovanni Ponti. Data Management Technologies and Applications: 4th International Conference, DATA 2015, Colmar, France, July 20-22, 2015, Revised Selected Papers, volume 584 of Communications in Computer and Information Science, chapter Classify Visitor Behaviours in a Cultural Heritage Exhibition, pages 17–28. Springer International Publishing, 2016.
Oren Etzioni, Anthony Fader, Janara Christensen, Stephen Soderland, and Mausam Mausam. Open information extraction: The second generation. In IJCAI, volume 11, pages 3–10, 2011.
ZhibiaoWu and Martha Palmer. Verb semantics and lexical selection. In 32nd. Annual Meeting of the Association for Computational Linguistics, pages 133 –138, 1994.
A. D’Acierno, V. Moscato, F. Persia, A. Picariello, and A. Penta. iwin: A summarizer system based on a semantic analysis of web documents. pages 162–169, 2012.
G. Sannino, I. De Falco, and G. De Pietro. An automatic rules extraction approach to support osa events detection in an mhealth system. IEEE Journal of Biomedical and Health Informatics, 18(5):1518–1524, 2014.
Angelo Chianese, Fiammetta Marulli, Francesco Piccialli, Paolo Benedusi, and Jai E Jung. An associative engines based approach supporting collaborative analytics in the internet of cultural things. Future Generation Computer Systems, 2016.
A. Chianese, F. Piccialli, and I. Valente. Smart environments and cultural heritage: a novel approach to create intelligent cultural spaces. Journal of Location Based Services, 9:209–234, 2015.
Giuseppe Caggianese, Luigi Gallo, and Giuseppe De Pietro. Design and preliminary evaluation of a touchless interface for manipulating virtual heritage artefacts. In Signal-Image Technology and Internet-Based Systems (SITIS), 2014 Tenth International Conference on, pages 493–500. IEEE, 2014.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Amato, F., d’Acierno, A., Colace, F., Moscato, V., Penta, A., Picariello, A. (2017). Semantic Summarization of News from Heterogeneous Sources. In: Xhafa, F., Barolli, L., Amato, F. (eds) Advances on P2P, Parallel, Grid, Cloud and Internet Computing. 3PGCIC 2016. Lecture Notes on Data Engineering and Communications Technologies, vol 1. Springer, Cham. https://doi.org/10.1007/978-3-319-49109-7_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-49109-7_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-49108-0
Online ISBN: 978-3-319-49109-7
eBook Packages: EngineeringEngineering (R0)