A Software Architecture for Narratives

Meghini, Carlo; Bartalesi, Valentina; Metilli, Daniele; Benedetti, Filippo

doi:10.1007/978-3-319-73165-0_3

Carlo Meghini¹¹,
Valentina Bartalesi¹¹,
Daniele Metilli¹¹ &
…
Filippo Benedetti¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 806))

Included in the following conference series:

Italian Research Conference on Digital Libraries

761 Accesses
2 Citations

Abstract

The current Digital Libraries (DLs) usually return as answer of a user’s query a ranked list of the resources included in the DLs but no semantic relation among the resources are reported. Using the Semantic Web technologies it is possible to improve these search functionalities introducing narratives as new search method. As narratives we intend semantic networks of events that are linked to the objects of the DLs and are endowed with a set of semantic relations that connect an event to another. These semantic networks may help the users to obtain a more complete knowledge on the subject of their searches. In this paper, we present a software architecture for building narratives in order to introduce them in DLs. Our architecture is composed of several tools (automatic and semi-automatic tools) for creating, storing and visualizing narratives. When possible, we reused open source components already available on-line, and for the software we developed, we freely distribute it for research aims.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Formal Components of Narratives

Using Formal Narratives in Digital Libraries

Representation and Management of Complex ‘Narrative’ Information

Keywords

1 Introduction

The search functionalities of the current Digital Libraries (DLs) are usually basic systems that answer to a user’s query, expressed in natural language, with a ranked list of the resources included in the DLs but no semantic relations among the returned objects are reported. The Semantic Web [3], and the Linked Open Data [11] paradigm, can overcome the limitations of these search functionalities. The long-term aim of our research is to develop and integrate in DLs a new search method, using the semantic Web technologies: the narrative. We intend narratives as semantic networks composed of events that are linked to the objects of the DL and are endowed with a set of semantic relations connecting these events, i.e. actions or occurrences taking place at a certain time at a specific location. In our vision, instead of list of objects, DLs should provide the narratives as answers of the queries, which could be useful for users in order to obtain a more complete knowledge on the subject of their searches. To reach this aim, we developed a software architecture that allows to create narratives using the Semantic Web technologies. This architecture is composed of a set of tools (automatic and semi-automatic tools) for creating, storing, querying, and visualizing narratives. The stored knowledge is formally represented following an OWL ontology for representing narratives [1] we developed, encoded in the OWL 2 DL profile [5]. In order to maximize its interoperability, our ontology was developed as an extension of the CIDOC CRM standard ontology [4].

We have created some narratives using this architecture. In particular, two biographical narratives were produced by a Digital Humanities researcher at the Italian National Research Council (CNR): (i) on the life of Dante Alighieri^{Footnote 1}, the major Italian poet of the Middle Age; (ii) on the life of the Austrian painter Gustav Klimt^{Footnote 2}. The third narrative was developed by a researcher in Computational Biology at the CNR to narrate the discoveries related to the giant squid^{Footnote 3}.

Several components of our architecture are already developed and open source, thus we reused them. For what regards the software we developed, we distribute it freely for research aims.

2 Architecture

This section describes our current architecture for the representation of narratives. Figure 1 shows the architecture, whose main components are the following:

1.
a narrative-building tool. It is used for creating, modifying or visualizing a narrative, possibly representing knowledge that has been derived by reading some texts. The user operates through the Graphical User Interface (GUI) of the narrative-building tool, by manually inserting the narrative data and, at the same time, importing resources from Wikidata^{Footnote 4}. The created narrative is stored as an intermediate JSON representation^{Footnote 5};
2.
an OWL triplifier. Once the narrative is complete, the corresponding JSON representation is given as input to the Java Triplifier. The triplifier transforms the JSON file into an OWL (Web Ontology Language) ontology encoded as an RDF graph, using the OWL API library [6]. The organization of the knowledge in the graph follows the structure defined in the ontology for narratives we developed [1];
3.
a semantic reasoner. It is used by the triplifier to infer new knowledge. The triplifier takes as input also a file with SWRL rules [7] that are used by the reasoner to support the temporal reasoning on the narrative;
4.
a triple store. The triplifier stores the resulting graph, expanded with inferences produced by both the reasoner and the SWRL rules, into a Blazegraph triple store^{Footnote 6};
5.
a visualization interface. Finally, the user can access the knowledge stored in the triple store through a Web interface. The knowledge is extracted using SPARQL queries [10] and shown using graphic libraries.

A review of approaches relevant to our study is reported in [8].

2.1 Narrative-Building Tool

In order to facilitate the creation of a narrative and its semantic representation by the narrator, we built a web-based narrative-building tool. The tool is built with HTML5^{Footnote 7} and JavaScript (ECMAScript6^{Footnote 8}), using the jQuery^{Footnote 9}, jQuery UI^{Footnote 10}, Bootstrap^{Footnote 11}, and Typeahead.js^{Footnote 12} libraries.

The main interface of the tool is based on simple drag-and-drop metaphors, allowing the user to create events and drag the appropriate entities that compose them from a list of entities (e.g. location, person, object) automatically extracted from the Wikidata knowledge base through its SPARQL endpoint. The entities are color-coded according to their class, i.e. person, organization, place, object, concept, or work. These are linked to the corresponding CRM classes by using a specific mapping we defined. The interface also allows the user to link together events by using two different semantic relations: (i) the mereological (part-of) relation, or (ii) the causal dependency relation. A view of the NBVT interface is reported in Fig. 2.

The tool is available online^{Footnote 13}, it is open-source and released under the GPLv3 license^{Footnote 14}. As the user inserts knowledge in the interface of the tool, the data is stored into a CouchDB^{Footnote 15} database, using the PouchDB^{Footnote 16} library to interface with it. This allows automatic saving of the data, revisioning, and synchronization between a local and remote database. Finally, the resulting timeline of events is exported in the JSON format.

2.2 OWL Triplifier

The knowledge exported from the tool in JSON format is subsequently imported into a Java-based triplifier. The triplifier makes use of the OWL API library to define an ontology model. Then, it loads the JSON file and converts it to an intermediate Java representation. Then the OWL API library takes as input this representation that is used for populating the model. The triplifier also imports some SWRL rules. The Semantic Web Rule Language (SWRL) is a proposed language for the Semantic Web that can be used to express rules as well as logic, combining OWL DL or OWL Lite with a subset of the Rule Markup Language. We added SWRL rules to overcome the limitations of the OWL 2 DL about: (i) the definition of a relation as simultaneously transitive and irreflexive, as in the case of the part-of and causality relations, (ii) the definition of a relation as simultaneously transitive and disjoint [9], as in the case of the temporal relation, (iii) the implication between part-of and temporal relations and between causality and temporal relations. For this reason, we use SWRL rules and OWL 2 DL axioms simultaneously. The SWRL rules were produced using a software developed by Batsakis et al. [2] for what concerns the temporal relations. The implications between part-of and temporal relations and between causality and temporal relations were defined by writing the SWRL rules manually. The SWRL rules are taken as input by the triplifier as an OWL file. The result of the process of triplifier is an OWL graph that represents the narrative, exportable in RDF/XML^{Footnote 17}, Turtle^{Footnote 18}, or several other syntaxes^{Footnote 19}.

2.3 Reasoner

At this point, a reasoning is performed on the knowledge in order to perform consistency checks and make inferences. The reasoner we adopted is Openllet^{Footnote 20} version 2.6. The main reasons for this choice are the following: (i) Openllet supports all the features of OWL 2 DL; (ii) it fully supports SWRL rules; (iii) it is Java-based and easily integrated with OWL API^{Footnote 21}; (iv) it is an open source software actively maintained. Openllet provides functionality to check consistency of ontologies, compute the classification hierarchy, explain inferences, the graph is stored into a triple store.

2.4 Triple Store

The knowledge is exported to a triple store. The triple store we chose is Blazegraph^{Footnote 22}. Blazegraph is a standards-based, high-performance, scalable, open-source graph database. Written entirely in Java, the platform supports the RDF data model and the SPARQL 1.1 family of specifications, including Query, Update, Basic Federated Query, and Service Description. The knowledge stored in Blazegraph is shown to the user through a visualization interface. We implemented SPARQL queries to retrieve this knowledge from the triple store.

2.5 Visualization Interface

First of all, in order to give a complete overview of the narrative, the events were placed on a timeline. We used TimelineJS library^{Footnote 23} for the implementation. For each event on the timeline, the more meaningful information is reported, i.e. title, date, primary sources, related digital objects, related images. Events occurred at the same time are allowed and visualised on a timeline. Figure 3 shows an event of the timeline of Dante Alighieri’s life.

Another requirement for the tool is the visualization of the entities that compose each event. To this aim, a SPARQL query to get this information from the knowledge base was implemented. This query retrieves, for each event title, the names and IRIs of the corresponding entities. The vis.js^{Footnote 24} JavaScript library was used to implement the visualization. One of the most important requirements for a scholar who studies historical events, is the knowledge of their primary sources. For each event of the narrative, the tool allows to visualize the primary sources and in particular the title and the author of a primary source, the textual fragment of the primary source that describes the event, the reference of the textual fragment. This information is visualized in tabular format. Finally, the user has the possibility to visualize all events that occurred in a specified period of time. Upon specifying the desired period, the user can freely insert the dates using a widget to select a full date or the year only. The results of the query are shown in form of table, where for each event its dates are shown.

It is possible to explore the visualization interface on-line^{Footnote 25}, browsing the three narratives that are available on our Web site^{Footnote 26}.

3 Conclusions and Future Work

In this paper we have presented a software architecture for building narratives using the Semantic Web technologies. In this context, we intend narratives as semantic networks of events linked to each other and to digital objects by semantic relations. In order to represent the knowledge we have developed an OWL ontology for narratives as an extension of the CIDOC CRM standard vocabulary.

Where possible, to develop this architecture, we have reused software open source already available. For what regards the software we developed, it is freely distributed, released under the GPLv3 license.

The long-term goal of our study is introducing the narrative as new first-class search functionality of digital libraries. As output of a query, this new search functionality should not only return a list of objects but it should also present one or more narratives on the topic of the search. The architecture presented in this paper would be used to create narratives that later could be imported and shown in the DL interfaces.

Notes

1.
https://dlnarratives.eu/timeline/dante.html.
2.
https://dlnarratives.eu/timeline/klimt.html.
3.
https://dlnarratives.eu/timeline/squid.html.
4.
https://www.wikidata.org.
5.
http://json.org/.
6.
https://www.blazegraph.com/.
7.
https://www.w3.org/TR/html5/.
8.
http://www.ecma-international.org/ecma-262/6.0/.
9.
https://jquery.com.
10.
https://jqueryui.com.
11.
https://getbootstrap.com.
12.
https://typeahead.js.org.
13.
https://dlnarratives.eu/tool.html.
14.
https://www.gnu.org/licenses/gpl-3.0.en.html.
15.
http://couchdb.apache.org.
16.
https://pouchdb.com.
17.
https://jena.apache.org/documentation/io/rdf-output.html.
18.
https://www.w3.org/TR/turtle/.
19.
https://jena.apache.org/documentation/io/rdf-output.html.
20.
https://github.com/Galigator/openllet.
21.
https://owlcs.github.io/owlapi/.
22.
https://www.blazegraph.com/.
23.
https://timeline.knightlab.com/.
24.
http://visjs.org.
25.
https://dlnarratives.eu/narratives.html.
26.
https://dlnarratives.eu.

References

Bartalesi, V., Meghini, C., Metilli, D.: Steps towards a formal ontology of narratives based on narratology. In: OASIcs-OpenAccess Series in Informatics, vol. 53. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik (2016)
Google Scholar
Batsakis, S., Petrakis, E., Tachmazidis, I., Antoniou, G.: Temporal representation and reasoning in OWL 2. Semant. Web 8(6), 981–1000 (2016)
Article Google Scholar
Berners-Lee, T., Hendler, J., Lassila, O., et al.: The semantic Web. Sci. Am. 284(5), 28–37 (2001)
Article Google Scholar
Doerr, M.: The CIDOC conceptual reference module: an ontological approach to semantic interoperability of metadata. AI Mag. 24(3), 75 (2003)
Google Scholar
Hitzler, P., Krötzsch, M., Parsia, B., Patel-Schneider, P.F., Rudolph, S.: OWL 2 Web ontology language primer. W3C Recomm. 27(1), 123 (2009)
Google Scholar
Horridge, M., Bechhofer, S.: The OWL API: a Java API for working with OWL 2 ontologies. In: Proceedings of the 6th International Conference on OWL: Experiences and Directions, Aachen, Germany, OWLED 2009, vol. 529, pp. 49–58. CEUR-WS.org (2009). http://dl.acm.org/citation.cfm?id=2890046.2890052
Horrocks, I., Patel-Schneider, P.F., Boley, H., Tabet, S., Grosof, B., Dean, M., et al.: SWRL: a semantic Web rule language combining OWL and RuleML (2004)
Google Scholar
Meghini, C., Bartalesi, V., Metilli, D.: Using formal narratives in digital libraries. In: Grana, C., Baraldi, L. (eds.) IRCDL 2017. CCIS, vol. 733, pp. 83–94. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68130-6_7
Chapter Google Scholar
Motik, B., Patel-Schneider, P.F., Parsia, B., Bock, C., Fokoue, A., Haase, P., Hoekstra, R., Horrocks, I., Ruttenberg, A., Sattler, U., et al.: OWL 2 Web ontology language: structural specification and functional-style syntax. W3C Recomm. 27(65), 159 (2009)
Google Scholar
Prud’hommeaux, E., Seaborne, A.: SPARQL Query Language for RDF. W3C Recomm. (2008). https://www.w3.org/TR/rdf-sparql-query/
Yu, L.: Linked open data. In: Yu, L. (ed.) A Developers Guide to the Semantic Web, pp. 409–466. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-15970-1_11
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Istituto di Scienza e Tecnologie dell’Informazione “Alessandro Faedo” – CNR, Pisa, Italy
Carlo Meghini, Valentina Bartalesi, Daniele Metilli & Filippo Benedetti

Authors

Carlo Meghini
View author publications
You can also search for this author in PubMed Google Scholar
Valentina Bartalesi
View author publications
You can also search for this author in PubMed Google Scholar
Daniele Metilli
View author publications
You can also search for this author in PubMed Google Scholar
Filippo Benedetti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Valentina Bartalesi .

Editor information

Editors and Affiliations

University of Udine, Udine, Italy
Giuseppe Serra
University of Udine, Udine, Italy
Carlo Tasso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Meghini, C., Bartalesi, V., Metilli, D., Benedetti, F. (2018). A Software Architecture for Narratives. In: Serra, G., Tasso, C. (eds) Digital Libraries and Multimedia Archives. IRCDL 2018. Communications in Computer and Information Science, vol 806. Springer, Cham. https://doi.org/10.1007/978-3-319-73165-0_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-73165-0_3
Published: 21 December 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73164-3
Online ISBN: 978-3-319-73165-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Software Architecture for Narratives

Abstract