Keywords

1 Parliamentary FAIR Linked Data for Problem Solving

ParliamentSampo – Parliament of Finland on the Semantic Web [8] is a new Linked Open Data (LOD) serviceFootnote 1 and an in-use semantic portalFootnote 2 on top of its SPARQL endpoint. Its main knowledge graph (KG) includes nearly million speeches of the plenary sessions of the Parliament of Finland (PoF) since its foundation in 1907 [7, 13], interlinked with another KG and ontology about the 2800 Members of Parliament (MP) and other speakers, parties, and other organizations in PoF [12]. The LOD has been enriched with data from several external sources and by reasoning. Both KGs are published and are available on the Linked Data Finland [9] platformFootnote 3, including a SPARQL endpoint and other LOD services, such as content negotiation and RDF browsing. The RDF data is also openly available (CC BY 4.0) at Zenodo.org, and in Parla-CLARIN and CSV form [7]. The CSV data is updated on a daily basis.

The minutes of the plenary speeches of PoF have been available as scanned books (1907–1999), HTML pages (2000–2014), or as XML documents (2015–), but not as Findable, Accessible, Interoperable, and Re-usable FAIR dataFootnote 4. If the user knows during which parliament a speech was given, he could download the corresponding document for close-reading. But if one wants, for example, to find out the answers to the following questions, this kind of web service and research method is not a viable solution:

  1. 1.

    Question: Who has given most regular speeches and when? Answer: Mr. Veikko Vennamo, SMP Party, over 12 600 speeches in 1945–1987 in total. Their distribution can be visualized on a timeline.

  2. 2.

    Question: Who was the first to speak about “NATO” in the PoF plenary sessions? Answer: Mr. Yrjö Enne, SKDL communist party, May 27, 1959

  3. 3.

    Question: What places are most often mentioned by the current MPs of the Swedish People’s Party of Finland”? Answer: Can be visualized as a table or on heat maps and markers on maps.

  4. 4.

    How are the MPs of the current Parliament referring to each other in their speeches : Answer: Can be studied using network analysis.

ParliamentSampo makes it possible to study parliamentary discussions and networks of politicians easily using methods Digital Humanities (DH), and find answers to questions, such as those above. Furthermore, applications can be created using LOD services, such as the ParliamentSampo portal. It is based on the Sampo model [5] and was created using a new declarative version of the Sampo-UI framework [10] for user interface (UI) design.

2 Using the ParliamentSampo Portal

Based on the Sampo-UI framework, the landing page of the portal contains application perspectives through which instances of the major classes of the underlying KG can be searched [10]. In this case, there are perspectives for speeches and people. Their instances can be searched and browsed using faceted search, and after filtering out a subset of interest 1) its individual instances can be studied by looking at their instance pages by browsing or 2) the whole result set can be analyzed as a whole. In both cases, a set of tabs are available for visualizing or analyzing the instance (a speech or person) or a set of them (say the speeches of MPs belonging to a party during a certain time period).

Fig. 1.
figure 1

Using faceted search to filter out and analyze speeches about NATO.

For example, in Fig. 1, the user has selected the Speeches perspective with facets Content, Speaker, Party, (Speech) Type, and others on the left. The search result, i.e., the speeches found, be visualized on the right in five ways by selecting tabs: 1) in tabular form (by default), 2) on a timeline (this tab is selected in the figure), 3) using histograms and pie charts, 4) on a map based on places mentioned in the speeches, or 5) using a heatmap. In the figure, the user has written a query “NATO*” in the Content text facet, the speech type is set to regular speeches, and then 3622 regular speeches that mention the word “NATO” in its various inflectional forms have been filtered into the search result starting from 1959. By clicking on the pie chart visualization button on the Party facet, the distribution of NATO speeches in terms of parties is shown: the most active party with 722 speeches has been the right-wing National Coalition Party Kokoomus.

In Fig. 2 the People perspective is seen, and the user has found all 507 members of the Centre Party, the active supporter of farming in the countryside, by a selection on the Party facet. For analyzing and visualizing the result, seven tabs (e.g., table, pie chart, ages, etc.) can be used. Here the Life chart visualization is chosen where the MPs are shown as arcs from their places of birth (blue end) to places of death (red end). The width of the arc illustrates the number of MPs related to the arc. By clicking on an arc, links to the instance pages of the corresponding MPs are shown. The graph shows strong movement from the countryside to the capital area of Helsinki.

Fig. 2.
figure 2

Using the People perspective of ParliamentSampo to study the movement of 507 MPs of the Centre Party from their place of birth to place of death. (Color figure online)

3 Easy Declarative Implementation Using Sampo-UI

The UI was implemented using a new declarative version of the Sampo-UI frameworkFootnote 5. Here the UI with its components can be created with little programming by using a set of configuration files in JSON format in three main directories: 1) configs. JSON files configuring the portal and its perspectives. 2) sparql. SPARQL queries referred to in the configs files. 3) translations. Translations of things like menu items and labels for different locales. When creating a new UI configuration, existing UI components, such as those for the facets and visualization tabs, can be re-used, and the system can also be extended with new components. Sampo-UI has been found handy to use in practise, and it has been used to create some 15 portals in the Sampo seriesFootnote 6 [5] of LOD systems.

4 Related Works and Contributions

Parliamentary debate datasets have been created from the both historical and contemporary parliaments [11]. The data is typically modelled as documents using XML-based formats, such as parla-CLARIN [4] and ParlaMint [3]. Linked data has been used before for modelling and publishing speeches of the European Parliament (LinkedEP) [14], the Latvian parliament [2], the Italian ParliamentFootnote 7, and in the historical Imperial Diet of Regensburg of 1576 project [1]. The contribution of our paper is to demonstrate novel ideas in the UI design for parliamentary DH research: 1) seamless integration of faceted search with data analytic tools for DH problem solving and 2) easy implementation of the UI based on declarative configurations and re-use of existing components. Published on February 14, 2023, the portal was used by some 3000 users in ten days. More information, links, videos, and publications related to ParliamentSampo data services and the portal can be found on the project homepageFootnote 8. ParliamentSampo is based on and is part of the Finnish Linked Open Data infrastructure LODI4DHFootnote 9 [6].