Keywords

1 Introduction

The usage of tag clouds to visualize textual data is a relatively novel technique, which was rarely applied in the past century. In 1976, Stanley Milgram was one of the first scholars who generated a tag cloud to illustrate a mental map of Paris, for which he conducted a psychological study with inhabitants of Paris, aiming to analyze their mental representation of the city [1]. In 1992, a German edition of “Mille Plateaux”, written by the French philosopher Gilles Deleuze, was published with a tag cloud printed on the cover to summarize the book’s content [2]. This idea to present a visual summary of textual data can be seen as the primary purpose of tag clouds [3]. But the popularity of tag clouds nowadays is attributable to a frequent usage in the social web community in the 2000s as overviews of website contents. Although there are known theoretical problems concerning the design of tag clouds [4], they are generally seen as a popular social component perceived as being fun [5]. With the simple idea to encode the frequency of terms to a given topic, tag clouds are intuitive, comprehensible visualizations, which are widely used metaphors (1) to display summaries of textual data, (2) to support analytical tasks such as the examination of text collections, or even (3) to be used as interfaces for navigation purposes on databases.

In the recent years, various algorithms that compute effective tag cloud layouts in an informative and readable manner have been developed. One of the most popular techniques is Wordle [6], which computes compact, intuitive tag clouds and can be generated on the fly using a web-based interface.Footnote 1 Although the produced results are very aesthetic, the different used colors do not transfer information and the final arrangement of tags depends only on the scale, and not on the content of tags or potential relationships among them. Some approaches attend to the matter of visualizing more information than the frequency of terms with tag clouds – most often to compare textual summaries of different data facets.

In this paper, we present the tag cloud design TagSpheres, which endeavors to effectively visualize hierarchies in textual summaries. The motivation arose from research on philology. Humanities scholars wanted to analyze the clause functions of an ancient term’s co-occurrences. Querying the large database, the scholars often face numerous results in the form of text passages. When only plain lists are provided to interact with the results, the discovery of significant text passages and the analysis of the contexts in which the chosen term was used becomes laborious. To support this task, we provide summaries of text passages in the form of interactive tag clouds that group terms in accordance to their distance to the search term. So, the humanities scholar gets an overview, and she is able to retrieve text passages of interest on demand.

We designed TagSpheres in a way that various types of text hierarchies can be visualized in an intuitive, comprehensible manner. To emphasize the wide applicability of TagSpheres, we list several examples from the digital humanities, sports, aviation and natural disaster management. That TagSpheres can be further used to generate layouts for tree structures is outlined in Sect. 5.

2 Related Work

Although tag clouds rather became popular in the social media, research in visualization attended to the matter of developing various layout techniques in the last years. A basic tag cloud layout is a simple list of words placed on multiple lines [7]. In such a list, tags are typically ordered by their importance to the observed issue, which is encoded by font size [8]. An alphabetical order is also often used, but a study revealed that this order is not obvious for the observer [5]. Later, more sophisticated tag cloud layout approaches that rather emphasize aesthetics than meaningful orderings were developed. A representative technique is Wordle [6, 9], which produces compact aesthetic layouts with tags in different colors and orientations, but both features do not transfer any additional information. A Wordle showing the most important terms in Edgar Allan Poe’s The Raven is given in Fig. 1(a).

Fig. 1.
figure 1

Tag cloud visualizations.

Various approaches highlight relationships among tags by forming visual groups. In thematically clustered or semantic tag clouds, the detection of tags belonging to the same topic is supported by placing these tags closely [10]. Traditional, semantic word lists place clustered tags successively [11]. More sophisticated layout methods often use force directed approaches with semantically close terms attracting each other [12,13,14]. After force directed tag placement, tag cloud layouts can be compressed by removing occurring whitespaces [15].

Some methods generate individual tag clouds for each group of related tags, and combine the resultant multiple tag clouds to a single visual unity afterwards. An example is the Star Forest method [16], which applies a force directed method to pack multiple tag clouds. Other approaches use predefined tag cloud containers, e.g., user-defined polygonal spaces in the plane [17], polygonal shapes of countries [18], or Voronoi tesselations [19]. Newsmap uses a treemap layout [20] to group newspaper headlines of the same category in blocks [21]. Morphable Word Clouds morph the shapes of tag cloud containers in order to visualize temporal variance in text summaries [22]. For the comparison of the tags of various text documents, a ConcentriCloud divides an elliptical plane into sectors that list shared tags of several subsets of the underlying texts [23]. Due to the rather independent computation of individual tag clouds – which often leads to large whitespaces in the final composition step – the above mentioned methods can be seen as sophisticated small multiples. A rather traditional small multiples approach is Words Storms [24] that supports the visual comparison of textual summaries of documents.

Tag clouds also have been used to visualize trends. Parallel Tag Clouds generate alphabetically ordered tag lists as columns for a number of time slices and highlight the temporal evolution of a tag placed in various columns on mouse interaction [25]. In contrast, SparkClouds attach a graph showing the tag’s evolution over time [26]. Other approaches overlay time graphs with tags characteristic for certain time ranges [27].

Only few approaches generate multifaceted tag cloud layouts in a single, continuous flow that includes the positioning of all tags belonging to various groups. RadCloud visualizes tags belonging to various groups within a shared elliptical area [28]. In Compare Clouds, tags of two media frames (MSM, Blogs) are comparatively visualized in a single cloud [29]. To support the comparative analysis of multiple tag groups, TagPies are arranged in a pie chart manner [30]. An example showing the comparative visualization of the co-occurrences of Latin terms is shown in Fig. 1(b).

Table 1. Characteristics of usage scenarios for TagSpheres.

Although techniques like TagPies or Parallel Tag Clouds are capable of visualizing sequences of tag groups, none of the mentioned approaches endeavors to visually encode generic hierarchical information intuitively in a single, compact, aesthetic tag cloud. TagSpheres – presented in this paper – are designed to fill this gap.

3 Designing TagSpheres

The central idea of TagSpheres is the visualization of textual summaries that comprise hierarchical information. This paper provides various usage scenarios that exemplify the existence of hierarchies in textual data (see Sect. 4). An overview of the characteristics of these examples is given in Table 1.

Given n hierarchy levels \(H_1,\dots ,H_n\), the top hierarchy level \(H_1\) contains tags representing the focus of interest of a usage scenario. All other tags are divided into \(n-1\) groups in dependency on their hierarchical distance according to the observed topic, or to the tags on \(H_1\). Each tag t in TagSpheres has a weight w(t) reflecting its importance, and an optional predecessor tag p(t) representing a relationship to another tag that was placed before t and usually belongs to a higher hierarchy level. In dependency on the observed topic, it might be necessary to place the same tag on several hierarchy levels to encode the change of a tag’s importance among hierarchies. In such cases, predecessor tags help to visually link these tags.

3.1 Design Decisions

When designing TagSpheres, we use the following, well-established design features for tag clouds:

  • Font Size: Evaluated as the most powerful property [31], font size encodes the weight w(t) of a tag.

  • Orientation: As rotated tags are perceived as “unstructured, unattractive, and hardly readable” [32], we do not rotate tags to keep the layout easily explorable.

  • Color: Being the best choice to distinguish categories [32], various colors are assigned to tags belonging to different hierarchy levels. As TagSpheres encode the distance to a given topic, the usage of a categorial color map is inappropriate. Unfortunately, suitable sequential color maps as provided by the ColorBrewer [33] produce less distinctive colors even for a small number of hierarchy levels, so that adjacent tags belonging to different hierarchy levels are hard to classify. Following the suggestions given by Ware [34], we defined a divergent cold-hot color map using red for the first hierarchy level and blue for tags belonging to the last hierarchy level n. To avoid uneven visual attraction of tags, we only chose saturated colors that are in contrast to the white background. Example color maps for up to eight hierarchy levels are shown in Fig. 2(a).

Fig. 2.
figure 2

TagSpheres layout algorithm details.

Fig. 3.
figure 3

Determining tag positions using an Archimedean spiral.

3.2 Layout Algorithm

In preparation, the tags are sorted by increasing hierarchy level, so that all tags within the same hierarchical distance to \(H_1\) are placed successively. The tags of each hierarchy level are ordered by decreasing weight to ensure that important tags are circularly well distributed.

To avoid large whitespaces, a problem addressed by Seifert et al. [35], our method follows the idea of the Wordle algorithm [6] – permitting overlapping tag bounding boxes if the tags’ letters do not occlude – to determine the positions of tags. So, we obtain compact, uniformly looking tag clouds for the underlying hierarchical, textual data. To ensure well readable tag clouds, we use a minimal padding between letters of different tags.

As shown in Fig. 2(b), we aim to visually compose tags of the same hierarchy level in the form of spheres around the tag cloud origin at (0, 0). Initially, we iteratively determine positions for the tags of \(H_1\) in the central sphere using an Archimedean spiral originating from (0, 0). An example is given in Fig. 3(a). For each tag t of the remaining hierarchy levels \(H_2,\dots ,H_n\), we also use (0, 0) as spiral origin, if p(t) is not provided (see Fig. 3(b)). If p(t) is defined, we use the predecessor’s position as spiral origin (see Fig. 3(c)). As a consequence, hierarchically related tags are placed closely and visually compose in the form of rays originating from (0, 0) as shown in Fig. 6(a). In contrast to other spiral based tag cloud algorithms, we avoid to cover whitespaces with tags of hierarchy level \(H_i\) within spheres of already processed hierarchy levels \(H_1,\dots ,H_{i-1}\). Dependent on the quadrant in the plane, in which a tag shall be placed, we search for already placed tags intersecting two vectors originating from the dedicated position as illustrated in Fig. 2(c). If no intersections are found, we place the tag. This approach coheres all tags of a hierarchy level as a visual unity outside the inner bounds of the previously processed hierarchy levels’ spheres.

3.3 Interactive Design

Implemented as an open source JavaScript library,Footnote 2 TagSpheres can be dynamically embedded into web-based applications. With mouse interaction, we enable the user to detect hierarchically related tags quickly. Thereby, we distinguish between strongly and weakly related tags, which are defined in dependency on the underlying usage scenario (see Table 1). Related tags are shown on mouseover (see Fig. 4). For strongly related tags we use a black font on transparent backgrounds having the hierarchy level’s assigned color. In contrast, weakly related tags retain their saturated font color, but gray, transparent backgrounds indicate relationships.

TagSpheres provide a configurable tooltip displayed when hovering or clicking a tag to be used, e.g., to list all related tags and their weights. The mouse click function can be used for displaying additional information. e.g., to link to external sources, or to show text passages containing the chosen tag.

3.4 Limitations

The main objective of the presented layout algorithm is to combine a hierarchical information of textual data with the aesthetics of tag clouds. In contrast to the usual approach to always initialize an Archimedean spiral at the tag cloud origin (0, 0) when determining the position of a tag, the usage of predecessor tags as spiral origins slightly affects the uniform appearance of the result in some cases (e.g., see Fig. 7). Occasionally, little holes occur, and – at the expense of visualizing the hierarchical structure of the underlying data – the tag cloud boundaries get distorted.

The proposed hot-cold color map used to visually convey hierarchical distance generates well distinguishable colors when the number of hierarchy levels is small. For a larger number of hierarchies as displayed in Fig. 6(c) or Fig. 10, closely positioned tags of different levels may become visually indistinct, especially when only few tags belong to a certain level.

The current TagSpheres design does not take the distribution of tags throughout different hierarchy levels into account. In use cases with a steadily increasing or decreasing number of tags per hierarchy level it gets possible that a considerable proportion of the color maps’ bandwidth is used for a comparatively small portion of tags. An assignment of colors taking the density distribution of the tags’ weights into account could overcome this issue.

4 Use Cases

TagSpheres are applicable whenever statistics of unstructured text shall be visualized in the form of a tag cloud and a decent hierarchy among the tags exists. This section illustrates usage scenarios of TagSpheres for text-based data from four different domains: digital humanities, sports, aviation and natural disaster management.

4.1 Digital Humanities Scenario

Within the digital humanities project eXChange,Footnote 3 historians and classical philologists work with a database containing a large amount of digitized historical texts in Latin. Usually, humanities scholars pose keyword based search queries and often receive numerous results, which are hard to revise individually. As a consequence, the generation of valuable hypotheses is a laborious, time-consuming process. To facilitate the humanities scholars’ workflows, we develop visual interfaces that attempt to steer the analysis of search results into promising directions.

TagPies – also developed within the eXChange project – are tag clouds arranged in a pie chart manner that support the comparison of multiple search query results [30]. Using a TagPie, humanities scholars analyze contextual similarities and differences of the observed terms – an example is given in Fig. 1(b). Whereas the tags of the same groups are placed in the same circular sectors in TagPies to support their comparative analysis, the intention of TagSpheres is the visualization of hierarchical information. This supports approaching a further research interest of the humanities scholars: the analysis and classification of a term’s co-occurrences according to their clause functions. For this purpose, the scholars require four-level TagSpheres displaying the following tags:

figure a

The font size of T on level \(H_1\) encodes how frequent the search term occurs in the underlying text corpus; the font sizes of all other terms reflect their number of co-occurrences with T in dependency on the corresponding distance. On \(H_4\), font sizes are normalized in relation to the distance range \(m-2\). A tag on hierarchy level \(H_i\) receives a predecessor tag if the corresponding term occurs on one of the previous layers \(H_{i-1},\dots ,H_1\).

Two use cases provided by the humanities scholars involved in the eXChange project shall illustrate the utility of TagSpheres to support the classification of a term’s co-occurrences by their clause functions.

Fig. 4.
figure 4

The analysis of co-occurrences of the Latin term morbo (disease) on word distance.

The first use case (see Fig. 4) outlines the analysis of the co-occurrences of the Latin term morbo (disease). The humanities scholar discovered and classified terms in similar relationships to the given topic. In large distances, the humanities scholar found objects in form of affected parts of the body, e.g., head (caput), soul (animo) and limbs (membrorum), affected persons, e.g., son (filius), woman (mulier) and king (rex), and related places, e.g., Rome (romam), church (ecclesia) and villa. Closer to morbo (most often with distance 1 or 2), typical attributes and predicates can be found. Whereas attributes describe the type or intensity of the disease, e.g., pestilential (pestifero), heavy (gravi), deadly (exitiali) and acute (acuto), the occurring predicates illustrate the disease’s progress, e.g., seize (correptus), dissappear (periit) and worsening (ingravescente). Adjacent to morbo, specific terms for “moral” diseases, e.g., greediness (avaritiae), arrogance (superbiae) and lust (concupiscentiae), and actual diseases like jaundice ([morbo] regio), leprosy (leprae) and two common names for epilepsy ([morbo] comitiali, [morbo] sacro) occur.

Fig. 5.
figure 5

The analysis of co-occurrences of the Latin term vino (vine) on word distance.

The second use case (see Fig. 5) illustrates the exploration of the co-occurrences of the Latin term vino (vine). Like in the previous example, attributes of vine like precious (pretioso), sweet (dulci), new (novo), good (bono), white (albo) or “the best” (optimo) co-occur next to vino. Also closely positioned, usually with distance 1 or 2, are verbs describing (1) what people do with vine, e.g., drink (postati, bibitur), mix (miscetur) or swill (lavabit), and (2) what vine does to people, e.g., inebriate (inebriatus, crapulatus), rave (furere) or degenerate (degenerantes). In larger distances, subjects associated with vine can be found, e.g., people (homines, populus), saints (sancti), lord (dominus) or drunks (ebrii). Rather unexpected was the dominant usage of vino in Christian texts – visible through co-occurring terms like bread (panem), blood (sanguis), Body of Christ (corpus, christi) or sacrifice (sacrificium) – in contrast to a less frequent usage in classical texts. But, the humanities scholar stated that the visualization vividly reflects the classical tricolon “vinofrumento (grain) – oleo (oil)” as a list of important groceries in antiquity for soldiers to survive.

In this usage scenario, the interaction capabilities of TagSpheres are tailored according to the needs of the humanities scholars. Hovering a tag opens a tooltip showing the term’s number of occurrences on all hierarchy levels as strongly related tags. Additionally, variant spellings or cases of the term are listed with their corresponding frequencies as weakly related tags to support the analysis process. An important requirement for the humanities scholars was the discovery of potentially interesting text passages, but they desired a straightforward access to the underlying texts in general. This so-called close reading is often reported as an important component when designing visualizations for humanities scholars [36]. TagSpheres support close reading by clicking a tag, which displays the corresponding text passages containing the search term and the clicked term with the chosen distance. For the first use case (Fig. 4, bottom right), text passages containing the terms morbo and comitiali are shown. In the second use case (Fig. 4, bottom right), we see text passages containing vino and frumento.

4.2 Championship Performances

This scenario illustrates how TagSpheres can be used to comparatively visualize performances in championships. Therefore, we processed a dataset containing the results of all national teams ever qualified for the FIFA World Cup. We receive the following six-level hierarchy:

figure b

The nations’ names are used as tags and font size encodes how often a national team partook a championship round without reaching the next level. Therefore, most nations occur on various hierarchy levels. If a tag t for a nation to be placed on \(H_i\) was already placed at a higher hierarchy level \(H_{i-1},\dots ,H_1\), we use the corresponding tag as predecessor p(t).

Figure 6(a) shows the resultant TagSpheres. Especially this scenario illustrates the benefit of using the positions of predecessor tags as spiral origins for successor tags. In most cases, the various tags of a nation are closely positioned. Hovering a tag displays the all-time performance of the corresponding national team for all championship rounds in a tooltip. Expectedly, Brazil and Germany achieved very good results, especially in the last championship rounds. In contrast, Italy was often knocked out in the first round, but in case of reaching the semifinal (8x), Italy often became FIFA World Champion (4x). Few nations have a 100% success rate in the group stage. Qualified three times for the FIFA World Cup, Senegal always reached the quarterfinals. Most nations, e.g., Sweden, show the expected pattern “the higher the championship round, the smaller the number of appearances”.

Fig. 6.
figure 6

Visualizing championship performances with TagSpheres.

Analogously to the FIFA World Cup results, Fig. 6(b) illustrates the performances of all national teams ever participated the UEFA European Championship – pointing out Germany and Spain as most successful nations. Another example is given in Fig. 6(c) that illustrates the success of football clubs ever played in England’s first league. Here, we use the average rank at the end of the seasons to cluster 68 clubs into eight hierarchy levels, and font size encodes the number of appearances.

Fig. 7.
figure 7

Direct flight connections from airports in Sydney, Rome, Frankfurt and Cagliari.

4.3 Airport Connectivity

To analyze the federal, continental and worldwide connectivity of airports, we derived a dataset from the OpenFlights database,Footnote 4 which provides a list of direct flight connections between around 3,200 airports worldwide. With the selected departure airport d (or city) on \(H_1\), all other airports (or cities) reachable with a non-stop flight cluster into three further hierarchy levels:

figure c

As tags we chose either airport names, the provided IATA codes,Footnote 5 or the corresponding city names. In this scenario, font size encodes the inverse geographical distance between the departure airport \(d=\{lat_d,lon_d\}\) and an arrival airport \(a=\{lat_a,lon_a\}\). To keep the deviation to the actual distance as small as possible, we apply the great circle distance G [37], defined as

$$G = 6378 \cdot \arccos \Big ( \sin (lat_d) \cdot \sin (lat_a) + \cos (lat_d) \cdot \cos (lat_a) \cdot \cos (lon_d - lon_a) \Big ).$$

Predecessor tags are used to place airports or cities of the same country or continent closely. For a tag t to be placed on \(H_3\), we choose the first placed tag with the same associated country as predecessor, if existent; for \(H_4\), we choose the first placed tag with the same associated continent. Thus, a predecessor tag p(t) in this scenario always belongs to the same hierarchy level as t.

Figure 7 shows TagSpheres for non-stop flights from various airports or cities. All examples show that airports/cities of the same countries/continents are placed closely in clusters. For Sydney, no tags are placed on \(H_3\), and for Cagliari, no flight connections to airports outside Europe exist. When the user hovers a tag, the corresponding connection and the travel distance are shown in a tooltip. Clicking a tag redirects to Google FlightsFootnote 6 listing possible flight connections.

Fig. 8.
figure 8

World Risk Index 2015 visualized with TagSpheres.

4.4 World Risk Index 2015

The World Risk ReportFootnote 7 analyzes disaster risks of countries. Thereby, the exposure of a country towards natural hazards (e.g., earthquakes, tsunamis, cyclones or floods) is compared to the country’s vulnerability, which depends on living conditions and economic circumstances. Each country in the database receives a disaster risk percentage – Vanatu being the country with the highest risk (36.72%) and Qatar the country with the lowest risk (0.08%). All countries are clustered into five classes from very high to very low disaster risk, which are used to generate a thematic mapFootnote 8 with countries colored according to these classes. The World Risk Index 2015 visualized with TagSpheres (see Fig. 8) uses the disaster risk classes as hierarchy levels:

figure d

In contrast to a thematic map, we highlight the actual, individual disaster risk percentage of each country with font size. To approximate geographical relations, we use predecessor tags to place country names belonging to the same continent closely.

5 Visualizing Tree Structures with TagSpheres

Numerous algorithms have been developed to visualize large tree structures [38]. Usually, explicit tree representations in the form of node-link diagrams focus on highlighting branching patterns, e.g., [39, 40]; the visualization of values associated with individual nodes plays only a minor role. On the other hand, in implicit tree representations, e.g., tree maps [20], bubble charts [41] or pie chart variants [42], links are not drawn but hierarchical relationships between nodes are illustrated with nesting techniques. But, only few implicit tree layout algorithms communicate the actual values of nodes [43, 44].

Applying the TagSpheres algorithm to tree structures yields an implicit node-link diagram that visualizes the values of nodes without explicitly displaying links. But, TagSpheres indicate structural relationships by using the parent of a node in the tree as predecessor tag, by applying variable font size to illustrate the number of a node’s children, and by using the interaction functionality to highlight individual paths in the tree. This way, we gain a novel tree layout that rather favors the representation of nodes than links. Two examples presenting tree layouts generated with TagSpheres are outlined below.

5.1 Airport Connectivity

Using the OpenFlights database, we can construct a (minimum spanning) tree that reflects all possible flight connections from a selected departure airport d. As in Sect. 4.3, d is the only tag on hierarchy level \(H_1\). All other hierarchy levels compose in dependency on the number of stops it takes to reach another airport. So, \(H_2\) contains all airports reachable with a non-stop flight, \(H_3\) contains all airports reachable with one stop, and so on. As the maximum number of stops is six, we get eight hierarchy levels. In case of multiple possible flight connections having the same number of stops when traveling between two airports, we keep the connection with the shortest geographical distance. Thus, each airport has a clearly defined predecessor. The resultant TagSpheres with Rome-Fiumicino (FCO) as departure airport is shown in Fig. 9. As the underlying tree is well balanced and the average number of children (outdegree) is relatively high (around 5.2 children per inner node), structural relationships are only faintly visible in the outer spheres. Paths are shown on mouse selection indicating the stops between d and the selected airport as well as available connecting flights to other airports. In contrast to other node-link diagrams, the values of all 3.228 nodes and their distances to the root node are easily recognizable with TagSpheres. Thereby, the font size of a tag reflects the number of connecting flights of the corresponding airport.

Fig. 9.
figure 9

Flight connections from Rome-Fiumicino (FCO).

Fig. 10.
figure 10

Bible family tree visualized with TagSpheres.

5.2 Bible Family Tree

More than 600 verses of the Bible describe familial relationships, e.g., between husbands and wives or between fathers and children. Tying all these information together results in the Bible family tree.Footnote 9 It contains 416 nodes (persons), the maximum depth of the tree is 74, and the average number of children of inner nodes is 1.7. Using a vertical dendrogram layoutFootnote 10 supports the analysis of global structural relationships, but the values of nodes are only locally visible. With TagSpheres, the values of all nodes are readable in the global view. In contrast to the previous example, the sparseness of the tree and scaling the font size according to the outdegree of a node fairly indicate present relationships, which can be further explored with mouse interaction.

6 Conclusion

We introduced TagSpheres that arrange tags on several hierarchy levels to transmit the notion of hierarchical distance in tag clouds. We accentuate relationships between different hierarchy levels by placing hierarchically related tags closely. The original motivation to design TagSpheres was to support humanities scholars in analyzing the clause functions of a search term’s co-occurrences (see Sect. 4.1). Aspects of a positive evaluation of the TagSpheres design during the corresponding eXChange project are outlined in the previous version of this paper [45]. Further usage scenarios in sports, aviation and natural disaster management outline the inherence of hierarchical textual information in various domains and the usefulness of TagSpheres as they provide an interesting view on this type of data. In addition, we pointed out that the TagSpheres also serves as a novel tree layout algorithm. Although the value of this approach is yet to be evaluated, two use cases in aviation and theology indicate it’s potential.

Despite few listed limitations, TagSpheres might be applicable to a multitude of further research questions from other areas. Also imaginable is the combination of TagSpheres and TagPies to support the comparative analysis of different textual summaries with hierarchical information.