Abstract
In recent years, the increasing availability of data describing the dynamics of real-world systems led to a surge of interest in the complex networks of interactions that emerge from such systems. Several measures have been introduced to analyse these networks, and among them one of the most fundamental ones is vertex centrality, which quantifies the importance of a vertex within a graph. In this paper, we propose a novel vertex centrality measure based on the quantum information theoretical concept of Holevo quantity. More specifically, we measure the importance of a vertex in terms of the variation in graph entropy before and after its removal from the graph. More specifically, we find that the centrality of a vertex v can be broken down in two parts: (1) one which is negatively correlated with the degree centrality of v, and (2) one which depends on the emergence of non-trivial structures in the graph when v is disconnected from the rest of the graph. Finally, we evaluate our centrality measure on a number of real-world as well as synthetic networks, and we compare it against a set of commonly used alternative measures.
Access provided by CONRICYT-eBooks. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
A large number of real-world systems can be modelled and analysed by looking at the structure that emerges from the interaction between their components [5]. The resulting graph is called a complex network, and provides a powerful way to study the static and dynamic aspects of the underlying system. Typical examples of systems that are studied in network science include metabolic pathways [9], protein interactions [8], brain regions interactions [20] and scientific collaborations [13]. Complex networks often display non-trivial structural properties that distinguish them from Erdös-Rényi random graphs [4], such as small-worldness and a power-law distribution of vertex degrees [5].
In these large networks one of the key problems is that of identifying the set of most relevant nodes, also called central nodes. A number of centrality measure have been introduced in the literature [3, 5,6,7, 12, 17], each of them capturing different but equally significant aspects of vertex importance. Commonly encountered examples include the degree centrality [7], the closeness centrality [21], and the betweenness centrality [7]. Let G be a graph with n nodes. In the degree centrality [7] the normalised (degree) is taken as the centrality value of a vertex, i.e.,
where \(d_v\) denotes the degree of the vertex v. In other words, the number of edges incident on a vertex is interpreted as a measure of its “popularity”, or, alternatively, as the risk of a node being infected in an epidemiological scenario. The closeness centrality [21] links the importance of a vertex to its proximity to the remaining vertices of the graph. More specifically, the closeness centrality is defined as the inverse of the sum of the distance of a vertex v to the remaining nodes of the graph, i.e.,
where sp(u, v) denotes the shortest path distance between nodes u and v. Finally, the betweenness centrality [7] measures the extent to which a given vertex lies on the (shortest) paths between the remaining vertices, i.e.,
where V is the set of nodes, \(\sigma (s,t)\) and \(\sigma (s,t|v)\) denote the number of shortest paths between s and t and the number of shortest paths between s and t that pass through v.
Recently, there has been an increasing interest in using concepts from quantum mechanics and quantum information theory to probe the structure of graphs [11, 18, 19]. In [11], Lockhart et al. introduced an edge centrality index based on quantum information theory, where the importance of an edge is measured in terms of its contribution to the Von Neumann entropy of the network [16]. This in turn relies on decomposing the edge set of a graph as follows. Given an edge u, the original graph is decomposed into two graphs over the same vertex set, but with different number of edges: (1) a graph where only the edge e is present, and (2) a graph where all the original edges except e are present. With this decomposition to hand, the centrality of e is measured as the Holevo quantity of the associated decomposition [11].
In this paper, we show that a similar approach can be taken to measure the centrality of a vertex. Given a vertex v, we propose to decompose the graph into two graphs over the same vertex set but with edge sets as follows: (1) one graph where only the edges incident to v are present, and (2) one graph where all the original edges except those incident to v are present. Then, the centrality of v is the Holevo quantity associated to the resulting graph ensemble. We show that the centrality of a vertex v can be broken down in two parts: (1) one part that is negatively correlated with the degree centrality of v, and (2) one part that depends on the emergence of non-trivial structures in the graph when v is disconnected from the rest of the graph by removing all edges incident to it. Finally, we perform a series of experiments to evaluate the proposed edge centrality measure on real-world as well as synthetic graphs, and we compare it against a number of commonly used alternative measures.
The remainder of this paper is organised as follows: Sect. 2 reviews the necessary quantum mechanical background and the quantum information theoretical concepts that underpin our approach. Section 3 introduces the proposed vertex centrality measure, which is then analysed and compared to alternative measures in Sect. 4. Finally, Sect. 5 concludes the paper.
2 Quantum Information Theoretical Background
2.1 Quantum States and von Neumann Entropy
In quantum mechanics, a system can be either in a pure state or a mixed state. Using the Dirac notation, a pure state is represented as a column vector . A mixed state, on the other hand, is an ensemble of pure quantum states , each with probability \(p_i\). The density operator of such a system is a positive unit-trace matrix defined as
The von Neumann entropy [14] S of a mixed state is defined in terms of the trace and logarithm of the density operator \(\rho \)
where \(\lambda _1,\ldots ,\lambda _n\) are the eigenvalues of \(\rho \). If , i.e., the quantum system is a pure state with probability \(p_i=1\), then the Von Neumann entropy \(S(\rho ) = -{{\mathrm{Tr}}}(\rho \ln {\rho })\) is zero. On other hand, a mixed state always has a non-zero Von Neumann entropy associated with it.
2.2 A Mixed State from the Graph Laplacian
Let \(G=(V,E)\) be a simple graph with n vertices and m edges. We assign the vertices of G to the elements of the standard basis of an Hilbert space \(\mathcal {H}_{G}\), . Here denotes a column vector where 1 is at the i-th position. The graph Laplacian of G is the matrix \(L=D-A\), where A is the adjacency matrix of G and D is the diagonal matrix with elements \(d(u) = \sum _{v=1}^n A(u,v)\). For each edge \(e_{i,j}\), we define a pure state
Then we can define the mixed state with density matrix
Let us define the Hilbert spaces \(\mathcal {H}_{V}\cong \mathbb {C}^{V}\), with orthonormal basis \(\mathbf {a}_{v}\), where \(v\in V\), and \(\mathcal {H}_{E}\cong \mathbb {C}^{E}\), with orthonormal basis \(\mathbf {b}_{u,v}\), where \(\{u,v\}\in E\). It can be shown that the graph Laplacian corresponds to the partial trace of a rank-1 operator on \(\mathcal {H}_{V}\otimes \mathcal {H}_{E}\) which is determined by the graph structure [2]. As a consequence, the Von Neumann entropy of \(\rho (G)\) can be interpreted as a measure of the amount of entanglement between a system corresponding to the vertices and a system corresponding to the edges of the graph [2].
2.3 Holevo Quantity of a Graph Decomposition
Given a graph G, we can define an ensemble in terms of its subgraphs. Recall that a decomposition of a graph G is a set of subgraphs \(H_1,H_2,...,H_k\) that partition the edges of G, i.e., for all i, j, \(\bigcup _{i=1}^k H_i=G\) and \(E(H_i)\cap E(H_j)=\emptyset \), where E(G) denotes the edge set of G. Notice that isolated vertices do not contribute to a decomposition, so each \(H_i\) can always be seen a subgraph that contains all the vertices. If we let \(\rho (H_1),\rho (H_2),...,\rho (H_k)\) be the mixed states of the subgraphs, the probability of \(H_i\) in the mixture \(\rho (G)\) is given by \(|E(H_i)|/|E(G)|\). Thus, we can generalise Eq. 4 and write
Consider a graph G and its decomposition \(H_1,H_2,...,H_k\) with corresponding states \(\rho (H_1),\rho (H_2),...,\rho (H_k)\). Let us assign \(\rho (H_1),\rho (H_2),...,\rho (H_k)\) to the elements of an alphabet \(\{a_1,a_2,...,a_k\}\). In quantum information theory, the classical concepts of uncertainty and entropy are extended to deal with quantum states, where uncertainty about the state of a quantum system can be expressed using the density matrix formalism. Assume a source emits letters from the alphabet and that the letter \(a_i\) is emitted with probability \(p_i = |E(H_i)|/|E(G)|\). An upper bound to the accessible information is given by the Holevo quantity of the ensemble \(\{p_i,\rho (H_i)\}\):
3 The Holevo Vertex Centrality
Given a graph \(G=(V,E)\) and a vertex \(v \in V\), we propose to measure the centrality of v as follows. Let \(G_v = (V,E_v)\) denote the subgraph with vertex set V and edge set \(E_v = \{ (u,v) \in E | u \in V \}\), and \(G_{\overline{v}} = (V,E_{\overline{v}})\) be the subgraph with vertex set V and edge set \(E_v = \{ (u,v) \in E | (u,v) \not \in E_v\}\). In other words, \(E = E_{v} \cup E_{\overline{v}}\) and \(E_{v} \cap E_{\overline{v}} = \emptyset \). Hence, from Eq. 5, we can show that
With this decomposition to hand, we define the Holevo vertex centrality of v as
Given a graph \(G=(V,E)\) and a vertex \(v \in V\), the first term in Eq. 8 (i.e., \(S\left( \rho (G)\right) \)) does not depend on the choice of v, and thus can be ignored when ranking the nodes of G according to their Holevo centrality. Moreover, note that we only need to compute the spectrum of \(\rho (G_{\overline{v}})\), as the spectrum of \(\rho (G_v)\) can be easily determined analytically. Recall that the star graph on n vertices \(K_{1,n-1}\) has Laplacian spectrum
i.e., it has three eigenvalues n, 1, and 0 with multiplicity 1, \(n-2\), and 1, respectively. This in turn implies that the spectrum of the density matrix \(\rho (K_{1,n-1})\) is
as shown in [10]. Since adding disconnected vertices to a graph does not change its Von Neumann entropy [16], we have that the entropy of \(\rho (G_v)\) is
where \(d_v\) denotes the degree of v. In other words, the entropy of \(\rho (G_v)\) is completely determined by the degree of v. As a result, the computational complexity of computing the Holevo centrality of v is dominated by the cost of computing the eigendecomposition of \(\rho (G_{\overline{v}})\).
Finally note that the Von Neumann entropy of a star graph is 0 when \(d_v = 1\), and it grows logarithmically as a function of \(d_v\). This in turn suggests that the Holevo centrality given by Eq. 8 could be negatively correlated with the degree centrality, however proving this would require finding general analytical form of the spectrum of \(\rho (G_{\overline{v}})\). Moreover, it should be noted that the Von Neumann entropy of \(\rho (G_{\overline{v}})\) depends on the presence of several non-trivial structural patterns, including paths, cliques, and connected components. Therefore the Holevo vertex centrality measures the importance of a vertex as a combination of its degree as well as the structural patterns that emerge after its removal.
4 Experimental Evaluation
We perform our experiments on two well known real-world networks, the Florentine families graph [15] and the Karate club network [22], as well as a number of synthetic graphs. We compare the proposed similarity measure the three commonly used alternative measures: (1) the degree centrality [7], (2) the closeness centrality [21], and (3) the betweenness centrality [7].
4.1 Synthetic Networks
Wheel Graph. The Wheel graph \(W_n\) on n nodes is the graph obtained by taking a cycle \(C_{n-1}\) on \(n-1\) nodes and connecting each of the nodes of \(C_{n-1}\) to another node, i.e., the hub. Figure 1 shows 3 wheel graphs of increasing number of nodes n and the corresponding value of the Holevo centrality. Note that for small values of n the hub is the least central node. However as n grows the centrality of the hub remains constant while the centrality of the other nodes decreases, until the hub becomes the most central node.
Indeed, our centrality measure seems to capture the increasing redundancy of the nodes along the cycle \(C_{n-1}\) as n grows. Note that this implies that our measure is negatively correlated with the degree centrality for small values of n, but positively correlated for large values of n. While this may seem surprising given the negative correlation highlighted in Eq. 9, the observed behaviour is likely due to the other component in Eq. 8, i.e., \(S(\rho (G_{\overline{v}}))\), as well as their respective weights.
Lollipop Graph. The (m, n)-lollipop graph on \(m+n\) nodes is the graph obtained by joining the clique \(K_m\) on m nodes with the path graph \(P_n\) on n nodes. Figure 2 shows the value of the Holevo centrality for increasing size of the clique, while keeping the size of the path fixed. For small clique sizes, the most central node is the central node on the path. However, as the size of the clique increases, the centrality of the path node decreases, while the clique nodes become increasingly important. We observe a similar behaviour when the length of the path is increased, with the nodes belonging to it being the most central ones for small values of n. However, as we increase n, the most central node in the graph becomes the node the shared node between the clique and the path.
Similarly to the Wheel graph, the Holevo centrality measure seems to capture the importance of the nodes belonging to the path when the total number of nodes in the graph is small. However, as \(m+n\) grows, our measure places most of the importance on the tightly connected nodes of the clique, while the centrality of the path is “diluted” as its length increases.
Barbell Graph. The Barbell graph is the graph obtained by joining two cliques \(K_m\) through a path \(P_n\) (i.e., a bridge between the two cliques). Note that when \(m=2\) the corresponding Barbell graph is the path graph \(P_{n+2m}\). Figure 3 shows three Barbell graphs with n constant (i.e., the length of the bridge is 3 in all the graphs) and m equal to 2, 3, and 4, respectively. When \(m=2\) the graph is a path over 7 nodes. In this case, our centrality measures assigns the largest weight to the two nodes that connect the two ends of the path to the rest of the nodes. As the m increases, the weight of the cliques shifts the importance from the bridge to the cliques, with the node connecting the cliques to the path becoming then most central ones. If we increase the path length, we observe that the junctions between the cliques and the path remain the most central nodes. However we start to discriminate between the nodes along the bridge, with the nodes closer to the center of the bridge being assigned a higher centrality. Note that this is contrast with the degree centrality, which would assign the same weight to all the nodes on the bridge (since they all have the same degree).
Scale-Free Graph. Finally, we consider a set of scale-free graphs generated by the Barabási-Albert preferential attachment model [1]. Starting from an empty graph, we iteratively add nodes to it until a user-defined size n is reached. At each iteration, the new node is connected to at most m nodes chosen according to their degree, i.e., nodes with a higher degree are more likely to be selected. We let m and n vary between 1 and 3, and 10 and 20, respectively, and for each pair of parameters we generated 100 graphs.
We are interested in measuring the correlation between the Holevo vertex centrality and the degree centrality. In the previous subsections we have observed that the correlation with the degree centrality seems to increase as the size of the graph increases. Figure 4 confirms that this is the case. The figure also shows that the correlation increases as we increase the number of connections added per iteration. Note that when \(m=1\) the resulting graph is guaranteed to be a tree. Indeed, Fig. 4 seems to suggest that the correlation is particularly high on trees, however further investigation is needed to understand if the observed effect is instead due to the particular degree distribution of scale-free graphs or to the presence of high degree nodes.
4.2 Real-World Networks
We conclude our experimental evaluation by measuring the centrality of two well-known networks, the Florentine families graph [15] and the Karate club network [22]. Figures 5(a) and (b) show the two networks, where the radius of the nodes is proportional to their Holevo centrality. We also compute the degree (DC), closeness (CC), and betweenness (BC) centralities over these networks and we show the corresponding correlation matrices in Figs. 5(c) and (d).
In these networks, the Holevo centrality measure shows a large positive correlation with all the other measure, in particular the degree centrality. However, if we look at the ranking induced by our measure there are some important differences. Table 1 shows the ranking given by the Holevo centrality, as well as the degree of each family in the network. Clearly, the degree centrality cannot distinguish between those families that have the same degree, while the Holevo centrality allows to define a more fine-grained ranking. For example, the Salviati family (node 10) is ranked higher than the Barbadori family (node 14), although both families have degree two. However the Salviati family is the only node connecting the Pazzi family (node 6) to the Medici family (node 2) and the rest of the graph, and therefore its importance is higher.
5 Conclusion
In this paper we have proposed a novel vertex centrality measure based on the quantum information theoretical notion of Holevo quantity. The idea underpinning our approach is that the importance of a vertex is proportional to the variation in the information content of the network before and after its removal. We have shown that the centrality of a vertex v can be broken down in two parts, one which is negatively correlated with the degree centrality of v, and one which depends on the emergence of non-trivial structures in the graph when v is disconnected from the rest of the graph. Finally, we have evaluated the Holevo centrality measure on a number of synthetic as well as real-world networks, and we have compared it against commonly used alternative measures. Future work will be aimed at investigating further the structural pattern that influence this centrality measure.
References
Barabási, A.L., Albert, R.: Emergence of scaling in random networks. Science 286(5439), 509–512 (1999)
de Beaudrap, N., Giovannetti, V., Severini, S., Wilson, R.: Interpreting the von Neumann entropy of graph Laplacians, and coentropic graphs. Panorama Math. Pure Appl. 658, 227 (2016)
Bonacich, P.: Power and centrality: a family of measures. Am. J. Sociol. 92, 1170–1182 (1987)
Erdös, P., Rényi, A.: On random graphs. Publ. Math. Debrecen 6, 290–297 (1959)
Estrada, E.: The Structure of Complex Networks. Oxford University Press, New York (2011)
Freeman, L.C.: A set of measures of centrality based on betweenness. Sociometry 40(1), 35–41 (1977)
Freeman, L.C.: Centrality in social networks conceptual clarification. Soc. Netw. 1(3), 215–239 (1979)
Ito, T., Chiba, T., Ozawa, R., Yoshida, M., Hattori, M., Sakaki, Y.: A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc. Nat. Acad. Sci. 98(8), 4569 (2001)
Jeong, H., Tombor, B., Albert, R., Oltvai, Z., Barabási, A.: The large-scale organization of metabolic networks. Nature 407(6804), 651–654 (2000)
Li, J.Q., Chen, X.B., Yang, Y.X.: Quantum state representation based on combinatorial Laplacian matrix of star-relevant graph. Quantum Inf. Process. 14(12), 4691–4713 (2015)
Lockhart, J., Minello, G., Rossi, L., Severini, S., Torsello, A.: Edge centrality via the Holevo quantity. In: Robles-Kelly, A., Loog, M., Biggio, B., Escolano, F., Wilson, R. (eds.) S+SSPR 2016. LNCS, vol. 10029, pp. 143–152. Springer, Cham (2016). doi:10.1007/978-3-319-49055-7_13
Newman, M.E.: A measure of betweenness centrality based on random walks. Social Netw. 27(1), 39–54 (2005)
Newman, M.: Scientific collaboration networks. i. network construction and fundamental results. Phys. Rev. E 64(1), 016131 (2001)
Nielsen, M.A., Chuang, I.L.: Quantum Computation and Quantum Information. Cambridge University Press, New York (2010)
Padgett, J.F., Ansell, C.K.: Robust action and the rise of the medici, 1400–1434. Am. J. Sociol. 98(6), 1259–1319 (1993)
Passerini, F., Severini, S.: Quantifying complexity in networks: the von Neumann entropy. Int. J. Agent Technol. Syst. (IJATS) 1(4), 58–67 (2009)
Rossi, L., Torsello, A., Hancock, E.R.: Node centrality for continuous-time quantum walks. In: Fränti, P., Brown, G., Loog, M., Escolano, F., Pelillo, M. (eds.) S+SSPR 2014. LNCS, vol. 8621, pp. 103–112. Springer, Heidelberg (2014). doi:10.1007/978-3-662-44415-3_11
Rossi, L., Torsello, A., Hancock, E.R.: Measuring graph similarity through continuous-time quantum walks and the quantum Jensen-Shannon divergence. Phys. Rev. E 91(2), 022815 (2015)
Rossi, L., Torsello, A., Hancock, E.R., Wilson, R.C.: Characterizing graph symmetries through quantum Jensen-Shannon divergence. Phys. Rev. E 88(3), 032806 (2013)
Sporns, O.: Network analysis, complexity, and brain function. Complexity 8(1), 56–60 (2002)
Stanley, W., Faust, K.: Social Network Analysis: Methods and Applications. Cambridge University, Cambridge (1994)
Zachary, W.W.: An information flow model for conflict and fission in small groups. J. Anthropol. Res. 33(4), 452–473 (1977)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Rossi, L., Torsello, A. (2017). Measuring Vertex Centrality Using the Holevo Quantity. In: Foggia, P., Liu, CL., Vento, M. (eds) Graph-Based Representations in Pattern Recognition. GbRPR 2017. Lecture Notes in Computer Science(), vol 10310. Springer, Cham. https://doi.org/10.1007/978-3-319-58961-9_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-58961-9_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-58960-2
Online ISBN: 978-3-319-58961-9
eBook Packages: Computer ScienceComputer Science (R0)