Introduction

From biological organisms to social groups, both natural and artificial systems demand sophisticated modeling tools to accurately capture their fundamental properties. Understanding how to represent interactions in such complex systems is crucial for unraveling their intricate architecture and emergent functionality. Networks have long offered a common language for studying these systems, representing units as nodes and interactions as dyadic links (Boccaletti et al. 2006; Cimini 2019). However, this approach overlooks group interactions involving three or more nodes, which are essential in systems where higher-order interactions are prevalent (Battiston 2020, 2021; Torres et al. 2021; Battiston and Petri 2022). Examples include collaboration networks (Patania et al. 2017), human face-to-face interactions (Cencetti et al. 2021, folksonomies (Ghoshal et al. 2009), species interactions within complex ecosystems (Grilli et al. 2017), brain networks (Petri 2014), and cognitive associations (Citraro 2023).

Hypergraphs (Berge 1973), able to explicitly encode group interactions as hyperedges, have emerged as a popular framework to represent higher-order networks (Battiston 2020, 2021). Recently, significant effort has been dedicated to the characterization of hypergraphs, from centrality (Benson 2019; Tudisco and Higham 2021) and clustering (Benson et al. 2018) measures, to backboning (Musciotto et al. 2021, 2022) and reconstruction (Young et al. 2021). Investigations into higher-order networks cover both micro- (Lotito et al. 2022, 2023; Lee et al. 2020) and mesoscale levels (Wolf et al. 2016; Vazquez 2009; Carletti et al. 2021; Eriksson et al. 2021; Chodrow et al. 2021; Contisciani et al. 2022; Chodrow et al. 2023; Ruggeri et al. 2023; Lotito et al. 2024), unveiling structural principles essential to understanding group interactions in real-world systems. These higher-order interactions are known to impact the dynamic and collective phenomena within networked systems (Battiston 2021), affecting processes such as synchronization (Skardal and Arenas 2020; Millán et al. 2020; Lucas et al. 2020; Gambuzza 2021; Zhang et al. 2023), diffusion (Schaub et al. 2020; Carletti et al. 2020), spreading (Iacopini et al. 2019; Chowdhary et al. 2021; Neuhäuser et al. 2020) and evolution (Alvarez-Rodriguez 2021).

However, not all interactions in complex systems are alike; they may differ in nature, type, and scope. This observation led researchers to introduce the concept of multilayer and multiplex networks (Boccaletti 2014; Kivelä 2014), where links are encoded into different interaction layers, each representing a distinct type of relationship (De Domenico 2013; Battiston et al. 2014). Multilayer and multiplex networks can successfully describe systems such as trade networks (Barigozzi et al. 2010), transportations networks (Aleta et al. 2017), collaboration networks (Battiston et al. 2016), and the brain (De Domenico 2017). Multiplex hypergraphs, where layers encoding hyperedges of different type, could offer a robust tool for describing complex systems that involve group interactions of varying types. Despite a few exceptions (Vasilyeva 2021) and a significant potential, however, multiplex hypergraphs remain relatively unexplored, and a general set of tools for their analysis is still missing.

In this paper, we introduce some basic measures to characterize multiplex networks with higher-order interactions, spanning from the node/hyperedge level to the system’s mesoscale. We propose measures for the activity of nodes in different layers and orders, as well as for node degree correlation. Moreover, we partition nodes in generalists or specialists based on how their higher-order degree is spread across layers. We characterize hyperedges by examining hyperedge order distributions and overlap in multiple layers. Additionally, we introduce measures to quantify the layer-dependent ability to connect either generalist or specialist nodes. Finally, we study the correlation of community and core-periphery structures across layers.

We apply our proposals to three different real-world datasets: scientific co-authorship in physics, collaborations in movies and face-to-face interactions in a high school. Our measures are able to highlight patterns of actors’ collaborations across various film genres, discern co-authorship trends within and across physics subfields, and offer insights into the daily interaction dynamics of students.

Multiplex hypergraphs

Multiplex hypergraphs model systems where interactions among units (i) may belong to multiple types and (ii) are not necessarily dyadic, i.e. they may involve more than two units. A multiplex hypergraph \({{\textbf {H}}}\) is defined as:

$$\begin{aligned} {{\textbf {H}}} = \{ H_1(V, E_1), \ldots , H_M(V, E_M)\} \end{aligned}$$

where each layer \(\alpha\) is a hypergraph \(H_{\alpha }(V, E_\alpha )\). Each hypergraph \(H_{\alpha }(V, E_\alpha )\) share the same set of entities V. \(E_\alpha \subseteq {\mathcal {P}}(V)\) is the set of all interactions of a specific type \(\alpha\). Moreover, we require \(|e| \ge 2\) for all \(e \in E_\alpha\) for any \(\alpha\). In other words, each layer in our framework shares the same set of nodes and represents a distinct set of interactions of the same nature. In Fig. 1, we show a simple multiplex hypergraph with 7 nodes and hyperedges spread across 3 layers.

Fig. 1
figure 1

Multiplex hypergraphs represent systems of units that display interactions of different orders and different types. Each type of interaction is encoded into a single layer of the hypergraph. All the layers share the same set of nodes

We remark that our proposed framework is different from a multiplex representation of higher-order interactions where layers are defined by interactions of different order (Lucas et al. 2020; Sun and Bianconi 2021).

In this work, we introduce a set of general tools to investigate multiplexity across different system scales in networks with higher-order interactions. We validate our measures and discuss relevant findings for three distinct real-world datasets:

  • aps (Co-authorship network), where nodes are authors, and hyperedges represent groups of authors who have co-authored a paper. Each layer collects papers from the same subfield of physics, identified by a PACS code (Physics and Astronomy Classification Scheme) (Dataset 2021).

  • imdb (Co-starring network), where nodes represent actors, and hyperedges represent the cast of a specific movie. Each layer corresponds to a movie genre.

  • high school (Social network), where nodes are students, and hyperedges represent groups of students interacting face-to-face, with each layer grouping interactions from the same day of the week (Mastrandrea et al. 2015).

Detailed statistics about datasets are provided in Table 1.

Table 1 Statistics about three real-world multiplex hypergraphs

Node properties

We begin by investigating multiplex properties at the node level. The first basic measure we consider is node activity (Nicosia and Latora 2015). A node i is active at layer \(\alpha\) if i participates in at least one interaction at layer \(\alpha\). Figure 2 shows statistics on nodes’ simultaneous activity across multiple layers. Specifically, the y-axis plots the proportion of nodes (from the total node count) active in at least x layers. By definition, these curves exhibit a decreasing trend, with variations in the negative slopes reflecting the datasets’ diversity. While it is uncommon for scientists and actors to be active across more than 2 or 3 layers, students tend to be active in all layers. In fact, the inactivity of a student in a specific layer implies their absence from school on that day.

Fig. 2
figure 2

Proportion of nodes active in at least x layers across three different datasets. Colored dashed lines indicate the number of layers in each respective dataset

Fig. 3
figure 3

a Each dataset is a graph in which vertices represent the layers of the multiplex hypergraphs and the thickness of an edge \((\alpha , \beta )\) quantifies the pairwise cosine similarity of layer activity matrices \({{\textbf {B}}}_\alpha , {{\textbf {B}}}_\beta\) associated with layers \(\alpha\) and \(\beta\). Vertex size is proportional to the number of nodes active in that layer. b Matrix L associated with each dataset. Rows are normalized by the number of nodes active in each layer. Interaction orders are binned exponentially

So far, we have grouped all interactions together, regardless of their order. To obtain more detailed insights about higher-order interactions, we can examine node activity for each specific interaction order d. To this scope, we introduce a list \({{\textbf {A}}}\) of node activity matrices, one for each node i:

$$\begin{aligned} {{\textbf {A}}}_i = \{a_{\alpha d}\} = {\left\{ \begin{array}{ll} 1 &{} \text {if } i\text { is active} \text { at order } d \text { in } \alpha \\ 0 &{} \text {otherwise}. \end{array}\right. } \end{aligned}$$

Similarly, one can define activity from a layer perspective and consider a list \({{\textbf {B}}}\) of layer activity matrices, one for each layer \(\alpha\):

$$\begin{aligned} {{\textbf {B}}}_\alpha = \{b_{i d}\} = {\left\{ \begin{array}{ll} 1 &{} \text {if } i\text { is active} \text { at order } d \text { in } \alpha \\ 0 &{} \text {otherwise}. \end{array}\right. } \end{aligned}$$

It can be useful to aggregate information about nodes and define an aggregated layer activity matrix L as:

$$\begin{aligned} L = \{l_{\alpha d}\} = |\{ \text {nodes active at order } d \text { in } \alpha \}| \end{aligned}$$

In Fig. 3a, each dataset is represented as a graph where vertices are the layers of the multiplex hypergraph and links measure the similarity in activity patterns of two layers \(\alpha\) and \(\beta\), quantified as the cosine similarity of their node activity matrices \({{\textbf {B}}}_\alpha\) and \({{\textbf {B}}}_\beta\). The thicker the link, the higher the similarity. This figure emphasizes layers that not only share common active nodes, but also exhibit similar patterns of participation across different hyperedge orders. Particularly, a consistent higher-order similarity is observed across school days, reflecting recurring interaction patterns throughout the week. Other datasets show more heterogeneous behaviour, with documentary casts differing significantly from other layers, while drama and comedy casts exhibit similar patterns.

Figure 3b shows the aggregated layer activity matrices L for the three datasets. To account for variations in layer size, we normalize each row by the total number of active nodes in the respective layer. Distinct collaboration patterns emerge across the subfields of physics and movie genres. For instance, scientists in General or Electromagnetic Physics usually contribute to papers with a smaller number of co-authors, whereas co-author groups in fields like Elementary Particles and Nuclear Physics exhibit more variation in size. In movie collaborations, actor activity is concentrated in medium-size groups, typically between ten and twenty members. However, documentaries often feature smaller casts, while family and comedy movies tend to have larger ones. The figure once again highlights how students at school maintain a consistent group size in their interactions throughout the week.

Fig. 4
figure 4

a The heatmap shows the pairwise correlation between the degrees of nodes across different layers. The color scale indicates the strength of the correlation, with blue representing low correlation and red representing high correlation. b A system unit \(i\) is represented as a point on a Cartesian plane, with the overlapping degree \(o_i\) on the \(y\)-axis, the participation coefficient \(P_i\) on the \(x\)-axis, and the average order of the interactions in which the unit is involved indicated by color intensity

Similar to node activity, node degree (defined as the number of interactions in which a node participates) is another property that can be used to measure the activity across the different layers and interaction orders. We define a list \({{\textbf {K}}}\) of node degree matrices, one for each node i:

$$\begin{aligned} {{\textbf {K}}}_i&= \begin{aligned} \{k_{\alpha d}\}&= |\{ \text {hyperedges of order } d \\&\quad \text {involving } i \text { at } \alpha \}| \end{aligned} \end{aligned}$$

We use \(k_{i \alpha }\) to denote the total number of interactions involving i in layer \(\alpha\), irrespective of their order:

$$\begin{aligned} k_{i \alpha } = \sum _{d=1}^{D}{{{\textbf {K}}}_{i \alpha d}}, \end{aligned}$$

where D is the order of the largest interaction in the dataset.

In Fig. 4a, we analyze the correlation of node degree across layers, exploring the extent to which a node with a high or low degree in one layer similarly exhibits a high or low degree in another layer. The correlation matrix for physics collaborations uncovers a hierarchical structure, with strong correlations among specific subfields sharing commonalities and notable scientists, such as in Nuclear and Elementary Particles Physics. In contrast, the degree correlations among actors are generally weak, though certain genres, like thriller and horror, show similarities. A significant correlation in node degrees across consecutive days in high school suggests stable and structured daily interaction patterns, implying that individuals with numerous interactions on one day tend to maintain similar levels of interactions on subsequent days, and vice versa.

We now define the overlapping degree \(o_i\) for a node i as the total number of interactions involving i, irrespective of both layers or orders:

$$\begin{aligned} o_i = \sum _{\alpha =1}^M k_{i \alpha } \end{aligned}$$

It can be interesting to measure (i) how the overlapping degree of a node i is spread across the layers, i.e., if the degree is concentrated in certain layers or if it is uniformly distributed; (ii) how interactions involving node i are spread across orders. We measure (i) by defining the participation coefficient \(P_i\) of a node i of the degree with respect to the layers:

$$\begin{aligned} P_i = \frac{M}{M-1} \left[ 1 - \sum _{\alpha =1}^{M} \left( \frac{k_{i \alpha }}{o_i}\right) ^2\right] \end{aligned}$$

where \(k_{i\alpha }\) is the degree of node i at layer \(\alpha\), \(o_i\) is the overlapping degree of node i and M is the total number of layers. We measure (ii) by considering the average order of the interactions node i participates in.

In Fig. 4b, we represent each unit i of the different systems on a Cartesian plane, characterizing them across three distinct dimensions: their overlapping degree \(o_i\) (on the y-axis), their participation coefficient \(P_i\) (on the x-axis), and the average order of the interactions in which they are involved (indicated by color intensity). In general, such three dimensions provide different information about connectivity patterns and are only weakly correlated, or even uncorrelated. In aps, scientists are spread across the plane in terms of degrees and average interaction order, displaying an overall tendency towards specialization in a selected number of physics subfields, yet the behavior remains heterogeneous. imdb displays isolated outliers with a very high degree, low dispersion around the average interaction order, and a general tendency towards uniform participation across multiple genres. high school shows students covering the entire spectrum of node degrees and average group orders, with an expected tendency to interact uniformly across school days.

Hyperedge properties

We now turn our attention to the properties of the hyperedges encoding interactions in the same three real-world systems.

Fig. 5
figure 5

Distribution of hyperedge orders disaggregated by layers in each dataset. Colors distinguish between different layers, with interaction orders binned exponentially

We begin by considering the simplest measure for characterizing higher-order interactions, namely, the order of the groups. In Fig. 5, we plot the hyperedge order distribution disaggregated by layers. aps and imdb reveal heterogeneity across layers, suggesting that different physics subfields and movie genres exhibit distinct patterns of collaboration in terms of the number of people involved in a paper or a movie cast. For example, genres such as documentaries and animated movies typically feature fewer actors compared to other genres. Conversely, papers in Elementary Particles and Nuclear Physics often include a larger number of authors compared to those in other areas of physics. The distributions in high school are stable across layers, indicating that patterns of face-to-face interactions tend not to change over the days, with a general preference for smaller groups over larger ones.

Another property frequently studied in the context of multiplex networks is edge overlap, which measures the extent to which interactions among the same nodes tend to repeat across multiple layers. We define hyperedge overlap as the maximum number of layers in which an interaction repeats exactly. In Fig. 6, we present the distribution of hyperedge overlap, including information about the order of the interactions. As expected, aps displays a high degree of hyperedge overlap, indicating that the same set of scientific authors consistently interact across multiple areas of physics. Conversely, for actors, hyperedge overlap decays very rapidly. Small interactions typically exhibit a higher degree of overlap than large interactions. Historically, edge overlap in multiplex networks with higher-order interactions has often been investigated by projecting hyperedges at different layers into cliques, frequently resulting in extremely high values of edge overlap. Our analysis suggests that patterns of hyperedge overlap are more complex and that projections of hyperedges can account for the high amount of overlap previously observed (Battiston et al. 2014).

Fig. 6
figure 6

Number of hyperedges as a function of their overlap, i.e., the maximal number of layers in which an interaction repeats. Markers are scaled proportionally to the average order of hyperedges. Colored dashed lines indicate the corresponding number of layers in each dataset

Fig. 7
figure 7

Boxplots showing the distribution of hyperedge participation coefficient \(P_e\) across layers in each dataset

Finally, we assign a score \(P_e\) to each hyperedge e, defined in terms of the participation coefficient of the nodes involved in the interaction:

$$\begin{aligned} P_e = \frac{1}{|e|} \sum _{i \in e} P_{i} \end{aligned}$$

where |e| represents the number of nodes participating in hyperedge e and \(P_{i}\) is the participation coefficient of node i, as defined in the previous section.

This measure captures the tendency of hyperedges to connect nodes that either specialize in a few layers or act as generalists across multiple layers. In Fig. 7, we use boxplots to show distributions of P for hyperedges in various layers. It is noteworthy that layers can display heterogeneous mean values for the participation coefficient of their hyperedges. For example, casts in documentaries and co-authors in Nuclear and Elementary Particles Physics tend to include specialists. On the other hand, family and thriller movies are more likely to feature generalist actors. In high school, layers exhibit a consistent maximum mean value for the participation coefficient of hyperedges, attributed to students’ regular attendance at school each day.

Mesoscale properties

We finally shift our focus towards mesoscale structures, examining the emergence of communities and core-periphery structures within different layers of real-world hypergraphs.

Fig. 8
figure 8

a Heatmaps illustrating the similarity of community structures across layers, measured by Normalized Mutual Information (NMI), for the three datasets. In these heatmaps, high NMI values are represented by light colors, while low NMI values are represented by dark colors. b Core-periphery score \(c_i\) across layers for each node in the datasets, visualized as heatmaps. Rows are layers and columns represent nodes. Lighter colors are higher coreness values. Nodes are consistently sorted across layers based on their coreness in the aggregated hypergraphs, i.e., the hypergraph obtained by collapsing all layers into a single layer

Communities are groups of nodes that display a higher degree of connectivity among themselves than with the rest of the nodes in the system. In hypergraphs, a community is defined as a subset of nodes that tend to form cohesive units by participating in common hyperedges. When analyzing multiplex systems, it is typical to examine the similarities in community structures observed across various layers. In this direction, we employ a method for hard clustering, applied independently to each layer, which is an extension of the well-established Infomap algorithm to the case of hypergraphs (Eriksson et al. 2021). In general, Infomap minimizes the map equation, which quantifies the description length required to represent the random walker’s movements on the network (Rosvall et al. 2009). This optimization effectively partitions the network into communities that best capture the inherent modular structure. The method is publicly available (Eriksson et al. 2021) and we have used default parameters.

To assess the similarity and consistency of community structure across different hypergraph layers, we use Normalized Mutual Information (NMI), taking into account the set of nodes active in both layers. NMI is defined as:

$$\begin{aligned} \text {NMI}(C_1, C_2) = \frac{I(C_1, C_2)}{\sqrt{H(C_1) H(C_2)}} \end{aligned}$$

where \(I(C_1, C_2)\) denotes the mutual information between partitions \(C_1\) and \(C_2\). Mutual information quantifies the amount of information shared between the two partitions, i.e., measures how much knowing the community structure in one partition informs about the structure in the other. \(H(C_1)\) and \(H(C_2)\) represent the entropies of the partitions \(C_1\) and \(C_2\), respectively. By normalizing the mutual information \(I(C_1, C_2)\) with the geometric mean of these entropies, NMI adjusts for the variability in partition sizes and the number of communities, allowing for a fair comparison of community structures across different partitions. NMI ranges from \(0\) (indicating no mutual information) to \(1\) (indicating perfect agreement).

In Fig. 8a, we present the outcomes of this analysis through heatmaps, highlighting the strength of community structure similarities across layers. For example, community structures in animation and fantasy movie casts are closely related, as are those in comedy and family movie genres, whereas documentary casts show a completely uncorrelated structure. The community structure within high school interactions remains consistent across days. Physics collaborations reveal a significant degree of similarity across fields, though some layers exhibit more pronounced similarities than others. Overall, communities tend to be preserved within physics subfields and school days, while movie genres often demonstrate predominantly uncorrelated communities.

We then direct our attention to core-periphery structures. Core-periphery structures delineate the existence of a group of central and tightly connected nodes, the core, surrounded by less densely connected peripheral nodes, forming a distinctive organizational pattern often crucial for system functionality (Borgatti and Everett 2000).

We detect core-periphery structure for each layer independently using a method tailored for hypergraphs (Tudisco and Higham 2023). Such a method ranks nodes assigning to each node i a value \(c_i\), where \(c_i\) is a real number within the range of 0 to 1. This value delineates the extent to which a node participates in the core (value closer to 1) or the periphery (value closer to 0) structure of the system. Vector \({\textbf{c}}\) is selected for each layer \(\alpha\) independently. Following the work by Tudisco and Higham (Tudisco and Higham 2023), for each layer \(\alpha\) we select the vector \({\textbf{c}}\) that optimizes the following function:

$$\begin{aligned} \max _{{\textbf{c}}} \, \sum _{e \in E_\alpha } \frac{1}{|e|} \cdot \left( \sum _{i \in e} c_i \right) \quad \text {subject to} \quad \Vert {\textbf{c}} \Vert _2 = 1 \text { and } c_i \ge 0 \text { for all } i \end{aligned}$$

This continuous scale allows for a nuanced characterization of each node’s role within the core-periphery framework. Additionally, by comparing the coreness score \(c_i\) for each node \(i\) across the different layers of a multiplex hypergraph, we can analyze the variation or consistency of a node centrality across the layers. This method is publicly available (Tudisco and Higham 2023). We use the implementation from Hypergraphx (Lotito 2023).

To provide a visually appealing way of highlighting correlations of core-periphery structures and node behaviour across layers, Fig. 8b shows heatmaps in which rows are layers and columns are nodes, and each entry is coloured depending on node coreness. To compare the coreness value of single nodes across layers and visualize to which extent it keeps its core value, we maintain a consistent sorting of the nodes on the x-axis. For each dataset, nodes are sorted according to their core values in the aggregated hypergraphs (i.e., the hypergraph obtained by dropping information about layers and collapsing every hyperedge to a single layer). We observe that coreness values of nodes are maintained across layers exhibiting patterns similar to those seen in community structure correlations.

Conclusions

Networks have established themselves as a fundamental tool in a variety of disciplines to encode and study systems of interacting units. With the idea of capturing richer information about interactions, novel and more comprehensive network models have emerged: (i) multiplex networks, describing links of different types, and (ii) hypergraphs, encoding non-dyadic ties. Bridging these two notions, in this work we have introduced a general set of measures to characterize the structure of multiplex hypergraphs at multiple scales. We introduced a description of nodes in terms of higher-order activity patterns and degrees, to quantify the extent and magnitude of node participation in interactions of different orders across layers. Nodes have been also characterized in terms of how their degree is correlated and spread in the different layers, and by their preferred order of group interactions. For hyperedges, we have studied their order distribution disaggregated by layers, highlighting different patterns of group interactions depending on the hyperedge type. We have quantified the extent to which hyperedge tends to repeat exactly in multiple layers and we have analyzed the layer-dependant property of hyperedges of linking nodes with low or high participation coefficients. Finally, we have analyzed hypergraphs at their mesoscale, quantifying similarities of communities and core-periphery participation across layers. We have validated our proposed measures on three datasets from different domains, describing collaboration patterns across physics subfields, movie genres and daily interactions among students.

In summary, we believe that our measures can be useful in describing the structure of various empirical datasets characterized by both multiplex and higher-order interactions. We also hope that this initial characterization of multiplex hypergraphs will spark interest from a methodological perspective, such as proposing frameworks for extracting multiplex communities in hypergraphs. Further characterizations could be enhanced by considering the complex patterns of temporality in hyperedges, a common feature of higher-order systems (Cencetti et al. 2021; Ceria and Wang 2023; Gallo et al. 2023; Iacopini et al. 2023; Di Gaetano et al. 2024; Arregui-García et al. 2024; Mancastroppa et al. 2024; Chodrow 2020).