1 Introduction

Healthcare campaigns play a crucial role in increasing public awareness of health issues, promoting disease prevention, improving health literacy, and fostering overall well-being. However, evaluation of the efficacy of these campaigns and discerning which among multiple alternatives yields the most significant impact represents an important challenge for decision-making. Specifically in biomedical sciences, quantifying campaign effectiveness holds particular significance, as fundraising campaigns often aim to solicit public donations to support biomedical research. One critical facet in designing such campaigns hinges on emotion elicitation, as their emotional resonance may closely influence the public’s decision to contribute. Within this context, communication strategies can be categorized into two paradigms: the ’thinking ad’ approach, emphasizing the delivery of factual and rational content, and the ’feeling ad’ paradigm, which harnesses emotions and empathetic responses to encourage approach behaviors.

With the growing prominence of social media as an information source, videos have emerged as a potent medium for disseminating health-related content, incorporating verbal, vocal, and visual elements (Kostygina et al. 2020). Additionally, research in affective neuroscience has shown that videos surpass other stimuli, such as static images. This is due to their dynamic nature and the synchronous presentation of visual and auditory information. The latter can immerse participants in more realistic scenarios (Gross et al. 2013). In the study of emotion elicitation, identifying emotions expressed following the presentation of visual stimuli often involves the analysis of participants’ psychophysiological responses, both subjectively and objectively (Mustafizur et al. 2021).

In this research, campaign effectiveness is assessed by leveraging electroencephalography (EEG) signals, which represent a valuable tool in the study of brain activity and function (Michel and Murray 2012). Notably, the brain operates as a network of functionally interconnected regions (Pessoa 2014), and EEG allows us to characterize these functional networks by utilizing various connectivity metrics (Chiarion et al. 2023; Lai et al. 2018). The connectivity within brain networks has been linked to behavioral functioning ( Anzolin et al. 2017; Bosl et al. 2011). For instance, alterations in network connectivity have been associated with conditions such as epilepsy (Centeno and Carmichael 2014), reduced network connectivity has been observed in cases of mild cognitive impairment (Tóth et al. 2014) and aging (Hou et al. 2018; Javaid et al. 2022). In contrast, increased connectivity has been correlated with reduced math anxiety (Klados et al. 2019) and freezing of gait in Parkinson’s disease patients (Asher et al. 2021).

The EEG signals considered in this study were obtained from an experimental research project designed to explore the objective physiological and subjective emotional responses to four videos from healthcare campaigns. The campaigns aimed to raise funds from the public to support research on rare diseases, and they were implemented with diverse communication strategies, including variations in narrators (internal or external) and subject matter. To assess the impact of each campaign, we use EEG signals associated with attention to estimate brain networks and subsequently investigate their connectivity as a proxy measure for effectiveness. According to the literature, attention, as a cognitive domain preserved throughout evolution, plays a fundamental role in the human cognitive system. We concentrate on attention since it enables the selection of behaviorally relevant information, prioritizing sensory processing and decision-making (Sreenivasan and Sridharan 2019). In this study, we assume that more effective campaigns will elicit increased attention and more interconnected networks (Contò et al. 2021; Domakonda et al. 2019; Weber et al. 2018). Thus, connectivity differences in the networks may reflect attention differences across campaigns. The dataset comprises recordings over time from electrodes spatially distributed across the participants’ scalp, capturing local electrical oscillations in the brain during the video.

From a statistical perspective, time series methods represent an appealing computational tool to model EEG data (Gasser and Molinari 1996; Pardey et al. 1996). For instance, the (multivariate) autoregressive model has been used to formulate Granger causality tests (Granger 1969) for computing effective connectivity from partial direct temporal effects between electrodes or regions of interest ( Goyal and Garg 2020; Khan et al. 2023; Phang et al. 2019). In more complex settings, advanced regression techniques such as the (generalized) linear mixed model (LMM, Pinheiro and Bates 2000) have been used to model simultaneously EEG recordings from multiple subjects (Frömer et al. 2018; Meinardi et al. 2023; Mir et al. 2022; Spinnato et al. 2015).

In designs where the linearity assumption is not satisfied, generalized additive models (GAMs, Wood 2017) have been proposed as a tool to model nonlinear characteristics in EEG data using smoothing splines (Eliseyev and Aksenova 2014; Khare and Acharya 2023; Kryuchkova et al. 2012; Zhang et al. 2022). Typically, for modelling EEG networks multivariate approaches in the class of probabilistic graphical models (Lauritzen 1996; Whittaker 1990) are exploited (Schelter et al. 2006; Qiao et al. 2019; Li and Solea 2018). For modelling undirected EEG networks, Markov Random Fields (MRFs) are regularly used (Liu et al. 2016; Salazar et al. 2021). When the data fulfill the Gaussian assumption, the Gaussian graphical model (GGM) and its extensions have been applied to model undirected networks from EEG data ( Gao et al. 2019; Min et al. 2022; Paz-Linares et al. 2023; Solea and Li 2022).

In this paper, we explore differences in brain connectivity induced by campaign exposures from a multivariate statistical perspective. The proposed approach integrates additive mixed models with GGMs to infer EEG networks from hierarchically structured multivariate time series data. Following this, to assess campaign efficacy, we examine the structural characteristics of the inferred EEG networks through pairwise comparisons in post-hoc analysis, employing resampling techniques with adjustments for multiple testing.

The paper is organized as follows. In Sect. 2 we discuss the EEG data collection process together with the experimental design. In the same section, we describe the psychometric tools used to investigate the subjective emotional experience, as well as the psychological scales administered to account for internalizing symptoms and personality traits of the participants in the analysis. The proposed statistical framework for modelling the experimental EEG data and the post-hoc tests for comparing the estimated EEG networks are discussed in detail in Sect. 3. The analysis results are presented in Sect. 4. Findings are discussed in Sect. 5.

2 Materials

2.1 Experimental design and data collection

The modelling approach is based on data generated by a collaborative effort between interdisciplinary groupsFootnote 1. The study protocol was approved by the Ethics Committee of Politecnico di Milano (#31/2019).

Four videos, denoted as video 1, video 2, video 3, and video 4, from the campaign “Adotta il futuro” by the Telethon Foundation, an Italian non-profit organization recognized by the Italian Ministry of Education, Universities, and Research for its active involvement in supporting research on rare genetic diseases, were analyzed in this study. We chose Telethon campaign videos as they aim to raise awareness about health issues, particularly rare diseases, and encourage participation. Telethon campaigns exhibit variability in communication style and execution. Video pairs 1 and 2, and video pairs 3 and 4, each featured identical images and background music. Videos 2 and 4 utilized internal narrators, specifically a father or mother addressing the audience directly and sharing real-life stories of other parents with children afflicted by rare diseases. In contrast, videos 1 and 3 presented similar information on rare diseases and patients’ quality of life through a disembodied “voice” external to the story world.

To compare the videos in terms of elicited emotional and physiological responses, serving as a proxy measure of video effectiveness, we evaluate EEG signals recorded during video viewing. Additionally, within the same protocol, other parts of autonomic nervous system (ANS) activity, such as heart rate variability skin electrical conductivity, and facial expression data, were collected throughout the video administration process. To further investigate the emotional experience provoked by the advertisements and assess anxiety and alexithymia constructs, self-report questionnaires were administered. Furthermore, health literacy regarding rare diseases and Telethon brand awareness were investigated, with preferences for the presented videos.

Experimental sessions were conducted at PHEEL on the Bovisa campus in Milan, from December 6–18, 2019 to February 3–7, 2020. During these sessions, 74 healthy adults aged between 19 and 64 participated in the experimental setup. Participants were randomly assigned to one of four groups, ensuring balance in age and gender distribution. Each group was tasked with watching two video campaigns, distinguished by the images used and the type of narrator. To mitigate habituation and learning effects, participants viewed either the sequence of videos 1 and 4 or 2 and 3, with the presentation order randomized within each sequence. However, for this study, we focused solely on measurements collected during the administration of the first video to prevent potential carry-over learning effects associated with evaluating the second video. Consequently, the analysis considered four independent groups of participants, corresponding to the four videos. Specifically, 14 participants viewed video 1 (\(18.9\%\)), 21 viewed video 2 (\(28.4\%\)), 16 viewed video 3 (\(21.6\%\)), and 23 viewed video 4 (\(31.1\%\)). Participant characteristics are summarized in Table 1.

Table 1 Demographic information for the Telethon data

2.2 Measuring subjective emotional experience

Emotional experiences were assessed using different psychometric scales, based on either discrete (Ekman 1992; Panksepp 1982, 2004) or dimensional (Russell 1980; Russell and Barrett 1999; Russell and Bullock 1985) theories of emotions. Following the video presentations, participants were asked to rate the intensity of six basic emotions–happiness, anger, fear, sadness, disgust, and surprise—on a 7-point Likert scale. Afterward, participants completed the Italian version of the differential emotion scale (DES, Battacchi et al. 1993; Izard et al. 1974; Schaefer et al. 2010), consisting of 16 items, each comprising a triplet of adjectives (e.g., “interested, concentrated, alert”). For each item, participants rated on a 7-point scale (ranging from 1=“not at all” to 7=“very intense”) the extent to which they experienced the described emotional state while watching the advertisements. Additionally, the Self-Assessment Manikin (SAM; Bradley and Lang, 1994) was utilized to gauge the valence and arousal induced by the videos, employing a 9-point pictorial scale.

Concerning the DES items, nearly all videos were considered engaging, with the item “interested, concentrated, alert” receiving the highest ratings. Participants also reported feeling “moved”, “sad, downhearted, blue”, and “loving, affectionate, friendly”. Notably, only participants who watched video 1 rated sadness lower than love. Nevertheless, there were no significant differences among the four videos in the scores obtained from the DES and SAM scales, indicating that, subjectively, the advertisements evoked similar emotional responses (Kruskal–Wallis test). The most prevalent emotions represented across all groups were interested, moved, sad, and loving, as illustrated in Fig. 1.

Fig. 1
figure 1

Percentage distribution of the ratings for the four most relevant DES items within each video. For each bar of the Likert plots, on the left side the cumulative percentage of ratings below or equal to 3 is reported, in the middle the percentage of ratings for the neutral category 4 “Neither little nor intensely” is displayed, while on the right the cumulative percentage of ratings higher than or equal to 5 is shown

2.3 EEG: measuring emotional activation

EEG signals measure small electrical oscillations of just a few microvolts spontaneously generated in the brain cortex, reflecting various brain conditions. Oscillatory activity within different frequency ranges has been associated with diverse processes in the brain. Commonly utilized frequency ranges include \(\varvec{\delta }\) (0.1–4 Hz), \(\varvec{\theta }\) (4–8 Hz), \(\varvec{\alpha }\) (8–13 Hz), \(\varvec{\beta }\) (13–36 Hz), and \(\varvec{\gamma }\) (above 36 Hz). In this study, we specifically focused on \(\varvec{\alpha }\) (8–13 Hz) brain waves, as they have been correlated with affective and cognitive attention. The brain activity of each participant was recorded using the BE Plus LTM system, comprising 25 electrodes spatially distributed across the participant’s scalp according to the standard 10–20 configuration ( Klem et al. 1999).

The EEG activity was collected at a sampling rate of 128 Hz, and the impedance level at the skin interface was maintained below 5 \(\text{k}\Omega\) for all electrodes. EEG traces were automatically segmented to separate EEG activity recorded during the baseline period from that acquired during the observation of the videos. Pre-processing of the recorded EEG data included band-pass filtering (1–35 Hz), manual rejection of excessively corrupted epochs, and the execution of independent component analysis (ICA) to detect and remove components related to eye and muscular artifacts.

Furthermore, the EEG signal, referenced to a specific electrode positioned on the pre-cabled cap between CPz and Pz positions, was re-referenced using the common average reference (CAR) method. This re-referencing technique aims to minimize uncorrelated sources of signal and noise through averaging while eliminating noise sources common to all sites.

The signal was decomposed into typical frequency bands. The individual alpha frequency (IAF) was identified for each subject during the baseline period, and the alpha rhythm was extracted using an FIR digital filter centered on the IAF with a bandwidth of 4 Hz. This filtering process was applied with zero phase, Kaiser windowing, and an order of 800. To quantify alpha power, the squared single samples of the filtered signal were computed, and the resulting series was averaged over non-overlapping time windows of 0.5 s. This approach allowed for characterizing alpha power dynamics across the EEG recordings.

2.4 Psychological scales

Validated self-report questionnaires were administered to assess anxiety, which has been linked with EEG activity (Blackhart et al. 2006), and alexithymia, characterized by difficulty in identifying and describing emotions. At the outset of the experiment, participants completed the State-Trait Anxiety Inventory (STAY-Y Pedrabissi and Santinello 1989; Speilberger et al. 1983), which comprises 40 items in total, with 20 items measuring trait anxiety (STAI-1) and 20 measuring state anxiety (STAI-2). Responses were rated on a 4-point Likert-type scale ranging from 1=“almost never” to 4=“almost always”. Each scale yielded a total score ranging from 20 to 80, with higher scores indicating elevated anxiety levels. A cut-off point of 40 is commonly employed to indicate anxiety symptoms‘(Barisone et al. 2004).

Additionally, participants completed the Toronto Alexithymia Scale (TAS-20 Bagby et al. 1994; Bressi et al. 1996), consisting of 20 items rated on a 5-point Likert scale ranging from 1=“strongly disagree” to 5=“strongly agree”. The TAS-20 measures three factors: difficulty identifying feelings, describing feelings to others, and externally oriented thinking.

Median scores on the TAS, STAI, and STAI-Y scales for each group are displayed in Table 1. TAS-20 scores were below the cut-off value (TAS\(\ge 61\)) for the presence of alexithymia, and STAI scores were below the threshold (STAI\(\ge 41\)) for anxiety symptoms, except for the trait anxiety score of participants who viewed videos 2 and 3. For these participants, the median STAI score was 42, indicating mild anxiety.

Participants randomized to watch the four videos obtained significantly different scores in total TAS (Kruskal–Wallis \(\chi ^2=8.6227\), \(p= 0.03475\)). Specifically, post-hoc analysis revealed a significant difference in the scores obtained by participants assigned to watch videos 1 and 2 (Dunn’s post-hoc comparison test with a Bonferroni correction for multiple testing, \(p_{Bonf}=0.0213\)).

3 Methods

This study proposes to model EEG networks with undirected edges by assuming symmetric relationships between the electrodes. Undirected graphical models, also known as Markov random fields (MRFs), represent families of distributions that adhere to a set of conditional independence relationships among the random variables, depicted by an undirected graph \(\mathcal {G}\). A graph is defined as a pair \(\mathcal {G}=\left( \mathcal {V}, \mathcal {E} \right)\), where \(\mathcal {V}=\{1,\dots ,p\}\) denotes the set of vertices or nodes, and \(\mathcal {E}\subseteq \mathcal {V}\times \mathcal {V}\) represents the set of edges. In this context, we adopt a Gaussian MRF, called Gaussian graphical model (GGM).

The GGM is a multivariate approach that entails a Gaussian random vector with additional conditional independencies, or Markov properties. Let \(\mathbf {Y_\mathcal {V}} = \left( Y_1,\dots , Y_p\right) ^\intercal\) denote a vector of random variables indexed by the set of vertices associated with electrodes in the study, where each \(Y_j, j\in \mathcal {V}\), represents a vector with n observations

Moreover, assume that the random vector \(\mathbf {Y_\mathcal {V}}\) is generated from a p-variate Gaussian distribution,

$$\begin{aligned} \mathbf {Y_\mathcal {V}}\sim \mathcal {N}_p \left( \varvec{\mu }, \varvec{\Sigma }\right) , \end{aligned}$$
(1)

with a \(p\times 1\) mean vector \(\varvec{\mu }= (\mu _1, \dots , \mu _p)^\intercal\) and a symmetric \(p \times p\) covariance matrix \(\varvec{\Sigma }\) with inverse \(\varvec{\Theta }=\varvec{\Sigma }^{-1}\), which is known as precision or concentration matrix. The data \(\mathbf {Y_\mathcal {V}}\) are typically centred before the analysis, and the mean vector \(\varvec{\mu }\) is fixed to zero.

In a GGM, the sparsity pattern in the precision matrix \(\varvec{\Theta }\) corresponds to conditional independence statements represented by the graph \(\mathcal {G}\). This arises from the pairwise Markov property, which states that two distinct vertices \(i,j \in \mathcal {V}\) in the graph with \(i\ne j\), are conditionally independent given the rest,

$$Y_i \perp \!\!\!\!\perp Y_j \mid \mathbf {Y_{\mathcal {V}\backslash \left\{ i,j\right\} }} \ \Leftrightarrow \ \omega _{ij}=-\frac{\theta _{ij}}{\sqrt{\theta _{ii} \theta _{jj}}} =0,$$

where \(\theta _{ij}\) denotes the \(\left( i,j\right)\)-th element of the precision matrix \(\varvec{\Theta }\). Thus, estimating the structure of a GGM involves a covariance selection task (Dempster 1972), where one estimates the sparse inverse covariance matrix of a multivariate Gaussian distribution, with non-zero elements associated with edges in \(\mathcal {G}\). Estimating \(\varvec{\Theta }\) under any framework that induces sparsity in the parameter space such as the graphical LASSO (gLASSO, Friedman et al. 2007), or any other regularized neighborhood selection method (Meinshausen and Bühlmann 2006) yields a GGM representation.

The traditional GGM parametrization in Eq. (1) assumes that all observations are generated independently from a common multivariate Gaussian distribution. However, in more complex settings, such as hierarchical longitudinal data, the traditional GGM may not adequately capture the various sources of heterogeneity and multiple dependencies inherent in real data. To address this limitation and accommodate the heterogeneity present in multivariate time series data from individuals nested within different groups, we propose a hierarchical GGM approach (HGGM). This approach allows for the specification of a distinct network structure for each group of participants, thereby enabling more flexible data modelling.

3.1 Hierarchical Gaussian graphical model

Consider an EEG study involving n participants indexed by \(i\in \{1,\dots ,n\}\), who are randomized into V different groups, denoted by \(v\in \{1,\dots ,V\}\), where \(n=\sum _{v=1}^V n_v\). Each participant from group v is assigned to watch a distinct fundraising video campaign. During the video presentation, the brain activity of each participant i from group v is measured simultaneously at each time point \(t\in \{1,\dots , T\}\) from p electrodes spatially distributed across the scalp, indexed by \(j\in \mathcal {V}\), where \(\mathcal {V}=\{1,\dots ,p\}\) represents the vertex set of a graph \(\mathcal {G}\).

Let \(\textbf{y}=\{y_{vijt}\}\) represent the data from the study, with the EEG activity measured on person i in group v, electrode j, and time t. The vector of measurements \(\textbf{y}_{vit}=(y_{vi1t},\dots ,y_{vipt})^\intercal\) for person i in group v at time t is p-dimensional and is modelled as,

$$\begin{aligned} \begin{aligned} \textbf{y}_{vit}\mid \textbf{Q}_i&\sim \mathcal {N}_p\left( \varvec{\mu }_{vit}, \mathbf {\Sigma }_v \right) \\ \varvec{\mu }_{vit}&= \textbf{a}_v +\textbf{B}\textbf{x}_{it} + \textbf{f}\left( \textbf{w}_{it}\right) + \textbf{Q}_i\textbf{z}_{it} \\ \textbf{Q}_i&\sim \mathcal{M}\mathcal{N}_{p\times l}\left( \textbf{0}_{p\times l},\textbf{I}_{p\times p},\mathbf {\Sigma }_Q\right) \\ \varvec{\epsilon }_{vit}&\sim \mathcal {N}_{p}\left( \textbf{0},\mathbf {\Sigma }_v\right) \end{aligned} \end{aligned}$$
(2)

where \(\varvec{\mu }_{vit}=(\mu _{vi1t},\dots ,\mu _{vipt})^\intercal\) is a p-variate vector of person and video-specific means, and \(\varvec{\Sigma }_v\) is a \(p\times p\) video specific-covariance matrix with inverse \(\varvec{\Theta }_v=\varvec{\Sigma }_v^{-1}\). The vectors \(\textbf{x}_{it}\), \(\textbf{w}_{it}\), \(\textbf{z}_{it}\) are of dimensions m, d, and l respectively corresponding to the t-th row of known model matrices for the fixed, smooth, and random covariates from person i. Moreover, \(\textbf{a}_v\) is a vector of video-specific intercepts, \(\textbf{B}\) is a \(p\times m\) matrix for the fixed effects, \(\textbf{f}(\cdot )=(f_1(\cdot ),\dots ,f_p(\cdot ))^\intercal\) is a p-dimensional vector of arbitrary smooth functions, and \(\textbf{Q}_i\) is a \(p\times l\) matrix with the person—specific random effects. The random effects follow a matrix normal distribution, with independent rows and column covariance \(\mathbf {\Sigma }_Q=\text {bdiag} \left( \mathbf {\Sigma }_1(Q), \dots , \mathbf {\Sigma }_p(Q)\right)\) a block diagonal matrix with \(l\times l\) dimensional blocks \(\mathbf {\Sigma }_j(Q)\) on its diagonal. By including only random intercepts in the model, then \(\textbf{z}_{it}={z}_{it}=1\), \(l=1\), \(\textbf{Q}_i=\textbf{q}_i=({q}_{i1},\dots ,{q}_{ip})^\intercal\) becomes a \(p\times 1\) vector of random effects, and \(\mathbf {\Sigma }_Q\) is reduced to a large diagonal matrix parametrized by a p-dimensional vector of variance parameters, i.e., \(\mathbf {\Sigma }_Q=\text {diag}\left( {\sigma }_1^2(Q), \dots , {\sigma }^2_p(Q)\right)\).

3.2 Estimation procedure

The parameters of the hierarchical GGM described in Eq. (2) are estimated by a two-stage procedure. In the first step the p-dimensional person specific means \(\varvec{\mu }_{vit}=(\mu _{vi1t}, \dots , \mu _{vipt})^\intercal\) are estimated from their univariate counterparts. Then, the residuals from the first step model are modelled in a second step to estimate the video-specific inverse covariance matrix \(\varvec{\Theta }_v\), which encodes the graph structure.

3.2.1 First step: univariate estimation of the person-specific means

The person-specific mean vector \(\mathbf {\mu }_{vit}\) is estimated through a univariate electrode-by-electrode estimation procedure. This involves independently estimating each row of the matrices \(\textbf{B}\) and \(\textbf{Q}_i\), along with each component of the smooth functional vector \(\textbf{f}(\cdot )\). To accomplish this, we assume that the matrix \(\mathbf {\Sigma }\) from Eq. (2) is a diagonal matrix with p variances \(\sigma ^2_{j}\) on its diagonal. Thus, in the first step, the mean from each person i, campaign v, and electrode j, is modelled by a univariate additive mixed model with random intercepts. That is,

$$\begin{aligned} \begin{aligned} y_{vijt}\mid {q}_{ij}&\sim {N}\left( \mu _{vijt}, \sigma ^2_{j} \right) \\ \mu _{vijt}&= a_{vj} + \textbf{x}_{it}^\intercal \varvec{\beta }_j + {z}_{it}^\intercal {q}_{ij} + f_j\left( \textbf{w}_{it}^\intercal \right) \\ {q}_{ij}&\sim {N}\left( {0},\sigma _j^2(Q)\right) \\ {\epsilon }_{vijt}&\sim {N}\left( {0},\sigma _j^2\right) \end{aligned} \end{aligned}$$
(3)

where \({\textbf{x}^\intercal _{it}}\), \(\textbf{z}^\intercal _{it}\), and \({\textbf{w}^\intercal _{it}}\) are the same as in (2), \(\varvec{\beta }_j\) is an \(m\times 1\) vector of fixed effects corresponding to the \(j^{th}\) row of \(\textbf{B}\), and \(q_{ij}\) is the j-th element of the \(p\times 1\) vector \(\textbf{q}_i\). Estimating the group and time-varying person specific mean for electrode j, \(\hat{\mu }_{vijt}\) thus implies estimating the fixed effect parameters \({\varvec{\beta }}_j\), the thin plate regression spline function \(f_j(\cdot )\) (Wood 2003), the random effect \({q}_{ij}\) together with its variance \(\sigma ^2_j(Q)\), and the error variance \(\sigma ^2_j\). In practice, the parameters of the additive mixed model described in (3) can be estimated using a Bayesian framework implemented in the function gam(), which is available in the R package mgcv (Wood 2017).

3.2.2 Second step: regularized neighborhood selection

In the second step, we consider the residuals from the first-step model in Eq.  (3). Let \(\epsilon _{vijt}=y_{vijt} - \hat{\mu }_{vijt}\) be the residual from person i in group v, electrode j, and time t. Let \(\varvec{\epsilon }_v=\left( {\varvec{\epsilon }_{v1}},\dots , {\varvec{\epsilon }_{vp}}\right) ^\intercal\) collect the residual observations from all participants in group v. The \(\varvec{\epsilon }_v\) is assumed to follow a p-variate normal distribution,

$$\begin{aligned} \varvec{\epsilon }_v \sim \mathcal {N}_p\left( \textbf{0},\mathbf {\Theta }_v^{-1}\right) \end{aligned}$$
(4)

where the video-specific mean vector common to all individuals in the group v, \(\varvec{\mu }_v\), is fixed to zero, and \(\varvec{\Theta }_v\) is the inverse of the covariance matrix \(\varvec{\Sigma }_v\). The precision matrix \(\varvec{\Theta }_v\) associated with the underlying graph \(\mathcal {G}\) is assumed to be sparse encoding all the conditional independencies between the electrodes.

For estimating the precision matrix \(\varvec{\Theta }_v\) we employ the gLASSO (Friedman et al. 2007) algorithm, which solves the following minimization problem:

$$\begin{aligned} \min _{\varvec{\Theta }_v \in \mathbb {S}^p_{++}} - \log \det (\varvec{\Theta }_v) + \textrm{trace}(\textbf{S}_v\varvec{\Theta }_v) + \lambda \Vert \varvec{\Theta }_v\Vert _{1}, \end{aligned}$$
(5)

where \(\mathbb {S}^p_{++}\) is the cone of symmetric positive definite matrices of size \(p\times p\), \(\textrm{trace}(\cdot )\) corresponds to the trace operator of a matrix, \(\textbf{S}_v\) is the \(p\times p\) sample covariance matrix of \(\varvec{\epsilon }_v\), \(\lambda\) is a positive regularization parameter that shrinks the off-diagonal elements of \(\varvec{\Theta }_v\) towards zero, and \(\Vert \cdot \Vert _{1}\) is the sum of absolute values of the off-diagonal elements of \(\varvec{\Theta }_v\).

The optimal regularization parameter \(\lambda\) is determined by minimizing information criteria. In this work, we consider the extended Bayesian Information Criterion (EBIC, Foygel and Drton 2010) for GGMs, which is defined as,

$$\begin{aligned} EBIC(\lambda ) = -2\ell _{N_v}\left( \widehat{\mathbf {\Theta }}_v(\lambda )\right) +\widehat{\mathcal {E}}_v(\lambda ) \log (N_v) + 2\widehat{\mathcal {E}}_v(\lambda ) \log (p) \end{aligned}$$
(6)

where \({\mathcal {E}}_v(\lambda )\) is equal to the edge set associated with the estimate of \(\varvec{\Theta }_v\) for a particular \(\lambda\), \(N_v=n_vT\) is the number of observations, \(\ell _{N_v}\left( \widehat{\varvec{\Theta }}_v(\lambda )\right)\) denotes the value of the log-likelihood function for the model associated with the parameter \(\lambda\), and p is the cardinality of the vertex set.

3.3 Post-hoc network comparison

The ultimate goal of this study is to compare networks estimated by the hierarchical GGM presented in Sect. 3.1. Since the normality assumption underlying standard statistical tests cannot be justified in penalized regression settings (Pötscher and Leeb 2009), Van Borkulo et al. (2022) proposed a nonparametric permutation-based test called the network comparison test (NCT). The NCT is implemented in the R package NetworkComparisonTest (van Borkulo 2015). NCT is designed for comparing undirected graphs on three structural aspects, and its algorithm consists of three steps.

  1. 1.

    Network estimation In the first step, it estimates the network structure for each group and subsequently calculates the test statistic. Network estimation can be performed using various graph estimation algorithms (here we use the gLASSO).

  2. 2.

    Reference distribution In the second step, the algorithm creates the reference distribution of the test statistic by repeatedly sampling the observed data based on the original sample size, re-estimating the network structure in each repetition, and recalculating the test statistic.

  3. 3.

    Significance evaluation In the last step, NCT evaluates the significance of the test statistic from the first step by comparing it to the reference distribution created in the second step.

The NCT approach can test the difference between networks both at the global and local levels. Specifically, the tests that can be performed involve hypotheses regarding:

  1. 1.

    Invariant network structure (global level).

  2. 2.

    Invariant connectivity strength (global level).

  3. 3.

    Invariant edge strength (local level).

In this study, we focus on global network differences, which means that we restrict ourselves to connectivity tests that involve the hypothesis of invariant connectivity strength (2.).

Let \(\mathcal {G}_1\) and \(\mathcal {G}_2\) be the undirected graphs from two GGMs, each consisting of the same p variables. These graphs (networks) are characterized by the \(p\times p\) symmetric precision matrices \(\varvec{\Theta }_1\) and \(\varvec{\Theta }_2\) respectively. These matrices consist of elements \(\theta ^k_{ij}\), \(k\in \{1,2\}\) with the regression weight between nodes i and j, where \(i,j\in \mathcal {V}\) and \(i\ne j\). When there is no edge connecting two nodes i and j, then these nodes are conditionally independent and \(\theta ^k_{ij}=0\).

The test on invariant connectivity strength tests the following hypothesis:

  • \(H_0:\) \(\sum _{i=1}^p\sum _{j>i} \left|\theta _{ij}^1 \right|= \sum _{i=1}^p\sum _{j>i} \left|\theta _{ij}^2 \right|\) (the global connectivity strength is equal),

  • \(H_1:\) \(\sum _{i=1}^p\sum _{j>i} \left|\theta _{ij}^1 \right|\ne \sum _{i=1}^p\sum _{j>i} \left|\theta _{ij}^2 \right|\) (the global connectivity strength is not equal),

using the test statistic:

$$\begin{aligned} S\left( \theta _{ij}^1-\theta _{ij}^2\right) =\left|\sum _{i=1}^p\sum _{j>i} \left|\theta _{ij}^1\right|-\left|\theta _{ij}^2\right|\right|. \end{aligned}$$

The resulting p-value of the test is the proportion of samples that have a test statistic larger than that of the observed data.

4 Results

Initially, participant-specific means are modelled using fixed effects such as a first-order autoregressive effect and baseline electrode activity. These fixed effects also include participants’ demographic and psychological characteristics, such as age, gender, TAS, and STAI scores. The functional part of the model captures any time variation in average electrode activity at the population level. Finally, the random part of the model accounts for person-specific variability with a normally distributed random intercept. In the second step, we fit a GGM to the residuals obtained from the first-level mean model, specifically focusing on video-specific residuals.

4.1 Estimated networks and network descriptive statistics

Figure 2 shows the estimated networks for each video. The networks are densely connected, and thus hard to describe by visual inspection. However, overall network connectivity may be summarized by computing network density, a global network statistic indicating the proportion of present over possible edges in the network. It takes values between 0 and 1 and the higher the density the more connected is the network. The most densely connected network corresponds to video 2 (\(\hbox {density}_2 = 0.563\)) followed by video 3 (\(\hbox {density}_3 = 0.55\)). The networks that correspond to videos 1 and 4 are less dense, with \(\hbox {density}_1 = 0.547\) and \(\hbox {density}_4 = 0.467\) respectively.

Fig. 2
figure 2

Undirected networks for each group in the study, with their corresponding network density on the bottom left corner of each figure. Grey edges connect electrodes from different groups, while colored edges connect electrodes of the same group. For the sake of interpretability edges with absolute weight lower than 0.05 are not shown in the graphs (colour figure online)

Local network descriptive statistics at the electrode (node) level were utilized to characterize the estimated network structures. Specifically, degree, closeness, and betweenness centralities were examined. Each centrality index captures different aspects of network connectivity. Degree centrality, the simplest measure to compute, indicates the number of edges connected to a node. Closeness (Bavelas 1950) and betweenness (Freeman 1977) centralities are based on the concept of shortest path and have been widely used to characterize EEG networks (van Straaten and Stam 2013; Ferri et al. 2007). The standardized centrality indices are visualized in Fig. 3. The dark orange circle represents the center of the centrality distribution. Centrality values higher or lower than the dark orange circle indicate more or less central electrodes in the network. Electrode names highlighted in blue are situated on the midline of the scalp, those in green correspond to electrodes in the right hemisphere, and those in red are on the left hemisphere of the scalp.

Fig. 3
figure 3

Standardized node centrality indices for the estimated networks. The orange circle corresponds to zero. Electrodes located at the right hemisphere of the scalp are colored green, electrodes located at the left hemisphere of the scalp are colored red, and electrodes located at the center of the scalp are colored blue. Degree centrality is shown in black, closeness centrality in yellow, and betweenness centrality in blue (colour figure online)

Figure 3 reveals similar patterns across all centrality indices. Examining the degree centrality, we observe that in the video 1 network, 10 electrodes exhibit centrality higher than the average. The most central electrode is F4, followed by F8, T3, and T5 with the same degree. Among these 10 central electrodes, three (\(30\%\)) are situated in the right hemisphere of the brain, one (\(10\%\)) is at the midline, and six (\(60\%\)) are in the left hemisphere. Additionally, six central electrodes (\(60\%\)) are positioned in the frontal part of the brain.

In the video 2 EEG network, 13 electrodes demonstrate centrality above the average. The most central electrode is F6, followed by F8 and T5. Among these 13 central electrodes, six (\(46.15\%\)) are located in the right hemisphere, one (\(7.69\%\)) is at the midline, and six (\(46.15\%\)) are in the left hemisphere. Moreover, six out of the 13 central electrodes (\(46.15\%\)) are situated in the frontal part of the brain.

In the video 3 EEG network, 11 electrodes exhibit higher than average degree centrality. The most central electrodes are F7 and T6. Among these 11 central electrodes, four (\(36.36\%\)) are positioned in the right hemisphere, two (\(18.18\%\)) are at the midline, and five (\(45.45\%\)) are in the left hemisphere. Additionally, seven electrodes (\(63.63\%\)) out of the 11 central electrodes are located in the frontal part of the brain.

In the video 4 network, 11 electrodes are central higher than average. The most central electrodes in this network are F8, followed by T3 and T4. Among these 11 central electrodes, five (\(45.45\%\)) are situated in the right hemisphere, two (\(18.18\%\)) are at the midline, and four (\(36.36\%\)) are in the left hemisphere. Additionally, six of these central electrodes (\(54.54\%\)) are located in the frontal part.

Fig. 4
figure 4

Topographic map of an EEG field as a 2-D circular view (looking down at the top of the head) that visualizes the standardized electrode betweenness centrality for each estimated network. Red color corresponds to high betweenness centrality and blue corresponds to low betweenness centrality. The topoplots linked with videos 1 and 2 are showcased in the upper left and right panels, respectively, while those associated with videos 3 and 4 are displayed in the bottom left and right panels (colour figure online)

Topographic maps generated from EEG data, known as topoplots, offer valuable insights into brain activity distribution across different regions. These maps visualize the activation of various brain areas, allowing researchers to identify patterns associated with distinct cognitive functions. Given the study’s objective of identifying more engaging campaigns, attention is typically directed towards the frontal part of the brain, which plays a crucial role in attentional processes.

By plotting topoplots focusing on the frontal region, researchers can observe the distribution of centrality indices or other relevant metrics across electrodes. This visualization helps in understanding how different stimuli or conditions affect attention-related brain activity. Areas with higher centrality or activity levels may indicate increased engagement or processing of information related to the stimulus, suggesting stronger attentional involvement.

Fig. 5
figure 5

Topographic map of an EEG field as a 2-D circular view (looking down at the top of the head) that visualizes the standardized electrode degree centrality for each estimated network. Red colour corresponds to a high degree centrality and blue corresponds to a low degree centrality. Topoplots associated with video 1 and 2 are shown respectively in the upper left and right panels, while maps associated with video 3 and 4 are displayed in the bottom left and right panels (colour figure online)

Figures 4 and 5 provide insightful topoplots illustrating centrality patterns across different brain regions for each video campaign. These topoplots are constructed based on betweenness and degree centralities, offering a comprehensive view of how different videos elicit distinct neural responses.

In both figures, regions with high centrality values, indicating greater activation, are depicted in red, while regions with lower activation are shown in blue. Consistent centrality patterns are observed for each video, highlighting the unique neural signatures associated with them.

Looking at video 1, (in the upper left part of both figures), it’s evident that this video triggers widespread activation across the brain, particularly in the frontal region associated with attentional processes. Video 2, depicted in the upper right of the figures, exhibits significant activation in the frontal region, albeit in a more localized area than video 1.

On the other hand, videos 3 and 4, represented in the bottom left and right panels respectively, show distinct centrality patterns. While both videos elicit high activation, especially in the brain hemispheres, they demonstrate less pronounced activation in the frontal region compared to videos 1 and 2.

These findings indicate that different videos result in networks with different structural characteristics, brain activation intensity, and spatial distribution across regions. This enhances our understanding of how diverse stimuli impact cognitive processes, especially attention, crucial for video advertising effectiveness.

4.2 Network comparisons

The NCT approach was used to compare the estimated networks, focusing primarily on identifying differences in overall connectivity strength. By conducting such analyses, we can discern whether certain videos elicit distinct neural responses compared to others, shedding light on the effectiveness of video content in engaging specific brain regions and cognitive processes.

Figure 6 displays the NCT results on network connectivity strength, providing insights into the pairwise comparisons between the estimated networks for each video campaign. The plots in the panel illustrate the p-values obtained from the pairwise network comparisons. Notably, comparisons involving the network from the second video campaign exhibit p-values below the significance threshold (\(\alpha =0.05\)). Specifically, significant connectivity differences were observed between:

  1. 1.

    The first and second campaign (\(S_{12} = 0.9204204\), \(p = 0.0338\)).

  2. 2.

    The second and third campaign (\(S_{23} = 1.400741\), \(p = 0.0046\)).

  3. 3.

    The second and fourth campaign (\(S_{24} = 1.761043\), \(p = 0.0192\)).

These findings suggest that the second video campaign induces distinct neural connectivity patterns compared to the other campaigns. However, no significant connectivity differences were detected between the other campaign pairs.

Fig. 6
figure 6

Results of the test on network connectivity. Each plot corresponds to a pairwise video comparison and shows the distribution of the test statistic. The red diamond-shaped point corresponds to the value of the observed test statistic. The resulting p-value of the test is shown in blue (colour figure online)

5 Discussion

This study introduces a comprehensive methodological framework for comparing alternative fundraising campaigns and assessing the impact of corresponding videos on cognitive functions. Focusing on four fundraising campaigns, EEG data were modelled using undirected graphical models. Specifically, hierarchical GGMs were employed to capture the heterogeneity in multivariate time series. The structures of the resulting networks were compared using nonparametric testing procedures, under the assumption that more effective campaigns would lead to higher attention and denser networks (Orquin and Wedel 2020; Venkatraman et al. 2015).

The results revealed that the second fundraising campaign yielded the most densely connected network. Moreover, the network comparison indicated significantly higher connectivity for the second fundraising campaign compared to the others. Thus, under the assumption that more effective campaigns would result in more connected EEG networks, the second fundraising campaign emerged as the most effective among the four. Analysis of participants’ preferences suggested that videos featuring an internal narrator (second and fourth videos) were more likely (\(66.2\%\)) to enhance donation compared to others. This finding aligns with the network analysis results from EEG data, suggesting that characters or actors speaking directly to the audience convey messages more effectively.

This study introduces a novel data-driven approach to evaluating fundraising campaign effectiveness, which can be extended to analyze other affective stimuli. Integrating both self-reported emotional experiences and objective physiological activation within analogous study structures can yield valuable insights. Such an approach allows for a comprehensive analysis, enabling the identification of stylistic elements that contribute to the effectiveness and engagement of campaigns. By combining subjective perceptions with objective physiological responses, researchers can gain a deeper understanding of the emotional experiences and reactions elicited by fundraising campaigns. It is worth noting that the nature of this study primarily leans towards an exploratory perspective. This exploratory approach is instrumental in fostering a holistic understanding of emotional experiences and reactions to fundraising campaigns.

Importantly, the proposed approach eliminates the need to rely on composite indicators, as conclusions are based on the entire connectivity structure derived directly from raw EEG data. Furthermore, this method allows for a straightforward graphical representation of the conditional dependence structure, providing valuable insights into brain activation in specific areas.

Studies encompassing a larger sample size and addressing possible sources of confounding could improve the generalizability of our findings. For example, examining socio-demographic factors can reveal how variables like age, gender, education, and socioeconomic status influence emotional processing and subsequent behavior. Understanding the interplay between psychological factors and emotional processing is crucial, as it sheds light on how individuals perceive and respond to stimuli.

Other possible educational features that affect the perception of health-related topics should be considered. In particular, differences in health literacy levels can impact how individuals interpret and act upon emotional messages. Additionally, familiarity with both the advertisement and the topic can influence habituation and learning effects, thus affecting emotional responses. Analyzing the consistency between self-reported intentions and actual behavior is vital for understanding the efficacy of emotional stimuli. Considering potential biases like social desirability can provide valuable insights into the reliability of self-reported data. Moreover, when starting the experimental session, an assessment was conducted on alexithymia and anxiety traits, both involved in emotion processing. However, as we have emphasized the role of attention in campaign effectiveness, it could be interesting to assess and account for attention abilities using psychometric tools.

Overall, integrating these elements into future research can provide a comprehensive understanding of how individual characteristics shape affective responses and subsequent behaviors, offering valuable implications for various fields such as marketing, psychology, and public health.