1 Introduction

The global incidence of anxiety, depression, multiple sclerosis, pathological stress and different stress-related diseases has significantly increased in recent years; this is probably due to dramatic changes in the daily lives of citizens. It has been shown that stress generally has negative consequences in terms of mental health and well-being [1,2,3] and that chronic stress especially increases the risks of certain pathologies, such as cardiovascular diseases, strokes, diabetes and others [4, 5], and can even cause sudden death [6]. According to the World Health Organisation [7], social and medical problems associated with stress are clearly rising and seriously affecting mental health and well-being not only in adults but also in young people and children. Therefore, stress has become one of the significant focal points of research interest [8]. Reliable markers that allow to measure the body’s stress response could be utilized to monitor stress conditions in daily life. This would, in turn, enable an accurate monitoring of patients, prevent illnesses and detect pathological conditions in their early stages. However, to the best of our knowledge, no reliable tool yet exists for measuring levels of stress response.

Stress has been defined by Selye in 1950 as a state of biological activation triggered by the person interacting with external agents that force her or his capacity to adapt [9]. Stress is also defined as a disturbance of homeostatic balance of the individual [10] that provokes a stress response, which is the body’s (or individual’s) attempt to cope with the stressor. Stress can be classified as either acute or chronic [3, 11]. Acute stress is the immediate body’s response to a stressor. Chronic stress, on the other hand, is the state caused by a continuous stressor stimulus over a period of months (e.g. family caregivers of terminal patients) or the state reached when the body cannot to deal with the stressor and does not achieve a homeostatic balance state (e.g. traumatic stress of war veterans).

The response to acute stress is designed to provide the organism with the requisite alertness, energy, physiological regulation and immunological activation to counterbalance the effects of the stressor in order to reach a new adaptive state and survive [11]. Consequentially, the activation of stress response triggers changes in the autonomous nervous system (sympathetic and parasympathetic balances) and a hormonal response, as well as behavioural changes and a decrease in cognitive skill [3, 11]. Thus, a diversity of body changes is triggered, affecting different possible stress markers, which have been used to assess stress so far.

Psychiatry, psychology and the medical community in general have established, and are using, psychometric questionnaires as the appropriate tools to assess psychological stress [12], measuring mostly behavioural and cognitive skill changes. Among others, one of the most frequently used questionnaires are the subject’s self-reporting assessments (e.g. the Visual Analogue Scale for Stress, VASS). Following a different approach, researchers from biomedical fields are more enthusiastic about using biochemical markers from the hormonal response, such as cortisol, prolactin, α-amylase [13], interleukin-6, copeptin [14] and others [15, 16]. However, even though biochemical samples and psychometric questionnaires constitute well-known stress makers, they cannot be implemented in a continuous stress monitoring method. Nevertheless, they are suitable for specific applications and research studies. The limitations of these biochemical markers include invasiveness and the impossibility of being continuously measured. Additionally, it must be noted that their dynamic response should be considered in order to perform a proper measurement [17]. The relationship between the body’s activation of biochemical markers and the intensity of the stress perceived is both complex and understudied, which also makes the use of biochemical markers disadvantageous. The major drawbacks of the psychometric questionnaires are that they are not designed for frequent applications and they are subjective, depending on the self-perception of the individual about her or his condition.

On the other hand, diverse studies exist that assess stress using mainly markers extracted from biosignals and thus measure the physiological reactions of the stress response on the body (see [12, 18,19,20]). The most widely used biosignals are electrocardiography (ECG) [21, 22] and skin conductance (SC) [21, 23]. Some features extracted from these biosignals, such as the heart rate (HR) and mainly its variations (HRV), are driven by the autonomic nervous system (ANS) and, more precisely, by its sympathetic and parasympathetic components; thus, they constitute good stress markers. Other biosignals are also used such as electroencephalogram [24], skin temperature [25], pulse photoplethysmography (PPG), respiration [21, 22], pupil diameter, electromyography and blood pressure.

Most of these studies employed similar approaches. Firstly, while some physiological signals are measured, a stress state is induced under a real situation [21, 23, 26,27,28] or in a laboratory [22, 24, 29, 30], by, for example, a mental arithmetic [22, 23], the Stroop Test, film fragments or the Trier Social Stress Test [25]. Subsequently, a variety of features are used with either pattern recognition or machine learning techniques that allow the automatic classification of stress states. The applied techniques are also diverse, such as support vector machines [22], fuzzy logic, Bayesian networks, decision trees [23], artificial neural networks [31] and computer aided diagnostic tools [32]. Notwithstanding, most of these studies only allow stress detection (stress/ no stress) [23], and few of them identify three levels of stress [22, 29, 30], i.e. high, medium and low, normally related to the intensity of the stressor. Additionally, the methods that have been employed as the ground truth of the induced stress intensity reference are diverse. Several studies employ standard psychometric questionnaires [22, 24] or subject’s self-report assessments as a reference, regardless its previous mentioned limitations. Other studies employ as a reference the status of the stressor stimulus, i.e. active or not [33], while others use the amount of work load and cognitive demand that is being applied as the stressor agent [21].

Despite the efforts made, a scientifically supported, objective, reliable, repeatable and easily usable measurement method remains unavailable. Indeed, up to now, no agreement among professionals or within the research community exists, neither on a unique stress assessment method nor in a standard stress measurement method.

We propose to use only physiological biomarkers that can be unobtrusively monitored, aiming to obtain a reliable description of the stress response that can be easily monitored in daily life. Therefore, the main objectives of this study are the following: (1) to identify a set of features derived from physiological signals that comprehensively describe the physiological stress response according to established stress markers and (2) to then use them for a reliable quantitative assessment of stress level as a first attempt towards stress monitoring. Although the ultimate goal is to measure acute stress or chronic stress in daily life, a potential stress measurement method should first be validated under controlled stress conditions. Thus, this work contributes to the extant literature by:

  • Designing an experiment to quantify participants’ levels of acute psychological induced stress

  • Selecting a set of stress biomarkers from physiological signals that describe the stress response level in a similar range as do the standard psychological questionnaires and biochemical stress markers

  • Proposing a multimodal approach for stress-levels measurement based on a comprehensive assessment of the physiological response of the individual to a stressor agent

  • Proving that, in these experimental conditions, stress response levels can be unobtrusively and continuously measured from a set of physiological signals, as a viable alternative to the currently used stress assessment methods

2 Methods

The research presented in this paper involves experimental measurements of physiological signals, biochemical variables and psychometric questionnaires in healthy young students from the Autonomous University of Barcelona (UAB), who were under an acute psychological standard stressor.

Varieties of stressor stimuli have been employed in stress research. The Trier Social Stress Test (TSST) has been chosen in this study to induce acute psychological stress because TSST is one of the most widely used and documented protocols in stress research [34,35,36,37], and because it provides a robust and reliable acute stress inducement [38] compared with other stressors.

Two main stress states were induced to the participants, a basal state (BS) and an acute stress state (SS). Participant’s stress response was assessed using a set of stress reference variables (well-established stress markers). Suitable features derived from the physiological signals were selected and compared with a whole set of stress reference variables. Afterwards, these features were used to make an estimation of stress level.

2.1 A multimodal approach to stress measuring

The methodology and analysis of the results presented are focused on objectively quantifying the physiological components of the stress response, which can be understood as the degree of disturbance of homeostasis [11], i.e. how close or far an individual is from the state of homeostatic balance. The response to stress triggers biological changes that prepares the body to deal with a stressor agent, which provoke the homeostatic imbalance that will continue until the achievement of a new state that is adapted to the new situation.

Consequently, the stress level response is determined as a function of the particular reactions that the homeostatic imbalance provokes in the individual. Such stress reactions are dependent not only on the exposure time to stressors (acute or chronic) but also on certain characteristics of the stressor as they are perceived by the individual, such as its intensity, her or his own ability to control it and its predictability [39].

The physiological reactions of the homeostatic imbalance triggered by the stress response can be broadly summarized as follows [35]:

  1. 1)

    Neurohypophysis activation: vasopressin secretion leading to increased blood volume and pressure and increased peripheral blood vessels resistance

  2. 2)

    Hypothalamic-pituitary-adrenal axis (HPA axis) activation: cortisol secretion leading to increased catabolism, anabolism inhibition as well as immune system depression

  3. 3)

    Sympathoadrenal system activation: sympathetic nervous system sensitization

  4. 4)

    Nervous system activation: sympathetic nervous system activation and parasympathetic withdrawal, causing increased heart rate and respiratory frequency, bronchial and pupil dilation, sweating skin, peristalsis inhibition and hyperglycaemia and other symptoms

Thus, a single stress marker cannot globally assess the stress response of an individual, as it is also concluded in [20]. In order to be able to compare and assess the stress level response, a method that considers the different stress response reactions as a whole is requisite. Therefore, a multivariable approach is proposed in this study to assess the individual stress response to a stressor.

2.2 Study design

The study is a quasi-experimental pre-post study without a control group to elucidate the effect of a stressor on psychological, biochemical and physiological stress markers in a unique group of healthy students. The experiment design included a Relax Session, as a control condition, and a Stress Session, performed on different dates, so that participants could be compared to themselves in a basal state and a stress state.

A preparatory relaxation period of 10 min was conducted at the start of each session to achieve a fairly similar baseline state among participants for both sessions. The state reached after the Relax Session was considered as the basal state while the state reached at the end of the Stress Session was considered as the stress state. During both sessions, physiological signals were continuously recorded while standard psychological questionnaires were applied and biochemical markers collected at the end of each session, as shown Fig. 1. The experimental design shares its measurement framework with other studies within the ES3 Project [8], which aims to cover all complexities of stress.

Fig. 1
figure 1

Details of study protocol. The first saliva sample was taken after the baseline stages (BLR, BLS) and the second saliva sample at the end of the Relax and the Stress Sessions. A blood sample was taken and psychometric tests were applied at the end of each session. Physiological signals were measured during both the Relax Session and during the Stress Session stages: story telling (ST), memory test (MT), stress anticipation (SA), video display (VD), and arithmetic task (AT). Stress states reached at the end of the Relax and the Stress Sessions were basal state (BS) and acute stress state (SS), respectively

2.2.1 Participants

Participants were recruited using an advertisement on the UAB website, which was linked to the webpage of the ES3 Project [40]. All interested candidates completed an online form beforehand to ensure that they met the selection criteria for the study. After individually checking to determine whether they fulfilled the inclusion requirements, participants were included in the study consecutively according to when they were enrolled in the study. Appointments were made at random.

The inclusion criteria were to be between 18 and 30 years old and a non-regular consumer of psychotropic substances, alcohol or tobacco. The exclusion criteria were a body mass index higher than 30, any chronic disease or psychopathology and a stress level higher than 70% on a visual analogue scale.

The UAB Ethics Committee approved the study protocol. Participants gave their written informed consent. They were instructed to avoid the use of any psychotropic substance, alcohol or tobacco and to avoid doing any physical exercise for 24 h prior to each session, wake up 2 h before the start of the sessions and consume a light breakfast without coffee or tea.

2.3 Experimental protocol

The Relax and the Stress Sessions were performed on different dates, but at the same hour for each participant. The Relax Session comprised two stages, the baseline stage (BLR) and the relax stage (RS), while the Stress Session had a baseline stage (BLS) and five different stages, inducing acute psychological stress through a standard stressor.

The relaxation was induced by autogenic relaxation guided by an audio recording conforming to Schultz’s method [41] at stages BLR and RS in the Relax Session and BLS in the Stress Session.

Acute psychological stress was generated with a modification of TSST as follows. Instead of the speech, the participants performed a memory test while being video-recorded. Then, the video was shown to her or him in the company of an audience, after a prior stress anticipation period [42]. This modification was introduced to compare the results from the Memory Test with the study performed in [42] as part of the ES3 Project [8].

The stress test included five different stages over 25 min. Each stage had different demanding conditions, inducing different stress states in participants:

  1. 1)

    Story telling stage (ST): Three different short stories are told to the subjects, and they are asked to remember as many details as possible.

  2. 2)

    Memory test stage (MT): The subjects are requested to repeat each story aloud, including as many details as they can in a 30-s period per story. During this stage, the subjects are notified that they will be video-recorded to assess their performance.

  3. 3)

    Stress anticipation stage (SA): The participants are asked to go to an adjacent, empty room under the pretext of they should wait until the group of psychologists assesses their video.

  4. 4)

    Video display stage (VD): The participants return to the previous room, where an unexpected audience of at least three people is waiting for them. The video showing each participant’s storytelling is played in front of the audience. Each video is interwoven with another video, in which an actor of the same gender is telling most of the details of each story, to increase the subject’s arousal.

  5. 5)

    Arithmetic task stage (AT): The subjects are asked to count backwards from 1022 in increments of 13, in less than 5 min. If they make a mistake, they should start again from 1022.

At the end, the subject is required to complete the psychometric questionnaire. Participants were seated the entire time, except at the beginning and the end of the stress anticipation stage, when they walked into the adjacent room.

2.4 Physiological signals: acquisition and processing

Electrocardiogram, pulse photoplethysmography, respiration, skin conductance and face and finger temperature signals were selected due to their relation with stress response, and because they can be continuously and unobtrusively monitored. The main aspects concerning signal selection, processing and feature extraction methods used in this research are listed below. Among all those physiological signals that show changes related to stress [8], the signals that can be unobtrusively monitored and that comprehensively describe the stress response were selected. They are presented in Table 1.

Table 1 Selection of physiological signals

The ECG signal was sampled at 1 kHz, while the other signals were sampled at 250 Hz using the Medicom system, ABP-10 module (Medicom MTD Ltd., Russia). Most recordings of skin conductance had to be discarded in the reported experiment due to a sensor malfunction. Thus, skin conductance values were not considered in any further analyses. Respiration signal was also excluded from this analysis, as it is clearly artefacted during the MT and AT stages when the participant had to talk.

2.4.1 Skin temperature

Variations in face and finger temperature are associated with stress response due to the activation of the vasoregulatory system (i.e. a peripheral vasoconstriction together with a vasodilatation in the core and face) [25]. Both face (Tfa) and finger (Tfi) temperature signals were filtered using a one-dimensional median filter (50 orders), and then values out of the 20–40 °C range were excluded. Subsequently, the following features were computed:

  • Temperature gradient (∆T): temperature variation in sliding windows of 1 min time

  • Temperature power (T_Ptotal): defined as the average power of temperature measured via a rectangle approximation of the integral of the power spectral density in sliding windows of 1 min time

  • Temperature ratio (Tratio): ratio between mean values of finger (Tfa) and face (Tfi) temperatures in sliding windows of 1 min time

2.4.2 Electrocardiogram

The ECG signal was analysed as in [43] to obtain a time series of R to R-peak (RR series). The following features were computed, which are explained in detail in [44]:

  • Time features: From the fixed RR series, mean heart rate (HRmean), standard deviation of all RR intervals (RRSDNN) and root mean square of successive differences of RR (RRrMSSD) were computed.

  • Frequency features: low frequency (LF) power (PLF) and high frequency (HF) power (PHF), the LF/HF ratio and power in the very low frequency (PVLF) band

2.4.3 Pulse Photoplethysmogram

A filter was applied to the PPG signal, as well as an artefact detector as described in [45]. Fiducial points in pulse waves were automatically determined from each pulse beat by an implemented algorithm based on [46]. The detected points for the ith pulse in PPG signal are seen in Fig. 2.

Fig. 2
figure 2

Detection points in ECG and PPG signals. Each absolute maximum of the PPG signal between two successive QRS complexes detected in the ECG signal is considered an apex point nAi of a PPG pulse. nBi: the basal point is the absolute minimum of the PPG signal between two successive QRS complexes in the ECG signal. nMi: the middle point is the instant in time when the PPG pulse reaches half of its apex-to-basal amplitude. nARi: the apex point of the reflected wave is the maximum reflected pulse wave. nNi: the dicrotic notch point is the inflection point between the pulse wave and the reflected wave

Pulse period (PPi) is the time interval between two consecutives nAi, and the pulse rate (PRi) is the inverse of the PPi series. A subsequent analysis to obtain pulse period features was equal to the aforementioned HR time features from the ECG signal (PRmean, PPrMSSD, PPSDNN). Furthermore, additional time-based parameters were calculated (see Fig. 2), and their median values and standard deviations in sliding windows of 1 min time were assessed:

  • Pulse transit time (PTT): The PTT is inversely proportional to pulse wave velocity, which is associated with arterial stiffness and cardiovascular output. Therefore, a decrease in PTT is related to an increase in blood vessel resistance and cardiovascular output, as well as being inversely related to blood pressure [47].

  • Pulse wave rising time (PRT): The PRT is proportional to the pulse amplitude and is related to arterial stiffness, blood volume and systolic blood pressure peak [48];

  • Pulse Wave Decreasing Time (PDT): The PDT is an index of diastolic time, which is also highly related to PR.

  • Pulse Width until Reflected Wave (PWr): The PWr is a time index of the reflected pulse wave arrival and is associated with the PRT, the peripheral central vascular resistance, the cardiovascular output and the pulse wave velocity [49].

2.5 Acquisition and analysis of stress reference variables

The stress reference variables were selected from established psychometric questionnaires and biochemical stress markers taking into account a bibliographic review conducted and the criteria of medical doctors and psychologist involved in the ES3 Project, as described in [8]. These variables are used to assess the stress state reached at the end of each session, the basal state (BS) and stress state (SS), respectively.

2.5.1 Psychometric variables

The psychometric questionnaires applied were the Perceived Stress Scale (PSS), the Visual Analogue Scale (VASS) and the State-Trait Anxiety Inventory test (STAI). The PSS measures the degree of overall stress of the individual or the extent to which life situations are appraised as stressful [50]. The VASS records the stress level that is self-perceived by the participant. The STAI evaluates anxiety from two different points of view: (1) as a measure of the subject’s state (STAI-s) at a given time and (2) as the trait (STAI-t) or stable tendency of the individual to respond by increasing her or his level of anxiety in stressful situations [51]. These questionnaires were self-administered at the end of each session.

2.5.2 Biochemical variables

Biochemical markers collected were copeptin and prolactin obtained from blood samples and cortisol and α-amylase obtained from saliva samples. In accordance with the circadian rhythm of cortisol and α-amylase [52], the sessions were scheduled in the morning (09:45–10:45 h), 2 h after waking up. Two saliva samples were taken in each session: one before the stimulus (relaxation or stress) and the other at the end of the session [53] (i.e. 25-min in-between) to obtain cortisol (Δcortisol) and α-amylase (Δα-amylase) variation per session, see Fig. 1.

Cortisol and α-amylase were measured in the endocrinology and radioimmune analysis service of the UAB Neurosciences Institute. An immunoassay was performed to determine cortisol concentration, and an enzyme assay analysed the α-amylase enzyme kinetics (Salimetrics, State College, PA, USA). The analysis of prolactin and copeptin was quantified at the Centre of Biomedical Diagnostic at the Clinic Hospital of Barcelona. All of the samples were processed in the same test to avoid any inter-test variability, achieving intra-test variation coefficients lower than 5% in both cases.

2.6 Stress reference level

The stress reference variables were used as a reference to compute a stress reference scale applying the multivariable approach presented in Section 2.1. The stress references variables were re-scaled to a 0−100 range of arbitrary units. The stress reference level of participant p, in state x (BS or SS), Sref (p,x), is described as a linear combination of n stress reference variables [54], as expressed by the following equation:

$$ {S}_{\mathrm{ref}}\left(p,x\right)=\sum \limits_{j=1}^n{c}_j\cdot {{\mathrm{SRV}}_j}^{\prime}\left(p,x\right)\kern2em ;x\in \left\{\mathrm{BS}\right.,\left.\mathrm{SS}\right\} $$
(1)

where SRVj (p,x) is the re-scaled value of the jth reference variable for the participant p at state x; the coefficient cj represents the contribution of each SRVj to the state’s differentiation. This coefficient was assigned by using a ponderation of the first and second components obtained from the principal component analysis (PCA). Further information is described in [54].

The following analysis is focused on selecting a set of physiological features and a model able to provide similar values to the Sref (p,x) at the two identified states.

2.7 Stress estimation

A subset of the 27 features extracted from the physiological signals was selected according to the experiment conditions using the following criteria: (1) to show no significant differences between BLR and BLS stages’ values, (because equivalent states at the end of both baseline stages were assumed) and (2) to have a correlation coefficient higher than 0.4 with the stress reference level, to ensure that the feature selected and the stress level references share at least 15% of common values (value in which the coefficient of determination is at least 0.15).

Furthermore, in a second step, and to avoid linear dependencies and redundancies among features, Pearson’s correlation between them was computed at each stage. A correlation coefficient greater than ± 0.98 in all stages was considered to group-related features. One feature was selected per group of related features.

A linear relationship between this subset of features and the estimated stress level reference was assumed. Feature values of RS and AT stages were associated to BS and SS states, respectively. Coefficients of this linear function were computed using a linear regression model between Sref values at BS and SS and the values of the selected features at RS and AT. This is an alternative pathway to estimate the stress reference level using physiological signals.

Additionally, through the linear function obtained, stress level state or Sref was calculated for each stage of the experiment, and its values were compared between participants.

2.8 Statistical analysis

The participant’s state at the end of the Relax and the Stress Sessions (BS and SS) was taken as the lower and higher band of moderate the stress state, respectively. Variations in biochemical variables, psychometric questionnaire scores and stress reference levels between both sessions were analysed using the Student’s t test for paired samples as they have resulted in normally distributed data. The data were tested for normality using the Kolmogorov-Smirnov test prior to analysis.

Physiological feature values associated with each stage are the median of the feature values over the whole time stage interval (i.e. ST, MT, VD and AT) and for stages that last more than 5 min (i.e. BLR, BLS, RS and SA). The associated feature values were calculated over the last 5 min of the stage.

The statistical differences between the physiological feature values of two different stages were checked using the Wilcoxon signed rank test (non-parametric test), as they are considered non-normally distributed data. A Friedman’s test (repeated measurements analysis of variance on ranks, non-parametric test) was performed for each feature to determine whether there was any change in the different stages of the values (a p value < 0.05 was considered statistically significant, meaning that there are at least two different median stage values).

The correlation between psychometric, biochemical and proposed stress reference levels with the physiological features was studied through the linear Pearson’s correlation. Sessions were analysed using the Student’s t test for paired samples.

3 Results

The study included 40 healthy young participants. The group constituted a socio-demographic homogeneous data sample, including 23 females and 17 male subjects, with mean age of 21.3 ± 2.8 and a body mass index of 22.1 ± 2.7. For the subsequent analysis, only 38 of the 40 participants were considered, since two participants did not attend the second session.

3.1 Stress reference variables

The results of the psychometric questionnaires values, the biochemical variables and calculated stress reference level are presented in Table 2. Values for the STAI-s and VASS questionnaires were statistically significantly higher in the Stress Session than in the Relax Session, thus reflecting the subjects’ stress states. Results of PSS and STAI-t scales are similar in both sessions, in that they represent individuals’ traits thereby indicating coherence in the experiment. Values of Δcortisol, copeptin and prolactin were significantly different between sessions. In contrast, Δα-amylase did not exhibit significant differences, although median values showed a small increase. This may be because α-amylase concentration has a faster dynamic [55].

Table 2 Inter-subject median and median absolute deviation (MAD) of stress markers

3.2 Stress reference level

Reference variables showing statistically significant differences between BS and SS states were utilized to produce the stress level reference scale. The contribution of each reference variable to the quantitative stress level scale was estimated based on a PCA analysis. By evaluating Eq. (1) with the estimated coefficients of the reference variables, the stress level value (Sref (p,x)) of participant p at state x was calculated as follows:

$$ {S}_{\mathrm{ref}}\left(p,x\right)=0.25\cdot \mathrm{VAS}{\mathrm{S}}^{\prime}\left(p,x\right)+0.25\cdot \mathrm{STA}{\mathrm{I}}^{\prime}\left(p,x\right)+0.16\cdot \mathrm{Cortiso}{\mathrm{l}}^{\prime}\left(p,x\right)+0.16\cdot \mathrm{Prolacti}{\mathrm{n}}^{\prime}\left(p,x\right)+0.16\cdot \mathrm{Copepti}{\mathrm{n}}^{\prime}\left(p,x\right) $$
(2)

3.3 Stress estimation from physiological features

The most relevant features from the skin temperature, ECG and PPG signals that show variations due to the triggered stress response, according to the experimental conditions, were selected as stress markers from the whole set. The median and median absolute deviation (MAD) values of the features at the different stages of the experiment are shown in Table 2, which characterize the individuals’ states at each stage of the experiment. It can be seen how most of the features change from one task to another, and they have significant changes at MT and AT with respect to the other stages. Those features whose median at each stage is not significantly different from its median at RS are highlighted in Table 2. As a result of the Friedman test, each of the features has at least one difference per pair compared with their stage values (p value < 0.05), and therefore, they all changed in at least one stage of the experiment.

3.3.1 Stress biomarkers selection

At the start of each session, participants engaged a relaxation exercise to obtain a similar baseline state. Hence, at Baseline stages (BLR and BLS), the feature values should not have significant differences between them. Therefore, features showing statistically different values in both baseline stages were not taken into consideration as stress markers in this experiment assuming that they may instead reflect any other difference that is not related to stress. These discarded features from the baseline stages were ∆Tfa, HRPLF, HRLF/HF and PWrmean. Likewise, at stages RS and AT, considered basal stress and highest stress levels, respectively, the features should vary significantly. Accordingly, those features, whose medians were not significantly different between stages RS and AT (i.e. Tfi_Ptotal, HRLF/HF, PRLF/HF, PRTmean and PWrmean), were not taken into account as stress markers in this experiment.

So far, for each participant, two sets of reference variable values were obtained: one at the BS (minimum stress level) and one at the SS (showing the stress level reached after the complete TSST). Using these values, linear Pearson correlations between physiological features and stress reference variables were computed. The resulting correlation coefficients are found in Table 3.

Table 3 Pearson’s correlation between physiology parameters and stress reference variables

The set of features presented in Table 3 was analysed to discern features with linear dependences between them throughout this experiment, because such features will give redundant information to the stress measurement model in these specific experimental conditions. Three groups were found: (1) Tfi_Ptotal and Tratio; (2) HRmean, RRrMSSD, HRPVLF, PPrMSSD and PDTmean; and (3) PPSDNN and PDTSTD. From each group, the feature that had the highest correlation value with stress reference level was selected. These are Tratio, HRmean and PPSDNN.

From previously explained considerations and only including features that correlate more than 0.4 with stress reference Sref (r > 0.4), the features selected are as follows: Tratio, HRmean, PRTSTD, PTTmean, PTTSTD and PWrSTD.

3.3.2 Stress estimation

A stress estimation function (Sest) was implemented assuming a linear relation between selected features and stress level, see Eq. (3). A linear regression between Sref values at BS and SS and the values of the selected feature at RS and AT were computed and the coefficients extracted.

$$ {S}_{\mathrm{est}}=-40.9\cdot {T}_{\mathrm{ratio}}+0.25\cdot {\mathrm{HR}}_{\mathrm{mean}}+171.1\cdot {\mathrm{PRT}}_{\mathrm{STD}}-145.4\cdot {\mathrm{PTT}}_{\mathrm{mean}}-79.83\cdot {\mathrm{PTT}}_{\mathrm{STD}}-94.1\cdot {\mathrm{PWr}}_{\mathrm{STD}}+80.51 $$
(3)

Results of the stress level reference from reference variables against the estimated stress are presented in Fig. 3, in which two well-defined groups for basal and stress states can be observed. The RMS error was 9.57 (R2 0.67).

Fig. 3
figure 3

Results of stress estimations. a Stress level estimated from reference variables against the estimation performed through selected features. b Black-Altman plot of the same data as in a

The stress estimation results show an increase of 64.13% from basal to stress state generated, while the same results through the stress level reference values are 66.94%. So, differences between BS and SS are in the range of 18% which, as expected, represents a moderate but significant stress effect. At basal state, the values have less dispersion than those of the stress state because a stressor affects individuals differently, while basal states tend to be similar.

Through the linear equation obtained (Eq. (3)), the stress level state was calculated for each stage of the experiment, and its values are shown in Fig. 4. The stress level reached in the ST and SA task have similar median values although physiological stress markers do not have similar median values, indicating that similar stress level values could clearly be assigned to different states. The paired t test between estimated stress values for each stage was also calculated. Sest at each of the stages related to the TSST are statistically different from the Sest measured at relaxation stages (BLR, RS and BLS). Table 4 shows the p values of the paired t test. Stages with a relaxation exercise show values of approximately 29.06 au, while those in the Stress Session rise to values ranging from 34.58 to 47.55 au.

Fig. 4
figure 4

Stress measurement of a participant throughout both sessions

Table 4 p values of the paired t test among measured stages’ stress

As physiological signals were continuously recorded, all selected features were computed over a 1 min window for the whole session, and then a continuous stress estimation can be performed every minute using the Eq. (3). The Sest values from two different participants throughout both sessions are seen in Fig. 5. It shows the evolution of participants along the sessions.

Fig. 5
figure 5

Stress measurement from two subjects throughout both sessions. On the left for subject 1 and on the right for subject 2

Figure 6 presents the evolution of other participants along the Stress Session. Different stress responses were selected among the participants to show the diversity of stress responses generated throughout the session. It can be observed that some participants had a higher stress level than others at the beginning of the session. In addition, some of them greatly reduced their values in BLS, while others remained quite similar.

Fig. 6
figure 6

Stress measurements of different subjects throughout the Stress Session

4 Discussion

The stress concept establishes that stress arises when a specific event threatens the homeostatic stability of an individual. The ultimate goal of this work is to find an approximate but quantitative measure of that homeostatic imbalance. We faced a complex human system in which the stress response elicits multiple physiological and cognitive reactions [35] in the individual. This is why none of the established stress markers alone allow for reliable quantitative assessment of stress response intensity. In this regard, we propose to use a set of biomarkers that comprehensively assess the level of stress response and then employ them in a multivariable method of stress measurement. In this way, we could have a broad measurement of the stress response and consider extensive aspects triggered by the stressor (see Table 1).

Applying the proposed multimodal approach for stress measurement, a multivariable stress reference level is proposed by merging the above-mentioned stress reference variables (see Section 3.1) into a single scale. As a first approximation, a linear relation between stress level reference and each variable was assumed. The scale of stress level reference shows higher correlations with the physiologic features than with either the psychometric variables or the biochemical variables alone. This suggests that this multivariable approach might improve the most commonly used methods. The same approach is used to estimate the acute stress response through physiological biomarkers. Moreover, in view of a possible stress monitoring method, the biomarkers selected were limited to those that can be continuously monitored, and therefore, they were obtained from physiological signals that are not only continuously, but also unobtrusively, measured.

The stress estimation method presented in this paper gives continuous values of acute physiological stress every 1 min. Results prove its ability not only to measure changes from a basal state (27.86 au) to a stress state (47.55 au) but also to measure the stress level at different stress states produced throughout the sessions. This makes it possible to statistically discriminate between five stress levels induced in this experiment (BLR-RS-BLS; ST-SA; MT; VD; and AT). Moreover, although a tendency to increase the stress level in the ST and MT stages is found, in the SA and VD stages, different reactions are observed since these stages may be related to social phobia or assessment anxiety rather than stress. In the last stage, the AT also triggers different levels of stress, according to the participant’s capabilities to face this task.

The presented approach can establish the groundwork for applications that aim to monitor stress and improve research in the field of stress, because it might constitute a starting point for future inter-individual and intra-individual valid comparisons of facing different stressors.

Until now, the stress response level has not been quantitatively assessed as it was above-presented, and only stress detections and classifications have been made [12, 18,19,20]. To the best of our knowledge, there is no objective, reliable, repeatable and easily applicable method either to compare the values of the stress response level of an individual at different stressful moments or to compare the stress state of two different individuals for a fast and reliable follow-up. Indeed, current methods cannot define thresholds and have not provided quantitative and reliable estimations of the global stressing effects of an alleged stressor on a population sample.

No clear correlation pattern between measured physiological parameters and perceived levels of stress has yet been found [3, 12]. Moreover, no agreements exist among professionals or within the research community either on a standard ground truth of stress level assessment or in a standard stress measurement method. This is probably a consequence of the variety of stimulus that induce stress and the diversity of human reactions to each stressor [20]. Additionally, it might be also a result of the fact that the extant literature aims to assess one or very few reactions that arise from the stress response, rather than comprehensively describe the physiological stress response itself as we propose.

We used a reliable protocol and a well-documented stressor such as the TSST [37]. The purpose was to induce moderate, but significant, stress with a validated and widely documented tool for the generation of stress [16]. Results of both physiological features and stress reference values provide evidence of the two different stress levels of induced states, BS and SS, in agreement with previous studies [16, 37].

Features extracted from finger and face skin temperatures, as well as electrocardiogram and pulse wave signals, were selected to comprehensively identify the physiological stress response. Changes related to stress that these selected signals have are seen in Table 1 (Section 2.4). Most of the features exhibit statistically different values between the BS and the SS, and a significant linear correlation with the reference variables, as has been reported in others studies [16]. HR features are highly correlated with the stress reference level, as well as PR features, which are quite similar to results presented in [56]. A preliminary analysis of heart rate variability (HRV) as a stress marker has also already been reported in [43]. Nevertheless, respiratory information should be taken into account to increase the reliability of HRV as a marker of stress [57] as frequency features of HRV are influenced by respiration [58,59,60]. This analysis would be an important addition to futures studies.

Meanwhile, other features, such as Tratio, PRT and PTT, are first used as stress biomarkers in this study. In the BL and RS stages, both finger and face temperatures show low variations following the normal body thermoregulation (Tratio ≈ 1), but in stages in which a stressor is applied, for instance at ST, MT, VD and AT, face temperature increases while finger temperature decreases. Similar results are reported in [25]. PTTmean has the highest Pearson correlation value with the stress reference level, and it is also highly sensitive to the different states induced throughout the experiment. The lower PTTmean values occurs for stressful tasks, indicating an increase in both blood vessel resistance and blood pressure, while the highest PTTmean values occur for relaxing tasks.

The experiment described in this paper takes a particular sample (healthy young people) that was subjected to a particular stress stimulus, TSST. However, other physiological responses should be investigated using other groups of individuals facing another stressor. Moreover, in a real-life application, environmental conditions must be included, as well as certain characteristics such as age, gender, IMC, physical activity and diseases. In real-life conditions, more complex algorithms than the presented estimation of stress should be used, in which the previously mentioned conditions should be included.

Furthermore, only six features from the whole set have been retained, applying above explained selection criteria. However, it is important to note that this selection was tailored to these restricted experimental conditions, in which healthy young participants were sitting and only under the influence of the experimental tasks. Those restrictions were meant to eliminate from the equation features that, in this case, do not provide additional information or have not shown significant variations. When applying this method to a population with other characteristics and subjected to another stressor, however, these simplifications will not be valid and others will probably be applicable. Then, this method will provide, for example, a quantitative assessment of how far an individual is from the average of this population. Therefore, changes in features other than those selected in this experiment could be triggered by either other stressors or in different conditions than the ones used in this experiment, as well as acting in a different population sample or when a more intensive stress response arises.

As previously mentioned, the correlation analysis between physiological features reveals that some of them were changing in equal proportions during this experiment. Even if they met the selection criteria, only one feature per related group was used for the estimation performed, as their information was redundant. However, all others could be newly included in future studies under different conditions, such as real situations or long-term stress monitoring.

There are also some limitations to note regarding this study. Only linear relations between the stress response and the stress reference variables were analysed in this preliminary approach. Even though the results obtained strongly support this approach, other approaches, including non-linear relations, could probably yield better results. Likewise, it was assumed that the stress function is linear in the six features selected, but slightly different features and relations could be analysed in future research to improve on these results. A limitation of the protocol is that the relaxation period is restricted to only 10 min, and participants may need a longer time to physiologically accommodate to the experimental environment. The study design has two limitations concerning biochemical timing collection: (1) cortisol and alpha-amylase are measured in the morning hours when intra-individual variability is at its highest, and (2) alpha-amylase is known to peak at about 10–15 min after stressor onset, whereas it was assessed 25 min after the stressor onset.

Despite these limitations, this laboratory based experiment can provide a solid foundation for daily life stress monitoring, by first identifying the score function adapted to a particular individual and then by applying this function to monitor daily life stress.

5 Conclusions

A multivariable approach based on stress markers was proposed to estimate stress levels in order to obtain a broad measurement of the stress response, thus considering the multiple reactions triggered by the stressor. This approach may establish the groundwork for future applications that aim to monitor stress. It also benefits research in the field of stress by enabling intra-individual and inter-individual evaluations to be made using a common index of stress level.

The performed experiment proves that the level of stress response caused by a psychological stressor can be determined from a set of unobtrusively and continuously measured physiological parameters, as an alternative to the currently used stress assessment methods.

An innovative pioneering approach was proposed to estimate stress using a multivariable biomarker, constituted by features extracted from physiological signals. Results from this approach demonstrated its ability to continuously measure stress because different stages that occur sequentially in a short period of time can be easily distinguished.

A reliable and continuous stress measurement will be a useful tool for mental health care and well-being. It will also enable professionals to possess easy and precise communication methods in order to identify more suitable and personalized solutions for the treatment of pathological cases.

The results of the experiment presented in this article properly motivate further experiments with different stressors and populations in order to validate its extrapolation capabilities and to adapt its use to other conditions. Therefore, a new and more sensitive stress measurement method could emerge based on the presented results, which could be utilized effectively for unobtrusive stress monitoring.