1 Introduction

Analgesia, hypnosis and immobility are three important components of anesthesia [1, 2]. Clinically, the response resulting from neural processing of painful stimulus is defined as ‘nociception’, and the potential of analgesic agent to attenuate the response is defined as ‘anti-nociception’. An ideal analgesia depth means obtaining an adequate balance between nociception and anti-nociception. The accurate measurement of analgesia depth could reduce drug consumption and adverse impacts including postoperative hyperalgesia [3, 4]. Nevertheless, there is no standard indicator available for monitoring analgesia depth. Previous studies have suggested that the nociception-anti-nociception (NAN) balance is related to analgesics dose, the stimulus intensity, and sedation state [5], i.e. if there is an indicator dependent on analgesic concentration or stimulus intensity during muscle paralysis when sedation state is kept at a steady state, analgesia could be administrated by merely adjusting the dosage of analgesics.

The ability to monitor the physiological effect of the analgesic agent is of interest in clinical practice. Photoplethysmography (PPG) is mediated by sympathetic nervous system and its detection is readily accessible [6]. PPG has advantages of simplicity and rapidity in signal processing with simple morphology and without mixed signals. It has been proven that several parameters such as PPG amplitude (PPGA), pulse beat interval (PBI) and surgical pleth index (SPI) are correlated with the stimulus intensity or analgesia depth [7, 8], however, without desirable accuracy [9, 10].

In the field of anesthesia, nonlinear variables are mainly applied in evaluating unconsciousness and focused on EEG signal [5, 11]. Few studies have considered their relationship with analgesia depth or applied them to other physiological signals. Considering the disturbances caused by noxious stimulus are nonstationary, the nonlinear indices could quantify better the irregularity of dynamical signals [12]. Based on the prominent results in estimating levels of anesthetic-induced unconsciousness [13,14,15], approximate entropy (ApEn) and sample entropy (SampEn) are considered to have the potential in monitoring NAN balance under the constant hypnotic states.

The purpose of this paper was to examine the methodology of ApEn and SampEn calculations, focusing on the variations of recorded signal under various analgesics concentrations and the influence of different coefficients employed in ApEn and SampEn. The key question addressed was whether ApEn and SampEn had a monotonic response to analgesic depth. In this study, sedative properties were reflected using state entropy (SE) and anti-nociception was administrated using remifentanil [16].

2 Methods

2.1 Anesthesia and data collection

The study was approved by the Ethics Committee of Women’s Hospital School of Medicine Zhejiang University. Forty patients with American Society of Anesthesiologists (ASA) physical status I or II, scheduled for gynecologic laparoscopy surgery or breast surgery under general anesthesia were enrolled. We excluded patients with cardiovascular disease or autonomic nervous dysfunction, difficult airway (Mallampattis test III or IV), body mass index over 30 kg·m−2, and chronic use of alcohol or psychoactive medication. Patients were allocated randomly to receive one of four possible remifentanil effect-compartment target concentrations (Ceremi): 0 (n = 10, group R0), 1 (n = 10, group R1), 3 (n = 10, group R3), 5 (n = 10, group R5) ng·ml−1. In this study, the concentration of remifentanil at 0 and 1 ng·ml−1 represented underdosage of analgesics, whereas the concentration at 5 ng·ml−1 represented overdosage.

With no premedication administration, patients were anesthetized with propofol and remifentanil, and muscle relaxant was achieved with 0.15 mg·kg−1 cisatracurium. Propofol and remifentanil were administrated using target-controlled infusion (TCI) pump (Orchestra Primea, Fresenius, France) programmed with propofol pharmacokinetic dataset [17] and remifentanil pharmacokinetic dataset [18]. The target was adapted such that before laryngeal mask airway (LMA) insertion SE was maintained at 50 ± 10 and steady-state target Ceremi was obtained. LMA insertion was applied as a standard noxious stimulus.

Photoplethysmographic signal was monitored and collected using anesthesia monitoring (CARESCAPE B650, GE Healthcare, Finland) with sample rate at 100 Hz. Recordings were exported as comma-separated value formatted files in arbitrary unit. A 1-min window before and after LMA insertion (N = 12,000) was designated for off-line analysis.

2.2 ApEn

ApEn introduced by Pincus [19], is a measure quantifying the unpredictability of a finite length signal. For a time series X(i) = x(1), x(2),…, x(L) with L data points, ApEn could be calculated as follows.

  1. (i)

    Reconstitute the time series X(i) as m-dimensional sequence vectors Xm(1) to Xm(L − m + 1):

    $$ X_{m} (i) = \{ x(i),x(i + 1), \ldots ,x(i + m - 1)\} ,\quad i = 1,2, \ldots ,L - m + 1. $$
    (1)
  2. (ii)

    Compute the distance d between the vectors Xm(i) and Xm(j):

    $$ d\left[ {X_{m} \left( i \right),X_{m} \left( j \right)} \right] = \max \left( {\left| {x\left( {i + l} \right) - x\left( {j + l} \right)} \right|} \right),\quad l = 0,1, \ldots ,m. $$
    (2)
  3. (iii)

    Calculate the probability:

    $$ C_{i}^{m} (r) = \frac{{B_{i} }}{L - m + 1}, $$
    (3)

    where Bi is the number of Xm(j) satisfying d[Xm(i), Xm(j)] ≤ r. The tolerance r is defined as:

    $$ r = k\sigma , $$
    (4)

    where σ is the standard deviation of L data points, and k is a constant.

  4. (iv)

    We further define:

    $$ \phi^{m} (r) = \frac{1}{L - m + 1}\sum\nolimits_{i = 1}^{L - m + 1} {\ln C_{i}^{m} (r)} . $$
    (5)

    Set m = m + 1, and repeat steps (i)–(iv):

    $$ \phi^{{m{ + }1}} (r) = \frac{1}{L - m}\sum\nolimits_{i = 1}^{L - m} {\ln C_{i}^{{m{ + }1}} (r)} , $$
    (6)

    and calculate ApEn:

    $$ {\text{ApEn}}(m,r,L) = \phi^{m} (r) - \phi^{m + 1} (r). $$
    (7)

Previous researches [15, 20,21,22] have shown that the selection of parameters L, m, and r has a great influence on outcomes. The parameter L should be 10m20m to gain a valid statistics. Also, the choice of r value requires care as detailed system information would be lost if r is too large, whereas the extracted information would mainly be signal noise if r is too small. ApEn in this study was calculated using 10-s time windows (L = 1000) to track consistent changes in PPG recording with m = 2. The choice of L = 1000, rather than prolonged time period (L = 2000 or 3000) [23, 24], ensured less time delay when 1000 was sufficient to estimate ApEn and SampEn validly [22, 25]. The coefficient k was recommended to be 0.05 to 0.25. Given that its optimal value was different for various applications [15, 23], five different k values were tested: 0.05, 0.1, 0.15, 0.2, and 0.25.

2.3 SampEn

SampEn proposed by Richman and Moorman [25] is an improved algorithm from ApEn. SampEn differs from ApEn in two ways to reduce bias. Firstly, SampEn does not count self-matches. Bi is defined as the number of Xm(j) satisfying d[Xm(i), Xm(j)] ≤ r, i ≠ j. Secondly, the number of reconstituted vectors is L − m instead of L − m + 1 in embedding dimension m, so that the number of vector comparison would be equal for both embedding dimension m and m + 1. Accordingly, the probability is adjusted.

SampEn is calculated as follows:

$$ {\text{SampEn}}(m,r,L) = - \,\ln \frac{{\varphi^{m + 1} (r)}}{{\varphi^{m} (r)}}, $$
(8)

where

$$ \varphi^{m} (r) = \frac{1}{L - m}\sum\limits_{i = 1}^{L - m} {C_{i}^{m} (r)} , $$
(9)
$$ C_{i}^{m} (r) = \frac{{B_{i} }}{L - m - 1}. $$
(10)

The selection of parameters L, m, and r for SampEn was same to ApEn [26, 27].

2.4 Data processing and optimal timestamp selection

The off-line PPG signal with N data points (N = 12,000) was analyzed using Matlab software (version R2017b, Mathwork Inc., MA, USA). Generally, band-pass filter was applied to remove noise in PPG signal. Considering the good quality of the PPG signal acquired from monitoring, this step was simplified with moving average filter for its good performance in eliminating sharp noises. yn represented the raw off-line PPG while yyn represented the preprocessed PPG displayed in Fig. 1. The span of moving average filter was set as 5, as described in:

$$ \left\{ \begin{gathered} yy_{n} = y_{n},\quad n = 1,N \hfill \\ yy_{n} = \frac{{y_{n - 1} + y_{n} + y_{n + 1} }}{3},\quad n = 2,N - 1 \hfill \\ yy_{n} = \frac{{y_{n - 2} + y_{n - 1} + y_{n} + y_{n + 1} + y_{n + 2} }}{5},3\quad \le n \le N - 2 \hfill \\ \end{gathered} \right.. $$
(11)
Fig. 1
figure 1

An example of recorded photoplethysmography (PPG). The signal in the upper panel is the selected recording of one minute before and after LMA insertion. The lower panel presented the detailed signal sequence delimited by the vertical dashed lines

To observe the continuous response of variables to stimulus, ApEn and SampEn time series were constructed using PPG data. A 10-s time window (L = 1000) obtained one ApEn and SampEn value as their definitions. The window slid in L/10 steps with 900 data points overlapped [23]. Thus the length of ApEn and SampEn time series was (N − L)/(L/10) + 1 = 111, where the ‘1’ represented the last 10-s time window. Each measure contained four time series, describing the average variation under four levels of opioid dosage.

Based on the time series, two values were extracted for each variable to characterize the states before and after the stimulus. The baseline of all observation indices (bef) was defined as the ApEn and SampEn values calculated using the data of last 10 s before stimulus, and the values of ApEn and SampEn at optimal timestamp occurring after the stimulus were defined as aft. Rules were made for determining the optimal timestamp.

In consideration of the fact that Ceremi of 3 ng·ml−1 is adequate for the supposed impact of LMA insertion defined as a moderate noxious stimulus [7, 28], the time series of 5 ng·ml−1 Ceremi was not taken into account for the localization of the optimal timestamp. At one timestamp, the distance of variables’ time series between 0 and 1 ng·ml−1 was defined as d1 and the distance between 1 and 3 ng·ml−1 was defined as d2. The timestamp where satisfying max(d1 + d2) and \(\min\!\left( {d_{1}^{2} + d_{2}^{2} } \right)\) was considered as the optimal timestamp. Max(d1 + d2) was calculated for yielding the greatest possible distinction between 0 and 3 ng·ml−1, and then \(\min\!\left( {d_{1}^{2} + d_{2}^{2} } \right)\) rendered the difference of each comparison largest. If there was more than one eligible point, the median would be extracted.

2.5 Statistical analysis

Values were presented as mean ± standard deviation unless otherwise indicated. All tests were 2-tailed. The bef and aft values were both analyzed from four levels using Kruskal–Wallis H test (IBM SPSS Statistics 25.0, SPSS Inc., Chicago, IL, USA) for investigating the influence of Ceremi on indices during nonnoxious and noxious events. Wilcoxon signed-rank test (Graphpad Prism 7, Graphpad Software Inc., San Diege, CA, USA) was employed to test the response to LMA insertion by comparing bef and aft values. The detailed effect of remifentanil was examined using Mann–Whitney test by comparing any two aft values. A p value less than 0.05 (p < 0.05) was considered statistically significant.

Prediction probabilities (PK) were calculated as a criterion to assess the applicability of indicators in predicting analgesic depth by PKMACRO spreadsheets [29]. If the indicator composes a concordance, no matter in the same or opposite direction, with remifentanil concentration, the PK would be great. A PK value of 1.0 means a perfect prediction whereas a PK of 0.5 means no better prediction than a random guess. In case PK ∈ [0, 0.5], PK could be normalized by calculating 1-PK [15].

3 Results

Characteristic data of the 40 patients were illustrated in Table 1. The four groups showed no statistically significant differences in terms of age, weight, height, or the body mass index (BMI).

Table 1 Study patients and remifentanil groups

3.1 ApEn and SampEn time series

Shapes of the ApEn and SampEn time series with different k choices were presented in Fig. 2. The variation trends of ApEn and SampEn were generally similar when the k value was same. In the curves with positive slope, the sharp increase in low Ceremi groups (R0 and R1) and slight increase in high Ceremi groups (R3 and R5) were clearly suppressed in a dose-dependent manner. Switching the k value from 0.05 to 0.25, the curves had less overlaps among groups after stimulus.

Fig. 2
figure 2

ApEn and SampEn time series of four groups during the time of 1 min before and after LMA insertion. Time series were shown for five different k values (k = 0.05, 0.1, 0.15, 0.2, and 0.25)

The points conforming to max(d1 + d2) and min(d12 + d22) were summarized (Table 2). The location of optimal timestamp, ranging from 65 to 70, was comparatively farther away from the stimulus point with the increase of k value. The value of each measure at its optimal timestamp was selected as aft for later analysis.

Table 2 The selected optimal timestamp

3.2 The relationship between variables and Ceremi before and after LMA insertion

During the nonnoxious period, variables were not correlated with tested Ceremi. The bef values of measures were displayed in Table 3. The p values of Kruskal–Wallis H test for ApEn (0.567, 0.615, 0.748, 0.419, and 0.172 respectively) and SampEn (0.750, 0.131, 0.116, 0.121, and 0.082 respectively) with k = 0.05, 0.1, 0.15, 0.2, and 0.25 were apparently larger than 0.05.

Table 3 Bef and aft values of ApEn and SampEn responding to laryngeal mask airway insertion

Table 3 also displayed the reactions (aft values) of ApEn and SampEn after LMA insertion. The stimulus increased all variables dramatically during underdosage of analgesics (Ceremi = 0 and 1 ng·ml−1) except ApEn with k = 0.25. No significant increase was observed under the dosage of 3 and 5 ng·ml−1.

Examined variables at optimal timestamp of four groups were demonstrated in Fig. 3. The behaviors of ApEn and SampEn depended considerably on k values: Measures with all k values except k = 0.25 showed excellent capability in distinguishment between low Ceremi (0 and 1 ng·ml−1) and high Ceremi (3 and 5 ng·ml−1). The differences between 0 ng·ml−1 and high Ceremi were more and more significant with the increasing k value. However, the decreases of ApEn and SampEn responding to increasing Ceremi were not continuous when k = 0.05 and 0.1. ApEn with k = 0.15, 0.2, and 0.25 and SampEn with k = 0.2 and 0.25 could significantly differentiate group R0 from R1. The discrimination between R1 and R3 was not detected using indices with k = 0.25. No variables could reveal substantial differences between R3 and R5.

Fig. 3
figure 3

Boxplots of ApEn and SampEn (k = 0.05, 0.01, 0.15, 0.2, and 0.25) in group R0, R1, R3 and R5. Data were presented as median, interquartile range, and range. ****p < 0.0001, ***p < 0.001, **p < 0.01 and *p < 0.05

Figure 4 presented the predictive powers of variables in indicating analgesic depth. PK became larger first and then reduced with the increase of k value. The best value of k was determined based on the type of entropy parameter. For SampEn, k value of 0.2 provided the best monotonic response to Ceremi with PK = 0.867. ApEn with k = 0.15 yielded the largest PK value, 0.875.

Fig. 4
figure 4

The change curve of PK with k values. Value presented as PK-value (standard error)

4 Discussion

In this paper, the application of entropy methods to PPG signal was studied to assess the depth of analgesia during propofol–remifentanil anesthesia. We compared the effects of ApEn and SampEn with various coefficients to make a distinction among Ceremi, expecting the monotonicity of measures would be in accord with opioid concentrations.

4.1 Time series of entropies with no analgesics administration

The time series of entropies varied with the continuous transition from unconsciousness to pain, as illustrated in Fig. 2. The curves of variables were maintained at a specific state when num ≤ 51. After LMA insertion applied, entropies started to increase and then subsequently decreased to its original state after saturating to the maximum. It was obvious that entropy variables could be affected by nociceptive input and its diversification was consistent with that of the degree of nociception. Nevertheless, it was inevitable to observe that the values of entropies were relatively small less than 0.5. This could be ascribed to the choice of L (L = 1000) that the variation contained in PPG data within 10 s was limited. Despite that, the discrimination detected using entropies with L = 1000 was enough for studying the influence of drug concentrations.

4.2 Dependency of ApEn and SampEn on Ceremi

All measures remained roughly unaffected by remifentanil concentrations before LMA insertion. The results were in accordance with the previous studies [30, 31]. This demonstrated that parameters were reliable to provide relevant evaluation of patients’ response to stimulus as they were independent on Ceremi during nonnoxious events.

Remifentanil had dose-dependent influence on ApEn and SampEn after LMA insertion. The main observations from this study were that (1) stimulation-induced changes of entropies during ‘adequate’ analgesia (Ceremi = 3 and 5 ng·ml−1) markedly diminished in comparison with ‘inadequate’ analgesia (Ceremi = 1 and 0 ng·ml−1); and (2) ApEn and SampEn were significantly correlated to remifentanil concentration with appropriate k values. This follows that physiological signal tends to be more resistant to stimulus and more regular as the anesthesia deepens. However, the variables could not discern 3 from 5 ng·ml−1. This implied the incapacities of measures for evaluating over-dose opioid drugs.

Entropy measures were heavily relied on the choice of k values. When k = 0.05 and 0.1, the changes of ApEn and SampEn were not monotonous. We speculated that the variations caused by LMA insertion under Ceremi = 3 and 5 ng·ml−1 and the noise in the signal are both small, the extracted information decided by k = 0.05 mixed too much noise. ApEn and SampEn with 0.25 outperformed others in discerning 0 from 1 ng·ml−1 whereas other k values could perform better in differing 1 from 3 ng·ml−1. This might be attributed to the fact that the changes of PPG with no remifentanil or Ceremi of 1 ng·ml−1 were so intense that the distances defined for ApEn and SampEn with small k values were not long enough to observe the discrepancy. The PK values of indices with k ≥ 0.15 were all greater than 0.8. A choice of k = 0.15 for ApEn was superior to other measures with PK = 0.875. With the objective of obtaining large PK value, ApEn and SampEn were both good methods for indicating the analgesia depth.

On the other hand, compared with the good performances of EEG-derived ApEn and SampEn (PK values ranged from 0.86 to 0.89) in the previous studies of awareness monitoring, the efficiencies of PPG-derived ApEn and SampEn in qualification of analgesia depth during propofol–remifentanil anesthesia were similar. This indicated that both hypnotic and analgesic effects could be detected by ApEn and SampEn, though no one has ever studied how to distinguish the two effects contained in ApEn and SampEn so far.

4.3 Comparisons between entropies and other nociception-related physiologic variables

To examine the efficiencies of entropies, the contrast of variables and other nociception-related physiologic indices including PPGA and PBI was conducted. Given that each pulse could compute one PPGA and PBI value whereas the calculation of entropies requires a period of PPG data, we extracted the bef and aft values of PPGA and PBI using different method mentioned in the previous research [31].

PPGA and PBI were averaged the minute before (bef) and after (aft) LMA insertion, as presented in Fig. 5. PPGA could only distinguished the stimulus-induced differences at Ceremi of 0 and 1 ng·ml−1. PBI had a better capacity compared with PPGA. Diverse drug concentrations could be discriminated using PBI with PK value of 0.830. However, PBI was significantly affected by remifentanil concentrations during nonnoxious period. This would reduce the accuracy in the evaluation of patients’ responses during the presence of the stimulation.

Fig. 5
figure 5

PPGA and PBI before (white boxes) and after (gray boxes) stimulation at 0, 1, 3, and 5 ng·ml−1 remifentanil effect-compartment concentrations (R0, R1, R3, and R5 respectively). Data were median, interquartile range, and range. *p < 0.05 and **p < 0.01 vs. pre-stimulus value. #p < 0.05, ##p < 0.01, ###p < 0.001, and ####p < 0.0001 vs. lower remifentanil concentration

4.4 Related research

A few studies considering the nonlinear measures during propofol–remifentanil anesthesia have been published. In [23], EEG-derived ApEn, SampEn and permutation entropy were examined in three groups with various dosages of remifentanil. The results showed remifentanil had a consistent dose-dependent effect on ApEn and SampEn throughout the transition from awake to burst suppression state. No stimulation had been applied and analgesia depth was not assessed. In [32], spectral entropy was chosen to predict a motor response to noxious stimulation. In contrast to this paper, the hypnotic state was merely controlled by observing whether the patient lost consciousness, rather than an objective indicator such as SE. The predictive value of entropy was weak with PK < 0.8. Besides, researchers used to designate ‘movement’ as an end point and explored the relationship between measures and ‘movement’ [32,33,34]. The usefulness of these researches was limited in paralyzed patients on the one hand, on the other hand, muscle relaxants which were usually applied to make the operation easy would seriously affect the observation of movement. In general, nonlinear features have not been thoroughly studied in the field of analgesia and standard research procedure is needed to establish.

In our previous research, some outstanding linear measures including diastolic interval (DI) were extracted in the time domain. In future work, linear and nonlinear features would be combined, and more kinds of physiological signals would be added to monitor the underlying changes caused by the autonomic system during anesthesia.

4.5 Limitations

First, larger subjects would be needed to strengthen the present findings. The hoc-post power calculation was performed for Wilcoxon signed-rank test and Mann–Whitney test using G*power software. As to Wilcoxon signed-rank test, the associations between bef and after values of entropies with various k values were firstly assessed using Spearman’s rank correlation coefficient test. According to α = 0.05, sample size = 10 and the effect size reckoned using means, standard deviations and correlations, it was found that only the paired groups with statistical significance achieved the power of 0.8. This indicated that the differences before and after stimulus at Ceremi of 3 and 5 ng·ml−1 might be detected as the amount of data increased. As to Mann–Whitney test, the power values were all less than 0.8 when two groups with closer concentrations were compared. Hence larger sample size was needed to further verify the relationships of these groups.

Secondly, the study protocol could be further optimized. During the data collection in this study, sedation state (SE value) and stimulation type (LMA insertion) of all patients were held relatively constant, and different patients received different remifentanil doses. Based on this study protocol, the purpose of revealing the correlation between remifentanil doses and variables’ response to noxious stimulus could be achieved. Similar research has been conducted before [29]. In actual clinical practices, however, the nociception resulted from LMA insertion was not exactly the same. Though we had designated the same anesthesiologist to complete the LMA insertion in this pilot study, standard painful stimulation such as tetanic stimulation (STIM) was still required in further research [34,35,36].

5 Conclusion

The importance of our results was that PPG-derived nonlinear variables could correctly grade part of the relations of remifentanil concentrations and monitor the trend of analgesia depth. ApEn and SampEn with judicious choice of k value were both practical approaches in reflecting the underlying changes when different remifentanil concentrations were administrated.