Introduction

Inguinal hernia repair is one of the most common procedures in general surgery with > 20 million procedures performed worldwide each year [1]. Among men, the risk of inguinal hernia repair is around ten times higher than among women [2]. The only cure for symptomatic inguinal hernia is surgical treatment. Men with an asymptomatic or minimally symptomatic inguinal hernia can be considered for watchful waiting [1], but 70% of those men will be referred for surgery within five years because of pain [1].

Persistent postoperative pain (PPP) has been reported to affect 2–30% of patients after inguinal hernia repair six months postoperatively [3, 4], with variations primarily depending on the method of pain evaluation [5], whereas surgical techniques seem less important [6]. PPP 6 months after surgery adversely affects daily life activities for 10–15% of all patients after inguinal hernia repair and 1–3% suffer from severe PPP [1, 6]. Management of PPP after inguinal hernia repair remains difficult with inconclusive results of both pharmacological and surgical treatment [1, 7,8,9]. Since management is difficult, prevention of PPP is of major importance [1]. To this end, a prediction model might be helpful for the surgeon to identify high-risk patients and guide tailoring of their treatment [1]. We have recently found a relationship between lifting > 1,000 kg/day and PPP after inguinal hernia repair, which suggests a previously untapped preventive potential [10].

Risk stratification may be an essential step forward in the prevention of PPP [11, 12]. Using a predictive model in daily practice could lead to personalised patient information, choice of surgical technique, perioperative pain management, and recommendations regarding the duration of postoperative sickness absence or temporary work modifications for high-risk patients. Identification of preoperative risk factors seems most desirable for early prevention [11]. Previous prediction models have been too complex to apply in clinical settings [13, 14], or generic rather than procedure specific (i.e. concerning various kinds of surgery rather than inguinal hernia repair specifically) [15, 16], and none of the models concerning inguinal hernia repair have been validated externally [17]. Thus, prediction models of the risk of PPP after inguinal hernia repair have so far not been translated into clinical practice [12,13,14].

This study aimed to develop and validate a simple and clinically useful risk score to identify patients with an elevated risk of PPP 6 months after inguinal hernia repair.

Methods

Design and study populations

The predictive model was developed based on a 6-month follow-up study of a nationwide cohort of 2,508 Danish men (population I) and validated within a consecutive prospective cohort of 293 Danish men with follow-up at 6 months (population II). We followed the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement [18]. The Danish Data Protection Agency approved the study (j.no. 1-16-02-121-15). In Denmark, register and questionnaire studies do not require approval by health research ethics committees and consent is considered given when participants return the questionnaire.

Population I.

The establishment of population I has been described previously [10]. To summarise, the cohort consisted of all men with an inguinal hernia repair according to the Danish Hernia Database in the period January 1 2015 to October 31 2016, who were born between October 1 1949 and October 1 1998 and living in Denmark (excluding Greenland) five months postoperatively. The cohort excluded patients in population II (see below), patients who were < 18 years old at the time of surgery, patients whose data was entered into the database later than five months after their inguinal hernia repair, patients who had no available address or had emigrated or died before follow-up, and patients who were not active in the labour market in the week before their surgery. All members of the cohort received a questionnaire by mail six months after their inguinal hernia repair. Of these, 2,609 (54%) answered the questionnaire and 2,508 contributed to the analyses. The mean time since surgery was 6.5 (SD 1.0) months [10]. Non-response analysis showed minor differences with respect to age (respondents were on average three years older than non-respondents) and no differences concerning the region of residence, socioeconomic status, and surgical characteristics [10].

Population II

Population II consisted of all men scheduled for inguinal hernia repair in three public hospitals in Central and North Denmark Regions, who were 18–65 years old and active in the labour market at the time of surgery, had a cell phone, and were able to understand and read Danish. Eligible patients were included in the study if they returned the preoperative questionnaire before their surgery. At 6-month follow-up, they received a questionnaire by mail. Included patients also received up to 23 short text messages in a period of 6 months postoperatively, where they were asked to report their intensity of pain in the same way as in the questionnaire. If a participant did not respond to the questionnaire at 6 months, the last short text message response was used to determine the outcome. A total of 318 men answered the preoperative questionnaire, and 212 answered the follow-up questionnaire, while short text messages could be used for a further 81 men. Thus, 293 men (92%) contributed to the analyses at follow-up.

Outcome

The outcome was PPP during the last 24 h in terms of inguinal pain on the operated side with an intensity ≥ 2 on a numerical rating scale NRS-11 during activity 6 months postoperatively [3, 18]. The NRS-11 ranged from 0 (no pain) to 10 (worst imaginable pain). The cut-off of ≥ 2 has previously been associated with daily functional impairment [13], also in population I [10].

Demographics

Age at surgery was calculated from the date of birth and the date of surgery. In population I, socioeconomic status was categorised as academics and administrative/office workers, skilled workers, and unskilled workers based on unemployment insurance fund membership [19] in the year of the inguinal hernia repair [10]. In population II, the same socioeconomic status categories were applied, only based on self-reported job titles because information on unemployment insurance fund membership was not available.

Duration of preoperative pain

Information on the duration of preoperative pain was obtained by questionnaire and categorised as ≤ 1, 2–6, and 7–48 months. Of note, the information on preoperative pain was obtained retrospectively in population I.

Occupational mechanical and psychosocial exposures

Questionnaire information on job title at the time of surgery was collected six months postoperatively in population I and preoperatively in population II. After recoding to occupational titles represented in the Danish version of the International Standard Classification of Occupations from 1988, this information was combined with a job-exposure matrix (The Lower Body JEM) [20] to assess occupational mechanical exposures in terms of time spent standing/walking (h/day) and total load lifted (kg/day). The Lower Body JEM cross-tabulates 121 homogeneously exposed job groups with the mean of five experts’ ratings of the just-mentioned exposures [20]. Working in a bent-over position might also be related to PPP through a mechanism of increased intra-abdominal pressure [21], but was not represented in the JEM. Instead, we used the self-reported number of h/day spent working in this position [22]. As described previously, the exposure estimates were adjusted according to self-reported weekly working hours [10].

Information on job demands and job control was collected by questionnaire six months postoperatively in population I and preoperatively in population II. A job strain variable was then constructed as described previously: low strain (low demands, high control), passive (low demands, low control), active (high demands, high control), and high strain (high demands, low control) [10, 23].

Lifestyle factors

Information on present lifestyle factors was collected by questionnaire 6 months postoperatively in population I and preoperatively in population II. Body mass index (BMI) was calculated as weight/(height2) and descriptively categorised as < 25, 25- < 30, and 30–56 kg/m2 – in further analyses (see below), BMI was entered as a continuous variable; leisure-time physical activity was categorised as not active (< 2 h/week) and active (≥ 2 h /week); smoking status was categorised as never, ex-, and current smoker.

Surgical characteristics

For both populations, information on surgical characteristics was obtained from the Danish Hernia Database [24]. Type of hernia repair was categorised as laparoscopic, Lichtenstein, and other (Onstep, other open technique with mesh, open technique without mesh—i.e. types that are not gold standard for inguinal hernia repair in men > 30 years according to international recommendations from 2018 [1, 25]); anatomical hernia type was categorised as lateral, medial, pantaloon, and unspecified, and repair of recurrent hernia was categorised as no and yes.

Statistical analyses

To develop the predictive model, univariable logistic regression analyses were undertaken to assess the strength of the association between each of the potential predictor variables and PPP (population I). All potential predictor variables with p values < 0.2 in these analyses were selected to be simultaneously entered into a multivariable logistic regression model, except the occupational mechanical exposure variables and socioeconomic status. Variable selection was then performed using backward elimination, using a criterion of p value < 0.10 for retention in the model. Due to expected correlations, we a priori decided to attempt to enter each of the occupational mechanical exposure variables assessed with the JEM into our logistic regression model one at a time, and not to include socioeconomic status. Duration of preoperative pain fulfilled the above criteria for inclusion in the model but was only entered in supplementary analyses due to the risk of recall bias [26].

A multivariable logistic regression model was created (model 1) following the steps outlined above. We then created a PPP risk score based on the β coefficients for model 1. Points for this score were assigned by multiplying the respective coefficients by ten times the smallest predictor coefficient in the model and rounding the result to the nearest half-point. The PPP risk score was determined as the sum of the individual point values associated with each predictor. Next, a model with the PPP risk score as the single predictor was created (model 2).

Internal validation

Calibration of models 1 and 2 was assessed by the Hosmer–Lemeshow goodness of fit test [27] and calibration belt test and plot [28, 29]. A p value > 0.05 for these tests indicates that the model is well calibrated, i.e., the model produces probabilities that accurately reflect the actual outcome experience in the data [29, 30]. Calibration belt plots depict this relationship (i.e., the relationship between the probabilities according to the model and the actual outcome experience in the data) along with confidence bands [29]. The discriminatory capability of the models was assessed by receiver operating characteristic (ROC) curve analysis. Bootstrap analysis was performed to obtain 95% confidence intervals (CI) of the area under the curve (AUC) in both models; a total of 500 computer-generated samples from population I were derived by random selection with replacement, and AUC was calculated within each bootstrap sample [31, 32].

External validation

Both models were validated in population II using calibration belt test and plot and ROC curve analysis as described above.

Results

Table 1 presents the prevalence of PPP in population I (19%) and univariable associations of PPP with demographics, duration of preoperative pain, occupational exposures, lifestyle factors, and surgical characteristics. The prevalence of PPP was higher among men aged 18–49 and 50–59 years than among men aged ≥ 60 years. Both socioeconomic status and duration of preoperative pain were related to the outcome. Patients with occupational exposures in the highest categories had an elevated prevalence of PPP. Higher odds ratios (OR) were observed among patients who were not physically active in their leisure-time and patients with a BMI ≥ 30 kg/m2, while smoking was not related to the outcome. The prevalence of PPP did not differ between anatomical hernia types, except for an elevated prevalence in the small unspecified category. No statistically significant difference was found between laparoscopic repair and Lichtenstein repair, but patients with other types of hernia repair had an elevated prevalence of PPP. Patients with the repair of a recurrent hernia had a higher prevalence of PPP than patients with repair of a first-time hernia.

Table 1 Prevalence of persistent postoperative pain (PPP) and univariable odds ratios (OR) with 95% confidence intervals (CI) in population I (men only) in relation to demographics, duration of preoperative pain, occupational exposures, lifestyle factors, and surgical characteristics

Table 2 presents the multivariable model (model 1) as well as the model including only the PPP risk score (model 2). Model 1 included age, total load lifted per day, working in a bent-over position, leisure-time physical activity, BMI, and repair of recurrent hernia. Other surgical characteristics did not contribute to the model. Thus, the PPP risk score was based entirely on preoperative information. It ranged from 1.8 to 10.1 in population I corresponding to predicted risks of PPP of 6% to 61% according to model 2. The adjusted OR was 1.50 for an increment of 1 PPP risk score point. Figure 1 presents two personalised examples of how to calculate probabilities of PPP based on PPP risk scores, yielding risks of 7% and 41%.

Table 2 Multivariable prediction models of persistent postoperative pain (PPP) in population I (n = 2,508 men). CI: confidence interval
Fig. 1
figure 1

Personalised examples of the predicted risk of persistent postoperative pain (PPP) after inguinal hernia repair using the PPP risk score

Table 3 presents the PPP risk score performance in population I in terms of probability of PPP, sensitivity, specificity, and positive and negative predictive values at varying cut points of the score. Figure 2 presents observed and expected percentages with PPP according to the PPP risk score (model 2). The Hosmer–Lemeshow goodness of fit test demonstrated close agreement between the observed and expected counts and the p values (p = 0.85 for model 1 and p = 0.52 for model 2) indicated good calibration (cf. calibration belt test and plot in supplemental digital content) and acceptable discriminatory capability with an AUC of 0.64 (95% CI 0.61–0.67) for model 1 and 0.63 (95% CI 0.60–0.65) for model 2. In supplementary analyses, where we included duration of preoperative pain, the discriminative ability of model 1 improved (AUC = 0.69) slightly and the calibration was still excellent (p = 0.91).

Table 3 Risk score performance and cumulative percentages of persistent postoperative pain (PPP) at a different cut off values of the PPP risk score in population I, where the prevalence of PPP was 19%
Fig. 2
figure 2

Observed and expected percentages of patients with persistent postoperative pain (PPP) 6 months after inguinal hernia repair according to PPP risk score in population I

The prevalence of PPP was lower in population II (12% versus 19% in population I) and skilled workers constituted 48% of population II against 30% of the population I. The mean time since surgery was similar and population II also resembled population I in other respects (results not shown). External validation (population II) of both models using calibration belt test and plot showed good calibration (supplemental digital content) and the AUCs were similar to the AUCs in the internal validation: in model 1 the AUC was 0.64 (95% CI 0.54–0.76) and in model 2 the AUC was 0.62 (95% CI 0.52–0.72).

Discussion

We created and validated a PPP risk score, which was based entirely on information available preoperatively. The risk score showed good internal validity in the developing population and performed well in a preoperative setting with another population. In particular, the models could estimate probabilities of PPP accurately in both developmental and validation data sets. Predictors of PPP were age at surgery ≤ 59 years, total load lifted > 1,000 kg/day, working in a bent-over position > 1 h/day, leisure-time physical activity < 2 h/week, increasing BMI, and repair of recurrent hernia. The PPP risk score was independent of anatomical hernia type and type of hernia repair.

We defined PPP as an NRS ≥ 2, which reflects daily functional impairment [13, 33]. The outcome was based on NRS-11 [34] 6 months postoperatively as recommended previously [6, 11, 12, 18]. We chose to leave out the duration of preoperative pain due to the retrospective nature of this information in population I. The discriminative capability of model 1 improved slightly when preoperative pain was included, but even without the inclusion of preoperative pain, the calibration was good, and the model performed well in external validation. Preoperative pain has previously been left out of prediction modelling without reducing the predictive power [15].

The main limitation of this study was the relatively small size of population II, leading to wide calibration belts in the external validation of the prediction models. Nevertheless, the observed probabilities of PPP were accurately estimated even in population II. Regarding population I, a further limitation was that the percentage that participated was not optimal (54%), but previous non-response analyses (summarized in the methods section) have not indicated selection bias [10].

Occupational lifting and standing/walking were assessed using a JEM [20]. The validity of the JEM was originally assessed by two external experts, who in general agreed with the ranking of exposures [20]. The JEM has previously shown good predictive validity in studies of total hip replacement [35], surgery for varicose veins [36], inguinal hernia repair [37,38,39], and sickness absence and permanent work disability [40]. The strength of using a JEM is the elimination of recall bias which would be an issue in the case of retrospective exposure assessment based on self-report. Working in a bent-over position was based on self-report with a risk of recall bias in population I, but since model 1 performed well in a preoperative setting (the validation population), recall bias seemed limited. Total load lifted > 1,000 kg/day and standing/walking > 6 h/day have been identified as risk factors for lateral inguinal hernia repair [37, 38, 41], prolonged convalescence after inguinal hernia repair [39], and increased prevalence of PPP [10]. Thus, we expected these exposures and working in a bent-over position to be retained in model 1, whereas we were not surprised that job strain, which was included exploratively [10], did not make it into the final models. Occupational mechanical exposures have not been included in previous models to predict PPP after inguinal hernia repair.

Repair of recurrent hernia was the only surgical characteristic, which was included in our model. In Denmark, the type of repair of a recurrent hernia depends on the initial type of repair; if the initial repair was laparoscopic, the recurrence will typically be treated by a Lichtenstein repair, and vice versa. Future studies focusing on the risk of PPP in relation to the type of repair of recurrent hernias may be warranted. Laparoscopic repair has previously been associated with a lower risk of PPP than Lichtenstein repair [42, 43], which we could not corroborate. We speculate that this may be because laparoscopic repair was previously predominantly performed by highly experienced surgeons, while this difference between the two types of hernia repair no longer pertains. In the past two decades, intensive research in surgical methods and standardisation of inguinal hernia repair have taken place in Denmark [24, 44]. For that reason, the inguinal hernia repairs in our study were to a large extent performed according to the European Hernia Society Guidelines [1], e.g. 97% of the patients had a laparoscopic or a Lichtenstein repair. A high standardisation and volume for each centre must be expected to reduce the risk of PPP in relation to different methods of inguinal hernia repair to a minimum [1]. We did not incorporate surgical experience in our model building, but surgeons’ volumes of inguinal hernia repairs seem to be of minor importance [45] and, in general, inguinal hernia repair is performed in high volume centres in Denmark [46].

The data for the study was collected from 2015 to 2018 and the prevalence of PPP at six months was consistent with the prevalence in Sweden from 2012 to 2015 [42]. It seems likely that our results can be generalised to other countries with working conditions, surgical conditions, and a health care system comparable with the Danish. Our study was restricted to men, and the results may therefore not be generalisable to women.

The prediction models estimated probabilities of PPP accurately in both developmental and validation data sets. This is valuable with a view to personalised guidance to patients and planning of perioperative measures. The PPP risk score may even be useful as a screening tool although the score’s predicted risks did not separate very clearly between those patients who did and did not develop PPP (the AUCs were all around 0.65, which is not excellent). Different trade-offs between sensitivity and specificity (cf. Table 3) may be considered depending on the risks and costs connected with potential interventions to reduce the risk.

Preoperative risk stratification could support clinical decision-making with respect to pre-, peri-, and postoperative management of patients with inguinal hernia. E.g., high-risk patients may be recommended a longer convalescence or a transient reduction of occupational mechanical exposures postoperatively (for highly exposed individuals only). A more strict postoperative follow-up concerning pain and complications may also be considered to treat early postoperative pain more efficiently [1, 8]. We believe that our model has a potential for preoperative risk stratification in clinical settings, and the PPP risk score can easily be translated to an app for use in clinical practice. A logical next step would be an impact study [47] to evaluate if preoperative risk stratification and personalised treatment strategies do indeed improve clinical decision making and lead to a reduced risk of PPP after inguinal hernia repair.

In conclusion, our results suggest that the risk of PPP after inguinal hernia repair can be predicted using a preoperative risk score based on age, occupational mechanical exposures (total load lifted per day and time per day spent working in a bent-over position), BMI, leisure-time physical activity, and repair of recurrent hernia.