Introduction

Acute radiation dermatitis (RD) is a prominent adverse side effect of external beam radiotherapy (RT), with symptoms arising in up to 95% of patients [1]. While RT remains a promising intervention in the prevention of locoregional cancer recurrence, RD can worsen patient’s quality of life, lead to interruptions in treatment, and is often characterized by erythema, moist desquamation, edema, pruritus, and pain [2, 3]. Unfortunately, there is no “gold standard” intervention for the prevention or management of RD, despite decades of research on the topic.

Clinical practice guidelines on RD care have been published by the Multinational Association of Supportive Care in Cancer (MASCC) in 2013, along with several other institutions [4, 5]. Due to a lack of definitive guidelines and increasingly new research on RD care in the last decade, an update of guidelines was warranted. A team of experts across the globe, within and outside of MASCC, have developed the 2022 Clinical Practice Guidelines on RD Prevention and Management in accordance with the MASCC Guidelines Development Policy [6] to reflect the current literature and expert opinions on RD care. After a comprehensive literature search, further in-depth, quantitative analyses of certain types of RD interventions were highly warranted due to the high number of randomized trials available. To date, few meta-analyses have been published comparing interventions for RD care due to a lack of comparable evidence, but with increasingly available literature, systematic reviews and meta-analyses were conducted as a sub-study of the MASCC RD Guidelines Development Project. A panel of researchers assisted in the process of conducting meta-analyses to pool data across multiple primary studies. Ultimately, these reviews and meta-analyses will act as independent resources to guide clinical decision-making, supplementary to the upcoming MASCC Clinical Practice Guidelines.

The present paper provides an overview of the methodology used in conducting meta-analyses on various interventions used in RD prevention and management. The findings of these reviews will be published in a series of seven subsequent articles (six meta-analyses, one critical review) organized by RD treatment category.

Initial systematic review for guideline development

A comprehensive literature search was conducted for the purpose of developing updated RD guidelines by the MASCC Oncodermatology Study Group RD Guidelines Working Group [7, 8]. All study types were included in the systematic review, regardless of study design or type of skin intervention. The systematic review provided an overview of all interventions used in the prevention or management of RD. An intervention was considered preventive if administered prior to the onset of RD symptoms.

Scope of review

The scope of the review was kept broad to avoid exclusion of any relevant studies. The following inclusion and exclusion criteria were followed.

Inclusion criteria

  • Original research studies on an intervention aimed at preventing or managing RD in cancer patients undergoing external beam RT

  • English language

  • Full-text or abstract available

  • Human subjects

  • RD severity or RD-related symptoms measured as primary and/or secondary outcomes

Search strategy

A medical librarian conducted a comprehensive literature search of Ovid MEDLINE, Embase, and Cochrane Central Register of Controlled Trials Databases. The search was performed on September 21, 2020 (1946 to September 2020), and followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (Table 1).

Table 1 Search strategy

Assessment of quality of evidence

In accordance with the MASCC Guideline Policy, quality of evidence was assessed using the Hadorn et al. (1996) criteria based on the presence of “major” or “minor” flaws across several diagnostic criteria (i.e., selection of patients, allocation of patients to treatment groups, study administration, etc.) (Table 2) [9]. Any studies with none or “minor” flaws were assigned a quality of evidence of “Adequate”, while studies with “major” flaws were assigned a quality of evidence of “Doubtful”, indicating potentially poor study methodology.

Table 2 Hadorn criteria (1996) for quality of evidence assessment [9]

Systematic reviews and meta-analyses

Study selection and logistics

The literature search results are summarized in Fig. 1. After the initial systematic review, 240 studies were identified for inclusion in the development of RD guidelines (151 randomized, 89 non-randomized). Each of the RD interventions assessed within the 240 articles were compiled in a list. If a skin intervention was investigated by two or more independent randomized controlled trials (RCTs) and reported quantitatively comparable data, those RCTs were included to be further analysed through meta-analyses. Only full-texts were included. Thus, 51 RCTs were chosen for inclusion in quantitative analysis. The rationale for solely including RCTs can be explained by the fact that RCTs are considered to be of higher quality than non-randomized studies. Additionally, studies were only included if they were one of two or more studies investigating the same intervention because two or more comparable studies are required to conduct a meta-analysis. If a quantitative analysis could not be done on a given skin intervention due to a lack of comparable outcomes between RCTs, a narrative review was synthesized.

Fig. 1
figure 1

PRISMA diagram

Among the RCTs included for quantitative analysis, a total of six RD intervention categories were identified: barrier films and dressings, photobiomodulation therapy, topical non-steroidal agents, topical steroidal agents, antiperspirant/deodorant, washing with water/soap, and natural and miscellaneous agents. A team of researchers across 10 countries was convened to lead each meta-analysis, with two to three co-leaders assigned to a single intervention category. Co-leaders led all stages from data collection to manuscript drafting under supervision of an advisory team within the MASCC Oncodermatology Study Group.

Data collection

Per intervention category, data extraction was completed by two to three independent reviewers to ensure consistency and accuracy. Data was collected on the patient and treatment characteristics (e.g., cancer site, type of RT, dose fractionation schedule) and the study characteristics (e.g., blinding versus open-label). Outcome measures and results were collected as categorical variables (i.e., event proportions) and/or continuous variables (i.e., mean scores). The number of patients who experienced an outcome of interest or mean scores in the experimental and control arms were collected to estimate the OR and 95% CI. Data from the intention-to-treat and per-protocol analyses was collected where available, and the study’s corresponding author was contacted in the event that both data sets were not readily available. Subgroups were determined a priori and analyses were conducted where applicable to assess the efficacy of interventions in different cancer sites or modes of administration (e.g., topical vs oral). Given the extensive variability in outcome assessment across RD trials, independent reviewers aimed to maximize the number of outcomes that could be compared across studies by assigning certain outcomes as equivalent where appropriate.

Certainty of evidence and risk of bias

To assess the certainty of evidence of each study included, the Grading of Recommendations, Assessment, Development and Evaluations (GRADE) framework was followed, whereby a certainty of evidence could be assigned as “very low”, “low”, “moderate”, or “high” based on the independent assessment of two to three reviewers per intervention category [10]. GRADE ranks certainty of evidence based on several domains: risk of bias, imprecision, inconsistency, indirectness, publication bias, magnitude of effect, dose–response gradient, and residual confounding [10]. The risk of bias of each study was also assessed using the revised Cochrane risk-of-bias tool for randomized trials (RoB 2) by two to three independent reviewers [11]. Five domains in the RoB 2 were considered: bias from the randomization process; bias from deviations from intended interventions; bias from missing outcome data; bias in outcome measurement; and bias in selection of results reported [11]. Any disagreements between reviewers on a study’s uncertainty of evidence or risk of bias were resolved by consulting a third party to reach a consensus.

Statistical analysis

Forest plots were developed where possible using the Cochrane RevMan 5 software. Random effects models were used to generate 95% confidence intervals (CI). When categorical variables were included, the Mantel–Haenszel method was used to generate odds ratios (OR). When continuous variables were included, standard deviation and mean values were generated. I2 statistic was measured to indicate low heterogeneity (I2 < 0.25), moderate heterogeneity (I2 = 0.25–0.50), and high heterogeneity (I2 > 0.50). A p-value of less than 0.05 indicated statistical significance in the test for overall effect (Z). Based on the outcome of interest that was assessed between studies, a summary OR below 1.00 indicated that the intervention under study lowered the odds of having the outcome, while a summary OR above 1.00 indicated that the intervention increased the odds of having the outcome.

Conclusion

The methods described in this paper have allowed for the findings across many RCTs on various RD interventions to be pooled together. Findings of these systematic reviews and meta-analyses will supplement the updated MASCC RD Guidelines, which will be used to guide clinical decision-making in RD care.