Introduction

Nowadays, there is an increasing number of patients requesting treatment for dental discoloration due to recent health-related and esthetic demands [1, 2]. Among the therapies for dental discoloration, vital tooth bleaching is a safe and conservative approach. This procedure can be performed in the office by the dentist using high concentrations of hydrogen peroxide [HP] or can be applied by the patient at-home using low concentrations of carbamide [CP] and hydrogen peroxides [3].

Since the introduction of the at-home bleaching technique [3], successful results have been reported [46]. In a recent questionnaire-based survey conducted in the southern of Brazil, at-home bleaching was broadly preferred (78.1 %) for vital tooth bleaching compared with in-office therapy (21.9 %) [7]. This trend seems to occur worldwide. The advantages of the at-home technique include ease of application, reduced chair time and costs, high rates of success [811], and safety of materials used [10, 12]. Carbamide peroxide breaks down into HP and urea, with the HP concentration being approximately one third of the original CP percentage. There are many carbamide peroxide bleaching systems on the market with concentrations ranging from 10 to 22 % [13]. Irrespective of the concentration, these bleaching gels are recommended for periods ranging from 8 (overnight) to 2 h daily [6, 1315].

Even though at-home bleaching is the most frequently recommended treatment, some patients do not adapt to the technique, because they need to wear the bleaching tray for longer periods of time to see effective results in 2 to 3 weeks. These patients might request a method with better comfort that reaches the same bleaching efficacy.

In view of this market need, some companies have introduced low-concentrated strips or tray-delivered HP products for at-home bleaching. They claim that these new systems are as effective as CP in equivalent concentrations with the advantage of requiring less time of use. However, the literature is still controversial as to which product is actually the most effective one [1618]. Although an earlier systematic review of the literature has already evaluated the effectiveness of at-home bleaching agents [13], this review was published 8 years ago. Additionally, they did not aim to investigate a narrow research question as several comparisons were done, making the clinical decision harder.

Therefore, the aim of this systematic review of the literature was to answer the following focused question: Are there differences in color changes of tray-delivered CP versus HP for at-home dental bleaching in patients of any age group?

Material and methods

Protocol and registration

We registered the study protocol at the PROSPERO database under the number CRD42015008993, and we followed the recommendations of the PRISMA statement for the report of this systematic review [19].

Eligibility criteria

The controlled vocabulary (mesh terms) and free keywords in the search strategy (Table 1) were defined based on the following elements of the PICOS question:

Table 1 Electronic database and search strategy (1 June 2014)
  1. 1.

    Population (P): patients with dental discoloration of any age group

  2. 2.

    Intervention (I): at-home bleaching with HP-based products

  3. 3.

    Comparison (C): at-home bleaching with the tray-delivered CP products

  4. 4.

    Outcome (O): color change (primary outcome); Secondary outcomes (tooth sensitivity risk and gingival irritation) were also assessed

  5. 5.

    Study design (S): randomized clinical trials (RCTs)

A validated filter from the Cochrane Handbook for Systematic reviews of Interventions 5.1.0 (http://handbook.cochrane.org) for RCTs was employed for the PubMed database. The strategy keys of intervention and comparison were combined. The O was not used in the search strategy to maximize the sensitivity over the specificity of the search strategy.

Only RCTs that compared the color change of tray-delivered CP with HP-based products in permanent dentition of patients of any age group were eligible. We included parallel or split-mouth design in clinical human trials (Table 1).

No minimum follow-up was required for inclusion as we were interested in the immediate color change reached by the products. The color change in ΔE or in shade guide units (ΔSGU) was the primary outcome of the study, and the risk of tooth sensitivity and gingival irritation were the secondary outcomes. No restrictions regarding settings (academic university department, dental hospital, primary care, private practice, etc.) were established.

Non-controlled clinical trials, editorial letters, pilot studies, historical reviews, in vitro studies, cohort, and observational and descriptive studies, such as case reports and case series, were excluded. Additionally, RCT studies were excluded if (1) the experimental group was not an at-home HP bleaching product, (2) a control tray-delivered CP was not used, (3) when HP products were compared to placebo gels, and (4) over-the-counter instead of tray-delivered CP was used as control.

Information sources and search

To identify trials to be included for this review, we searched on the electronic databases MEDLINE via PubMeb, Scopus, Web of Science, Latin American and Caribbean Health Sciences Literature database (LILACS), Brazilian Library in Dentistry (BBO), and Cochrane Library (Table 1). We hand-searched the reference lists of all eligible primary studies for additional relevant publications and the related article links of each eligible primary study in the PubMed database. No restrictions were placed on the publication date or language.

The abstracts of the annual conference of the International Association for Dental Research (IADR) and their regional divisions (1990–2014) were also searched, and authors of relevant abstracts were contacted for further information. The grey literature was explored using the database System for Information on Grey literature in Europe (SIGLE). Dissertations and theses were searched using the ProQuest Dissertations and Theses Full Text database as well as the Periódicos Capes Theses database.

To locate unpublished and ongoing trials related to the review question, the following trials registry were also searched: Current Controlled Trials (www.controlled-trials.com), International Clinical trials registry platform (http://apps.who.int/trialsearch/), the ClinicalTrials.gov (www.clinicaltrials.gov), Rebec (www.rebec.gov.br), and EU Clinical Trials Register (https://www.clinicaltrialsregister.eu).

The search strategies defined for the databases described above are listed in Table 1. The search strategy was appropriately modified for each database and performed by two reviewers (I.L.-M. and A.R.) to identify eligible studies. Full-text versions of the papers that appeared to meet the inclusion criteria were retrieved for further assessment and data extraction.

Study selection and data collection process

Initially, the articles were selected by title and abstracts according to the previously described search strategy (Table 1). Articles appearing in more than one database were considered only once. Full reports were also obtained when there was insufficient information in the title and abstract to make a clear decision. Subsequently, full-text articles were acquired and two reviewers (I.L.-M and A.R.) classified those who met the inclusion criteria. We gave a study identification number for each eligible study, combining first author and year of publication. The collection form was pilot tested using a sample of study reports to ensure that the criteria were consistent to the research question.

Data were extracted using customized extraction forms and the following data recorded for each included study:

  • Details of the study including year of publication, author(s), setting, and evaluation criteria for color change

  • Details of study method such as the study design and setting

  • Details of participants including age (mean and range), sex, and number of patients per group

  • Details of the bleaching products used, including concentration and bleaching protocol (bleaching daily time, time of bleaching treatment)

  • Details of the methods used for evaluation of adverse effects such as tooth sensitivity and gingival irritation

  • Details of the outcomes including tooth used for color evaluation, color change in ΔE and ΔSGU, and number of dropouts: When more than one bleaching gel of each group was investigated, their values were combined to make a single entry per outcome

  • If the study reported any conflict of interest

Risk of bias in individual studies

Quality assessments of the included trials were evaluated by two independent reviewers (I.L.-M and A.R.), using the Cochrane Collaboration’s tool for assessing risk of bias in randomized trials [20]. The assessment criteria contained six items: sequence generation, allocation concealment, blinding of the outcome assessors, incomplete outcome data, selective outcome reporting, and other possible sources of bias. All these domains were evaluated at the study level. During data extraction and risk of bias assessment, any disagreements between the reviewers were resolved through discussion, and if needed, by consulting a third reviewer (L.C.M.).

For each aspect of the quality assessment, the risk of bias of each domain was scored following the recommendations of the Cochrane Handbook for Systematic reviews of Interventions 5.1.0 (http://handbook.cochrane.org). The judgment for each entry involved recording “yes” indicating low risk of bias, “no” indicating high risk of bias, and “unclear” indicating either lack of information or uncertainty over the potential for bias.

For the ΔSGU, tooth sensitivity risk and gingival irritation, which are subjective measures, three (random sequence generation, allocation concealment, and blinding) out of the six domains from the Cochrane risk of bias tool were considered the key domains for the assessment of the risk of bias. For the ΔE, which is an objective measure that cannot be influenced by the examiner’s awareness of the group assignment, we only considered adequate sequence generation and allocation concealment as the key domains.

Studies were considered to be at “low” risk of bias if there was adequate sequence generation, allocation concealment, and evaluator blindness, being the latter only considered for the ΔSGU outcome. When the study was judged as “unclear” in their key domains, we tried to contact authors to obtain more information and allow a definitive judgment of “yes” or “no.”

Summary measures and synthesis of the results

Data from eligible studies were either continuous (color change) or dichotomous (tooth sensitivity and gingival irritation). We performed subgroup analysis according to the risk of bias of the included studies (“low” risk of bias and “unclear/high” risk of bias). Although subgroup analyses were not pre-specified in the research protocol registered at PROSPERO, it allowed us to evaluate the impact of the exclusion of “unclear/high risk” of bias studies on the overall results.

To summarize the color change for each study, we calculated the standardized mean difference with a 95 % confidence interval (CI). For the tooth sensitivity and gingival irritation, we calculated the risk ratio along with the 95 % CI. The random effects models were employed. Heterogeneity was assessed using the Cochran Q test and I 2 statistics. All analyses were conducted using RevMan (Review Manager, version 5.3 software, Cochrane Collaboration, Copenhagen, Denmark). We performed a sensitivity analysis whenever decisions about the data had to be done so that the impact of such decision on the overall conclusion could be evaluated.

Results

Study selection

After the database screening and removal of duplicates, 1379 studies were identified (Fig. 1). After title screening, 195 studies remained and this number was reduced to 29 after careful examination of the abstracts. The full texts of these 29 studies were assessed to check if they were eligible. Among them, 16 were excluded due to the following reasons: (1) old systematic reviews [ [13, 21]], (2) case series studies [ [22]], (3) placebo-controlled studies [ [2325]], (4) studies that did not compare PC and PH products [ [2631]], and (5) studies that did not employ a control tray-delivered CP product [3235].

Fig. 1
figure 1

Flow diagram of study identification

Characteristics of included articles

The characteristics of the 13 selected studies are listed in Table 2. The parallel study design was predominantly used in these studies [9, 14, 15, 17, 18, 3640]. Most of the studies whose setting was reported were conducted in universities [6, 9, 11, 1418, 36, 40].

Table 2 Summary of the studies selected for this systematic review

Eight out of the 13 studies employed a VITA shade guide for color evaluation [6, 9, 11, 14, 1618, 36]. Four out of these eight studies added an objective instrument (spectrophotometer or colorimeter) for color assessment [11, 14, 16, 17]. Digital image analysis was employed in four studies [15, 3638] and only one employed a fluorosis scale [40] for color change assessment.

The number of patients included in these studies ranged from 16 to 114 participants. The mean age of all participants included in the clinical trials was approximately 35.7 years [9, 14, 15, 17, 18, 3640], except for three articles that did not report this information. In six [9, 15, 16, 18, 37, 40] out of the 13 studies, bleaching trays with reservoirs were fabricated (Table 2). Three studies [14, 17, 36] did not employ reservoirs, and four papers did not report this information [6, 11, 38, 39]. More than half of the studies were fully or partially supported by manufacturers, posing an important conflict of interest [11, 15, 16, 3739] (Table 2).

Bleaching protocol

As shown in Table 2, nine of the selected studies used HP in strips [9, 1517, 3640] and four papers used HP delivered in a tray [6, 11, 14, 18] (Table 2). The concentration of the CP gels varied from 5 to 35 %, but most of the studies employed a 10 % CP gel [6, 9, 14, 15, 17, 37, 40]. For HP, the concentrations varied from 2.5 to 14 %, being 7.5 % the concentration most used [11, 14, 36, 40]. The studies selected showed a protocol of treatment from 2 to 28 days, whereas 14 days was the bleaching period most used [11, 14, 17, 18, 37, 38] (Table 2).

Assessment of the risk of bias

The assessment of the risk of bias of the selected studies is presented in Fig. 2. Few full-text studies reported the method of randomization employed and how the allocation concealment was performed in their full texts. As these two items were the key domains of the current systematic review, authors were contacted for further information. E-mails were sent to 11 corresponding authors of the 12 different studies [6, 9, 11, 14, 15, 17, 18, 3740] to request further information, but only seven corresponding authors answered [6, 14, 15, 18, 3739]. Eight [6, 11, 1417, 36, 38] were considered to be at “low” risk of bias in these two domains and five [9, 18, 37, 39, 40] “unclear” for at least one of these two domains.

Fig. 2
figure 2

Summary of the risk of bias assessment according to the Cochrane Collaboration tool. Underlined authors provided extra information by e-mail to allow assessment of the risk of bias

In relation to blinding of participants and evaluators, this domain was judged “unclear” in one full-text article [39] and at “high” risk of bias in another one [17]. Regarding selective reporting, all studies, except one [39] were considered at “low” risk of bias.

In summary, from the 13 studies, five [9, 18, 37, 39, 40] were considered to be at “high” risk of bias in the key domains of the Cochrane risk of bias tool, yielding 8 studies [6, 11, 1417, 36, 38] which met the best requirement features (randomization, allocation concealment, and blinding) for meta-analysis of the color change in ΔE. From these eight studies classified as at low risk of bias, two [6, 15] did not report color change in terms of ΔE.

For ΔSGU, eight studies [6, 11, 1417, 36, 38] met the best requirements (randomization and allocation concealment) for meta-analysis, but two [15, 38] did not measure the color change in shade guide units.

Meta-analyses

In all meta-analyses conducted, subgroup analyses with studies classified as at “low” risk of bias and those classified as at unclear/high risk of bias were performed. In none of the cases, the incorporation of the studies as at “unclear/high” risk of bias changed the conclusions reached with the studies classified as at “low” risk of bias. In light of that, the description of the results section will focus on the subgroup of studies as at “low” risk of bias.

Color change

For both meta-analyses (ΔE and ΔSGU), we included data for only those participants whose results were known (available case analysis). The impact of this decision was evaluated in a sensitivity analysis, where an intention-to-treat protocol was applied. No change in the overall significance was shown (data not shown).

The forest plot of the ΔE for the subgroup of studies at “low” risk of bias indicated that the data was not heterogenous (chi2 test p = 0.20; I 2 = 31 %; Fig. 3). For ΔSGU, the heterogeneity among the studies classified as at “low” risk of bias was significant (chi2 test; p < 0.001; I 2 = 69 %; Fig. 4). This heterogeneity was caused by inclusion of the study data of a single paper [17], as detected by means of a sensitivity analysis.

Fig. 3
figure 3

Forest plot of the ΔE outcome for color change. The included studies not listed in the meta-analysis did not report the color change

Fig. 4
figure 4

Forest plot of the ΔSGU outcome for color change. The included studies not listed in the meta-analysis did not report the color change in shade guide units

In three studies, two for the ΔE [37, 38] and another for ΔSGU [6], the standard deviations were not reported in the original articles and we imputed an arbitrary value based on the standard deviations of the other studies (Figs. 3 and 4). Through a sensitivity analysis, we observed that imputations of half, the same, or twice the mean did not produce significant changes in the model (data not shown).

For ΔE, the standardized mean difference for the subgroup of studies at “low” risk of bias was −0.45 (95 % CI −0.69 to −0.21) (Fig. 3), being statistically significant (p < 0.001). In other words, bleaching with tray-delivered CP provided more color change than HP gels. For ΔSGU (Fig. 4), the standardized mean difference for the subgroup at “low” risk of bias was 0.08 (−0.32 to 0.48) and it was not statistically significant (p = 0.70), meaning that both groups yielded similar color changes in terms of shade guide units.

Tooth sensitivity

Figure 5 shows the forest plot for the tooth sensitivity. Data was not heterogeneous for the subset of studies at “low” risk of bias (chi2 test, p = 0.90; I 2 = 0 %). The risk rate was 0.98 (95 % CI 0.78 to 1.23), and no significant difference was detected among the study groups (p = 0.83).

Fig. 5
figure 5

Forest plot for the tooth sensitivity. The included studies not listed in the meta-analysis did not report the tooth sensitivity risk

Gingival irritation

Figure 6 shows the forest plot for the gingival sensitivity. Data was not heterogeneous for the subset of studies at “low” risk of bias (chi2 test, p = 0.17; I 2 = 38 %). The risk rate was 1.18 (95 % CI 0.62 to 2.23), and no significant difference was detected among the study groups (p = 0.62).

Fig. 6
figure 6

Forest plot for the gingival sensitivity. The included studies not listed in the meta-analysis did not report the gingival sensitivity risk

Discussion

Meta-analysis takes the advantage of aggregating information with a higher statistical power for any measure of interest, as opposed to a less precise measure derived from a single study [20, 41]. However, this method presents some weaknesses. Meta-analysis cannot control for sources of bias of individual studies: a good meta-analysis of badly designed studies will still result in bad statistics. This means that only methodologically sound studies should be included in a meta-analysis in a practice called “best evidence synthesis” [20]. Although we have performed subgroup analyses with studies classified as at “low” risk of bias and “unclear/high” risk of bias, the main conclusions should be based only in the studies classified as at “low” risk of bias in regard to randomization and allocation concealment, opposed to earlier studies that based their conclusions on all included studies regardless of their risk of bias [13, 21]. Although the inclusion of studies at “high” risk of bias did not change the overall conclusions of the present investigation, this type of inclusion tends to overestimate the effect size [42].

Randomization, when correctly performed, guarantees that the chances of a patient being allocated in either test or control group are the same for all participants, which means that both known and unknown prognostic factors are balanced between groups [20]. As important as randomization, the allocation concealment is necessary to protect the randomization process, since the treatment to be allocated is not known before the patient is enrolled into the study [20]. The adequate management of these two domains (randomization and allocation concealment) minimizes selection bias, as clinical investigators in RCTs often find it difficult to maintain impartiality when knowing treatment allocation and this may produce non-comparable groups in baseline features.

Randomization and allocation concealment were poorly described in the included studies. Although some investigators described that the participants were randomly distributed, they rarely described the method of randomization (random number tables, computer random number generator, coin tossing, shuffling cards or envelopes, throwing a dice, etc.). This prevented the authors from performing a straightforward evaluation of the risk of the bias. Description of the allocation concealment was even more infrequent in the studies, which required contact with authors for further information.

Blinding the study participants and personnel may reduce the risk that knowledge, of which intervention was received, rather than the intervention itself, affects outcomes and outcome measurements. In other words, blinding avoids performance bias [20]. For the present systematic review, participants’ blinding was difficult to perform in some studies due to the differences in the at-home protocols (strips vs. trays, for instance) [9, 1517, 3638, 40, 43], which could be easily identified by the patients.

In light of that, and considering that the primary outcome of this study was not a patient-centered response (not susceptible to bias by the patient’s knowledge of the technique), we did not consider participants’ blinding a key domain for this systematic review. On the other hand, evaluator blinding was considered important when the color change was subjectively assessed in shade guide units. For the ΔE, which derive from objective numbers produced by a spectrophotometer, evaluator blinding was not considered a key domain.

An earlier systematic review [13] also compared the effectiveness of at-home bleaching products for dental bleaching, but the authors included many different comparisons, which made it difficult to reach an overall conclusion for clinical practice. The authors concluded that most of the available dentist-supervised or over-the-counter bleaching products available in the market for at-home bleaching are effective, but this conclusion was based on studies classified as at “moderate” and at “high” risk of bias. In the present systematic review, we opted to answer a narrower PICO question, comparing the widely used tray-delivered CP bleaching protocol [3] with the most recent strips or tray-delivered HP at-home bleaching products released in the market.

In terms of ΔE, this meta-analysis revealed that tray-delivered CP showed a higher whitening efficacy than HP-based products. Regardless of the bleaching product used, the active whitening component of both types of bleaching gels is the same HP (a 10 % solution of CP is roughly 3.5 % HP and 6.5 % urea). Thus, one could expect that a better whitening efficacy would be observed for groups when higher active HP was employed.

A closer evaluation of the bleaching products used in the studies included in the meta-analysis of ΔE (Fig. 3) revealed that, opposed to previous expectation, the concentration of the active HP was higher in the HP-based products rather than in the tray-delivered CP gels. Additionally, this reduced concentrated of active HP in tray-delivered CP was not compensated for extended application time in three out of the five studies [11, 14, 16].

This highlights that other factors apart from active HP concentration are responsible for the better whitening degree achieved with the tray-delivered CP products. While both CP and HP are used for whitening, their properties are quite different. HP-based products are very unstable and release all of its active hydrogen peroxide in 30 to 60 min [44, 45]. As oxidization of organic substance involve a series of consecutive steps and takes time to occur, there might be a limit to which the rapid release a chemical reagent (in this case, HP) leads to faster reaction rate. Thus, part of the fast HP released from the HP-based products may even not have time to contact the organic substance of the teeth, being partially lost at the dental surface. This hypothesis, however, requires in vitro and clinical evaluations.

On the other hand, the release of active HP in tray-delivered CP gels is slower than in HP-based products: about 50 % of its peroxide is released in the first 2 to 4 h, then the remainder over the next 2 to 6 h [44, 46]. In this way, there will be always available HP for oxidization due to the slow release, allowing better oxidization of the organic matrix of the dentin.

Additionally, when applied on the dental surfaces, CP breaks down into HP and urea. The HP further reduces to water and oxygen and the urea to ammonia and carbon dioxide. This has an additional advantage: the ammonia yield pH increase, which favors the dissociation of HP into free radicals. It is known that in an alkaline media, the dissociation of HP into free radicals is the highest as the dissociation constant (pKa) of the HP is around 11.5. In a pH of 9, HP dissociates 2.7 times more than in a pH of 4.4 [47]. Maximum effectiveness of bleaching was shown to occur under an alkaline pH of 9.

The pH of the media not only affects the decomposition kinetics but also the type of by-products produced. While in an acidic solution, free oxygen radicals and hydroxyl anions are produced; in an alkaline medium, there is a higher concentration of perhydroxyl ions [48]. Whether or not the different free radicals produced plays a role on the whitening outcome is yet to be investigated.

From a clinical perspective, these findings should be cautiously interpreted. The lack of statistical difference between at-home protocols in the studies included in the meta-analysis of ΔSGU may suggest that the differences observed in ΔE may not be clinically significant. A change in one shade guide unit can be clinically detectable, but only changes in ΔE equal to or higher than 3 are detectable by visual inspection [49]. Perhaps on the long run, this slight difference may result in less color rebound and more stable results. However, this requires further long-term clinical trials for investigation.

ΔE values are obtained from objective measures from spectrophotometer while ΔSGU values are obtained from visual matching with shade guide units. Randomized clinical trials usually employ these two instruments (spectrophotometer and shade guide units) for color evaluation in order to increase the confidence in the results of the study. However, although similar results are obtained when they are compared, some papers show significant differences when these two measurement instruments are compared [5052]. Systematic reviews of the literature do not show new results but only express a summary of the results from primary RCTs, and this may be the reason of why the data of ΔE values reached significance and the ΔSGU did not. According to some studies, spectrophotometer readings provide more accurate results than visual shade matching with shade guides [53, 54], which is another reason for such difference between the two instruments in the present study.

Gingival irritation and tooth sensitivity, the most common side effects associated to bleaching therapy [13, 55] were not different among the at-home bleaching protocols. The gingival irritation associated with at-home bleaching is mainly related to two factors: trauma due to the tray/strip and the aggression of the hydrogen peroxide and derivates to gingiva tissue.

The use of a thick and rigid tray material, no scalloped (extended more than 1 mm onto the soft tissue), was responsible for gingival irritation at the past [31, 56]. Currently, many improvements in tray materials and designs have reduced gingival irritation, especially with the use of a soft, thin tray material [31, 56], which was used in the included clinical trials [6, 1518, 36, 37, 39].

On the other hand, bleaching strips have a predefined form and the contact of the hydrogen peroxide occurs directly with the gingival tissue during treatment. Although there is a consensus that this might be responsible for some degree of gingival irritation, we have confirmed this in the present and earlier meta-analyses of the literature [13, 57].

Regarding the tooth sensitivity, the etiology of tooth sensitivity induced by bleaching is not fully understood. Since the hydrodynamic theory of dentin sensitivity has been widely accepted as the explanation of dentinal sensation, some authors have used this theory to explain tooth sensitivity due to bleaching [58]. However, pain during and following bleaching treatment can affect intact teeth lacking dentin exposure and this is in sharp contrast with the hydrodynamic theory [59].

Kielbassa et al. in a recent review of the literature suggested that the most probable cause of tooth sensitivity is a reversible pulpitis. There is some degree of pulpal inflammation due to some amount of hydrogen peroxide that reaches the pulp. It is widely known that hydrogen peroxide can pass easily through the enamel and dentin to the pulp [60] and can cause damage to the pulp cells [61]. The inflammatory reaction into the pulp after bleaching was demonstrated by the presence of inflammatory mediators such as the cell-derived factor adenosine triphosphate [62] and prostaglandins, which excite and sensitize pulpal nociceptors [63] causing the tooth sensitivity.

However, although the risk of bleaching-induced tooth sensitivity in the primary studies was quite variable (13.8 to 66 %), the overall risk of tooth sensitivity was 41 % (95 % CI 36.0 to 46.1), which is lower from that reported for in-office bleaching that uses a higher concentration of hydrogen peroxide (62.9 %; 95 % CI 56.9–67.3) [64]. In the study of Rezende et al. [64], the authors observed that at-home bleaching was associated with reduced risk and intensity of tooth sensitivity compared to in-office bleaching, which is in agreement with the results of the present study. This is in line with the results of an in vitro study that demonstrated that the damage to the pulp cells is directly correlated with the amount of HP that reaches the pulp chamber [65, 66].

Finally, this review highlights an important problem in the evidence base of bleaching products. More than half of the studies were fully or partially supported by manufacturers. Currently, manufacturers conduct their own evaluations of products or fund researchers to test their products. While this approach to product development is standard in all industries, it may pose a significant publication bias, as studies with results that do not favor the manufacturer would not be published. Therefore, there is an urgent need for independent studies using commercially available products that follow the current standards for design and reporting of randomized controlled trials [13].

Conclusions

Tray-delivered CP gels showed a slightly better whitening efficacy than HP-based products when the color change was evaluated with a spectrophotometer; such superiority, however, could not be detected with shade guide units. Both whitening systems demonstrated equal level of gingival irritation and tooth sensitivity.