Introduction

Clinical evidence for the overall survival of direct light-cured resin composites restorations in posterior teeth has been well documented [1,2,3,4,5]. This material is considered to be the first choice for esthetic restorations in posterior teeth [6].

The main reasons for the clinical failure of direct resin composite restorations in posterior teeth over time are secondary caries and restoration fractures [1, 3]. Resin composite shrinkage stress may lead to the development of marginal gaps at the tooth–restoration interface, which may result in the development of secondary caries. Conversely, the occurrence of fractures is associated with the limitations of the materials’ mechanical properties, cavity configuration, quantity and quality of the remnant tooth structure, and a patient’s occlusion [7,8,9]. To reduce the polymerization shrinkage stress, incremental layering of the resin composites has been recommended for decades. The incremental technique reduces stress at the cavity wall interface and allows a more efficient light curing of the material and a lower gap formation at the interface [10,11,12].

Bulk-fill resin composites have been designed to simplify the restorative technique because they can be placed into posterior teeth cavities in a single increment of 4–5 mm [13,14,15]. These materials offer greater translucency, allowing greater light dissipation through the material; incorporation of more reactive photoinitiators, which enable a greater depth of cure; and include monomers that act as modulators of the polymerization reaction, achieving low polymerization shrinkage [8, 16]. Two types of these materials are commercially available: base and full-body bulk-fill resin composites. Base bulk-fill materials are low-viscosity resin composites and therefore are also known as flowable bulk-fill resin composites [17]. These materials involve lower filler loading than conventional/standard microhybrid or nanohybrid resin composites, which require incremental filling. Therefore, they are used as a liner/ base, followed by capping with the conventional resin composites. Full-body bulk-fill resin composites can be applied in one increment without the need for coverage or capping. Because of their viscosity, they are also referred to as sculptable or paste-like bulk-fill resin composites, allowing the reconstruction of the lost tooth structures. In addition, these materials have high inorganic filler loading and are therefore used in areas of high masticatory load [17,18,19,20].

The simplification of the operative procedures is desirable in clinical daily practice. In this context, bulk-fill resin composites are an attractive alternative for posterior restorations. However, a clearer understanding of the clinical performance of this relatively new class of materials in comparison to conventional resin composites is required. Therefore, the aim of the current systematic review and meta-analysis was to compare the clinical performance of bulk-fill and conventional resin composites in direct restorations in posterior teeth. The hypothesis of this study was that the clinical performance of these restorative materials is similar.

Materials and methods

Protocol and registration

This review was conducted following the recommendations of the Cochrane Collaboration for systematic reviews [21] and is reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [22]. The study was registered at the international prospective register of systematic reviews—PROSPERO under the registration number CDR42016053436.

Eligibility criteria

The research question of this study was “Do bulk-fill resin composites have a comparable clinical performance to conventional composites in posterior restorations?” The population/problem, intervention/exposure, comparison, and outcome of the study were established according to the PICO question. In this respect, the population consisted of patients with direct resin restorations in permanent posterior teeth. The intervention was posterior teeth restored with a bulk-fill resin composite, and a comparison was performed with posterior teeth restored with a conventional resin composite. Evaluated outcomes were failure such as anatomical shape, marginal adaptation and discoloration, surface roughness, color, secondary caries, loss of retention, fracture, and postoperative sensitivity.

The inclusion criteria used were as follows: (1) only randomized clinical trials (RCTs), (2) studies with a follow-up period of at least 1 year, and (3) studies evaluating class I and II direct restorations in permanent teeth restored with bulk-fill and conventional resin composites.

The exclusion criteria were as follows: (1) prospective studies without randomization and retrospective studies, (2) case reports, (3) reviews, (4) in vitro studies, and (5) studies reported in more than one publication with different follow-up periods, and studies evaluating only bulk-filled resin restorations without direct comparison with conventional resin composites.

Information sources and search strategy

The databases searched were PubMed/MEDLINE, Excerpta Medica Database (Embase), Cochrane Library, and Web of Science. The search strategies for each database are represented in Tables 1, 2, 3, and 4. A manual search through journals in the area of interest, even if they are indexed in the research databases, is recommended as an additional form of search [23,24,25]. Therefore, the electronic search was complemented by manual searches of the following journals: Journal of Orofacial Sciences, Operative Dentistry, Dental Materials, and Journal of Dentistry. Additionally, the reference lists of the included studies were checked to identify possible relevant studies. The electronic search was performed up to January 2018 and without any language restrictions.

Table 1 Search strategy for the PubMed/MEDLINE database
Table 2 Search strategy for The Cochrane Library
Table 3 Search strategy for the Embase database
Table 4 Search strategy for the Web of Science database

Study selection and data collection

Two independent researchers (S.R.M.V. and C.A.A.L.) performed the electronic search, selecting studies based on titles and abstracts that answered the research question. Relevant information extraction, as well as an assessment of the risk of bias of the articles, were performed by one of the researchers (S.R.M.V.) and verified by a second researcher (C.A.A.L.). Disagreements between the evaluators were resolved by consensus.

Relevant information was extracted using a form containing questions regarding the author, study design, number of patients and restorations, mean age, follow-up, bonding agent and resin composites of the control and intervention groups, number of surfaces and restoration locations, and restoration failures [26].

Risk of bias in individual studies

The risk of bias of the selected articles was assessed using the Cochrane risk of bias tool for randomized clinical trials, which uses a domain-based approach [21]. In this tool, the aspects of bias risk are evaluated individually without assigning scores and are divided into six domains: random sequence generation, allocation concealment, blinding of outcome assessors, blinding of participants and personnel, incomplete outcome data, and selective outcome reporting, comprising the assessment of selection, performance, attrition, reporting, and detection bias. Each domain is classified as having a low risk, unclear risk, or high risk of bias.

Summary measures

The data extracted were analyzed using the Review Manager (RevMan) 5.3 software (The Cochrane Collaboration, Copenhagen, Denmark). The relative risk (RR) and 95% confidence interval (CI) were calculated for each study. The data of the eligible studies were ordinal, referring to the scores for the characteristics of the restorations evaluated using the modified versions of the U.S. Public Health Service (USPHS) (Table 5). For further analysis, these data were dichotomized as either acceptable or unacceptable. The acceptable restorations were those that received the Alpha and Bravo scores. The unacceptable restorations were those that received the Charlie and Delta score in at least one of the characteristics. Therefore, failed restorations were those that were classified as unacceptable.

Table 5 Summary of the modified USPHS—United States Public Health Service—evaluation criteria (the reported scores were cited as either Alpha, Bravo, Charlie, and Delta or 0, 1, 2, 3, and 4)

The I2 index was used to measure the percentage of variation across studies that was due to heterogeneity, where 25% corresponded to low heterogeneity, 50% to moderate heterogeneity, and 75% to high heterogeneity. A fixed-effect model was used because no statistically significant heterogeneity was found among the studies (p > 0.10) [37].

Additional analysis

The kappa coefficient was calculated to determine inter-reader agreement in the initial selection of the articles in the PubMed/MEDLINE, Embase, The Cochrane Library, and Web of Science.

Results

Selection of studies

The initial search of the databases retrieved 522 articles, including 300 in PubMed/MEDLINE, 79 in Embase, 16 in Cochrane Library, 127 in Web of Science, and 2 in the manual search (Fig. 1). After the removal of duplicates, 356 articles remained and 15 were selected based on the research question. After reading the full text, five articles were excluded; one was excluded because of a short follow-up period (30 days) [38] and the other four were excluded because they contained the data reported in another study with a longer follow-up period [39,40,41,42]. Thus, 10 articles were selected for final analysis of the results. The kappa test was applied to determine inter-reader agreement in the initial selection of the studies, which showed a high level of agreement for PubMed/MEDLINE (kappa = 0.9), Embase (kappa = 1.0), The Cochrane Library (kappa = 1.0), and Web of Science (kappa = 0.83) [43].

Fig. 1
figure 1

Flow diagram of the article selection process

Characteristics of included studies

Ten studies were selected for qualitative and quantitative analysis. A total of 1076 restorations were performed in 459 patients, and 941 restorations were evaluated. Nine studies were split-mouth studies [27,28,29,30,31,32,33,34, 36] and one was a parallel group study [35]. The mean follow-up was 33.6 months (12–72 months). Five studies [27, 30, 32,33,34] did not report sample size calculation.

Rubber dam isolation was only reported in four studies [30, 34,35,36]; the remaining six studies used only cotton rolls and saliva ejectors. In one of these four studies [30], rubber dam isolation was only employed only when cotton rolls/saliva ejector were insufficient. Most studies did not report the use of a lining material. In two studies [27, 32], used calcium hydroxide cement was used as a liner in deep cavities, and in one study [33], calcium hydroxide cement and/or glass ionomer cement was used in deeper cavities. Regarding the bonding agent used, self-etch systems were used in both the intervention and control groups of six studies [27,28,29, 32,33,34], three studies [31, 35, 36] used etch-and-rinse systems, and one study [30] used an etch-and-rinse system in the control group and a self-etch system in the intervention group. All base/flowable bulk-fill resin composites were used with a 2-mm capping of a conventional resin composite [28, 29, 31]. One study [27] evaluated four groups: two full body/sculptable bulk-fill composites, a base/flowable bulk-fill composite covered with a conventional resin composite, and a conventional resin composite by itself. The remaining studies [30, 32,33,34,35,36] compared full-body/sculptable bulk-fill resin composites with conventional composites (incremental technique) (Table 6).

Table 6 Qualitative summary of the ten selected studies

Among the 941 restorations evaluated, 43 failures occurred, with a 5.57% (29 of 520) failure rate in bulk-fill resin composites and a 3.32% (14 of 421) failure rate in conventional resin composites. The causes of restoration failure were secondary caries (23%; n = 10), tooth and resin fractures (19%; n = 8, each), postoperative sensitivity (9%; n = 4), anatomical shape and poor marginal adaptation (7%; n = 3, each), marginal discoloration (9%; n = 4), caries associated with tooth fracture (5%; n = 2), and retention (2%; n = 1) (Table 7).

Table 7 Qualitative summary of restoration failures

Assessment of risk of bias

Six of the included studies did not describe the randomization process. Karaman et al. [31], Arhun et al. [32], and Çolak et al. [33] used coin tossing for randomization, and Yazici et al. [36] used a random number table of the groups. None of the studies mentioned the method used for allocation concealment. Four studies were unclear about the blinding of participants and personnel [30, 33,34,35], and the other studies reported that patients were blinded. Only one study was unclear about the blinding of outcome assessment [35]. The incomplete outcome data domain was unclear in four studies [27, 31, 32, 36] because of unexplained reasons for participant loss. All studies had a low risk of bias regarding selective reporting. Even when the study protocol was unavailable, it was clear that the published reports clearly included all expected outcomes, including those that were prespecified [21] (Fig. 2).

Fig. 2
figure 2

Risk of bias of the included studies

Meta-analysis

The meta-analysis included the 10 studies selected in the systematic review. The failure rates of bulk-fill and conventional resin composite restorations were evaluated using subgroups for the classification of the bulk-fill resin composites (base/flowable and full-body/sculptable). No significant differences were observed between conventional resin composites and base/flowable (p = 0.31; RR 1.49; 95% CI 0.69–3.25; p = 0.10; I2 = 56%) or full-body/sculptable bulk-fill resin composites (p = 0.12; RR 1.89; 95% CI 0.84–4.24; p = 0.51; I2 = 0%) (Fig. 3).

Fig. 3
figure 3

Forest plots of included studies

Publication bias

A funnel plot was used for assessing the publication bias. The funnel plot of the studies included in this review exhibited symmetry, indicating low heterogeneity and the possible absence of publication bias (Fig. 4).

Fig. 4
figure 4

Funnel plot

Discussion

The hypothesis of this study was not rejected. The clinical performance of bulk-fill resin composites is comparable to conventional resins in direct posterior restorations. The studies included in this review reported similar results with the use of bulk-fill resin composites, regardless of type (base/flowable and full-body/sculptable).

The greater translucency of bulk-fill resin composites [8, 44] and the use of more reactive photoinitiators [16] allow a higher depth of cure. The higher reactiveness enables the insertion of the material in thick increments of 4–5 mm, with uniform polymerization and degree of conversion. These factors are essential to obtaining satisfactory mechanical properties and, consequently, increasing the longevity of the restorations [45, 46]. In addition, bulk-fill resin composites contain polymerization modulators that achieve low contraction and less stress at the bonded interface [8, 16, 47]. The insertion of thicker increments also contributes to reducing the incorporation of air voids, forming a more homogeneous restorative unit [8, 16].

The longevity of restorations is also related to the operative technique, as well as the patients’ and cavity characteristics [27]. In this respect, a larger number of failures in restorations of posterior teeth seem to be related to parafunctional habits such as bruxism [5]. Van Djiken and Pallesen [28, 29] reported a considerable significant number of failures caused by fracture of the material and tooth, most of which occurred in patients with bruxism.

Failure caused by secondary caries was considered in the studies when it was clinically observed in continuity with the margins of the evaluated restoration [26]. Secondary caries can be associated with the presence of marginal defects in a restoration [6, 9] or with high caries risk patients [48]. However, characteristics such as a high caries index and poor oral hygiene were considered exclusion criteria in most studies [27, 30,31,32,33,34,35,36]. Van Djiken and Pallesen [28, 29] did not exclude patients with this condition and confirmed that failure caused by secondary caries was associated with patients at high risk of caries. Thus, secondary caries may be related to biological failure rather than with the restorative material used [5, 48, 49]. The presence of secondary caries may also be related to problems of marginal adaptation and local failures such as contamination with saliva during the restorative procedure [27]. A previous systematic review [50] observed that direct restorations performed with rubber dam isolation resulted in a lower failure rate than restorations performed with only cotton rolls and saliva ejectors.

The location and number of surfaces involved, as well as the extent of the cavities, also appear to be related to the overall survival of restorations. The studies included in this review evaluated class I and II direct restorations in posterior teeth (molars and premolars). Five studies evaluated only class II restorations. Three studies did not report failure in class I restorations [28, 29, 34] and two other studies did not observe any failure in class II restorations over the follow-up period [31, 36]. Class II restorations appear to be more prone to the development of secondary caries compared to class I restorations [48]. Thus, the failure rate seems to be lower for cavities where only one surface is affected instead of two [3].

Cavity depth [51] and extent [52] are factors that can influence postoperative sensitivity. Apart from the study conducted by Karaman et al. [31], which used endodontically treated teeth, only one study [34] radiographically confirmed that all were performed in 4- to 5-mm-deep cavities, and all other studies evaluating postoperative sensitivity did not describe the depth of the preparations. Two studies [28, 29] reported that most cavities were deep. Nevertheless, a low rate of failure caused by postoperative sensitivity was observed in the included studies. This result should be carefully analyzed because one study [35] contained many reports of an initial sensitivity with a Bravo score that then regressed but did not receive a Charlie score (intense and intolerable sensitivity).

The small number of failures caused by postoperative sensitivity may be associated with the use of lining materials in deep cavities [27, 33] and the use of self-etch adhesive systems [30]. Conversely, a systematic review found no statistically significant relationship between postoperative sensitivity and the bonding system used [53]. No association was found between the placement technique of the bulk-fill resin composite (incremental and bulk-filling) and the cavity depth on the postoperative sensitivity [52]. A Cochrane review found inconsistent evidence regarding the use of liners and restoration failure, particularly concerning postoperative sensitivity [54].

Air entrapment between increments can occur when the incremental technique is used, causing sensitivity and degradation of the restorative material degradation [10, 11, 16]. Although no differences were observed between bulk-fill and conventional resin composites, the lower technical sensitivity and the simplified procedure of the former support its use in restorations of posterior teeth.

Randomized clinical trials are a critical method for clinically evaluating new materials and treatments because they are standardized to achieve greater significant credibility and clinical reliability. However, a detailed description and similar methods should be employed to allow this comparison. All studies included in this review used the modified USPHS criteria, but observed variations resulted in a lack of standardization among the studies. Thus, the different analyzed aspects and various instruments and assessment criteria used hampered the comparison of the results.

The USPHS criteria are the most common method for evaluating restorations. However, they have shown a limited sensitivity, and the categories may not fully reflect the clinical success of restorations. Clinical trials that have used other criteria tend to detect failure rates more than four times higher than produced by the USPHS. However, this may be due not only to the greater sensitivity of these criteria but also to a large number of other factors, such as the fact that not all new systems are fully validated [55]. The World Dental Federation (FDI) criteria have been reported as an alternative. The categories can be selected according to the objective of the study, and the scores of 1 to 3 can also be simplified and considered as “clinically satisfactory” [56].

Similarly, the randomization procedure and allocation concealment are fundamental to the design of randomized clinical trials to avoid selection bias. Most of the included studies did not provide a full description of these steps. Göstemeyer et al. [55] reviewed the design and validity of randomized controlled dental restorative trials and observed a high risk of bias, mainly in the domains of allocation concealment (selection bias 93%) and blinding of participants and personnel (performance bias 99%) or blinding of outcome assessment (detection bias 46%). The blinding of the operator and examiners may, in certain cases, be more difficult or even impossible to do depending on the studied materials. However, allocation concealment can be implemented in all trials.

Sample size calculation was also not clearly described in five of the included studies [27, 30, 32,33,34]. In a previous review [55] of 114 studies, only 17% reported sample size calculations, with relatively small sample sizes varying from 8 to 456 participants (median 37). Additionally, most studies described sample size calculation inadequately. Although the absence of sample size calculation does not affect the risk of bias of clinical trials, it may result in underpowered studies that are unethical and wastes considerable resources. Small samples are unable to highlight small differences in the results [55, 57].

Thus, the results indicated the lack of adherence of the authors to the Consolidated Standards of Reporting Trials (CONSORT) statement for clear and transparent reporting of a randomized clinical trial, which would have aided the reader in forming a judgment and conclusions [58].

The follow-up periods observed in this study varied from 12 to 72 months. Differences in the efficacy of therapies can only be measured only after several years because failure behavior can vary, and one type of material may be more susceptible to long-term dental fracture and the other caries. Therefore, long periods of observation are essential (often more than 10 years) to observe all pertinent effects and differences. However, maintaining a population of participants over a prolonged period is extremely challenging, and attrition bias is very common [56].

The results of the present review should be interpreted with caution because of the small number of clinical studies evaluated. Further randomized clinical trials with longer observation periods are still needed before a full recommendation of clinical protocol change regarding direct restorations of posterior teeth with resin composites.

Conclusion

This systematic review and meta-analysis revealed that the clinical performance of bulk-fill and conventional resin composites in direct restorations of posterior teeth was similar, within a follow-up period of 12 to 72 months. However, randomized clinical trials with longer follow-up periods are still necessary.