Introduction

Root canal treatment (RCT) outcome can only be assessed through case follow-up [1]. According to the European Society of Endodontology (2006), follow-up should be regularly performed for at least 1 year after RCT conclusion, aiming to monitor apical health progress [1].

Several studies have suggested that the quality of RCT may influence its outcome, and RCT quality is frequently assessed through periapical radiographs (PA) [2,3,4]. The criteria mostly used include the incidence of procedural errors and obturation characteristics, such as length and density [5, 6].

Working length, as well as obturation extent, is traditionally defined by PA [4, 6, 7], and currently, the use of electronic apex locators has made those measurements more objective and reliable [8]. As reported by the American Association of Endodontists, underextended obturation is considered an incomplete obturation of the root canal space, and overextended obturation is described as material extending beyond the radiographic apex [9]. However, there is no consensus among researchers regarding the ideal obturation extent [3, 4, 6]. On the other hand, other investigators suggested that acceptable obturation extent should end less than 2 mm away from the radiographic apex [7].

A previous systematic review published in 2008 used the terminology of short, flush, and over-filling to categorize the characteristics of root canal obturation; however, the precise obturation extent (mm) was not reported or considered by the authors [4]. Nonetheless, according to the authors, short and flush obturations were related to higher odds of favorable outcome than over-filling. Conversely, when only teeth with preoperative periapical lesions were considered, the odds of a favorable outcome for short and over-filling obturations were not significantly different [4].

Because that study was published more than 10 years ago from this publication and new data on RCT outcomes have been published, a further systematic review with more recent data is proposed. Therefore, this systematic review aims to assess the influence of obturation extent in RCT outcome by answering the following question “among patients requiring RCT on fully formed permanent teeth, is there an association between obturation extent and final treatment outcome?”

Methods

This systematic review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) Checklist [10].

Protocol and registration

A systematic review protocol based on the PRISMA Protocols (PRISMA-P) [11] was registered at the International Prospective Register of Systematic Reviews (PROSPERO) under the number CRD42017079444 [12].

Eligibility criteria

The question of this systematic review was formulated using the acronym PECOS (Population, Exposition, Comparison, Outcomes, Studies), of which (P) human permanent teeth with complete root formation; (E) primary root canal treatment (with the root canal obturation apical extent reported); (C) none or studies comparing different apical extent groups; (O) endodontic therapy outcome (odds ratio, success rate, post-operative symptomatology, signs of infection, periodontal ligament status, or periodontal bone status); and (S) clinical trials and observational studies. It is important to emphasize that only primary RCT data were considered in this systematic review, and therefore, articles on root canal retreatment were excluded. No time restrictions for publications were applied.

The following exclusion criteria were applied: (1) Studies in primary teeth; (2) studies in open apex teeth; (3) studies in which the RCT outcome was not described or correlated with obturation extent; (4) studies with less than 12 months of clinical follow-up; (5) studies with mixed treatment and retreatment data; (6) studies in which data were not clearly reported; (7) reviews, case reports, protocols, short communications, personal opinions, letters, posters, conference abstracts, and laboratory research; (8) studies not in Latin (Roman) alphabet; (9) full copy not available.

Information sources and search strategy

Individual search strategies were developed for the following databases: Latin American and Caribbean Health Sciences (LILACS), LIVIVO, PubMed, Scopus, and Web of Science. A search in gray literature was performed, including Google Scholar, Open Grey, and ProQuest. All searches were conducted from the earliest date available until October 10th, 2017 (Appendix 1). All references were managed in a reference software (EndNote X7®, Thomson Reuters, Philadelphia, PA), in which collection of references and duplicate removal were conducted.

Study selection

Selection of studies was conducted in two phases. Phase 1 was performed in a systematic review web application (Rayyan®, Qatar Computing Research Institute), in which two authors reviewed the identified references’ titles and abstracts. In phase 2, the same reviewers applied the eligibility criteria for full-text studies. In addition, the reviewers screened the reference list of selected studies individually aiming to identify potentially relevant articles. Any disagreement was resolved between the two authors and, if there was not a consensus, a third author was consulted to make a final decision.

Data collection process and data items

Two authors collected key data from selected references; any disagreement was solved between the authors. Data collection consisted of study characteristics (author, year of publication, country, and study design), sample characteristics (size, obturation extent, RCT technique, and follow-up period), outcome assessment (clinical and radiographic criteria), and studies’ main results. If the required data were not complete or the data presented could not be extrapolated, attempts were made to contact the authors by e-mail to retrieve the missing information. No further information was obtained through these contact attempts.

Risk of bias in individual studies

The Meta-Analysis of Statistics Assessment and Review Instrument (MAStARI) for cohort studies was utilized to evaluate included studies [13]. Two reviewers assessed the risk of bias (RoB) for each study and crosschecked the information. RoB was categorized as “high” when the study reached up to 49% score “yes”; “moderate” when the study reached 50% to 69% score “yes”; and “low” when the study reached more than 70% score “yes.” The figures were generated with Review Manager 5.3 (RevMan 5.3, The Nordic Cochrane Centre, Copenhagen, Denmark).

Summary measures

The association between RCT outcome and obturation extent was the main outcome evaluated. RCT outcome was reported or calculated as odds ratio (95%CI), success rates, and healing rates (reported or calculated as percentage). For the purposes of synthesis, “favorable” outcome was defined according to the report of the included studies, based on the success and healing parameters. Studies were grouped based on the ideal endpoint reported for the obturation.

Synthesis of results

A qualitative analysis based on RCT outcome was performed. A meta-analysis was planned if sufficient data were available and whenever the studies were considered homogeneous regarding study design, interventions, and outcomes. A qualitative analysis of results based on the influence of obturation extent on RCT outcome was performed. In order to decrease heterogeneity among studies, results were separated according to the ideal endpoint reported.

Risk of bias across studies

The quality of evidence, as well as the strength of the recommendations, was assessed and a summary was presented using “Grading of Recommendations Assessment, Development and Evaluation” (GRADE) Summary of Findings (SoF) tables, from GRADEpro software (McMaster University, Hamilton, Canada) [14].

Additional analyses

Sensitivity analysis with regard to RoB subgroups (low, moderate, and high RoB) was conducted to verify whether the results of the systematic review were altered.

Results

Study selection

The search strategy resulted in 2170 references once the duplicates were removed. After phase 1, 61 articles were included for full-text assessment. In phase 2, 39 articles were excluded due to different reasons (Appendix 2). Finally, 22 cohort studies were included for final evaluation [3, 15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35] (Fig. 1).

Fig. 1
figure 1

Flow diagram of literature search and selection criteria*. *Adapted from PRISMA

Studies characteristics

The follow-up period ranged from 1 [15, 21, 24, 28, 31, 35] to 19 years [29]. The examined parameter varied, including teeth, roots, canals, and more than one of these parameters. Furthermore, the studies’ sample size varied from 85 [27] to 1139 teeth [20]; from 143 [26] to 551 roots [19]; and from 236 [35] to 1369 canals [30].

Obturation extent assessment

The obturation extent assessment varied between studies; variable endpoints were suggested as ideal. One study considered an adequate obturation within 3 mm from the radiographic apex [29], ten studies within 2 mm [3, 17, 21, 23, 25, 26, 28, 31, 33, 35], three within 1 mm [20, 22, 24], and three at the apex (0 mm) [15, 19, 30]. One study not stated an endpoint and considered an adequate obturation the range of 0.5 to 1.5 mm from the apex [32]. Two studies measured the obturation extent in mm from the radiographic apex and reported the results in means [16, 34]. Finally, one study classified as over-filling or underfilling, which was assorted in three categories: 0 to 0.4 mm, 0.5 to 1.0 mm, and 1.1 to 2.0 mm [27].

For pooled data analysis purposes, included studies were grouped based on the ideal endpoint reported. For studies that stated an endpoint (in mm) as adequate obturation extent, the underextended obturation was considered when that endpoint extent was not reached, and incomplete obturation was radiographically observed. In contrast, for all included studies, an overextended obturation was described as obturation material extending beyond the radiographic apex.

RCT outcome

RCT outcome evaluation was performed mainly by PA assessment. Only one study compared cone beam computed tomography (CBCT) and PA assessments [26]. Additionally, five studies classified the apical healing according to the Periapical Index (PAI) [17, 18, 23, 29, 31]. Three of them executed a dichotomized analysis grouping PAI 1 and 2 for normal periapical status, and PAI 3, 4, and 5 representing the presence of apical periodontitis [17, 29, 31].

Risk of bias within studies

Two studies were classified as high RoB [15, 24], seven as moderate RoB [20,21,22, 27, 28, 34, 35], and thirteen as low RoB [3, 16,17,18,19, 23, 25, 26, 29,30,31,32,33] (Appendix 3). For most studies, the questions “are confounding factors identified and strategies to deal with them stated?”, and “are outcomes assessed using objective criteria?” were scored as “high RoB”, since data were not presented separately for vital and non-vital teeth nor PAI criteria or other validated criteria were utilized for periapical healing assessment. The question “has bias been minimized in relation to selection of cases and of controls?” was considered as “not applicable” for all included studies as none of them presented data in regard to controls (Fig. 2).

Fig. 2
figure 2

Risk of bias graph. Q1 Is sample representative of patients in the population as a whole?; Q2 Are the patients at a similar point in the course of their condition/illness?; Q3 Has bias been minimized in relation to selection of cases and of controls?; Q4 Are confounding factors identified and strategies to deal with them states?; Q5 Are outcomes assessed using objective criteria?; Q6 Was follow-up carried out over a sufficient time period?; Q7 Where the outcomes of people who withdrew described and included in the analysis?; Q8 Were outcomes measured in a reliable way?; Q9 Was appropriate statistical analysis used?

Results of individual studies

Since most included studies considered that adequate obturation was within 2 mm from the radiographic apex, this endpoint was considered as “adequate” for the purpose of this section, unless otherwise stated. Detailed information about included studies’ results are available in Table 1.

Table 1 Summary of descriptive characteristics of included articles (n = 22)

Adequate obturation presented higher “healed” rates than inadequate obturations (which included both over and underextended) in the study of Chugal et al. (2003) [16], while both adequate (within 3 mm from the radiographic apex) and overextended obturations showed higher “healed” rates compared to underextended obturations in the study of Pirani et al. (2015) [29]. Moreover, Lee et al. (2012) suggested that adequate obturations presented higher rates of periapical healing than overextended obturations, although no differences in periapical healing were observed between underextended and overextended obturations [25].

Four studies found that the success rates were lower in under and overextended than in adequate obturations (at the apex [15], within 1 mm [20, 24], and within 2.5 mm from the radiographic apex [30]), from which one [20] found no statistical difference between under and overextended obturations. In addition, Helling et al. (2001) found no differences in the success rates between adequate, under, and overextended obturations [21], while Tamarut et al. (2006) suggested that teeth with unsuccessful outcomes presented on average with underextended obturations [34].

Ilgüy et al. (2012) showed that PAI of adequate obturations (within 1 mm from the radiographic apex) was lower than under and overextended obturations [23]. In addition, Ridell et al. (2006) reported that teeth with apical periodontitis in the follow-up period were more likely to have under or overextended obturations [31].

Halse and Molven (1987) reported that overextended obturations had lower rates of success than obturations ending at the radiographic apex [19]. Similarly, Matsumoto et al. (1987) concluded that overextended obturations presented lower repair rates than obturations within 0.5 and 1 mm from the radiographic apex [27]. In addition, two studies reported that the extrusion of endodontic cement was not significantly associated with lower healing rates [17, 18].

Liang et al. (2011) compared the influence of obturation extent in the presence of post-treatment periapical lesions using PA and CBCT [26]. As a result, in PA, the presence of under or overextended obturation presented higher rates of post-treatment apical lesions when compared to adequate obturations, while in CBCT analysis, obturation extent did not affect the presence/absence of post-treatment apical lesions (p > 0.05).

Regarding the influence of obturation extent on periapical repair, ten studies investigated teeth with preoperative normal/diseased apical tissues separately [3, 16, 18, 19, 22, 28, 30, 32, 33, 35]. Three of them reported that teeth with preoperative vital pulp or absence of radiolucency have higher rates of a favorable outcome related to underextended obturation [16, 22, 28], and Tani-Ishii and Teranaka (2003) reported that all teeth with normal preoperative apical conditions remained unchanged during the follow-up appointments [35].

Considering teeth with preoperative periapical lesions, Chugal et al. (2003) showed that obturation extent was closer to the radiographic apex in cases with favorable than with unfavorable outcomes [16]. Similarly, two other studies concluded that failure rates were higher on underextended or overextended than in adequate obturations [3, 33]. Additionally, Halse and Molven (1987) reported that overextend obturations have higher rates of an unfavorable outcome in teeth with preoperative periapical lesions [19]. Interestingly, Santos et al. (2010) found no association between obturation extent and maintenance of apical lesion [32].

Synthesis of results

Among all included studies, eleven studies described an endpoint (in mm) considered as adequate obturation extent and reported enough data to calculate the odds ratio of unfavorable outcome related to underextended and overextended obturations [3, 15, 20,21,22, 24, 26, 28, 29, 33, 35]. These studies were grouped by their ideal endpoint stated. Forest plots were constructed for the odds ratio measured (Figs. 3 and 4). High clinical and methodological heterogeneity across studies was found, reflecting differences in RCT techniques, outcome assessments, obturation extent assessments, and follow-up periods. Thus, a pooled quantitative synthesis was not recommended.

Fig. 3
figure 3

Subgroup analysis for underextended obturations* in included articles (n = 11). *Values considered as adequate by the articles. FO, favorable outcome

Fig. 4
figure 4

Subgroup analysis for overextended obturations* in included articles (n = 11). *Values considered as adequate by the articles. FO, favorable outcome

Underextended obturation

One study considered the adequate obturation extent within 3 mm from the radiographic apex, and showed that underextended obturation (less than 3 mm) presented increased odds (4.26; 95%CI = 2.1–8.7) of unfavorable outcome when compared to adequate obturation extent. For studies (n = 6) that considered the adequate obturation extent within 2 mm from the radiographic apex, odds of unfavorable outcome for underextended obturations (less than 2 mm) varied between 0.93 (95%CI = 0.5–1.6) and 6.94 (95%CI = 2.2–21.9) when compared to adequate obturation extent.

For studies (n = 3) that considered the adequate obturation extent within 1 mm from the radiographic apex, odds of unfavorable outcome for underextended obturations (less than 1 mm) varied between 1.73 (95%CI = 1.0–2.9) and 7.67 (95%CI = 0.9–60.2) when compared to adequate obturation extent. One study considered the adequate obturation extent at the apex (0 mm), in which the underextended obturation (shorter than 0 mm) presented increased odds (5.67; 95%CI = 2.1–15.5) of unfavorable outcome when compared to adequate obturation extent.

Overextended obturation

Odds of unfavorable outcome for overextended obturations (n = 11) varied between 0.72 (95%CI = 0.3–1.7) and 23.00 (95%CI = 5.6–94.7) when compared to adequate obturation extent.

Risk of bias across studies

The quality of evidence of this systematic review was classified as very low for all outcome measures (Table 2). The high and moderate RoB of some included studies and the high inconsistency between them influenced these results. Inconsistency was rated according to the heterogeneity of studies.

Table 2 GRADE assessment

Additional analyses

Among the eleven studies that enabled calculation of odds ratio of unfavorable outcomes related to under and overextended obturations, the odds of unfavorable outcomes for underextended obturations in studies classified as low RoB (n = 4) varied between 1.45 (95%CI = 0.2–8.3) and 6.94 (95%CI = 2.2–21.9), and for overextended obturation varied between 0.96 (95%CI = 0.4–2.5) and 4.60 (95%CI = 1.8–11.9) when compared to adequate obturation. Odds of unfavorable outcome for underextended obturations in studies classified as moderate RoB (n = 5) varied between 0.93 (95%CI = 0.5–1.6) and 7.67 (95%CI = 0.9–60.2), and for overextended obturation varying between 0.72 (95%CI = 0.3–1.7) and 23.00 (95%CI = 5.6–94.7). Only two studies were classified as high RoB; therefore, sensitivity analysis was not justifiable.

Discussion

Summary of evidence

This systematic review aimed to investigate the association between obturation extent and RCT outcome including studies with a minimum 1-year follow-up. The selected sample consisted of 22 studies, with a total of 1139 endodontic treated teeth. Among the included studies, various obturation extents were suggested as adequate. Thus, studies were grouped based on the endpoint (in mm) stated as adequate. Since these subgroups reflected differences in under and overextended parameters that differently interfer RCT outcome, and considering the high heterogeneity of the included studies, a meta-analysis was not executed.

The adequate obturation extent reported in the included studies ranged from 3 to 0 mm from the radiographic apex. Among them, only eleven studies reported enough data to enable calculation of the odds ratio, and were separated in the following subgroups according to the distance of the obturation from the radiographic apex: 3 mm [29], 2 mm [3, 17, 21, 23, 25, 26, 28, 31, 33, 35], 1 mm [20, 22, 24], and at the apex (0 mm) [15]. Underextended obturation was considered when obturation extent did not reach the ideal endpoint stated, and an incomplete obturation was radiographically observed, whereas, for all included studies, an overextended obturation was defined as obturation material extending beyond the apex.

As a result, seven studies reported a positive association between underextended obturation and increased odds of failure in RCT [3, 15, 20, 24, 28, 29, 33]. In these studies, underextended obturation was associated with higher odds of failure when compared with adequate obturations, with reported odds ranging between 1.73 (95%CI = 1.02–2.95) and 6.94 (95%CI = 2.2–21.9). The presence of underextended obturation may represent the presence of contaminated tissue in the final extension of the root canal system, which can contribute to the maintenance of apical bacterial biofilms, and therefore influence the outcome of RCT [36, 37]. However, due to the lack of standardized criteria for obturation extent assessment, future studies evaluating these criteria are required to reach safe conclusions.

Regarding overextended obturations, four studies reported increased odds of failure in overextended obturations compared to adequate obturation, with odds ranging between 1.90 (95%CI = 1.23–2.94) and 23.00 (95%CI = 5.58–94.75). The presence of overextended obturation represents the existence of foreign bodies in the periapical region, which may not be biocompatible, and therefore can increase inflammation and influence the healing rates in cases of RCT [38]. Two of the studies included [16, 34] provided data as the mean distance from the radiographic apex, which can facilitate comparisons with other studies and provide more reliable data.

Considering the data collected by the present study, obturation extent seems to influence the RCT outcome in function of the pulp tissue condition. Teeth with irreversible pulpitis (vital teeth) exhibiting bacterial colonization usually do not show colonies in the apical third of the roots [39]. Therefore, instrumentation and obturation at the foramen level is not necessary [3, 40]. In contrast, non-vital teeth show bacterial infection extending throughout the entire root canal system, with common formation of biofilms in the apical portion and beyond the apical foramen [41, 42]. Instrumentation at the apex level is highly recommended in teeth with periapical lesions, followed by apical enlargement attempting to reduce the microbial content and to facilitate periapical healing [43, 44]. In the present study, teeth without apical lesion demonstrated higher chances of favorable outcomes when obturation was placed shorter from the apex.

The authors of the present study understand that, for an effective decrease in the toxic content of the root canal system, not only instrumentation and obturation at a level close to the apical foramen are necessary to reach a favorable outcome in non-vital teeth [45]. Other factors, such as foraminal patency [43, 46], the use of intracanal medication prior to obturation [47], and the use of irrigation techniques with solutions that optimize sanification of the root canal system [48, 49], in addition to the advent of engine-driven instruments [50, 51], newer warm gutta-percha obturation techniques [52] and more tapered preparation [53], are also fundamental.

The findings of this study demonstrated that an obturation extent as close as possible to the apical foramen presented a positive correlation with treatment success in non-vital teeth. A retrospective study regarding 2000 clinical cases with proper samples and adequate exclusion and inclusion criteria reported high success rates when root canal instrumentation and obturation limits were established close to apical constriction [54]. Thus, it can be presumed that a root canal obturation close to the apical limit, in spite of other factors that may affect the success of the treatment, is indicative of a favorable prognosis.

The extrusion of endodontic sealers (cement) did not influence RCT outcome in the two studies [17, 18] that evaluated this association. Additionally, one study [19] investigated the influence of overextended obturation (both gutta-percha and sealer) on the rates of favorable outcomes and found that it did not influence the healing rates as most extruded material had been reabsorbed at the follow-up appointment. However, these findings are not consistent with the results of several studies, which demonstrated that the extrusion of obturation materials beyond the apical foramen is responsible for RCT complications [55,56,57,58]. Flare-ups and foreign body type reaction are the most common occurrences and are positively correlated with RCT complications [55]. Thus, due to the limited number of studies assessed in this systematic review, further prospective cohort studies are required to reach a definitive conclusion.

Most included studies used PA for RCT outcome assessment, which is limited by its bi-dimensional nature. However, a consensus regarding RCT outcome assessment is lacking. The PAI scoring system has been proposed for assessing periapical status through PA, and consequently addressing the apical healing status after RCT in teeth with apical lesions [59]. This system grades the apical lesions from 1 to 5, from which scores 4 and 5 at the follow-up are considered as an ultimate failure, and 1 or 2 as an absolute success. Score 3 is controversial, and some studies considered it as a success [59]. From the included studies, only five evaluated apical healing through the PAI [17, 18, 23, 29, 31], which emphasizes the need for standardized criteria for assessing RCT outcome. Moreover, the obturation extent assessment through PA is limited by the anatomical variation of the anatomic foramen [60]. Usually, the anatomical foramen does not coincide with the radiographic apex [61,62,63]. Thus, obturation distance from radiographic apex must be analyzed with caution. One included study [26] compared the periapical healing of vital teeth treated endodontically between PA and CBCT. CBCT detected periapical lesions in 25.9% of cases and PA in 12.6%; obturation extent did not influence RCT outcome in the CBCT analysis. Although CBCT provides useful information, it has been recommended for selected cases according to ALARA (As Low As Reasonably Achievable) where the radiation dose should follow safety principles [64,65,66].

Limitations

The 22 included studies were considerably heterogeneous, especially regarding study design, sample size, RCT technique, follow-up period, obturation extent, and outcome assessment criteria. Therefore, meta-analysis was not possible. Overextended and underextended obturation had a higher chance of association with a less favorable outcome than adequate obturation, as reported in the majority of the included articles. However, the methods used for investigating RCT outcome were usually poorly described and not validated. It is worth mentioning that most of included articles did not reported data on confounding factors such as preoperative pulpal status, coronal leakage, or carious involvement. As previous mentioned, several confounding factors can influence RCT outcome [2, 3], and therefore, they should be considered in future studies assessing RCT outcome characteristics. For these reasons, this systematic review’s conclusions must be interpreted with caution, and further studies with standardized and validated methods should be conducted. Future studies on this topic are required, aiming to reduce methodological heterogeneity in this topic. The authors suggest that these studies utilize the PAI method for the assessment of the endodontic outcome, and obturation extent should be measured in millimeter and present as mean values.

Conclusion

Obturation extent seems to influence the RCT outcome. Overextended and underextended obturation showed a higher chance of association with less favorable outcome than adequate obturation. However, this influence could not be categorically supported due to other factors which are crucial and may affect proper shaping and cleaning of the root canal system. Due to the limitations of this systematic review, this conclusion must be interpreted with caution.