Introduction

With the increase of esthetic requirements, the demand for orthodontic treatment has increased in both adult and young patients [1, 2]. Orthodontic fixed appliance treatment (FAT) is the most common and traditional method for comprehensive treatment [3]. However, conventional FAT has been associated with some compromises in terms of appliance appearance, patient comfort, and achieving adequate oral hygiene [4,5,6]. On the other hand, clear aligner treatment (CAT) has developed as an esthetic alternative option for patients seeking orthodontic treatment.

The idea of CAT was initially introduced by H.D. Kesling in 1945 [7], who suggested the use of a single positioner after orthodontic treatment with fixed appliances to achieve final settling/corrections. As thermoplastic materials became more widely available in orthodontics, Sheridan et al. [8] used a removable plastic retainer (Essix®; Dentsply, York, PA, USA) in combination with interproximal reduction (IPR) to apply force to teeth to make minor tooth movements. In 1997, commercially produced series of aligners were introduced by Invisalign® (Align Technology, Santa Clara, Calif.) as removable, clear semi elastic polyurethane aligners [9] and have been followed by many other variants produced by other manufacturers. These were initially indicated for cases with mild malocclusion. Subsequently, with the significant improvement in computer-aided design/computer-aided manufacturing (CAD/CAM) and materials, it has been able to forecast treatment outcomes and fabricate a series of custom-made aligners using a single silicone or digital impression [10] with the potential to treat more complex malocclusions.

Many positive advantages of CAT have been proposed, such as overall reduction in treatment and chair time [11], being safe and esthetic, more comfortable, reduced problems with eating, simplicity of ensuring adequate gingival/periodontal health, and optimal oral hygiene compared with FAT [4, 5, 12,13,14,15,16], and hence with a lower impact on patient’s quality of life [15]. In contrast, there are some reported limitations of CAT, for instance, limited control over root movements, limited intermaxillary sagittal/overjet correction, dependence on patient compliance on their wear with little or no control by the operator, reduced effectiveness in closing extraction spaces, and in achieving adequate occlusal contact and arch expansion when compared to FAT [17,18,19,20].

With the recent increase in popularity of CAT and the claims of Invisalign® to effectively perform major tooth and root movements [21], controversy exists in terms of whether CAT could be a suitable alternative to FAT. Several systematic reviews have been conducted to investigate different aspects of the effectiveness of CAT [22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42], but no single clear and conclusive result can be obtained from each of these reviews. Therefore, this study was designed to provide an overview (on a systematic review level) to answer the following research question: is CAT as effective as FAT? The strategy used in this study was to use the overview method to synthesize the evidence available (for patients of any age with any malocclusion).

Materials and method

Protocol and registration

The protocol for the present overview was registered in the International Prospective Register of Systematic Review (https://www.crd.york.ac.uk/prospero/; Registration number: CRD42021246855). This overview was conducted and reported in line with the Preferred Reporting Items for Systematic Review and Meta-Analyses (PRISMA) checklist [43]. Ethical approval was not required as there was no individual participation, no intervention or personal data collection.

Eligibility criteria

The eligibility criteria were determined according to the Population, Intervention, Comparison, Outcome, and Study design (PICOS) scheme:

Population: patients of any age with any type of malocclusions undergoing orthodontic treatment.

Intervention: orthodontic treatment with clear aligners (any type).

Comparison: orthodontic treatment with fixed orthodontic appliances (any type) or untreated control group or predicted digitally planned tooth movement models.

Outcome: evaluation of the clinical effectiveness of CAT (treatment outcomes, treatment duration, and stability), and its effects on periodontal health, root resorption, and oral health-related quality of life (OHRQoL).

Study design: systematic reviews with or without meta-analysis, assessed as moderate or high-quality according to AMSTAR 2 quality assessment tool (A Measurement Tool to Assess Systematic Reviews) [44]. In case of Cochrane reviews, the most recent publication was included, and all previous versions were excluded. Studies with any other design were also excluded as well as any systematic review with only in-vitro or animal studies.

Information sources, search strategy, and study selection

A comprehensive literature search was conducted for systematic reviews published up to July 15, 2021, using the following key terms: “aligners,” “Invisalign,” “orthodontic aligner,” “thermoplastic aligner,” “clear aligner,” “teeth positioner,” “invisible appliance,” “orthodontic appliance,” “fixed appliance,” “removable appliance,” “braces,” “treatment outcome,” “systematic review,” and “meta-analysis.” This was accomplished using electronic databases: MEDLINE via OVID (1946 to July 15, 2021), EMBASE (1974 to July 15, 2021), AMED (Allied and Complementary Medicine Database) (1985 to July 15, 2021), PubMed (inception to July 15, 2021), Cochrane Library (inception to July 15, 2021), and Web of Science (1900 to 2021). Any ongoing systematic reviews were searched using Prospero and a gray literature search was undertaken using Google Scholar and OpenGrey (www.opengrey.eu/). The reference lists of the eligible reviews were also checked for additional relevant reviews. No restrictions were applied in terms of language, date, and status of publication, and age of treated patients. All relevant reviews were identified, retrieved, and assessed for eligibility of inclusion by two authors independently (Y.A.Y. and G.T.M.) who screened the titles, abstracts, and, if necessary, the full texts. Any disagreement was resolved by discussion to reach consensus or alternatively by consulting a third author (D.R.B.).

Data items and collection

After screening the eligible systematic reviews, the following data were extracted independently and in duplicate by two authors (Y.A.Y. and G.T.M.): (1) authors; (2) year of publication; (3) study design; (4) number of studies included; (5) type of studies; (6) number of participants; (7) period of search; (8) name of journal; and (9) objectives of the study (Table 1).

Table 1 Systematic reviews that have assessed orthodontic treatment with clear aligners

Quality assessment in individual studies

Two authors (Y.A.Y. and G.T.M.) assessed the included reviews independently using the AMSTAR 2 quality assessment tool (Table 2). Any disagreement was initially resolved by discussion or in conjunction with a third author (D.R.B.), if necessary.

Table 2 A Measurement Tool to Assess Systematic Reviews (AMSTAR 2) items

The level of evidence according to the AMSTAR 2 is presented in Table 3.

Table 3 Level of evidence according to the AMSTAR 2 assessment tool

Summary measures and approach to synthesis

Data pooling was planned to qualitatively assess the effectiveness of CAT as systematic reviews per se do not have primary data. However, according to the PROSPERO protocol and the data synthesis, quantitative analysis was also planned in case of clinical homogeneity between the reviews.

Orthodontic treatment with clear aligners

The following categories of treatment with CAT will be taken into consideration: treatment effectiveness/efficiency (treatment outcomes, treatment duration, and stability), periodontal health, root resorption, and OHRQoL.

Results

Study selection and characteristics

A total of 361 potentially eligible studies were identified. After excluding the duplicates, 299 studies were left. Then, initial screening for the titles and abstracts reduced the number to 29 reviews. Following full-text assessment, eight reviews were excluded (three reviews included laboratory-based in vitro studies and five reviews were not relevant to the aim) leaving 21 systematic reviews for inclusion in the quality assessment. The PRISMA flow diagram of the literature selection process is presented in Fig. 1.

Fig. 1
figure 1

PRISMA flow diagram of the literature selection process

A summary of systematic review characteristics is presented in Table 1. Only systematic reviews that followed the PRISMA checklist were included and these were then evaluated for their quality with the AMSTAR2 tool. The included reviews were published from 2005 to 2021. Meta-analyses were carried out in 42.9% of the included systematic reviews (9/21 systematic reviews). Nine of the reviews included studies with Invisalign® only and the rest included Invisalign® among other types of aligners.

Quality of the evidence

According to the AMSTAR 2 checklist, the quality of the included reviews was variable: three reviews (14.3%) were considered of low/critically low quality of evidence, 17 reviews (80.9%) were considered of moderate quality of evidence, and one review (4.8%) was considered of high quality of evidence (Table 1). Most of the AMSTAR 2 items were covered to a varying degree (Table 2). There was complete consensus between the reviewers regarding the quality assessment.

In this overview review, the main findings from the moderate and high-quality systematic reviews were considered in the thematic synthesis (18 systematic reviews). Therefore, the results of Mortazavi et al. (2020) [31] (low quality), Lagravère and Flores-Mir (2005) [22], and Elhaddaoui et al. (2017) [36] (critically low quality) were not considered further in the results and discussion.

Data synthesis

Due to the lack of primary data, standardized treatment protocols, differences in interventions, clinical and methodological heterogeneity across the studies, further meta-analysis was not feasible. The data were, therefore, synthesized qualitatively using thematic synthesis by identifying the most prominent and important themes with the findings summarized accordingly.

Orthodontic treatment with clear aligners

The results of orthodontic treatment with clear aligners in terms of: treatment effectiveness/efficiency (treatment outcomes, treatment duration, and stability), periodontal health, root resorption, and OHRQoL are presented in Table 4.

Table 4 The results of the included systematic reviews in terms of CAT vs. FAT outcomes

Discussion

Summary of evidence

Clear aligners represent an esthetic and more comfortable alternative to fixed orthodontic appliances [5, 6]. This study was designed to overview the available evidence-based information regarding the clinical effectiveness of clear aligners, mainly in comparison with fixed orthodontic appliances. Therefore, data synthesis was based on systematic reviews only. The 18 included systematic reviews were published during the last 6 years in high-quality journals. About half of these systematic reviews have investigated Invisalign® only, and the rest investigated different types of aligners (but mainly Invisalign®); therefore, the results can be inferred in general to Invisalign®, although other aligner types could confound the results.

Treatment effectiveness/efficiency (CAT vs. FAT)

Treatment outcomes

It had been concluded that CAT is effective for mild to moderate cases of malocclusion in patients where simple tooth movement is required without the need of bodily/root movement and/or extraction, while, it is not as successful as fixed appliances for more severe crowding and complex cases, especially when extractions are included [24, 27, 29]. Generally, in minor cases (without extraction), aligners can increase arch perimeter, but in severe crowding (greater than 6 mm), this can result in significant incisor proclination and protrusion which is neither desirable nor stable [24, 30].

The capability of achieving intrusion with CAT was found to be comparable to that of FAT. This can be explained by the ability of force application directed apically through the incisal edge with or without attachments. Therefore, mild deep bite cases that can be treated with true intrusion can be managed with CAT. Paradoxically, extrusive movements have been found to be the least precisely achieved by CAT as it is difficult to apply the force in a suitable direction and magnitude to extrude teeth. Despite this neither the American Board of Orthodontics Cast-Radiograph Evaluation Index (ABO CR-EVAL) nor Peer Assessment Rating Index (PAR index) revealed any discrepancy with CAT in vertical tooth movement. Consequently, treatment of open bite cases that require extrusion of anterior teeth is still not recommended with CAT [29]. Galan-Lopez et al. (2019) [30] also agreed that one of least predictable movements by CAT is vertical movement, especially for posterior teeth. They recommended incorporation of attachments to improve outcomes. With fixed appliances, bracket position can be adjusted with or without the use of auxiliaries and elastics to intrude or extrude teeth as required. With aligners these movements, especially extrusion can be difficult to achieve. Besides, the presence of aligner material that covers the occlusal surfaces of the teeth as a physical barrier prevents adequate settling of the teeth during the finishing stage of treatment [25, 27]. This could explain the inability of CAT to establish adequate occlusal contacts, as found by several reviews [24,25,26, 30].

It was evident from the literature that there is limited control of rotational tooth movement with CAT for canines and premolars. This may be related to the difficulty in applying a force couple system on a round surface with an aligner [43, 45], but the use of IPR, attachments (vertical ellipsoid, centrally located) [45], overcorrection, and reduced staging to less than 1.5˚/aligner can improve CAT ability to deliver such movement [24, 27, 29, 30].

Both controlled and uncontrolled tipping can be achieved by CAT. This is logical as tipping movements are the simplest types of tooth movement and can be achieved by applying a single force on the crown; however, the ability for bodily tooth movement is limited due to the difficulty in controlling root movement where a force couple is required [24,25,26,27, 29, 30]. Likewise, arch perimeter can be increased effectively with aligners by increasing intercanine, interpremolar, and intermolar widths [30], whereas arch expansion with bodily tooth movement represents one of the limitations of CAT [24, 29]. The main reason for the limitation in achieving bodily tooth movement is due to the difficulty in applying a moment of couple to control root movement with aligners. With fixed appliances, the predetermined tooth inclination (within the bracket) when using a rectangular wire can deliver not only tipping of teeth but also torquing of their roots [25]. The use of attachments (ellipsoid precision attachments) [46], altered aligner geometries, and reducing the amount of activation per aligner could improve root control [29]. In spite of this, it has been shown that aligners are capable of controlling bodily distalization of maxillary molars of 1.5 mm or ≤ ½ cusp [24, 29, 30]. This may be due to the anatomy of the molar crown allowing greater engagement with the aligner to generate more complex forces.

Robertson et al. (2020) [28] found that CAT may produce clinically acceptable outcomes which are comparable to FAT regarding buccolingual inclination (torque) of maxillary incisors and proclination of mandibular incisors in mild to moderate malocclusions. Nevertheless, these findings were considered to be at a low level of evidence. Moreover, their finding of buccolingual inclination contradicts the abovementioned evidence. Hence a decision algorithm by Jadad et al. (1997) [47] has been applied, indicating that the evidence of the inferior buccolingual inclination control by CAT which was supported by a greater number of systematic reviews which included studies with higher levels of evidence should be accepted.

According to the ABO CR-EVAL index, treatment outcomes were worse with CAT than with FAT, especially when treatment included extractions. This is due to difficulty in achieving complex tooth movements with aligners. However, Zheng et al. (2017) [23] in their meta-analysis could not reach a clear conclusion regarding which technique is better because of insufficient evidence. On the other hand, the meta-analysis by Ke et al. (2019) [25] found that both CAT and FAT did not show statistically significant differences in the quality of treatment outcomes measured with the ABO CR-EVAL and PAR indices. Nevertheless, by looking at the mean difference of the ABO CR-EVAL index, which is 8.38 points (95% CI − 0.17 to 16.93), in favor of FAT, this could be of clinical significance as it has been determined that a score difference of five points can be considered to be of clinical importance [48]. Moreover, the consistently lower scores in the buccolingual inclination and occlusal contact with CAT may highlight the limitation of CAT in achieving root control and extrusion movement as noted above, agreeing with other reviews [24, 26, 27, 30]. The meta-analysis by Papageorgiou et al. (2019) [26] found a mean difference of 9.91 points when using ABO CR-EVAL in favor of FAT, which is comparable to that of Ke et al. (2019) [25]. This was considered of statistical and clinical significance. It is worth noting that the meta-analysis by Papageorgiou et al. (2019) [26] included 11 studies (including four RCTs), while that by Ke et al. (2019) [25] included five studies (including two RCTs). Therefore, the evidence of the Papageorgiou et al. (2019) [26] study can be deemed to be of a higher level. The lower quality of finish with CAT compared to FAT in terms of unacceptably finished cases (score > 30 points) or with significantly worse occlusal outcomes (with the ABO CR-EVAL index) is in line with other reviews regarding the limitations of CAT. On the other hand, the non-significant difference between CAT and FAT in the PAR scores may be attributed to several reasons. The ABO index included greater and more objective components that provide a detailed assessment of tooth relationship in all three planes (first, second, and third order), while the PAR index provides a less detailed assessment of occlusion and neglects some important aspects such as buccolingual inclination, root angulation, occlusal contacts, interproximal contacts, and leveling of posterior teeth [26]. Therefore, it would not be expected to find the same result for these indices as it has been noted that they are not significantly correlated [49, 50]. The systematic review by Pithon et al. (2019) [27] reported from one RCT that both CAT and FAT groups were successful with the ABO CR-EVAL scores (achieved scores below 30 points), but this was only for class I malocclusion cases, and also noted the CAT group had worse scores in the buccolingual inclination and occlusal contact components, and to some extent in the overjet and occlusal relationship components.

It should be kept in mind that patient’s motivation and compliance with wearing aligners could have an impact on the treatment outcome and explain in part the comparison with the FAT.

All the above findings can also explain the variation in the accuracy of CAT (i.e., the amount of deviation between the achieved and the planned tooth movements) [24, 30]. Many of the predicted tooth movements are not achieved as planned with CAT, with some exceptions, notably minor horizontal tooth movement [28]. Koletsi et al. (2021) [43] found that prediction of rotational tooth movements with CAT did not appear accurate, especially for canines and for rotational movement of more than 15˚. However, Galan-Lopez et al. (2019) [30] mentioned that the precision of tooth movement and malocclusion correction have improved with the development of new generations of aligners (for example those launched after Invisalign G5 in 2014), although these still do not reach the accuracy of FAT yet. Future further improvement in aligner design, materials, computer programming, and attachments could enhance the ability to perform difficult tooth movement and overcome these limitations.

Treatment duration

Total treatment duration was reported to be shorter with CAT, but this was mainly for mild to moderate cases, as any extractions can render treatment with aligners longer than that with FAT. This can be explained by the difficulties in achieving bodily or root movement with aligners as most extraction or complex cases require root torquing or uprighting movement which was found to be limited with aligners [23,24,25]. Similarly, it would be expected that CAT is associated with significantly less chair time, fewer appointments, and less emergency visits than FAT. From the initial appointment, aligners need reduced chair time for insertion compared to direct bracket bonding. Moreover, aligners usually require less adjustment than fixed appliances which require adjustment of archwires and/or brackets at each appointment. The regular interval between appointments with CAT is 10–12 weeks, while with FAT it is 4–8 weeks and this explains the difference in the appointment number. Regarding emergency visits, it is uncommon to experience emergencies with CAT due to the flexibility of the material with a relatively short wear time for each set of aligners, and adding fewer auxiliaries that are prone to breakage. This compares with FAT with brackets and tubes/bands that can become debonded. Also, the removal of aligners during eating reduces the risk of breakage when compared to fixed appliances [23].

The conflicting findings regarding treatment duration between CAT and FAT, from three reviews [26, 27, 30], was due to heterogenous results from different study designs. Therefore, the appliance alone may not be the key influence on treatment duration but this could also be due to different patient- and treatment-related factors that were not investigated in the available studies. For example, severity of malocclusion, extractions, patient age, gender, and compliance. These factors can influence treatment duration even when the appliance is not a variable [30, 51].

Treatment stability

Despite the ability to align teeth with mild to moderate crowding effectively, the tendency for relapse was shown to be higher with CAT than with FAT. This may be related to the limitation of CAT in obtaining solid occlusal contact and controlling root movement which in turn could increase the tendency of teeth to return to their original position. Therefore, the full sequence of tooth tipping and root uprighting that usually occurs with FAT does not occur with CAT [23,24,25, 27, 29, 30]. In addition to the above, the tendency of aligners to increase proclination of mandibular canines and mandibular intercanine width is greater than with FAT [25] and produces significant proclination of incisors when severe crowding cases are treated [24, 30]. Therefore, this may position the canines and incisors in a less stable zone and increase the probability of relapse. As alveolar bone resorption requires 7–14 days and almost an equal time is needed for periodontal tissue regeneration, Zheng et al. (2017) [23] and Ke et al. (2019) [25] postulated that this higher tendency of relapse with CAT could be related to the relatively short time for progressing aligners every 2 weeks. This frequent activation produces undermining resorption, root and bone resorption by truncating the repair process which leads to poor bone formation and greater relapse than with fixed appliances which are usually adjusted every 4–8 weeks. This postulation should be considered in detail, but it can also be refuted when other findings regarding orthodontically induced inflammatory root resorption (OIIRR) is regarded in the current overview. CAT was found to deliver a force comparable to that of the light forces with fixed appliances and a lower risk of root resorption.

The recent decision by Align Technology for weekly aligner changes is not to be recommended due to the problems of inadequate tooth adaptation, recovery from the force applied, and consequently reduced stability. Overcorrection is therefore recommended to decrease the relapse with CAT [30].

Periodontal health (CAT vs. FAT)

The majority of periodontal health problems with orthodontic treatment are related to plaque retention and inadequate oral hygiene measures. The finding that CAT was associated with lower scores of periodontal indices, more favorable quality and quantities of plaque which overall result in better periodontal health as compared to FAT can be explained as follows. The removable nature of aligners allows adequate tooth brushing, dental flossing, and extra-oral appliance cleansing. The smooth surfaces of aligners also play an important role in reducing plaque retention. In contrast, plaque retention with fixed appliance components is greater and these components can decrease natural self-cleansing by the saliva and tongue and impede toothbrushing and maintaining adequate oral hygiene [33,34,35]. The longer the time of plaque retention, the greater the possibility of more harmful plaque quality, and hence periodontal breakdown [33]. It is worth mentioning that in both treatment modalities good oral health is required to maintain periodontal health; moreover, periodontitis is a contraindication to start orthodontic treatment, irrespective of the type of appliance used.

The outcome of the systematic review by Rossini et al. (2015b) [33] (CAT is associated with better periodontal health outcomes than FAT) was further confirmed in two meta-analyses [34, 35]. The quality of evidence of these reviews was considered to be of moderate level. On the other hand, the meta-analysis by Papageorgiou et al. (2019) [26] reported no significant differences between these two treatment techniques in terms of gingival recession (2 years after treatment) and periodontal health. However, the evidence regarding these findings was limited as each was derived from a single study. Generally, gingival recessions do not result from orthodontic treatment, as long as correct biomechanics considering individual anatomical borders are applied.

An important point raised by Jiang et al. (2018) [34] is the need to measure periodontal health status before and after treatment in order to ensure adequate blinding without any chance of identifying the identity of the appliance used as well as assessing the periodontal health status over a long-term follow-up.

The effect of orthodontic appliances on periodontal health depends on the forces that are applied. This is influenced by the magnitude of tooth movement and in the case of aligners that are not constructed by the clinician, the level of forces is unknown. Variability in FAT due to labial or lingual appliances, use of the 0.018 or 0.022-inch slot and other factors can influence force levels.

Root resorption (CAT vs. FAT)

The evidence related to OIIRR with aligners is scarce, however all of the available studies have revealed that aligners are associated with a lower risk of OIIRR when compared to fixed appliances [26, 37, 38, 52, 53], and this may be due to the nature of intermittent (due to occasional removal during food consumption and hygiene maintenance) and relatively light forces with aligners, as well as the simpler types of tooth movement which result in smaller amounts of tooth movement/apical displacement (approximate velocity 0.5 mm/month for CAT and 1 mm/month for FAT). Hence, there is a greater chance of root healing by promoting the cementum repair process. Moreover, aligners are usually indicated for cases with minor crowding where no extractions are indicated and with shorter treatment duration [38, 39, 54]. Aligners are also subject to patient compliance, and any lack of compliance results in more intermittent force delivery with shorter duration of force application resulting in reduced OIIRR [55], but this may also produce jiggling forces which may increase the risk of OIIRR [38]. This may not be easily determined and hence is not readily reported [53]. The most recent systematic review and meta-analysis could not find a significant difference between preadjusted edgewise fixed appliance and aligners in terms of OIIRR, except for the maxillary right lateral incisor which showed reduced OIIRR with aligners. However, the level of incisor root resorption was not of clinical significance (less than 1 mm) [39]. It has been highlighted that the variation in the detection methods of OIIRR (such as periapical radiographs, orthopantomogram, cephalometric radiographs, microscopic investigation, and CBCT) could influence the results of the available studies among other factors that could produce risk of bias to the outcomes [37,38,39].

In light of the above, the comparison between CAT and FAT regarding OIIRR should be interpreted with caution [38, 53], especially because these two treatment techniques differ in their indications, treatment duration, and mechanism of action [26].

Oral health-related quality of life (CAT vs. FAT)

Pain and discomfort represent two important aspects that can affect quality of life [56], and fear of pain is one of the reasons for rejection or discontinuation of treatment [57]. The highest level of pain perception is usually felt during the initial phase/days of treatment, due to release of inflammatory mediators such as prostaglandin E and interleukin 1β as a result of the initial orthodontic forces that compress the periodontal ligament and lead to ischemia, edema, and release of these mediators (which sensitize nociceptors of the periodontal ligament). The level of these mediators reaches their peak during the first 24 h after the onset of orthodontic force and returns to normal level after about 7 days [58,59,60]. Patients who have been treated with aligners reported lower levels of pain during the first few days of treatment than those who were treated with fixed appliances [40, 41]. This can be attributed to several reasons. The lower level of pain associated with aligners may be related to the removable nature of the appliance and therefore the intermittent forces which produce less pressure, tension, and sensitivity than the continuous forces with fixed appliances [61]. Additionally, aligners can be removed by patients when they feel pain in order to relieve it, thus reducing pain experience by patients [40]. Consequently, this may explain the lower consumption of analgesics by patients treated with aligners.

The systematic review by Zhang et al. (2020) [42] found no significant differences in pain perception between CAT and FAT, but their results were only obtained from one cohort and one cross-sectional study and this could explain the minor differences, if present, between the two appliances. Interestingly, the meta-analysis by Pereira et al. (2020) [41] reported that placing attachments at the beginning of CAT is associated with higher pain perception due to greater pressure applied at the insertion of an aligner. Generally, pain perception is a subjective process and the difference between CAT and FAT was not noted in the medium or longer term (after the first month). Moreover, pain is influenced by several factors, such as age, gender, pain threshold, magnitude of force applied, emotional status, cultural differences, and individual’s personality [40, 41].

Zhang et al. (2020) [42] concluded that the difference between CAT and FAT on the overall OHRQoL was not conclusive due to lack of evidence. However, their results, from two studies, regarding eating and chewing disturbance and discomfort during treatment was in favor of CAT. This can be related to the difference in the design of two appliances, where the aligners can be removed during eating and allow normal function without the possibility of food accumulation around the appliance or fear from appliance breakage. Similarly, the smooth surfaces and margins of aligners can allow lower oral symptoms on the tongue, cheek, and lips compared to fixed appliances.

It is worth noting that in addition to different sources of bias, variation in types of appliances, sequence of archwires, type of malocclusion, and consumption of analgesics could have influenced the results of individual studies [40, 41].

Strengths and limitations

The protocol of this overview has been registered a priori [62]. This overview has been conducted following the PRISMA guidelines and using the AMSTAR 2 quality assessment tool to evaluate the published systematic reviews concerning the comparison of CAT with FAT. The number of the included systematic reviews in this discussion was considerable (18 systematic reviews). These were only of moderate- and high-quality level reviews in order to keep the level of evidence as high as possible.

Several limitations have been identified by the included systematic reviews as follows:

  1. 1.

    There are few prospective RCTs that investigated the difference between CAT and FAT. With many of the available studies are retrospective in design or case reports.

  2. 2.

    The available studies are influenced by different types of bias due to the followings:

    • Lack of randomization and allocation concealment (selection bias).

    • Lack of blinding (detection bias).

    • Lack of adequate experience with CAT and lack of standardized treatment protocol (performance bias).

    • Several potential confounders were not considered in the available studies and could bias their results. For example, severity of malocclusion, oral hygiene, patient’s compliance, number of aligners, aligner change frequency, psychosocial status, and other patient and treatment-related factors.

    • Inadequate power (due to small sample size) and lack of method of error analysis of the clinical studies.

    • The majority of studies with CAT investigated mild to moderate malocclusion only.

    • Heterogeneity in terms of methodology, outcome reporting, types of fixed appliances, and types aligners used (like Invisalign®, Smart Track®, ClearSmile®, F22 aligner, and Nuvola system). However, most of them investigated Invisalign® only and this might cause some bias as only a specific material and planning software were used.

    • The continuous improvement of the aligners may hamper the direct comparison between older studies with the most recent ones. Robertson et al. (2020) [28] stated that the difference in aligner production processes and material properties can affect the force levels and thus, the predictability of tooth movements.

    • Variation in the detection methods of OIIRR.

    • Few validated OHRQoL instruments used, in addition to their heterogeneity in terms of types and time of assessment

  3. 3.

    The comparison of cost effectiveness was not undertaken yet between CAT and FAT.

Therefore, owing to the above limitations, the results should be considered with caution as further studies may alter the findings.

Suggestions for future studies

The following suggestions can be made. Further high-quality RCTs that follow the CONSORT statement with rigorous methodology and appropriate sample sizes are needed to further evaluate the treatment effectiveness of CAT with FAT. These trials should assess the most recent aligners, include different types of malocclusion, control confounding variables, use ABO CR-EVAL (to evaluate treatment outcomes and post-treatment retention), measure periodontal health status before and after treatment, use CBCT (to evaluate root resorption where appropriate), and use a validated tool to assess OHRQoL.

Clinical recommendations

  1. 1.

    Based on to the available evidence, aligners are indicated in the following cases:

    • Patients with mild-to-moderate malocclusion (especially in non-extraction cases or where extrusive, severe rotation, bodily, and root movements are not required).

    • Minor relapse post-orthodontic treatment.

    • Patients with compromised periodontal health.

    • Patients who have social or emotional problems with fixed appliances.

  2. 2.

    The orthodontist should have sufficient knowledge and clinical experience before practicing CAT.

  3. 3.

    The use of precision attachments, IPR, and reduction of the amount of activation/aligner are important and should be used to achieve adequate control of tooth movement.

  4. 4.

    Overcorrection should be considered when using CAT to reduce the impact of relapse.

Conclusions

According to the available evidence, the following conclusions can be drawn:

  1. 1.

    The level of evidence regarding CAT is moderate.

  2. 2.

    Aligners can be a successful alternative to fixed appliances to treat patients with mild to moderate malocclusion.

  3. 3.

    The quality of treatment with CAT is inferior to that of FAT in the following clinical conditions:

    • Severe malocclusion cases (including extraction cases)

    • When extrusion, severe rotation, bodily tooth movement including expansion, and root movements are required

  4. 4.

    The accuracy of CAT can be improved with the utilization of advanced computer programming, improvement in aligner design and materials, reduced amount of activation of each aligner, and use of precision attachments and IPR.

  5. 5.

    There is inconclusive evidence regarding treatment duration with CAT; however, it tends to be shorter than that with FAT in mild to moderate cases and longer in severe cases.

  6. 6.

    The tendency of relapse is greater with CAT.

  7. 7.

    Aligners can allow better maintenance of oral hygiene and hence better periodontal health status compared to FAT.

  8. 8.

    Due to the intermittent force and mild-to-moderate malocclusions that can be treated with CAT, the risk of OIIRR tends to be lower with CAT and is comparable to that of light forces in FAT.

  9. 9.

    Due to the nature of aligners, CAT can be considered more comfortable for patients and with minimal impact on eating, chewing, and OHRQoL compared to FAT.