Introduction

Tendinopathies are multifactorial disorders affecting tendons, mainly associated to overuse, which result in pain, reduced function and exercise tolerance [1]. Tendinopathy is quite common, with increasing prevalence worldwide [2]. Diagnosis is based on clinical and imaging evaluation, and early recognition is important to minimize patient pain and disability [3]. Ultrasonography is the gold standard technique to image tendons [4,5,6,7], while conventional radiography and magnetic resonance may be helpful to rule out other causes of pain [8,9,10,11,12,13,14]. Shear-wave elastography (SWE) is an ultrasound technique that allows measuring the elasticity properties of tissues, by assessing the displacement of particles after an acoustic beam is transmitted from an ultrasound transducer [15, 16]. Despite SWE has been introduced first to evaluate other tissues (liver, breast, kidney, etc.) [17,18,19,20], several authors have investigated the potential of SWE in musculoskeletal disorders. For what concerns tendinopathies, interesting studies have been focused on the value of SWE, particularly about the evaluation of the supraspinatus, Achilles, and patellar tendons. As a matter of fact, tendons present a characteristic elasticity coefficient that is related to their structure and mechanical properties. It can be impacted by degenerative modifications, including an increased concentration of collagen type III fibers and fibrocartilaginous changes, due to microtrauma, overload, and vascular alterations, which result in different elasticity patterns and decreased tendon stiffness [21]. In this setting, SWE may support B-mode ultrasound to detect tendon disease and may be used as an imaging biomarker to assess treatment response of tendinopathies. To date, evidence supports the use of sonoelastography as a first-choice imaging modality in Achilles tendinopathy, medial and lateral epicondylitis [4]. Nevertheless, published results about the application of SWE in tendinopathies are still sparse and no clear SWE stiffness and velocity cut-off values have been established to differentiate pathologic and normal tendons. Hence, this systematic review and meta-analysis was performed aiming to firstly investigate differences in pathologic and healthy tendons’ characteristics assessed by SWE. Secondarily, we aim to assess intra- and inter-observer reproducibility of SWE modulus and SWE velocity in tendon evaluations.

Methods

Study design

We conducted this systematic review of the literature according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) and the Meta-analysis of Observational Studies in Epidemiology (MOOSE) guidelines [22, 23]. The protocol was registered in the International Prospective Register of Systematic Reviews database (PROSPERO identifier: CRD42023405410). Full PRISMA and MOOSE checklists can be found in Additional file 1.

Literature search strategy

Our search was done on PubMed and Embase databases from inception to September 2022. Further, the references of included studies and reviews were thoroughly checked to find eligible publications. The keywords “tendon” or “tendinopathy” and “shear wave” were used. We limited our search strategy to English language. We screened all studies by title and abstract, then the full text of eligible studies was checked for review, as well the references to identify any additional publications to be included. The literature search was carried on by one reviewer and checked by a senior researcher, with two and 10 years’ experience, respectively.

Inclusion and exclusion criteria

We applied the following inclusion criteria: (i) involvement of human participants; (ii) studies reporting outcome data of SWE velocity (m/s) or stiffness (kPa) of tendons affected by tendinopathy and normal tendons used as healthy controls (iii)Prospective, retrospective and cross-sectional studies dealing with the evaluation of tendons by means of SWE. Exclusion criteria were: (i) studies on patients subjected to surgery; (ii) studies reporting no quantitative SWE values; (iii) studies that used strain elastography; (iv) case report, case series involving < 10 subjects, narrative reviews, guidelines, consensus statements, editorials, letters, comments, or conference abstracts.

Data extraction

Data concerning these parameters were collected and analyzed:

  1. (a)

    General characteristics of the paper: first author, year of publication, study design;

  2. (b)

    study population characteristics: number of patients and controls, age, gender, and tendon status (normal vs abnormal);

  3. (c)

    measurement methods: position (rest, tension), SWE scan protocol and any intervention;

  4. (d)

    outcomes:

    1. (a)

      SWE modulus (kPa) and SWE velocity (m/s) of pathologic tendons and healthy tendons;

    2. (b)

      inter-observer reproducibility measured by intraclass correlation coefficient (ICC);

    3. (c)

      intra-observer reproducibility of measurements done in different sessions measured by ICC.

For what concerns prioritization of outcome data, we adopted the following a priori strategies:

  • when both longitudinal and axial values were reported, only longitudinal values have been considered;

  • when both baseline and after-treatment values were reported, only baseline values have been considered;

  • when measurements were done with the subject lying in different positions, only the neutral position has been considered;

  • when both proximal, mid and distal scans of tendons were reported, only mid-proximal scans have been considered;

  • when active and passive tendon measurements were reported, only active ones were considered.

  • we considered studies as cross-sectional given that we focused on a specific time point (i.e., we considered only baseline values when dealing with interventional studies).

Quality assessment

Two researchers evaluated study quality and differences were solved after discussion. The Methodological index for non-randomized studies (MINORS) was used [24] to evaluate quality of the observational studies, which was adapted to the cross-sectional observation at baseline removing item 6 and 7 that required follow assessment for judgement. Our adapted MINORS contains 10 items, and the quality of studies was scored from 0 to 20. All discrepancies were resolved through discussion. For each item, a score of 0 means that it was not described in the paper, 1 means that it was described inadequately, and 2 mean that it was described measured by adequately. Studies with inadequate control groups (i.e., contralateral tendon as controls) and missing baseline equivalence between groups were judged as proxy for poor methodological quality.

Statistical analysis

Studies characteristics were resumed in tables using the number of tendons assessed as unit of analysis [25]. If at least two papers showed the same outcome, data from each study were pooled in a meta-analysis using a random effect model [26]. We used the mean difference (MD) with 95% confidence interval (CI) in case the same measurement method was used. Statistical heterogeneity was evaluated through the I2 statistic [27]. We also performed subgroup analysis for each tendon (i.e., Achilles, patellar) with the relative estimated effect. In case of high heterogeneity (> 75%), we explored methodological issues and we carried on a sensitivity analysis after excluding poor quality studies [25]. We used Review Manager (RevMan) software version 5.3 for this meta-analysis.

Certainty of evidence

We used the five Grades of Recommendation, Assessment, Development and Evaluation (GRADE) [28] to evaluate the certainty of evidence for each study in the meta-analysis through the GRADEpro software (https://www.gradepro.org). When more than 10 studies were included in meta-analysis, we evaluated the publication bias using funnel plot [29] and Egger’s tests [30].

Results

Study selection

Excluding all duplicates, 292 records were screened and 233 were discarded. The text of the remaining 59 potentially eligible paper was checked, 43 of which were not in accordance with our inclusion criteria. Last, we included 16 studies (reported in Additional file 2). Figure 1 shows the flow diagram of our study selection.

Fig. 1
figure 1

Flow chart of study selection

General characteristics

Overall, we included 16 studies with 676 pathological tendons (188 Achilles, 142 patellar, 96 supraspinatus, 250 mixed) and 723 control tendons (484 healthy; 239 contralateral tendons). Six were prospective studies, 8 case control, 1 cross-sectional and 1 mixed retrospective and prospective published between 2015 and 2022. Mean age of subjects was 43 years, with the median percentage of males being 58% (IQR 30–86%). SWE was applied with longitudinal scan in all studies. Table 1 reports the general characteristics of included papers.

Table 1 Characteristics of the 16 papers included in our analysis

Quality assessment

Table 2 reports the quality of all included studies. The median MINORS assessment score was 17 out of 20 (IQR 16–18). Five studies (31.3%) had a poor methodological quality.

Table 2 Quality assessment

Shear modulus (kPa)

Overall, 11 studies assessed this outcome with 8 including 371 pathologic tendons and 313 contralateral/healthy tendons providing useful outcome data. Meta-analysis showed substantial heterogeneity (I2 = 99%) (Fig. 2). According to a sensitivity analysis, excluding three studies [31,32,33] with poor methodological quality, heterogeneity was still substantial (I2 = 99%;) and remained unresolved.

Fig. 2
figure 2

Meta-analysis on shear modulus. Legend: APL, abductor pollicis longus; CET, common extensor tendon; CFT, common flexor tendon; CI, confidence interval; EPB, extensor pollicis brevis; LHBT, long head of biceps tendon; SD, standard deviation. Humeral epicondylar tendons includes radial/ulnar epicondylar tendons

Shear-wave velocity (m/s)

Overall, eight studies assessed this outcome with seven including 316 pathologic tendons and 306 contralateral/healthy tendons providing useful outcome data. Meta-analysis showed substantial heterogeneity (I2 = 93%) (Fig. 3).

Fig. 3
figure 3

Meta-analysis on shear-wave velocity (m/s). Legend: APL, abductor pollicis longus; CET, common extensor tendon; CFT, common flexor tendon; CI, confidence interval; EPB, extensor pollicis brevis; LHBT, long head of biceps tendon; SD, standard deviation. Humeral epicondylar tendons includes radial/ulnar epicondylar tendons

Sensitivity analysis: shear-wave velocity (m/s)

According to a sensitivity analysis, excluding two studies [31, 34] with poor methodological quality, we pooled data from five studies including 114 pathologic tendons and 160 healthy tendons. Overall, patients with tendinopathy had a lower shear wave velocity, with a MD of − 1.69 m/s (95% CI − 1.85; − 1.52; I2 = 50%) compared to healthy tendons (Fig. 4).

Fig. 4
figure 4

Sensitivity analysis on Shear-wave velocity (m/s). CET, common extensor tendon; CFT, common flexor tendon

Certainty of evidence

The certainty of evidence was not assessed for the primary analysis due to very substantial heterogeneity that prevent us to show overall quantitative results. For sensitivity analysis on Shear-wave velocity (m/s), the certainty of evidence was very low, due to serious imprecision ( − 1 level due to optimal information size not met, less than 400 participants). GRADE summary of findings is in supplementary Additional file 3, Table S1.

Publication bias

Funnel plots (Fig. 5) and Egger’s test (coefficient − 4.01; 95% CI − 7.64; − 0.38; p = 0.03) indicated there was presence of publication bias in Shear modulus (kPa). Publication bias for Shear-wave velocity (m/s) was not assessed (less then 10 studies).

Fig. 5
figure 5

Funnel plot for shear modulus (kPa). Notably, ten studies are included in this evaluation. Of these, Dirrichs 2016 participates with three different cohorts (Achilles, patellar, humeral epicondylar tendons) considered as three studies

Reproducibility

Among the selected studies, just four papers assessed the reproducibility of SWE values reporting highly heterogeneous in modalities of reporting results. Specifically, regarding intra-observer reproducibility of measurements done in different settings, Aubry et al. [35] reported ICC = 0.46 for Achilles tendon evaluation, Breda et al. [36] reported ICC = 0.95 for patellar tendon, Zhu et al. [34] reported ICC = 0.986 for common extensor tendon (CET), and Hackett et al. [37] reported ICC = 0.96 for supraspinatus tendon. Concerning inter-observer reproducibility, Breda et al. [36] reported ICC = 0.79 for patellar tendon, Zhu et al. [34] reported ICC = 0.993 for common extensor tendon, and Hackett et al. [37] reported ICC = 0.45 for supraspinatus tendon.

Discussion

The main result of our analysis is that pathologic tendons present lower SWE velocity than healthy tendons, but the effects are very uncertain due to serious inconsistency of findings (i.e., high heterogeneity, sample size less the 400 tendons) that may limit clinical application of SWE.

SWE has been introduced as a more reliable quantitative imaging modality with respect to strain elastography, given that the compressions of tissues are produced directly by the ultrasound beam provided by the probe [38]. This has been proven in other organs, but in tendons, according to our data, there is still no evidence to support such statement. Despite many papers have investigated the role of SWE for evaluating tendon, the reproducibility of SWE values still needs to be proven [39]. In fact, only four out of the 16 included studies assessed intra- and/or intra-observer reproducibility, reporting highly heterogenous findings. For this reason, data could not be meta-analyzed. To be more conservative, we preferred not pooling together data on stiffness and velocity. In fact, the equation to convert SWE velocity to the Young’s modulus is based on tissue density, but this parameter is not constant, given that tendon structure can be altered due to pathology and anisotropy making this conversion unreliable [40, 41]. For this reason, the evaluation of SWE velocity should be preferred in future studies.

Several drawbacks of SWE may partly justify the heterogeneity of published data on tendons. The elastogram is affected by tensile loads, which are related to joint angle, but also to rest, concentric-eccentric contraction and stretching. In this regard, for instance, it is recommended to evaluate Achilles tendons in relaxed or plantar flexion to not stretch the tendon, thereby avoiding a false increase of SWE velocity [42, 43]. Further, non-negligible differences in SWE velocity measurements have been reported when comparing different machines and when changing the depth of the investigated structure [44]. In this context, significant statistical differences have been noted in the quantification of elasticity based on the vendor system and the measurement depth when using SWE. Indeed, potential dissimilarities in the settings may potentially elucidate the scanner disparities. Additionally, variations might arise due to differences in the speckle tracking algorithms employed by different scanners. Nonetheless, there remains a necessity for studies that examine the reproducibility across different systems to ascertain the reliability of SWE measurements. Moreover, other factors may impact on SWE results, like the probe pressure and tilt, the interposition of a spacer, or the orientation of the probe that may affect the shear modulus [16]. Another point is where the tendon should be evaluated for estimating SWE velocity, specifically, how many and how large the regions of interest should be, and which portion of the tendon must be examined. Looking at previous papers, an extreme heterogeneity of methods can be observed, with some studies having focused on the whole thickness of the tendon, while others having used several regions of interest of different sizes. It has been shown that significantly different SWE results can be obtained applying regions of interest of different dimensions and with different time of acquisition [45]. As a matter of fact, the lack of consensus on a standard protocol to apply when using SWE in clinical and research practice limits the reliability and accuracy of this tool. Future studies and evidence-based recommendations are needed to standardize these important technical issues. Now, in clinical studies it should be essential to report detailed information about region of interest placement (location, number and dimensions), in order to apply the same methods to all subjects and to make the methods reproducible in further studies [46].

Another issue to consider when dealing with SWE is that all studies attempted to use it as a standalone method to detect tendon abnormalities, while none proposed it as an adjunct tool to complement B-mode US. Once resolved reproducibility and standardization issues, SWE might be used to supplement the B-mode examination, especially when negative but patient is still symptomatic [47].

Some limitations can be acknowledged at review level as well as study level. For the review we limited our search to English records. Then, we included different study design in order to achieve more literature however this choice has incorporated some methodological heterogeneity. For example, 5 studies (31.3% of the whole sample) used a contralateral limb as control group explaining part of heterogeneity by the sensitivity strategy. Further, we planned ICC in our protocol, but we found poor reporting of this outcome. This can be related to missing evidence as already known in literature. In addition, we did not search grey literature, and studies with negative findings may have not been retrieved as showed by both funnel plot and Egger’s test. At study level most studies have been focused just on Achilles, patellar and supraspinatus tendons, while not so many works have focused on the performance of SWE in different tissues. Hence, further studies should be carried on understanding how to apply SWE in other tendons, since each presents specific biomechanical and structural properties.

In conclusion, our meta-analysis underlines that pathological tendons may have reduced SWE velocity compared to controls, but the evidence is very uncertain. We have highlighted the low number and limited quality of studies focused on this topic, due to non-negligible differences of methods in the application of SWE and estimation of results of velocity and stiffness. SWE velocity still remains an interesting quantitative imaging tool for tendinopathies that could be helpful for diagnosis, prognosis, and assessment of treatment response, but future robust high-quality longitudinal studies and clear technical indications on the use of this tool are warranted to understand its actual role in tendon imaging.