Introduction

The assessment of mobility is an important part in clinical and physical examinations. The recognition of impairments in joint range of motion (ROM) assists clinicians in their diagnoses, measuring improvements or determining functional limitations [1]. Muscle strength testing is widely used by clinicians for both diagnostic purposes, as well as determining a patient’s joint function prior to, during and following an intervention [2, 3]. Muscle performance testing is used in a variety of patients to detect the presence of impairments and the ability to produce movements that are required for function [4]. In addition, the establishment of objective measurements of ROM and strength functions is a fundamental tool for identifying risk factors of joint disorders and pain, i.e. shoulder and elbow problems, particularly in an active and athletic population [511]. Therefore, it is necessary for clinicians and researchers to have reliable examination tools to accurately and objectively assess the functional status of a joint [1, 4, 6, 12, 13]. Methods in evaluation for shoulder and elbow ROM are the visual observation, linear measures, inclinometry and goniometry [14]. Goniometry has been used widely due to its portability and low cost [15]. As a limitation, the clinician is required to use both hands, making stabilization of the extremity more difficult and increasing the risk of error in reading the instrument. It also induces the need for a second person as an assistant [16]. The assessment of ROM and strength is considered to be integral components of outcome measures of shoulder and elbow function besides self-report outcome scores and subjective clinical examinations [6, 1719].

Manual muscle testing is one of the most commonly used methods for assessing muscular strength [20]. Although isokinetic dynamometry is known as a criterion standard in measuring muscle strength, upper extremity muscles never generate isokinetic muscle actions in daily life [20]. Besides these physiological considerations, isokinetic testing is often not feasible because of the high costs and the laboratory setting required [21]. Hand-held dynamometry (HHD) has been described to be superior to manual muscle testing in identifying differences or impairments in muscle strength, such as those found after an injury [22, 23]. Stark et al. [21] concluded in their systematic review (19 studies) correlating the isokinetic dynamometer with the HHD that HHD for upper extremities was valid for measuring muscle strength. Limitations of HHD include tester strength [24, 25], lack of stabilization [24, 25] and inconsistency with testing procedures [25].

The purpose of this study was to examine the intrarater reliability for measuring shoulder and elbow active ROM using a clinical goniometer, as well as isometric strength using a portable HHD. Two testing sessions were completed with 7 days between sessions. We determined reliability (ICC, SEM) for the examination of asymptomatic female team handball athletes and an asymptomatic control group.

Materials and methods

Participants

22 female handball athletes (age: 21.0 ± 3.7 years, body height: 1.7 ± 0.1 m, body mass: 69.1 ± 6.2 kg, body mass index 22.8 ± 1.6 kg m−2) of a local 2nd-league team and 25 asymptomatic volunteers (13 women, 12 male; age: 21.9 ± 1.2 years, body height: 1.76 ± 0.1 m, body mass: 68.2 ± 14.1 kg, body mass index 21.7 ± 2.2 kg m−2) participated following written informed consent. Each candidate underwent a bilateral clinical analysis for pre-existent neck, shoulder and elbow pathologies. Subjects were excluded based on the following criteria: persistent or recurrent pain, subjective limitations in any ROM and discomfort which lead to interruptions for training or activity.

Instruments

Shoulder and elbow flexibility were measured with a standard baseline goniometer. Technical accuracy was provided with 1° discrimination and a range of 180°. Isometric muscle strength was determined with the IsoForceControl EVO2 hand-held dynamometer (Medical Device Solutions AG, Oberburg, Switzerland). This dynamometer is capable of measuring up to 400 N.

Procedures

After a supervised standardized warm-up consisting of multi-planar shoulder movements and bi-planar elbow exercises for 2 min, the participants completed all measurements bilaterally. Active shoulder ROM and isometric strength measurements consisted of internal rotation (IR) and external rotation (ER), while elbow ROM and strength measurements consisted of flexion (FLEX) and extension (EXT). All ROM and isometric strength measurements were conducted by one experienced examiner with support from an assistant familiar with the complete protocol. All assessments were performed bilaterally in the same order over three trials with 1 week between the two testing sessions. All participants refrained from performing any overhead throwing activities during the testing period [9].

For shoulder ROM measurements the participants were in a supine position with 90° of shoulder abduction, 90° of elbow flexion and a neutral wrist position. The assistant stabilized the test shoulder as described by Wilk et al. [16] to prevent any accessory movement [9, 16]. The goniometer was positioned with the fulcrum over the lateral humeral epicondyle, the stationary arm vertically and the moveable arm over the ulnar styloid process (Figs. 1, 2) [6].

Fig. 1
figure 1

Active shoulder external rotation ROM

Fig. 2
figure 2

Active shoulder internal rotation ROM

Separate calculations were completed for glenohumeral internal rotation deficit (GIRD, difference between IR throwing arm and IR non-throwing arm, described in negative values) and external rotation gain (ERG, difference in ER between limbs, describes in positive values).

For assessment of elbow ROM, the subjects were also in a supine position with the humerus stabilized against the side of the subject’s body. The goniometer was positioned with the fulcrum over the lateral humeral epicondyle, the stationary arm aligned with the midline of the humerus and the moveable arm aligned with the radial styloid process (Figs. 3, 4).

Fig. 3
figure 3

Active elbow extension ROM

Fig. 4
figure 4

Active elbow flexion ROM

To measure shoulder strength, each subject was again placed supine with their shoulder in 90° of abduction, elbow in 90° of flexion and neutral wrist position, with an assistant stabilizing the shoulder. The strap of the HHD was then held by the test hand of each subject as they isometrically contracted into either internal or external rotation against the counter force applied by the investigator holding the HHD (Figs. 5, 6).

Fig. 5
figure 5

Examination for isometric shoulder strength in external rotation

Fig. 6
figure 6

Examination for isometric shoulder strength in internal rotation

Determination of the isometric elbow strength was made with the subject seated and their humerus resting at 90° of forward flexion and stabilized against the examination table by the assistant (Figs. 7, 8).

Fig. 7
figure 7

Examination for isometric elbow strength in flexion

Fig. 8
figure 8

Examination for isometric elbow strength in extension

The strap of the HHD was held by the test hand of each subject as they isometrically contracted into either flexion or extension. All contractions lasted for 10 s for a total of three trials per movement. These trials were then averaged and used for analysis.

Statistical analysis

Descriptive statistics were calculated across participants for the dependent variables (active ROM and absolute isometric strength). Differences between groups (control vs. handball) were tested using a general linear model and were considered statistically significant at p values <0.05. Partial eta-squared (η 2) values were also provided to represent the level of clinical significance, if η 2 > 0.10.

Intraclass correlation coefficient (ICC) and standard error of measurement (SEM) were used to describe test–retest reliability [20]. Interpretation of ICC values was based on guidelines provided by Portney and Watkins, where a value above 0.75 was classified as good or excellent reliability [26]. ICC values may be influenced by inter-subject variability of scores, because a large ICC may be reported despite poor trial-to-trial consistency if the inter-subject variability is too high [26, 27]. However, the SEM is not affected by inter-subject variability [27]. Therefore, SEM was reported in conjunction with the ICC’s using the formula: \({\text{SEM}} = \; {\text{SD}}\sqrt { 1 - r}\) [26].

95 % limits of agreement (LOA) were calculated using the formula:

95 % limits of agreement = mean difference ± 2 SD [28].

Finally, one-way ANOVAs were conducted to determine if significant differences existed between test sessions.

All statistical analyses were performed using SPSS version 22.0 for Windows (SPSS Inc., Chicago, IL, USA).

Results

There were no significant differences between groups for any anthropometric data. Reliability data are shown (Tables 1, 2).

Table 1 Comparison of measurements [° and N] obtained from the two testing sessions for healthy subjects
Table 2 Comparison of measurements [° and N] obtained from the two testing sessions for team handball players

The intrarater reliability varied from 0.94 to 0.97 (SEM 1.07 N–4.76 N) for the shoulder joint samples from the handball athletes and 0.96–1.00 (SEM 0 N–4.48 N) from the control group. The ICC and SEM for the elbow joint samples differed between 0.79 and 0.97 (SEM 0.98°–5.94 N) for the athletes and 0.87–1.00 (SEM 0 N–5.43 N) for the volunteer subjects. The test–retest results were highly reliable for both samples. Based on the recommendation (ICC > 0.75) of Portney and Watkins [20], all parameters showed good reliability. The SEM differed in similar ranges in both groups. For female handball players, we found the lowest SEM with 0.98° in elbow ROM extension (throwing arm). In contrast, the highest SEM was calculated in elbow strength extension (non-throwing arm) with 5.94 N. Comparable results were observed for the control subjects. The lowest SEM was 0° and 0 N for five different parameters (Table 1); in contrast, elbow flexion strength of the throwing arm had the highest SEM (5.43 N). There were no significant differences between testing sessions (Tables 1, 2). 75 % of the parameters (12/16) showed larger limits of agreement in the handball sample than in the control sample (Tables 1, 2).

Overall (about all parameters), we could not find a group effect regarding intrarater reliability. For comparing absolute numbers concerning ROM and strength at the shoulder and elbow joint for athletes and control subjects, we separated the sample of volunteers into males and females and analysed the female data as shown in Table 3.

Table 3 Descriptive statistics (mean ± SD; 95 % confidence interval) calculated for each sample

We found significant differences in the throwing arm elbow isometric strength between the handball athletes (flexion 150.0 ± 19.3 N, extension 116.2 ± 21.3 N) and control subjects (flexion 117.0 ± 24.3 N, extension 82.3 ± 18.2 N). There was also a significantly higher amount of throwing arm isometric strength for shoulder external and internal among the athletes (athletes IR 92.6° ± 20.0°, ER 105.9° ± 19.2°; controls IR 71.9° ± 16.4°, ER 82.8° ± 16.5°) (Table 3). For the non-throwing arm, we determined significant differences in all parameters except for ER ROM at the shoulder between the groups. Although only elbow extension and elbow ROM were significantly larger in the control group (Table 3). There were no significant differences for the calculated parameters GIRD, ERG, or total ROM in either arm between groups (Table 3). We found significant differences for all strength parameters between groups. The largest difference (η 2 = 0.533) was observed for elbow extension strength (controls: 72.5 ± 14.5 N vs. handball: 112.1 ± 20.4 N). In contrast, only 50 % (4/8) of flexibility parameters showed significant differences. 75 % (3/4) of these parameters were found in the throwing arm.

Discussion

This study investigated the parameters for intrarater reliability of measuring active ROM and strength in the shoulder and elbow using standardized positions among female handball athletes and a group of controls. The study revealed good-to-excellent reliability for active shoulder internal and external ROM as well as elbow flexion and extension ROM, measured with a goniometer. The study also established good-to-excellent reliability for isometric shoulder IR and ER strength as well as elbow flexion and extension strength, using a HHD.

Cools et al. [6] measured reliability in shoulder external/internal rotation strength and passive ROM among asymptomatic individuals with two examiners at a single time evaluation. They concluded that different patient’s positions and technical equipment might influence the outcome measures of the assessment for shoulder external and internal rotation [6]. Kolber et al. [1] described interchangeable results when using goniometry and a digital inclinometer. Kolber et al. [4] also reported a high range of reliability for evaluating healthy male and female, using a HHD. Most previous studies have been focused on shoulder passive ROM. However, our results show similar reliability to those of Kolber et al. [29] and Fieseler et al. [9] for measuring active shoulder ROM. Our reliability is also consistent with other studies using passive motion [1, 4, 6]. Because different subject positions have been shown to influence ROM during evaluation, we attempted to standardize our tests using a supine position for shoulder ROM and strength, and elbow ROM as well as sitting for elbow strength. According to the work from Cools et al. [6], we kept the subject’s position constant for practical utility, which has been an important condition to reveal reproducible results. Owing to technical difficulties of requiring both hands for goniometry and HHD, we conducted the whole study with one examiner and an assistant, who only needs to be familiar with the purpose of the study and the stabilization techniques. According to Wilk et al. [16] and based on our good reliability, the positioning and stabilization techniques we used should be considered in the prevention and evaluation of various shoulder and elbow pathologies among athletes and non-athletes alike. There is no research available that has used a time interval similar to ours of 7 days in between test sessions. Although previous research has investigated male team handball players [9], this is the first study to investigate the reliability in shoulder and elbow joint active ROM and isometric strength among female handball athletes.

There is a similar lack of research investigating the reliability of shoulder and elbow strength measurements using a HHD. Gajdosik et al. [30] showed fair-to-excellent reliability when measuring very young children (age 28–50 months). Similarly, Rex Wong et al. [31] and Vermeulen et al. [32] reported good reliability, but tested a non-throwing adult population. Therefore, our results are the first to show that using HHD can produce reliable results for isometric strength among female athletes.

In summary, Schrama et al. [20] concluded in a recent review article that intrarater reliability of muscle strength testing with HHD was only acceptable for elbow measurements in asymptomatic individuals. There has been a wide range of grades of ICCs for shoulder strength among different studies, especially for athletes or patients with pain [20]. These authors have also recommended future research and pointed out that clinicians should not rely on HHD measurements for evaluation of treatment effects in patients with upper extremity disorders [20]. The results of our study prove that reliable measurements for shoulder and elbow ROM and strength among female athletes and asymptomatic non-throwing individuals can be conducted. Our results also underscore the need for consistent conditions during evaluation like standardizing subject’s position, proper stabilization techniques and defined time intervals between measurement sessions.

As a limitation, we know that the end ROM measurements taken during our flexibility tests were determined by subjective criteria. Future studies should also examine the reliability and validity of ROM and isometric strength testing among populations with joint disorders, pain or under high demands, such as athletes during competitive seasons.

Conclusion

We demonstrated a clinically applicable and standardized protocol for determining ROM and isometric strength in the shoulder and elbow. We revealed good-to-excellent reliability for shoulder and elbow ROM and strength in female team handball athletes and control subjects using a standard goniometer and HHD. Measurements for ROM and strength are recommended in the supine position for the shoulder and sitting for elbow strength because of the practical applicability and trunk stabilization [6].