Introduction

Over the past 200 years, descriptions and concepts of the many integrated components of human sacroiliac joints (SIJ) and pelvic girdle systems have emerged. In particular, the concept of motion in the SIJ has undergone a slow, convoluted evolution. Although initial writings suggested that motion is normally present between the ilia and sacrum, this idea fell out of favor in the mid-20th century [37]. Now, current research supports the existence of a limited motion in the average range of 2–4° in all three planes of this joint, but only in a lying position [15].

The SIJs are highly specialized joints that permit stable (yet flexible) support of the upper body. In bipeds, the pelvis serves as a basic platform with three large levers acting on it (the spine and two legs). Thus, for bipeds submitted to gravity in an upright position, both the necessary tightness of the well-developed fibrous apparatus and the specific architecture of the SIJ result in limited mobility. Sacral movement involves the SIJ, and most likely directly influences the discs and the higher lumbar joints as well [38].

The SIJ appears as a typical joint from the second month in utero onwards and development of the joint cavity is completed by months 7–8. Examination of 200 anatomical specimens revealed that the bony surfaces of the joint are smooth until puberty [37]. At a later age, different combinations of bony ridges and grooves occur. The most frequent location of the ridges appears to be on the ilium. These ridges are not identical with osteoarthritis. However, the interdigitating symmetrical grooves and ridges [6] of the SIJ articular surfaces contribute to the highest coefficient of friction of any di-arthrodial joint. This property enhances the stability of the joint against shearing [37]. The keystone-like bony anatomy of the sacrum further contributes to stability within the pelvic ring. At its base, the sacrum is wider superiorly than inferiorly; it is also wider anteriorly than posteriorly, permitting the sacrum to act like a “wedge” towards caudally and dorsally between the ilia within the pelvic ring. This anatomical structure of the sacrum in humans is adapted to resist shearing from vertical compression (e. g., gravity) and anteriorly directed forces on the spine. Therefore, in an upright position, the mobility is even more restricted and the total degree of mobility is less than 1° [37].

Immediately after birth, the general orientation of the human SIJ is very similar to that of quadrupeds. The articular surfaces have the same angular orientation as the zygapophyseal joints of the lumbar vertebrae. Change begins as soon as the child starts to walk upright. The sacrum enlarges laterally, and the articular surfaces modify to a more complex adult curvature, resulting in the surface profiles of the joint bearing resemblance to a propeller-like shape. Comparative anatomical and paleontological research indicate that these changes are the result of mechanical factors, such as the supine position, body weight, load on the femur, and strain on the pubic symphysis [27].

Typically, the SIJ includes the sacral segments S1, S2, and S3, although inclusion of the complete S3 segment in the SIJ is not common for females [38]. In general, fusion of the sacral vertebra begins early in the second decade. The bony anatomy of the joint surface is highly variable in size, shape, and contour among individuals, and the shape of the joint changes markedly from infancy to adulthood [6]. The sacral auricular part is generally concave; however, often an intra-articular bony tubercle is present ventrally, in the middle aspect of the auricular surface of the sacrum. The iliac part is predominantly convex. Large variations of the auricular surfaces exist, resulting in intra-individual variations of motion axes.

The SIJ is unique in having elements of a combined synarthrosis and a diarthrosis—hence resulting in the term amphiarthrosis. The main portion of the joint is surrounded by a complex capsule and lined with cartilage (diarthrosis). Its shape is auricular, and “opens” posteriorly. The sacrum and ilia have an extracapsular, dorsally located articulation (synarthrosis), which is augmented by the vast iliosacral ligament (ISL) that provides considerable internal stability. Essentially, the SIJ is encased in a capsule that has a smooth anterior wall and irregular bands comprising the posterior wall. The capsule is innervated, at least from the dorsal lumbosacral rami (L5–S3) and is surrounded by several strong ligaments, such as the sacrotuberal ligaments, sacrospinal ligaments, long dorsal sacroiliac, and iliolumbar ligaments, which therefore influence its range of motion. In turn, these ligaments are related to a complex thoracolumbar fascia composite derived from the aponeuroses of several large muscles (like the paraspinal and the abdominal muscles) that surround the joint at a distance [37].

Diagnostics of SIJ dysfunction

What is meant by the “dysfunction” of a joint that has so little mobility? Currently, there are no data to prove one or the other hypothesis—is it really the very small mobility, is it the affection of the capsule, or is it just a soft tissue reaction in segmental connection with the SIJ? Therefore, we rely on observations of pain and restriction in the region of the pelvic girdle and the related connective tissues.

On the one hand, structural changes exist in the bony parts of the joint, the synovia, and the capsule, which are caused either by long-lasting mechanical overcharge in the sense of degenerative osteoarthritic changes altering the mechanics by osteophytes. Alternatively, pathological autoimmune-reactive processes (by inflammation) also lead to a structural change of the joint, in the most severe situation represented by a synostotic fusion (i. e., ankylosing spondylitis). These changes are not reversible. However, they only occur in a small proportion of patients with respective complaints. On the other hand, the vast majority of patients present a reversible dysfunction during the examination, which sometimes disappears spontaneously, but more often will dissolve only after external influence (mechanical, medication, etc.).

Similar to the intervertebral joints and using the forces of compression or distraction, these reversible dysfunctions have the property to react to a respective mechanical provocation with a diminution of pain in at least one direction.

The reversible SIJ dysfunction follows the pattern of a complex arthro-neuromuscular reaction as an answer to nociception [4]. This complex nocireaction comprises the segmentally organized muscles that show a painful protective contraction, as well as the vast ligamentous apparatus of the pelvis that will develop painful insertions (here from segments L5–S3).

Last but not least, the multilayered collagenous fibers have to reorient themselves according to the altered direction of mechanical load within the system of the deep fascia of the abdominal, back, and leg muscles. In addition, a densification of the fascial layers through physical forces or biomechanical processes will occur that reduces the gliding, with the result that a spontaneous return to normal function is impeded [28, 29].

Which changes occur in the joint itself? Even today, we can only answer this question hypothetically. Based on the abovementioned data, it is quite plausible that the grooves and ridges that form themselves individually in every person during upright walking on the corresponding surfaces will always move in the same way, they will follow a uniform pattern.

In case external influences disturb this pattern, the grooves and ridges will no longer fit, leading to a disturbance of the movement—perhaps similar to a train that has been derailed. We can usually influence this biomechanical part of the complex nocireaction by mobilizing the joint partners (sacrum and ilium) in a direction previously established to be pain-free or just by separating them from each other.

The very successful manual therapeutic techniques to treat SIJ-related pain based on a reversible dysfunction that we teach today follow this hypothesis. Therefore, it is an indispensable precondition for treatment to differentiate between structural damage and reversible dysfunction by diagnostic steps, and furthermore to identify the pain-free direction of any movement in the latter. Although this sounds plausible, in daily practice it is not so easy.

Therefore, SIJ diagnostics remain difficult and uncertain. There is no typical pain pattern; there are substantial contradictions in the analysis of the innervation [10,11,12, 14, 24, 27]. Today, there is still no gold standard in SIJ diagnostics. Even Rx(fluoroscopically)-guided intraarticular injections do not have reliable results [9]. Many irritation zones/points have been described [5]. The clinical examination of function is not reliable and reproducible [21, 25, 26]. In addition, we have to take into consideration that the perception of pain at a certain point of the anatomy does not mean that the nocigenerator of this pain is exactly where the brain locates that pain. This means that a chronic pain in the area of the SIJ may also be a referred pain of any nocigenerator in any structure segmentally innervated from L5 to S3 [23]. Is there reliable information from the clinical findings at all? In this respect, the publications are conflicting and not unanimous [2, 8, 13, 19, 20, 32,33,34, 40]. Nevertheless, these studies present some general ideas:

  • Not reliable are:

    • One single test

    • An inexperienced examiner

    • SIJ pain combined with low back pain (LBP)

  • Reliable are:

    • At least three pain tests with the same result

    • An experienced examiner

    • Gluteal pain, also pseudo-radicular leg pain

    • Good positive or negative pre-test probability

    • Pain provocation is more reliable than palpation of mobility

The real functional mobility of the SIJ in a supine position is a complex three-dimensional nutation and counternutation movement of the sacrum towards the ilium in the range of 2–4°. This can only be tested in a lying position, using a helicoidal axis that varies largely even between the SIJs of the same person [3].

The axis even depends on the actual function (standing on both or one leg, lying in supine or prone position; [15]). During the nutation, the base of the sacrum moves cranially and anteriorly, and during counternutation, the tip of the sacrum moves caudally and anteriorly. During aging, the mobility decreases in men. In young subjects and in the horizontal position of the body we find 4°, in older men only 2°.

With only 2–4° of three-dimensional mobility in a supine position and less than 1° in an upright position, it is very difficult to produce clinically reliable signs of articular movements. In fact, clinical manual movement tests are completely unreliable for the SIJ [39]. Commonly used diagnostic tests are 1. the standing-flexion test (also called bending-forward-test or Gillet-test or Piedallu-test) and 2. the spine-test (also called hip flexion test or stork-/flamingo-test). Unfortunately, these tests have low reliability [22] and low reproducibility [21]. Sturesson et al. checked these tests in 22 patients with severe SIJ pain using radio-stereometric analysis (RSA). The results show minimal change of movements during the test, and no differences between symptomatic and asymptomatic sides. When the pelvis is loaded in a one-leg standing position while flexing the contralateral leg, patients are physically challenged, which leads to bilateral increased force closure of the SIJ [1, 3, 30]. During this test, no SIJ motion occurs. The visibly changing position of bony landmarks like the upper posterior iliac spine (SIPS) that is apparently derived from the SIJ is in fact a movement of the external pelvis relative to the hips. This gives the manually perceived but misleading illusion that the SIJs are moving [30]. Perhaps these functional tests react positively in the presence of SIJ dysfunction by changing the behavior of the connective tissues of the whole pelvic girdle.

Study aim and design

Several studies have checked the reliability of SIJ tests. In general, pain provocation tests seem to be more reliable than functional tests in the diagnosis of SIJ dysfunction without structural lesion [35, 39]. The most frequently used tests are the compression test, the femur-thrust test, the pelvic torsion test, the FAbER test (for flexion, abduction, and external rotation; [17]), and the hip extension test [35]. Based on this specific differential diagnosis, the results of manual therapy for SIJ-related gluteal and leg pain in a prospective single-blinded trial were better (72% success) than those of injections (50%) or physiotherapy (20%; [36]).

In Germany, physicians have been competent in the use of manual medicine (MM) and have been teaching the method for 65 years. They have developed these tests through empirical observations and good practical results. Nobody has ever evaluated these tests in comparison to other diagnostic techniques. Therefore, in the world of evidence-based medicine and reliability studies, evaluation of these tests in comparison to each other has become inevitable. The described approach uses three steps to identify SIJ dysfunction, including an exact medical history and a neurological examination:

  1. 1.

    Check for articular mobility (M; supposedly unreliable),

  2. 2.

    Check for articular painful irritation points (I; supposedly quite reliable)

  3. 3.

    Functional pain provocation of irritation points (P; supposedly highly reliable)

This is a diagnostic study with the primary aim of evaluating the interrater reliability between two independent examiners with respect to the principles of the three-step diagnosis. The secondary aim was to evaluate the validity of the three-step diagnosis compared to the already evaluated SIJ testing procedures (here: FAbER test, pelvic torsion test, femur-thrust test). In the case of the two pain provocation tests of the Physicians’ Seminar for Manual Therapy of Spinal and Periperal Joints (MWE; checking a gluteal irritation point for pain-free direction cranial–caudal and ventral–dorsal) reaching sufficient reliability in Cohen’s kappa [7], these two tests were to be added to the set of tests, reaching a set of five tests that are reliable for SIJ diagnostics.

Methods

Patient recruitment and rationale for sample size

Recruitment of a total of 75 cases and 75 additional controls was planned.

  • The symptomatic subjects were recruited from patients who came to a back pain rehabilitation clinic for a 3-week inpatient treatment because of chronic low back pain, not all had an SIJ dysfunction.

  • The healthy subjects (controls) were recruited from physicians attending MM courses; not all of them were really completely without signs and symptoms of dysfunction.

The sample size was chosen to provide reasonable accuracy for estimation of the kappa coefficient. In the planning stage, it was expected that the level of agreement would be higher than 0.65, i. e., almost perfect. The kappa value is calculated as κ = (p0 − pc)/(1 − pc), where p0 is the total agreement rate and pc the random agreement rate for a given cross-table. As we assume equal numbers of pathological and neutral cases, pc is expected to be 0.5. As a consequence, for κ = 0.65, it follows that p0 would be 0.825. The estimated standard deviation (SD) is given by SD(k) = √(p0(1 − p0)/(1 − pc)^2). For a kappa value of 0.65, the expected SD is thus 0.76. The two-sided 95% confidence interval (CI) for Cohen’s kappa coefficient is given as [k − 1.96 * SD(k)/√n; k + 1.96 * SD(k)/√n]. For a total number of 150 subjects (75 cases, 75 controls) the expected 95% CI will thus be given by [0.53; 0.77]. Therefore, we planned to recruit at least 150 subjects, which seems sufficient and reasonable for evaluation of interrater reliability.

During the recruitment phase, we were finally able to recruit 161 participants (80 healthy subjects, 81 with low back pain or pelvic girdle problems).

Diagnostic procedures

The two raters have both been teaching MM for more than 3 years with the same approach, so there was a short formation of only one weekend. The almost perfect overall agreement in the formation period was established during an educational course for teachers acting as instructors in MM. There were 46 participants (all physicians), who were investigated by the two raters.

The inclusion criteria were an age between 18 und 60 years, a body mass index (BMI) between 18 and 30 kg/m2, no radicular compression signs from L5 or S1. The subjects were assigned at random as either patients or controls, waiting outside of the two separated examination rooms for the next free rater. They were examined at random first by either rater 1 or by rater 2. In addition, the raters changed the sequence of tests at random to avoid the bias of the subjects expecting the same sequence. The two raters investigated all included subjects. Blinded to each other, they checked the subjects in a short time using a set of 10 diagnostic tests (one test was used in two versions), i. e., they finalized both examinations with a difference of less than 5 min. No treatment was applied. There was no follow-up.

The 10 tests:

  1. 1.

    Standing forward flexion test (also called Piedallu test, “Vorlaufphänomen”): the subject standing upright, the examiner fixates both posterior upper iliac spines (SIPS), the subjects bends forward; any development of asymmetry of the SIPS’ positions is registered as a positive (pathological) sign (Fig. 1).

  2. 2.

    Spine test (also called Gillet test, stork/flamingo test): the subject standing upright, one hand supported, the examiner fixates on the side to be tested the PSIS with one thumb and with the other thumb the sacral crest. During bending the leg of that side in hip and knee over 90°, the thumb on the SIPS should move caudally in normal function (Fig. 2).

  3. 3.

    Variability of leg length (also Derbolowsky test, sit-up test): the subject lying in supine position, the examiner takes both ankles with his hands, the thumbs in the same height at the ankle (Fig. 3a); the subject then sits up with eyes closed and no contact between the teeth; any developing asymmetry of the “length” of the legs is registered as a positive (pathological) sign (Fig. 3b).

  4. 4.

    Irritation point A: the subjects lying relaxed in prone position, the examiner presses on both sides the indexes in projection to the gluteal muscles, with the exact point to be found three finger-widths laterally of the SIJ (Fig. 4a), four finger-widths below the iliac crest (Fig. 4b). At the meeting point of the index fingers, the irritation point is expected (Fig. 4c). Any asymmetry in tissue consistency and pain is registered as a positive (pathological) irritation.

  5. 5.

    Irritation point B: the subjects lying relaxed in prone position, the examiner presses on both sides the indexes in projection to the gluteal muscles. He finds the exact point by dividing in two the distance between the upper end of the anal cleft and the lateral edge of the iliac crest (Fig. 5a, b). Any asymmetry in tissue consistency and pain is considered as a positive (pathological) irritation (Fig. 5c).

  6. 6.

    Functional pain provocation in horizontal (frontal) plane of the lying subject: traction of the leg caudally (patient’s ankle between physician’s thighs) will induce a nutation movement of the sacrum; the reaction is checked at the positive irritation point (4 and/or 5). The examiner registers an increase or decrease of tension and pain at the irritation point. Pushing the leg to cranial direction will indicate the reaction to a counternutation movement of the sacrum (Fig. 6a, b).

  7. 7.

    Functional pain provocation in vertical (sagittal) plane of the lying subject: compression to the sacral base (S1) will activate a nutation movement; compression to the tip of the sacrum (S5) will activate a counternutation movement. These directions of intraarticular forces will give information to the palpating finger at the irritation point, indicating a painful or pain-free direction of articular function provocation (Fig. 7a, b).

  8. 8.

    Flexion abduction external rotation test (FAbER test; also called sign of the 4 or Patrick test): with the subject lying in supine position, the examiner lays the lateral ankle of the side to be tested at the level of the patella of the opposite leg; he fixates the pelvis with one hand to the opposite anterior superior iliac spine (ASIS) and presses the ipsilateral knee towards the table. As the test also reacts to hip pain and its osteoarthritic stiffness, the degree of flexion, abduction, and external rotation is not important. Only the pain in the region of the sacroiliac joint counts for SIJ pathology (Fig. 8).

  9. 9.

    Pelvic torsion test (also called Gaenslen test): with the subject lying in supine position just on one edge of the table, the examiners positions this side’s leg besides the table in an extension position of the hip; then he bends the other leg in hip and knee as much as possible. After checking for pain in the sacral region, the test is repeated in the other direction of pelvic torsion from the other side of the table (Fig. 9).

  10. 10.

    Femur-thigh thrust (also called Ostgaard test or 4P test = posterior pelvis pain provocation test): with the subject lying in supine position, the examiner bends the leg opposite to him 90° in the hip and rolls the pelvis to his side, so he can position one hand’s contracted thenar under the sacrum, leaving the ilium free (Fig. 10a). Then he rolls the pelvis back to a vertical position of the femur and gives with the weight of his body a thrust to the knee in direction of the own hand under the sacrum (Fig. 10b). After registering “pain” or “no pain” in the region of the SIJ, he will repeat the test with the other femur.

Fig. 1
figure 1

Standing-flexion test: both examiner’s thumbs pressed to the upper posterior iliac spine the subjects bends forward – check for asymmetrical movement of the thumbs. With kind permission of J.-M. Werner

Fig. 2
figure 2

Spine-test: one examiner’s thumb pressed to the upper posterior iliac spine, the other to the sacral crest, the subject flexes his leg on that side up to 90°; for free function of the SIJ, the iliac thumb should move downwards. With kind permission of J.-M. Werner

Fig. 3
figure 3

Variability of leg length. a view from the side: start position with the subject lying supine, the legs of the subject are slightly lifted to have no contact with the table, b view from above: after the subject achieved the sitting position, the thumbs of the examiner should remain on the same level as in the lying position; if not, the test is registered as “positive ”. With kind permission of J.-M. Werner

Fig. 4
figure 4

Access “A” to SIJ-irritation. a Three finger withs laterally of the medial iliac rim, b four finger withs below the iliac crest, c the meeting-poiunt of the indexes is the point of the irritation. With kind permission of J.-M. Werner

Fig. 5
figure 5

Access “B” to SIJ-irritation. a position the middle fingers on the upper lateral iliac crest, b position the thumbs to the cranial end of the anal cleft, c bisecting the lines between middle finger and thumb you may find the SIJ-irritation bilaterally. With kind permission of J.-M. Werner

Fig. 6
figure 6

Check for pain-free direction cranial-caudal. a The middle finger on the irritation, the subject’s leg is tract down (nutation) or pushed upwards (counternutation) at the ankle between the examiners thighs, b position of the middle finger in relation to the pelvic bones. With kind permission of J.-M. Werner

Fig. 7
figure 7

Check for pain-free direction ventral-dorsal. a With the middle finger to the irritation, compression to the ilium will provoke a counternutation direction, b compression to the sacral base (S1) will provoke a nutation direction. With kind permission of J.-M. Werner

Fig. 8
figure 8

FAbER-test: the ankle of the side to be tested on level of the opposite patella, the hip is in flexion, abduction, external rotation; the pelvis is fiexed on the opposite side, the end-feel is tested. With kind permission of J.-M. Werner

Fig. 9
figure 9

Pelvic torsion test: one leg is positioned laterally of the table in hip extension, the other leg is flexed as much as possible in the hip; check for pain. With kind permission of J.-M. Werner

Fig. 10
figure 10

Femur-thigh-thrust. a The examiners contracted thenar is positioned exactly under the sacrum, b with the weight of his body the examiners thrusts the subject’s knee/femur in the direction of his hand. With kind permission of J.-M. Werner

Statistical analysis strategy

All analyses were performed for each diagnosed and evaluated side (right, left) separately. The interrater agreement for the three-step diagnostic approach was evaluated by means of Cohen’s kappa coefficient along with a corresponding 95% CI. Cohen’s kappa coefficients were also calculated for each of the established tests separately. Sensitivity and specificity for the three-step diagnostic approach in comparison to several well-established tests were not evaluated separately, as the number of positive diagnoses was too small, especially for the left side. Nevertheless, the result of the diagnostic procedure was defined as positive if at least three of the five established tests (irritation point craniocaudal, irritation point ventrodorsal, FAbER test, Gaenslen test, and the 4P or femur-thrust test according to Ostgaard) showed a positive result.

Results

Generally, the magnitude of Cohen’s kappa coefficients for the agreement of the two raters is interpreted as follows: kappa values between 0 and 0.20 indicate poor agreement, values between 0.21 and 0.40 indicate sufficient agreement, values between 0.41 and 0.6 are interpreted as a substantial agreement, and kappa coefficients above 0.61 are interpreted as almost perfect agreement [16].

Although 168 subjects signed the informed consent to participate, 7 had to be excluded from evaluation as they did not fulfill the inclusion criteria, mainly because they were too old or too heavy. Of the included subjects, 82 (50.9%) were male, 79 (49.1%) were female. The mean age was 44.4 ± 8.9 years, the mean weight was 76.3 ± 13.4 kg, the mean height was 174.6 ± 10.0 cm, and the mean BMI was 24.8 ± 3.1 kg/m2. The general physical examination found in 3 (1.9%) subjects a pathologic condition in the central nervous system, 9 (5.6%) with cardiac problems (high blood pressure), 2 (1.2%) with asthma, 1 (0.6%) with nephrologic problems, 11 (6.8%) had previous gastric ulcers, and 11 (6.8%) had thyroid disorders. They were included nevertheless, as all of them used the respective medication to control their specific problems.

In 8 (5.0%) subjects there were signs of some nerve root compression, but none from the lumbosacral region, so they were not excluded. Another 31 (19.3%) showed signs and symptoms of intervertebral dysfunction in the segments T12 to L5, which was also not a criterion for exclusion.

For the three functional tests (standing forward flexion, spine test, and checking the variability of leg lengths), we did not expect good results for interrater agreement. They are not based on real mobility within the SIJ, but rather on connective tissue reactions. Therefore, they may probably but not reliably be related to SIJ dysfunction. Although the observed kappa values were not poor for each of these functional tests, the 95% CI was too wide to allow an appropriate interpretation. Nevertheless, the observed kappa values for the FAbER test and the pelvic torsion test were almost equal to the data from the references [18]. In our study, we only found similarly good and precise results for the femur-thigh thrust as for the pain provocations to the gluteal muscular irritation. All results are compiled in Table 1.

Table 1 Cohen’s kappa coefficients of the specific sacroiliac joint (SIJ) diagnostics; κ‑values and 95% confidence interval (CI) ranges

These results show that SIJ diagnosis using the gluteal irritation in combination with the pain provocation in the cranial–caudal plane as well as in the ventral–dorsal plane have an almost perfect kappa coefficient and are even more reliable than the already examined tests.

We accepted the precondition [18] that there is a need for at least three positive tests out of five for the diagnosis of SIJ dysfunction. Using the pain provocation test cranial–caudal, pain provocation test ventral–dorsal, FAbER test, pelvic torsion test, and femur thrust test, the interrater reliability for one of the possible three diagnoses (no SIJ dysfunction, dysfunction right SIJ, and dysfunction left SIJ) was evaluated:

  • No SIJ dysfunction (80 subjects): Cohen’s κ = 0.93 ± 0.03; 95% CI [0.87; 0.99]

  • Right-side SIJ dysfunction (61 subjects): Cohen’s κ = 0.95 ± 0.03; 95% CI [0.89; 1.00]

  • Left-side SIJ dysfunction (20 subjects): Cohen’s κ = 0.94 ± 0.04; 95% CI [0.86; 1.00]

Discussion and conclusion

Limitations

The study is limited to a group of inpatients supposed to be impaired and to a group of physicians supposed to be healthy subjects. However, there is no evidence that these groups react differently than other patients or other healthy controls. There was no treatment at all and no follow-up to check the reliability of the diagnoses by post-therapeutic outcome.

Although the study was designed as a blinded study regarding the allocation of the subjects to the raters, some symptomatic subjects tried to start a discussion with the raters to get more information about themselves. Therefore, it was easy for the raters to uncover the respective allocation. As there were also many symptomatic subjects among the “controls” and almost the same number of asymptomatic subjects among the “patients,” we estimate this bias to be negligible. As both groups presented both results regarding the SIJ, the raters had to examine each subject without predisposition.

The number of subjects included was not as high as necessary for sufficient evaluation of a left SIJ pathology. The relation between symptomatic right-side and the symptomatic left-side SIJ dysfunction confirmed the empiric observation that the dysfunction of the right SIJ occurs much more often in our population (4:1) than the left SIJ. Unfortunately, the sensibility and specificity could not be evaluated reliably, because of the small number of positive diagnoses for the left side. Nevertheless, the model “3 out of 5 positive pain provocation tests” has already been evaluated with an estimated sensitivity of 93.8% and a specificity of 78.1% [18].

Regarding the evaluated manual tests, only the most commonly used tests of one physicians’ association for MM were investigated. However, the implementation of all other manual diagnostic techniques would have made it impossible to achieve reliable results. Already the 10 tests are supposed to be the maximum possible in one setting.

Generalizability

The methods of examination evaluated to show sufficient agreement were very efficient and can be generalized. Based on the results, the clinical examination of a supposed SIJ dysfunction can be performed reliably using the gluteal irritation of muscles with provocation in two planes in combination with the 4P (i. e., femur-thrust) test. In cases where these three do not have the same positive answer, they can nevertheless be used in a set with two additional tests (FAbER and Gaenslen tests) in the model “3 out of 5 positive pain-provocation tests.”

Regarding the results, there was an excellent reliability for the gluteal irritation point in combination with pain provocation in two planes. The consistent data of all pain provocation tests for the SIJ fit nicely. The study terminates with a very robust observation.

Interpretation and recommendation

This study shows that including the traditionally used irritation points (or zones) in the gluteal muscles with pain provocation according to the possible directions within the SIJ (nutation or counternutation) within a set of five tests is highly reliable and will lead to good results for the diagnostic determination of a neutral function or a unilateral dysfunction of the SIJ.