Inter-rater reliability of diagnostic criteria for sacroiliac joint-, disc- and facet joint pain
Abstract
BACKGROUND/OBJECTIVE: Several diagnostic criteria sets are described in the literature to identify low back pain subtypes, but very little is known about the inter-rater reliability of these criteria. We conducted a study to determine the reliability of diagnostic tests that point towards SI joint-, disc- or facet joint pain.
METHODS: Inter-rater reliability study alongside three randomized clinical trials. Multidisciplinary pain center of general hospital. Patients aged 18 or more with medical history and physical examination suggestive of sacroiliac joint-, disc- and facet joint pain on lumbar level. Making use of nowadays most common used diagnostic criteria, a physical examination is taken independently by three physicians (two pain physicians and one orthopedic surgeon). Inter-rater reliability (Kappa (κ) measure of agreement) and significance (p) between raters are presented. Strengths of agreement, indicated with κ values above 0,20, are presented in order of agreement.
RESULTS: One hundred patients were included. None of the parameters from the physical investigation had κ values of more than 0.21 (fair) in all pairs of raters. Between two raters (C and D), there was an almost perfect agreement on three parameters, more specifically ``Abnormal sensory and motor examination, hyperactive or diminished reflexes'', ``Sitting exam shows no reflex, motor or sensory signs in the legs'' and ``Straight leg raising (Laségue) negative between 30 and 70 degrees of flexion''. The ``Drop test positive'' parameters had moderate strength of agreement between raters A and D and fair strength between raters A and B. The ``Digital interspinous pressure test positive'' had moderate strength of agreement between raters C and D and fair strength of agreement between raters A and B as well as raters B and C. Three other parameters had a fair strength of agreement between two raters, all other parameters had a slight or poor strength of agreement. Inter-rater reliability, confidence intervals and significance of pooled items for SI joint-, disc- and facet joint pain are represented; κ values for the pooled parameters of the physical examination suggestive of SI joint pain stayed below 0.20 between all raters. The same applies for the pooled parameters of the physical examination suggestive of facet joint or disc pain.
CONCLUSIONS: The poor reliability of the diagnostic parameters seriously limits their predictive validity, and as such their use in patients with low back pain for more than 3 months.
References
[1] | Robinson HS, , Brox JI, , Robinson R, , Bjelland E, , Solem S, , Telje T. The reliability of selected motion- and pain provocation tests for the sacroiliac joint. Manual Therapy (2007) ; 12: : 72-9. |
[2] | Young S, , Aprill C, , Laslett M. Correlation of clinical examination characteristics with three sources of chronic low back pain. The Spine Journal (2003) ; 3: : 460-5. |
[3] | Hancock MJ, , Maher CG, , Latimer J, , Spindler MF, , McAuley JH, , Laslett M, et al. Systematic review of tests to identify the disc, SIJ or facet joint as the source of low back pain. Eur Spine J (2007) ; 16: : 1539-50. |
[4] | Kottner J, , Audigé L, , Brorson S, , Donner A, , Gajewski BJ, , Hróbjartsson A, et al. Guidelines for reporting reliability and agreement studies (GRRAS) were proposed. J Clin Epidemiol (2011) ; 64: : 96-106. |
[5] | Slipman CW, , Sterenfeld EB, , Chou LH, , Herzog R, , Vresilovic E. The predictive value of provocative sacroiliac joint stress maneuvres in the diagnosis of sacroiliac joint syndrome. Arch Phys Med Rehabil (1998) ; 79: : 288-92. |
[6] | Robinson HS, , Brox JI, , Robinson R, , Bjelland E, , Solem S, , Telje T. The reliability of selected motion- and pain provocation tests for the sacroiliac joint. Manual therapy (2007) ; 12: : 72-9. |
[7] | Laslett M, , Williams M. The reliability of selected pain provocation tests for sacroiliac joint pathology. SPINE (1994) ; 19: : 1243-9. |
[8] | Dreyfuss P, , Michaelsen M, , Pauza K, , McLarty J, , Bogduk N. The value of medical history and physical examination in diagnosing sacroiliac joint pain. SPINE (1996) ; 21: : 2594-2602. |
[9] | Laslett M, , Aprill CN, , McDonald B, , Young SB. Diagnosis of sacroiliac joint pain: validity of individual provocation tests and composites of tests. Manual Therapy (2005) ; 10: : 207-18. |
[10] | Kundel HL, , Polansky M. Measurement of Observer Agreement. Radiology (2003) ; 228: : 303-8. |
[11] | Siegel TS, , Castellan Jr NJ. Nonparametric statistics for the behavioral sciences, second edition, New York: McGraw-Hill, (1988) . |
[12] | Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas (1960) ; 20: : 37-46. |
[13] | Landis JR, , Koch GG. The measurement of observer agreement for categorical data. Biometrics (1997) ; 33: : 159-74. |
[14] | Potter NA, , Rothstein JM. Intertester reliability for selected clinical tests of the sacroiliac joint. Phys Ther (1985) ; 65: : 1671-5. |