Your English writing platform
Discover LudwigExact(11)
Intra-rater reliability was evaluated for raters 1 and 2, who evaluated twice the same 50 random radiographs 4 months apart (test retest reliability).
The rater severity measures range from -4.59 logits (for the most lenient rater, Rater 10) to 1.40 logits (for the most severe raters, Raters 1 and 11).
However, the overall ICC of the three raters was only 0.58 in the upper end of the fair category with the ICC between raters 1 and 3 and 1 and 2 being only 0.50 and 0.58, respectively.
Comparing the fair averages of the most severe and most lenient raters, we would conclude that, on average, Rater 10 tended to give ratings that were 3.16 raw score points higher than Raters 1 and 11 (i.e., 8.90 5.74 = 3.16).
When we examine the map, we see that the harshest raters (Raters 1 and 11) had a severity measure of about 1.4 logits, while the most lenient raters (Raters 3, 10 and 13) had a severity measure of about -4.0 logits.
To prevent rater-introduced bias, a random sample of images from each expression category was flipped on the left-right axis, prior to re-measurement by rater 2. A Pearson Product Moment correlation coefficient of the FAI measurements between raters 1 and 2 was positive and significant (r = .725, df = 126, p<.01), suggesting good agreement between the raters.
Similar(49)
Table 5 Intra-rater (test retest) reliability and inter-rater reliability (measured by intraclass correlation coefficient — ICC) for posterior tibial cortex (PTC) and tibial proximal anatomical axis (TPAA) methods obtained by repeated measurements in 50 random radiographs PTC TPAA Intra-rater (rater 1) 0.67 0.79 Intra-rater (rater 2) 0.89 0.93 Inter-rater (among 3 raters) 0.81 0.81.
The majority of raters (4 out of 5) preferred the MBPS.
For determination of intra- and inter-rater reliability, 50 random X-rays were selected, and blindly measured by two other raters (2 and 3).
Out of the 4,200 pairs (300 samples graded by 14 raters), 21.7% had perfect agreement.
The correlation between three raters (0.66) made reliability index of 0.85.
Write better and faster with AI suggestions while staying true to your unique style.
Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.

Justyna Jupowicz-Kozak
CEO of Professional Science Editing for Scientists @ prosciediting.com