Purpose: The purpose of this study was to evaluate the reliability of a newly developed AI-algorithm for the evaluation of long leg radiographs (LLR) after total knee arthroplasties (TKA).
Methods: In the validation cohort 200 calibrated LLRs of eight different common unconstrained and constrained knee systems were analysed. Accuracy and reproducibility of the AI-algorithm were compared to manual reads regarding the hip-knee-ankle (HKA) as well as femoral (FCA) and tibial component (TCA) angles. In the evaluation cohort all institutional LLRs with TKAs in 2018 (n = 1312) were evaluated to assess the algorithms' ability of handling large data sets. Intraclass correlation (ICC) coefficient and mean absolute deviation (sMAD) were calculated to assess conformity between the AI software and manual reads.
Results: Validation cohort: The AI-software was reproducible on 96% and reliable on 92.1% of LLRs with an output and showed excellent reliability in all measured angles (ICC > 0.97) compared to manual measurements. Excellent results were found for primary unconstrained TKAs. In constrained TKAs landmark setting on the femoral and tibial component failed in 12.5% of LLRs (n = 9). Evaluation cohort: Mean measurements for all postoperative TKAs (n = 1240) were 0.2° varus ± 2.5° (HKA), 89.3° ± 1.9° (FCA), and 89.1° ± 1.6° (TCA). Mean measurements on preoperative revision TKAs (n = 74) were 1.6 varus ± 6.4° (HKA), 90.5° ± 3.1° (FCA), and 88.9° ± 4.1° (TCA).
Conclusions: AI-powered applications are reliable for automated analysis of lower limb alignment on LLRs with TKAs. They are capable of handling large data sets and could, therefore, lead to more standardized and efficient postoperative quality controls.
Level of evidence: Diagnostic Level III.
Keywords: Artificial intelligence; Knee alignment; Large data; Long leg radiograph; Standardization; Total knee arthroplasty.
© 2022. The Author(s) under exclusive licence to European Society of Sports Traumatology, Knee Surgery, Arthroscopy (ESSKA).