-
How do Patient-reported Outcome Scores in International Hip and Knee Arthroplasty Registries Compare?
Patient-reported outcome measures (PROMs) are the only systematic approach through which the patient's perspective can be considered by surgeons (in determining a procedure's efficacy or appropriateness) or healthcare systems (in the context of value-based healthcare). PROMs in registries enable international comparison of patient-centered outcomes after total joint arthroplasty, but the extent to which those scores may vary between different registry populations has not been clearly defined.
(1) To what degree do mean change in general and joint-specific PROM scores vary across arthroplasty registries, and to what degree is the proportion of missing PROM scores in an individual registry associated with differences in the mean reported change scores? (2) Do PROM scores vary with patient BMI across registries? (3) Are comorbidity levels comparable across registries, and are they associated with differences in PROM scores?
Thirteen national, regional, or institutional registries from nine countries reported aggregate PROM scores for patients who had completed PROMs preoperatively and 6 and/or 12 months postoperatively. The requested aggregate PROM scores were the EuroQol-5 Dimension Questionnaire (EQ-5D) index values, on which score 1 reflects "full health" and 0 reflects "as bad as death." Joint-specific PROMs were the Oxford Knee Score (OKS) and the Oxford Hip Score (OHS), with total scores ranging from 0 to 48 (worst-best), and the Hip Disability and Osteoarthritis Outcome Score-Physical Function shortform (HOOS-PS) and the Knee Injury and Osteoarthritis Outcome Score-Physical Function shortform (KOOS-PS) values, scored 0 to 100 (worst-best). Eligible patients underwent primary unilateral THA or TKA for osteoarthritis between 2016 and 2019. Registries were asked to exclude patients with subsequent revisions within their PROM collection period. Raw aggregated PROM scores and scores adjusted for age, gender, and baseline values were inspected descriptively. Across all registries and PROMs, the reported percentage of missing PROM data varied from 9% (119 of 1354) to 97% (5305 of 5445). We therefore graphically explored whether PROM scores were associated with the level of data completeness. For each PROM cohort, chi-square tests were performed for BMI distributions across registries and 12 predefined PROM strata (men versus women; age 20 to 64 years, 65 to 74 years, and older than 75 years; and high or low preoperative PROM scores). Comorbidity distributions were evaluated descriptively by comparing proportions with American Society of Anesthesiologists (ASA) physical status classification of 3 or higher across registries for each PROM cohort.
The mean improvement in EQ-5D index values (10 registries) ranged from 0.16 to 0.33 for hip registries and 0.12 to 0.25 for knee registries. The mean improvement in the OHS (seven registries) ranged from 18 to 24, and for the HOOS-PS (three registries) it ranged from 29 to 35. The mean improvement in the OKS (six registries) ranged from 15 to 20, and for the KOOS-PS (four registries) it ranged from 19 to 23. For all PROMs, variation was smaller when adjusting the scores for differences in age, gender, and baseline values. After we compared the registries, there did not seem to be any association between the level of missing PROM data and the mean change in PROM scores. The proportions of patients with BMI 30 kg/m 2 or higher ranged from 16% to 43% (11 hip registries) and from 35% to 62% (10 knee registries). Distributions of patients across six BMI categories differed across hip and knee registries. Further, for all PROMs, distributions also differed across 12 predefined PROM strata. For the EQ-5D, patients in the younger age groups (20 to 64 years and 65 to 74 years) had higher proportions of BMI measurements greater than 30 kg/m 2 than older patients, and patients with the lowest baseline scores had higher proportions of BMI measurements more than 30 kg/m 2 compared with patients with higher baseline scores. These associations were similar for the OHS and OKS cohorts. The proportions of patients with ASA Class at least 3 ranged across registries from 6% to 35% (eight hip registries) and from 9% to 42% (nine knee registries).
Improvements in PROM scores varied among international registries, which may be partially explained by differences in age, gender, and preoperative scores. Higher BMI tended to be associated with lower preoperative PROM scores across registries. Large variation in BMI and comorbidity distributions across registries suggest that future international studies should consider the effect of adjusting for these factors. Although we were not able to evaluate its effect specifically, missing PROM data is a recurring challenge for registries. Demonstrating generalizability of results and evaluating the degree of response bias is crucial in using registry-based PROMs data to evaluate differences in outcome. Comparability between registries in terms of specific PROMs collection, postoperative timepoints, and demographic factors to enable confounder adjustment is necessary to use comparison between registries to inform and improve arthroplasty care internationally.
Level III, therapeutic study.
Ingelsrud LH
,Wilkinson JM
,Overgaard S
,Rolfson O
,Hallstrom B
,Navarro RA
,Terner M
,Karmakar-Hore S
,Webster G
,Slawomirski L
,Sayers A
,Kendir C
,de Bienassis K
,Klazinga N
,Dahl AW
,Bohm E
... -
《-》
-
Do the Revision Rates of Arthroplasty Surgeons Correlate With Postoperative Patient-reported Outcome Measure Scores? A Study From the Australian Orthopaedic Association National Joint Replacement Registry.
Patient-reported outcome measures (PROMs) are a pragmatic and efficient means to evaluate the functional quality of arthroplasty beyond revision rates, which are used by most joint replacement registries to judge success. The relationship between these two measures of quality-revision rates and PROMs-is unknown, and not every procedure with a poor functional result is revised. It is logical-although still untested-that higher cumulative revision rates correlate inversely with PROMs for individual surgeons; more revisions are associated with lower PROM scores.
We used data from a large national joint replacement registry to ask: (1) Does a surgeon's early THA cumulative percent revision (CPR) rate and (2) early TKA CPR rate correlate with the postoperative PROMs of patients undergoing primary THA and TKA, respectively, who have not undergone revision?
Elective primary THA and TKA procedures in patients with a primary diagnosis of osteoarthritis that were performed between August 2018 and December 2020 and registered in the Australian Orthopaedic Association National Joint Replacement Registry PROMs program were eligible. THAs and TKAs were eligible for inclusion in the primary analysis if 6-month postoperative PROMs were available, the operating surgeon was clearly identified, and the surgeon had performed at least 50 primary THAs or TKAs. Based on the inclusion criteria, 17,668 THAs were performed at eligible sites. We excluded 8878 procedures that were not matched to the PROMs program, leaving 8790 procedures. A further 790 were excluded because they were performed by unknown or ineligible surgeons or were revised, leaving 8000 procedures performed by 235 eligible surgeons, including 4256 (53%; 3744 cases of missing data) patients who had postoperative Oxford Hip Scores and 4242 (53%; 3758 cases of missing data) patients who had a postoperative EQ-VAS score recorded. Complete covariate data were available for 3939 procedures for the Oxford Hip Score and for 3941 procedures for the EQ-VAS. A total of 26,624 TKAs were performed at eligible sites. We excluded 12,685 procedures that were not matched to the PROMs program, leaving 13,939 procedures. A further 920 were excluded because they were performed by unknown or ineligible surgeons, or because they were revisions, leaving 13,019 procedures performed by 276 eligible surgeons, including 6730 (52%; 6289 cases of missing data) patients who had had postoperative Oxford Knee Scores and 6728 (52%; 6291 cases of missing data) patients who had a postoperative EQ-VAS score recorded. Complete covariate data were available for 6228 procedures for the Oxford Knee Score and for 6241 procedures for the EQ-VAS. The Spearman correlation between the operating surgeon's 2-year CPR and 6-month postoperative EQ-VAS Health and Oxford Hip or Oxford Knee Score was evaluated for THA and TKA procedures where a revision had not been performed. Associations between postoperative Oxford and EQ-VAS scores and a surgeon's 2-year CPR were estimated based on multivariate Tobit regressions and a cumulative link model with a probit link, adjusting for patient age, gender, ASA score, BMI category, preoperative PROMs, as well as surgical approach for THA. Missing data were accounted for using multiple imputation, with models assuming they were missing at random and a worst-case scenario.
Of the eligible THA procedures, postoperative Oxford Hip Score and surgeon 2-year CPR were correlated so weakly as to be clinically irrelevant (Spearman correlation ρ = -0.09; p < 0.001), and the correlation with postoperative EQ-VAS was close to zero (ρ = -0.02; p = 0.25). Of the eligible TKA procedures, postoperative Oxford Knee Score and EQ-VAS and surgeon 2-year CPR were correlated so weakly as to be clinically irrelevant (ρ = -0.04; p = 0.004 and ρ = 0.03; p = 0.006, respectively). All models accounting for missing data found the same result.
A surgeon's 2-year CPR did not exhibit a clinically relevant correlation with PROMs after THA or TKA, and all surgeons had similar postoperative Oxford scores. PROMs, revision rates, or both may be inaccurate or imperfect indicators of successful arthroplasty. Missing data may limit the findings of this study, although the results were consistent under a variety of different missing data scenarios. Innumerable factors contribute to arthroplasty results, including patient-related variables, differences in implant design, and the technical quality of the procedure. PROMs and revision rates may be analyzing two different facets of function after arthroplasty. Although surgeon variables are associated with revision rates, patient factors may exert a stronger influence on functional outcomes. Future research should identify variables that correlate with functional outcome. Additionally, given the gross level of function that Oxford scores record, outcome measures that can identify clinically meaningful functional differences are required. The use of Oxford scores in national arthroplasty registries may rightfully be questioned.
Level III, therapeutic study.
Hoskins W
,Bingham R
,Corfield S
,Harries D
,Harris IA
,Vince KG
... -
《-》
-
Poor Knee-specific and Generic Patient-reported Outcome Measure Scores at 6 Months Are Associated With Early Revision Knee Arthroplasty: A Study From the Australian Orthopaedic Association National Joint Replacement Registry.
The ability to identify which patients are at a greater risk of early revision knee arthroplasty has important practical and resource implications. Many international arthroplasty registries administer patient-reported outcome measures (PROMs) to provide a holistic assessment of pain, function, and quality of life. However, few PROM scores have been evaluated as potential indicators of early revision knee arthroplasty, and earlier studies have largely focused on knee-specific measures.
This national registry-based study asked: (1) Which 6-month postoperative knee-specific and generic PROM scores are associated with early revision knee arthroplasty (defined as revision surgery performed 6 to 24 months after the primary procedure)? (2) Is a clinically important improvement in PROM scores (based on thresholds for the minimal important change) after primary knee arthroplasty associated with a lower risk of early revision?
Preoperative and 6-month postoperative PROM scores for patients undergoing primary knee arthroplasty were sourced from the Australian Orthopaedic Association National Joint Replacement Registry (AOANJRR) and Arthroplasty Clinical Outcomes Registry National. Between January 2013 and December 2020, PROM data were available for 19,402 primary total knee arthroplasties; these data were linked to AOANJRR data on revision knee arthroplasty. Of these, 3448 procedures were excluded because they did not have 6-month PROM data, they had not reached the 6-month postoperative point, they had died before 24 months, or they had received revision knee arthroplasty before the 6-month PROMs assessment. After these exclusions, data were analyzed for 15,954 primary knee arthroplasties. Associations between knee-specific (knee pain, Oxford Knee Score, and 12-item Knee injury and Osteoarthritis Outcome Score [KOOS-12]) or generic PROM scores (5-level EuroQol quality of life instrument [EQ-5D], EQ VAS, perceived change, and satisfaction) and revision surgery were explored using t-tests, chi-square tests, and regression models. Ninety-four revision procedures were performed at 6 to 24 months, most commonly for infection (39% [37 procedures]). The early revision group was younger than the unrevised group (mean age 64 years versus 68 years) and a between-group difference in American Society of Anesthesiologists (ASA) grade was noted. Apart from a small difference in preoperative low back pain for the early revision group (mean low back pain VAS 4.2 points for the early revision group versus 3.3 points for the unrevised group), there were no between-group differences in preoperative knee-specific or generic PROM scores on univariate analysis. As the inclusion of ASA grade or low back pain score did not alter the model results, the final multivariable model included only the most clinically plausible confounders (age and gender) as covariates. Multivariable models (adjusting for age and gender) were also used to examine the association between a clinically important improvement in PROM scores (based on published thresholds for minimal important change) and the likelihood of early revision.
After adjusting for age and gender, poor postoperative knee pain, Oxford, KOOS-12, EQ-5D, and EQ VAS scores were all associated with early revision. A one-unit increase (worsening) in knee pain at 6 months was associated with a 31% increase in the likelihood of revision (RR 1.31 [95% confidence interval (CI) 1.19 to 1.43]; p < 0.001). Reflecting the reversed scoring direction, a one-unit increase (improvement) in Oxford or KOOS-12 score was associated with a 9% and 5% reduction in revision risk, respectively (RR for Oxford: 0.91 [95% CI 0.90 to 0.93]; p < 0.001; RR for KOOS-12 summary: 0.95 [95% CI 0.94 to 0.97]; p < 0.001). Patient dissatisfaction (RR 6.8 [95% CI 3.7 to 12.3]) and patient-perceived worsening (RR 11.7 [95% CI 7.4 to 18.5]) at 6 months were also associated with an increased likelihood of early revision. After adjusting for age and gender, patients who did not achieve a clinically important improvement in PROM scores had a higher risk of early revision (RR 2.9 for the knee pain VAS, RR 4.2 for the Oxford Knee Score, RR 6.3 to 8.6 for KOOS-12, and RR 2.3 for EQ-5D) compared with those who did (reference group).
Knee-specific and generic PROM scores offer an efficient approach to identifying patients at greater risk of early revision surgery, using either the 6-month score or the magnitude of improvement. These data indicate that surgeons can use single- and multi-item measures to detect a patient-perceived unsuccessful surgical outcome at 6 months after primary knee arthroplasty. Surgeons should be alert to poor PROM scores at 6 months or small improvements in scores (for example, less than 2 points for knee pain VAS or less than 10.5 points for Oxford Knee Score), which signal a need for direct patient follow-up or expedited clinical review.
Level III, therapeutic study.
Ackerman IN
,Harris IA
,Cashman K
,Rowden N
,Lorimer M
,Graves SE
... -
《-》
-
Who Benefits From Hip Arthroplasty or Knee Arthroplasty? Preoperative Patient-reported Outcome Thresholds Predict Meaningful Improvement.
Hip arthroplasty (HA) and knee arthroplasty (KA) are high-volume procedures. However, there is a debate about the quality of indication; that is, whether surgery is truly indicated in all patients. Patient-reported outcome measures (PROMs) may be used to determine preoperative thresholds to differentiate patients who will likely benefit from surgery from those who will not.
(1) What were the minimum clinically important differences (MCIDs) for three commonly used PROMs in a large population of patients undergoing HA or KA treated in a general orthopaedic practice? (2) Do patients who reach the MCID differ in important ways from those who do not? (3) What preoperative PROM score thresholds best distinguish patients who achieve a meaningful improvement 12 months postsurgery from those who do not? (4) Do patients with preoperative PROM scores below thresholds still experience gains after surgery?
Between October 1, 2019, and December 31, 2020, 4182 patients undergoing HA and 3645 patients undergoing KA agreed to be part of the PROMoting Quality study and were hence included by study nurses in one of nine participating German hospitals. From a selected group of 1843 patients with HA and 1546 with KA, we derived MCIDs using the anchor-based change difference method to determine meaningful improvements. Second, we estimated which preoperative PROM score thresholds best distinguish patients who achieve an MCID from those who do not, using the preoperative PROM scores that maximized the Youden index. PROMs were Hip Disability and Osteoarthritis Outcome Score-Physical Function short form (HOOS-PS) (scored 0 to 100 points; lower indicates better health), Knee Injury and Osteoarthritis Outcome Score-Physical Function short form (KOOS-PS) (scored 0 to 100 points; lower indicates better health), EuroQol 5-Dimension 5-level (EQ-5D-5L) (scored -0.661 to 1 points; higher indicates better health), and a 10-point VAS for pain (perceived pain in the joint under consideration for surgery within the past 7 days) (scored 0 to 10 points; lower indicates better health). The performance of derived thresholds is reported using the Youden index, sensitivity, specificity, F1 score, geometric mean as a measure of central tendency, and area under the receiver operating characteristic curve.
MCIDs for the EQ-5D-5L were 0.2 for HA and 0.2 for KA, with a maximum of 1 point, where higher values represented better health-related quality of life. For the pain scale, they were -0.9 for HA and -0.7 for KA, of 10 points (maximum), where lower scores represent lower pain. For the HOOS-PS, the MCID was -10, and for the KOOS-PS it was -5 of 100 points, where lower scores represent better functioning. Patients who reached the MCID differed from patients who did not reach the MCID with respect to baseline PROM scores across the evaluated PROMs and for both HA and KA. Patients who reached an MCID versus those who did not also differed regarding other aspects including education and comorbidities, but this was not consistent across PROMs and arthroplasty type. Preoperative PROM score thresholds for HA were 0.7 for EQ-5D-5L (Youden index: 0.55), 42 for HOOS-PS (Youden index: 0.27), and 3.5 for the pain scale (Youden index: 0.47). For KA, the thresholds were 0.6 for EQ-5D-5L (Youden index: 0.57), 39 for KOOS-PS (Youden index: 0.25), and 6.5 for the pain scale (Youden index: 0.40). A higher Youden index for EQ-5D-5L than for the other PROMs indicates that the thresholds for EQ-5D-5L were better for distinguishing patients who reached a meaningful improvement from those who did not. Patients who did not reach the thresholds could still achieve MCIDs, especially for functionality and the pain scale.
We found that patients who experienced meaningful improvements (MCIDs) mainly differed from those who did not regarding their preoperative PROM scores. We further identified that patients undergoing HA or KA with a score above 0.7 or 0.6, respectively, on the EQ-5D-5L, below 42 or 39 on the HOOS-PS or KOOS-PS, or below 3.5 or 6.5 on a 10-point joint-specific pain scale presurgery had no meaningful benefit from surgery. The thresholds can support clinical decision-making. For example, when thresholds indicate that a meaningful improvement is not likely to be achieved after surgery, other treatment options may be prioritized. Although the thresholds can be used as support, patient preferences and medical expertise must supplement the decision. Future studies might evaluate the utility of using these thresholds in practice, examine how different thresholds can be combined as a multidimensional decision tool, and derive presurgery thresholds based on additional PROMs used in practice.
Preoperative PROM score thresholds in this study will support clinicians in decision-making through objective measures that can improve the quality of the recommendation for surgery.
Langenberger B
,Steinbeck V
,Busse R
《-》
-
What Are the Minimal and Substantial Improvements in the HOOS and KOOS and JR Versions After Total Joint Replacement?
Lyman S
,Lee YY
,McLawhorn AS
,Islam W
,MacLean CH
... -
《-》