Skip to main content

Official Journal of the Italian Society of Orthopaedics and Traumatology

  • Original article
  • Open access
  • Published:

Not all questions are created equal: the weight of the Oxford Knee Scores questions in a multicentric validation study

Abstract

Background

The Oxford Knee Score (OKS) has been designed for patients with knee osteoarthritis and has a widespread use. It has 12 questions, with each question having the same weight for the overall score. Some authors have observed a significant ceiling effect, especially when distinguishing slight postoperative differences. We hypothesized that each questions’ weight will depend significantly on the patient’s sociodemographic data and lifestyle.

Methods

In this international multicentric prospective study, we included patients attending a specialist outpatient knee clinic. Each patient filled out 3 questionnaires: (a) demographic data and data pertaining to the OKS, (b) the standard OKS, and (c) the patient gave a mark on the weight of the importance of each question, using a 5-point Likert scale (G OKS). Linear regression models were used for the analysis.

Results

In total 203 patients (106 female and 97 male) with a mean age of 64.5 (±12.7) years and a mean body mass index (BMI) of 29.34 (±5.45) kg/m2 were included. The most important questions for the patients were the questions for pain, washing, night pain, stability, and walking stairs with a median of 5. In the regression models, age, gender, and driving ability were the most important factors for the weight of each of the question.

Conclusion

The questions in the OKS differ significantly in weight for each patient, based on sociodemographic data, such as age, self-use of a car, and employment. With these differences, the Oxford Knee Score might be limited as an outcome measure. Adjustment of the OKS that incorporates the demographic differences into the final score might be useful if the ceiling effect is to be mitigated.

Level of Evidence: Level II prospective prognostic study

Introduction

The Oxford Knee Score (OKS) was first described by Dawson et al. [1] in 1997 and is one of the most common patient-reported outcome measures (PROM). The OKS was specifically developed and validated for measuring outcomes of knee replacement surgery, designed to be used pre- and postoperatively [1, 2]. The OKS [1] consists of 12 questions assessed on a Likert scale, each question valued from 0 to 4. A summative score is calculated, with 48 points as the best possible score (least symptomatic) and 0 points as the worst possible score (most symptomatic) [1,2,3].

Patients who underwent a knee arthroplasty are in general satisfied in terms of pain reduction and overall improved function, due to improvements in materials, surgical technique, and fixation [6,7,8]. But in the past 20 years, the dissatisfaction has remained stagnant at 10–20% [9]. Furthermore, the improvements resulted in ceiling effects in commonly applied PROMs, including OKS and WOMAC [10, 11]. Ceiling or floor effects occur when a considerable number of patients score the maximum or minimum score [2]. As a result, the outcome measure is unable to discriminate between subjects at the top or bottom of the scale [12, 13].

PROMs are also often unable to detect subtle differences in patient satisfaction between different designs or implantation techniques [11]. Alternative alignment strategies such as kinematic alignment are increasingly used to improve patient outcome. Results are mixed without clear statistically significant differences in postoperative outcome between different alignment techniques [14]. To detect subtle differences, new PROMs were subsequently developed, such as the Forgotten Joint Score 12 (FJS-12) [11]. The FJS-12 is only used postoperatively and aims at different questions compared with established scores such as the OKS or Western Ontario and McMaster Universities Osteoarthritis (WOMAC) score [11], with significantly lower ceiling effects compared with the OKS [15].

The OKS rates each of its question with the same numeric value. We hypothesized that each question does not have the same weight for each patient and that the weight depends significantly on the patient’s sociodemographic data and lifestyle. Therefore, this study was conducted to evaluate the weight of all questions of the OKS in patients with osteoarthritis of the knee.

Material and methods

Study setting and design

This prospective multicentric cohort study recruited patients coming to two specialist outpatient knee clinics in two German-speaking countries. The patients attended the clinics because of symptoms associated to osteoarthritis of the knee. All patients received X-rays of the knee to validate the osteoarthritis of the knee. The patients were asked to complete three questionnaires, in this consecutive order: First, they provided demographic data (age, gender, height, weight, and BMI) as well as data pertaining to the OKS. At this point, the patient was blinded to the subsequent questionnaires (stairs at home, self-use of a car, use of walking aids, working status, and previous surgery). Then, the patient filled out the standard validated OKS in the German language [1, 4]. Finally, the patient gave a grade to the personal importance of each question’s topic, using a 5-point Likert scale (G OKS).

Ethics

Ethics approval was obtained from the local ethics committee in both centers (1286/2020 and 2021-2439-evBO) prior to conducting the study. All patients gave informed consent for participating in this study.

Statistical analysis

Data analysis was performed using SPSS version 27.0 (IBM SPSS statistics, Armonk, NY, USA). A power analysis was performed to achieve a 0.3 Pearson correlation coefficient using a beta of 0.8 and an alpha of 0.05. At least 85 patients were required. Descriptive analysis was conducted for patient demographics. Normality distribution was assessed using the Shapiro–Wilk test. Normally distributed data is presented using mean [± standard deviation (SD)], whereas non-normally distributed data are presented using median [interquartile range (IQR)]. Testing for differences in patient demographics between both centers was done with Mann–Whitney-U-test for continuous variables and with Fisher’s exact test for categorical variables. A 5-point Likert scale was used for measuring the importance of each question (Q) of the OKS. Linear regression models were created to predict the weight of each of the questions. A p-value of < 0.05 was defined as significant.

Results

Demographics

All patients screened for inclusion were included in the study. In total, 203 patients were included in this study, with a median age of 66 years [13] – 100 patients in Center A and 103 patients in Center B. The study group consisted of 106 female patients and 97 male patients with a median BMI of 28.10 [6.53] kg/m2; see Table 1. As for the OKS-related demographic data, 92.1% of patients had stairs at home, 71.4% were still driving a car, 26.6% were using walking aids, 33.5% were still working, and 62.6% underwent previous knee surgery (Table 1).

Table 1 Patient demographics for the general cohort separated by center; p-value for testing for difference between both centers

For demographic data, there were differences between both centers in age (p < 0.001), the use of walking aids (p = 0.036), and previous surgeries (p = 0.031; Table 1).

Oxford Knee Score and scored weight

The highest average score in the OKS was found for pain, with 3.61 points, and for kneeling, with 3.29 points (Table 2). The lowest average scores were found for washing, with 1.56 points, and walking for a longer distance, with 1.90 points (Table 2). The most important questions for the patients were the questions for pain, washing, night pain, stability, and walking stairs, with a median of 5. The scores for the OKS and the Likert scale on importance are given in Table 2.

Table 2 Mean and median values for the questions of the OKS and their importance according to the 5-point Likert scale

Predictive analysis

Question 1: Pain

In the regression models for the weight of each of the questions, pain was associated with older age (p = 0.009; B = −0.013) and gender (p = 0.015, B = 0.301; Table 4).

Question 2: Washing

Washing was associated with older age (p = 0.013, B = −0.010) and gender (p < 0.001, B = 0.387) in the regression model (Table 4).

Question 3: Transport

The question on transport was associated with the patients’ self-use of a car (p < 0.001; B = 0.707) in the regression model (Table 4).

Question 4: Walking for a longer distance

The question on walking was associated with older age (p = 0.041, B = −0.013), gender (p = 0.046, B = 0.347), height (p = 0.015, B = 0.022), and self-use of a car (p = 0.002, B = 0.496) in the regression model (Table 4).

Question 5: Standing up after eating

Getting up after eating was associated with older age (p = 0.009, B = −0.013) and self-use of a car (p = 0.031, B = 0.285) in the regression model (Table 4).

Question 6: Limping

Limping was associated with older age (p = 0.011, B = −0.016), gender (p = 0.031, B = 0.385), and self-use of a car (p = 0.037, B = 0.334) in the regression model (Table 4).

Question 7: Kneeling

In the regression model, kneeling was associated with the self-use of a car (p = 0.027, B = 0.394; Table 4). A significant negative correlation between the recorded OKS score and recorded OKS weight was found for the question on kneeling (r = −0.158, p = 0.024; Table 3).

Table 3 Predictive analysis between the recorded OKS score and recorded OKS weight (significant values in bold letters)

Question 8: Night pain

A significant positive correlation between the recorded OKS score and recorded OKS weight was found for the question on night pain (r = 0.143, p = 0.043) (Table 3). In the regression model, night pain was associated with older age (p = 0.037, B = −0.009) (Table 4).

Table 4 Regression coefficient B and p-values for the regression models for each question of the OKS and age, gender, height, weight, BMI, stairs, driving, use of walking aids, working status, and previous knee surgeries

Question 9: Working

In the regression model, the weight of the question on work was associated with gender (p = 0.004, B = 0.345), height (p = 0.041, B = 0.013), and self-use of a car (p = 0.014, B = 0.269; Table 4=). A significant positive correlation between the recorded OKS score and recorded OKS weight was found for the question on working (r = 0.174, p = 0.014; Table 3).

Question 10: Stability

In the regression model, instability was associated with older age (p < 0.001, B = −0.015), self-use of a car (p = 0.007, B = 0.301), and previous surgery (p = 0.040, B = 0.192; Table 4). A significant positive correlation between the recorded OKS score and recorded OKS weight was found for the question on stability (r = 0.166, p = 0.018; Table 3).

Question 11: Shopping

Household shopping was again influenced by gender (p = 0.019, B = 0.304) and previous surgery (p = 0.028, B = 0.217; Table 4).

Question 12: Walking stairs

Walking down a flight of stairs was associated with older age (p = 0.025, B = −0.010), having a set of stairs at home (p = 0.008, B = 0.470), self-use of a car (p = 0.038, B = 0.248), and previous surgery (p = 0.021, B = 0.224; Table 4).

Discussion

The most important finding of the present study is that age, driving ability, using walking aids, and working status were the most important factors for the weight of each of the questions.

There are mixed results for a possible ceiling effects of the OKS [2, 5, 10, 15, 16]. Marx et al. [10] noted that 5% of patients at 6 months and 7% of patients at 12 months post-surgery achieved the top score, indicating the presence of a ceiling and floor effect of the OKS. Hamilton et al. [17] reported 8% of top possible score in 4709 patients undergoing total knee arthroplasty (TKA). Postoperative ceiling effects for the OKS have been reported up to 27% [18] and 33% [16]. Harris et al. [2] did not find a ceiling effect for the OKS. However, in subgroup analyses males tended to score higher on postoperative OKS than females. The proportion of patients achieving postoperative top scores in males was almost double that of the female population, with 3.8% compared with 2% [2]. The results in the present study indicate differences according to gender. The questions on pain and washing were more important for female patients in the regression models. Therefore, female patients may have different expectations preoperatively and focus on different daily activities than male patients. As the OKS does not differentiate between different genders, this may result in an inherent difference in the OKS, leading to generally lower average mean scores because of the conception of questions and equal importance. A distinction between both genders and a consecutive adjustment for gender could be a possible improvement for the OKS and other PROMs.

Another ceiling effect for the OKS is described for age [6]. Harris et al. [6] found the highest ceiling percentage of 3% in the subgroup of patients between 60 and 79 years of age. These findings are also supported by the results of the presented study. Older age was the most important factor in the regression models in this study. This indicates that patients with older age are more focused on pain and night pain and focus their importance weight more on these categories. Therefore, a benefit in analyzing the pain and function subscales of the OKS separately after TKA is suggested [2]. The higher focus on pain due to older age demonstrated by the results in the presented study backs this suggestion. The equal scoring systems of the OKS might lead to ceiling effects because of missing representation of different age groups, which could be a possible target for adjustment of PROMs such as the OKS.

Ceiling and floor effects are initially addressed in the conception of a PROM [2]. In the conception of the OKS, a number of different items have been rejected [2]. However, ceiling and floor effects in PROMs are common [10]. Ceiling effects are also related to the number of items in a scale [11]. The more items a scale addresses, the less likely the patient chooses the highest or lowest response category in every single item [11]. However, the WOMAC score is outperformed by the FJS-12, which contains only half as many questions with only half as many patients achieving the best possible score [11]. In evaluating the Danish version of the FJS-12, the ceiling effect was significantly lower, with 16% compared with 37% for the OKS [15]. In scoring a PROM, the scoring systems is equally important as the quality of the question itself. The question itself can discriminate patients according to age or gender or activity level. Furthermore, an equally scaled scoring system can also lead to discrimination, as not all questions are equally important, which is demonstrated by the results presented in this study, as, e.g., the question on driving is more important for patients with self-use of a car. The FJS-12 also uses the same 5-item scoring scale for each question [11]. The questions are conceived to overcome discrimination such as in case of asking for awareness of the artificial joint while doing the patients’ favorite sports. It tries to overcome the different level of activities by asking for the patients’ favorite sport. However, in case of elderly patients, this question might impair the final score because the patients are not able to answer this question adequately, as they do not engage in sports at all.

One major advantage of the OKS lies in the simplicity in completing it [1]. Problems with possible ceiling effects or floor effects of one PROM could be addressed by using different outcome scores [3]. Some authors suggest using both joint-specific and generic health measures to assess the outcome after TKA [20, 21]. Measurements of quality of life such as the 36-Item Short Form Survey (SF-36) are commonly used in combination with PROMs [3]. However, the SF-36 was not developed specifically for total joint arthroplasty (TJA), leading to susceptibility to other influences such as pain and disability from other weight-bearing joints and other symptomatic conditions [1, 22]. Additionally, utilizing too many questionnaires can lead to response burden of the patient [23,24,25]. Response burden depends on the cognitive function of the patient [24] and is merely relying on the base decisions on use of instruments on the content rather than the length of the questionnaire [25].

Although pain relief and improvement in function after TJA is generally achieved to a satisfying extent for patients undergoing TJA, nuisance symptoms are generally common. As an alternative to PROMs focusing on pain or stability of the joint, newer PROMs such as the Forgotten Joint Score (FJS-12) focus on other aspects in TJA [11]. Noble et al. [19] found patients who had undergone arthroplasty to have a lower functional level than age- and sex-matched healthy controls. This indicates that the current concepts of TKA are not able to restore normal healthy joint function with an artificial joint in the near future [11, 19]. This is also an important aspect for PROMs, as it is elusive to ask questions about daily activities that are not possible or not relevant for patients after TKA. This would be an important aspect for future adjustments to new PROMs: to distinguish more clearly activities that are more important for arthroplasty patients.

Some limitations need to be noted. Firstly, the study was powered on an r-value of 0.3, which is low correlation. However, we found statistically significant correlation and significant predictive values for several sociodemographic aspects and several OKS questions. Secondly, the sociodemographic data of some of the categories was collected using a binary questionnaire. Even so, the outcomes even for these variables were significant in a number of instances. A further breakdown of the number of stairs at home and the number of kilometers driven would certainly provide more information. Due to a single-site data collection, the results might differ in other centers and countries.

Conclusion

The questions in the OKS differ significantly in weight for each patient, based on sociodemographic data, such as age, self-use of a car, and employment. With these differences, the Oxford Knee Score might be limited as an outcome measure. Adjustment of the OKS that incorporates the demographic differences into the final score might be useful if the ceiling effect is to be mitigated.

Availability of data and materials

Data and materials are available on request.

References

  1. Dawson J, Fitzpatrick R, Murray D, Carr A (1998) Questionnaire on the perceptions of patients about total knee replacement. J Bone Joint Surg Br 80(1):63–69. https://doi.org/10.1302/0301-620x.80b1.7859

    Article  CAS  PubMed  Google Scholar 

  2. Harris K, Lim CR, Dawson J, Fitzpatrick R, Beard DJ, Price AJ (2017) The Oxford Knee Score and its subscales do not exhibit a ceiling or a floor effect in knee arthroplasty patients: an analysis of the national health service PROMs data set. Knee Surg Sports Traumatol Arthrosc 25(9):2736–2742. https://doi.org/10.1007/s00167-015-3788-0

    Article  PubMed  Google Scholar 

  3. Clement ND, MacDonald D, Simpson AH (2014) The minimal clinically important difference in the Oxford Knee Score and short form 12 score after total knee arthroplasty. Knee Surg Sports Traumatol Arthrosc 22(8):1933–1939. https://doi.org/10.1007/s00167-013-2776-5

    Article  CAS  PubMed  Google Scholar 

  4. Naal FD, Impellizzeri FM, Sieverding M, Loibl M, von Knoch F, Mannion AF, Leunig M, Munzinger U (2009) The 12-item Oxford Knee Score: cross-cultural adaptation into German and assessment of its psychometric properties in patients with osteoarthritis of the knee. Osteoarthr Cartil 17(1):49–52. https://doi.org/10.1016/j.joca.2008.05.017

    Article  CAS  Google Scholar 

  5. Jenny JY, Diesinger Y (2012) The Oxford Knee Score: compared performance before and after knee replacement. Orthop Traumatol Surg Res 98(4):409–412. https://doi.org/10.1016/j.otsr.2012.03.004

    Article  PubMed  Google Scholar 

  6. Nilsdotter AK, Toksvig-Larsen S, Roos EM (2009) Knee arthroplasty: are patients’ expectations fulfilled? A prospective study of pain and function in 102 patients with 5-year follow-up. Acta Orthop 80(1):55–61. https://doi.org/10.1080/17453670902805007

    Article  PubMed  PubMed Central  Google Scholar 

  7. Laupacis A, Bourne R, Rorabeck C, Feeny D, Wong C, Tugwell P, Leslie K, Bullas R (1993) The effect of elective total hip replacement on health-related quality of life. J Bone Joint Surg Am 75(11):1619–1626. https://doi.org/10.2106/00004623-199311000-00006

    Article  CAS  PubMed  Google Scholar 

  8. Noble PC, Conditt MA, Cook KF, Mathis KB (2006) The John Insall Award: patient expectations affect satisfaction with total knee arthroplasty. Clin Orthop Relat Res 452:35–43. https://doi.org/10.1097/01.blo.0000238825.63648.1e

    Article  PubMed  Google Scholar 

  9. Dunbar MJ, Richardson G, Robertsson O (2013) I can’t get no satisfaction after my total knee replacement: rhymes and reasons. Bone Joint J 95-b(11 Supple A):148–152. https://doi.org/10.1302/0301-620X.95B11.32767

    Article  CAS  PubMed  Google Scholar 

  10. Marx RG, Jones EC, Atwan NC, Closkey RF, Salvati EA, Sculco TP (2005) Measuring improvement following total hip and knee arthroplasty using patient-based measures of outcome. J Bone Joint Surg Am 87(9):1999–2005. https://doi.org/10.2106/JBJS.D.02286

    Article  PubMed  Google Scholar 

  11. Behrend H, Giesinger K, Giesinger JM, Kuster MS (2012) The “forgotten joint” as the ultimate goal in joint arthroplasty: validation of a new patient-reported outcome measure. J Arthroplast 27(3):430-436 e431. https://doi.org/10.1016/j.arth.2011.06.035

    Article  Google Scholar 

  12. Stucki G, Liang MH, Stucki S, Katz JN, Lew RA (1999) Application of statistical graphics to facilitate selection of health status measures for clinical practice and evaluative research. Clin Rheumatol 18(2):101–105. https://doi.org/10.1007/s100670050065

    Article  CAS  PubMed  Google Scholar 

  13. McHorney CA, Tarlov AR (1995) Individual-patient monitoring in clinical practice: are available health status surveys adequate? Qual Life Res 4(4):293–307. https://doi.org/10.1007/BF01593882

    Article  CAS  PubMed  Google Scholar 

  14. Young SW, Sullivan NPT, Walker ML, Holland S, Bayan A, Farrington B (2020) No difference in 5-year clinical or radiographic outcomes between kinematic and mechanical alignment in TKA: a randomized controlled trial. Clin Orthop Relat Res 478(6):1271–1279. https://doi.org/10.1097/CORR.0000000000001150

    Article  PubMed  PubMed Central  Google Scholar 

  15. Thomsen MG, Latifi R, Kallemose T, Barfod KW, Husted H, Troelsen A (2016) Good validity and reliability of the forgotten joint score in evaluating the outcome of total knee arthroplasty. Acta Orthop 87(3):280–285. https://doi.org/10.3109/17453674.2016.1156934

    Article  PubMed  PubMed Central  Google Scholar 

  16. Jenny JY, Louis P, Diesinger Y (2014) High activity arthroplasty score has a lower ceiling effect than standard scores after knee arthroplasty. J Arthroplast 29(4):719–721. https://doi.org/10.1016/j.arth.2013.07.015

    Article  Google Scholar 

  17. Hamilton DF, Lane JV, Gaston P, Patton JT, Macdonald D, Simpson AH, Howie CR (2013) What determines patient satisfaction with surgery? A prospective cohort study of 4709 patients following total joint replacement. BMJ Open. https://doi.org/10.1136/bmjopen-2012-002525

    Article  PubMed  PubMed Central  Google Scholar 

  18. Impellizzeri FM, Mannion AF, Leunig M, Bizzini M, Naal FD (2011) Comparison of the reliability, responsiveness, and construct validity of 4 different questionnaires for evaluating outcomes after total knee arthroplasty. J Arthroplast 26(6):861–869. https://doi.org/10.1016/j.arth.2010.07.027

    Article  Google Scholar 

  19. Noble PC, Gordon MJ, Weiss JM, Reddix RN, Conditt MA, Mathis KB (2005) Does total knee replacement restore normal knee function? Clin Orthop Relat Res 431:157–165. https://doi.org/10.1097/01.blo.0000150130.03519.fb

    Article  Google Scholar 

  20. Bombardier C, Melfi CA, Paul J, Green R, Hawker G, Wright J, Coyte P (1995) Comparison of a generic and a disease-specific measure of pain and physical function after knee replacement surgery. Med Care 33(4 Suppl):AS131-144

    CAS  PubMed  Google Scholar 

  21. Guyatt GH, Feeny DH, Patrick DL (1993) Measuring health-related quality of life. Ann Intern Med 118(8):622–629. https://doi.org/10.7326/0003-4819-118-8-199304150-00009

    Article  CAS  PubMed  Google Scholar 

  22. Dawson J, Fitzpatrick R, Murray D, Carr A (1996) The problem of “noise” in monitoring patient-based outcomes: generic, disease-specific and site-specific instruments for total hip replacement. J Health Serv Res Policy 1(4):224–231. https://doi.org/10.1177/135581969600100408

    Article  CAS  PubMed  Google Scholar 

  23. Mes MA, Chan AHY, Wileman V, Katzer CB, Goodbourn M, Towndrow S, Taylor SJC, Horne R (2019) Patient involvement in questionnaire design: tackling response error and burden. J Pharm Policy Pract 12:17. https://doi.org/10.1186/s40545-019-0175-0

    Article  PubMed  PubMed Central  Google Scholar 

  24. Atkinson TM, Schwartz CE, Goldstein L, Garcia I, Storfer DF, Li Y, Zhang J, Bochner BH, Rapkin BD (2019) Perceptions of response burden associated with completion of patient-reported outcome assessments in oncology. Value Health 22(2):225–230. https://doi.org/10.1016/j.jval.2018.07.875

    Article  PubMed  Google Scholar 

  25. Rolstad S, Adler J, Ryden A (2011) Response burden and questionnaire length: is shorter better? A review and meta-analysis. Value Health 14(8):1101–1108. https://doi.org/10.1016/j.jval.2011.06.003

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

Supported by Johannes Kepler Open Access Publishing Fund.

Funding

The study was conducted without any funding or benefits from a commercial party.

Author information

Authors and Affiliations

Authors

Contributions

ML: drafting of the article and statistical expertise. CS: study design and final approval of the article. ESK: collection and assembly of data, provision of study materials or patients, and final approval of the article. MM: collection and assembly of data, provision of study materials or patients, and final approval of the article. TH: study design, critical revision of the article for important intellectual content, and final approval of the article. TG: data collection and final approval of the article. AK: data collection, conception and design, statistical expertise, and final approval of the article.

Corresponding author

Correspondence to Matthias Luger.

Ethics declarations

Ethical approval and consent to participate

Ethics approval was obtained from the local ethics committee in both centers (1286/2020 and 2021-2439-evBO) prior to conducting the study. Informed consent was obtained from every participant prior to inclusion in the study. All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. Informed consent was obtained by all participating patients.

Consent for publication

Consent for publication was given by the all authors.

Informed consent

Informed consent was obtained from every participant prior to inclusion in the study. All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. Informed consent was obtained by all participating patients.

Competing interests

T.G. declares personal fees paid during the conducting of the study outside the submitted work from Zimmer Biomet, Europe, Depuy Synthes Orthopädie Gmbh, Peter Brehm GmbH, and ImplanTec GmbH. T.H. declares paid presentations and research support from Smith&Nephew, ZimmerBiomet, and Implantcast, as well as paid consultancy for Smith&Nephew. All financial benefits have been received outside the submitted work. All other co-authors declare no financial support related or nonrelated to the study.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Luger, M., Schopper, C., Krottenthaler, E.S. et al. Not all questions are created equal: the weight of the Oxford Knee Scores questions in a multicentric validation study. J Orthop Traumatol 24, 44 (2023). https://doi.org/10.1186/s10195-023-00722-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s10195-023-00722-6

Keywords