- Research article
- Open Access
(Dis)agreement of polymyalgia rheumatica relapse criteria, and prediction of relapse in a retrospective cohort
BMC Rheumatology volume 6, Article number: 45 (2022)
To develop and assess a prediction model for polymyalgia rheumatica (PMR) relapse within the first year of glucocorticoid (GC) treatment.
A retrospective PMR cohort (clinical diagnosis) from a rheumatology department was used. All visits > 30 days after starting GC treatment and with > 2.5 mg/day oral prednisolone were used as potential relapse visits. Often used relapse criteria (1) rheumatologist judgement, (2) treatment intensification-based relapse) were assessed for agreement in this cohort. The proportion of patients with treatment-based relapse within 1 and 2 years of treatment and the relapse incidence rate were used to assess unadjusted associations with candidate predictors using logistic and Poisson regression respectively. After using a multiple imputation method, a multivariable model was developed and assessed to predict the occurrence (yes/no) of relapse within the first year of treatment.
Data from 417 patients was used. Relapse occurred at 399 and 321 (of 2422) visits based on the rheumatologist judgement- and treatment-based criteria respectively, with low to moderate agreement between the two (87% (95% CI 0.86–0.88), with κ = 0.49 (95% CI 0.44–0.54)). Treatment-based relapse within the first two years was significantly associated with CRP, ESR, and pre-treatment symptom duration, and incidence rate with only CRP and ESR. A model to predict treatment intensification within the first year of treatment was developed using sex, medical history of cardiovascular disease and malignancies, pre-treatment symptom duration, ESR, and Hb, with an AUC of 0.60–0.65.
PMR relapse occurs frequently, although commonly used criteria only show moderate agreement, underlining the importance of a uniform definition and criteria of a PMR specific relapse. A model to predict treatment intensification was developed using practical predictors, although its performance was modest.
Polymyalgia rheumatica (PMR) is an inflammatory rheumatic disease characterized by complaints of the neck, shoulder and hip girdle and a rise in acute phase reactants (APR) . The mainstay of treatment is glucocorticoids (GC), which are tapered after achieving remission to reduce frequently occurring adverse events . However, during this tapering, disease relapses occur in up to 55% of patients, necessitating an increase of therapy and increasing risk of adverse events [3,4,5]. Therefore, prediction of relapse would help identify patients benefitting more from intense early treatment—like starting a GC sparing agent or faster tapering.
However, the value of predictors for PMR relapse is still unclear, as studies show conflicting results [3, 5,6,7,8,9,10]. This may be due to the relatively limited number of patients and relapses or due to the heterogeinity of relapse definitions and criteria, which hinders comparison between studies . Moreover, a multivariable prediction model combining risk factors, which may assess the overall combined value of predictors, has not been implemented.
Using a prediction model to estimate a patient’s risk of PMR relapse at the start of therapy may help in the decision of starting disease-modifying antirheumatic drugs (DMARDs) at onset or taper GC more judiciously. In addition, such a model may help stratify patients by prognosis for clinical studies. Therefore, we set out to develop and assess a prediction model for relapse within the first year of treatment. Furthermore, we will describe our choices leading to two different criteria for relapse and assess agreement between these criteria.
Design and setting
Our goal was to develop and assess a prediction model for relapse within the first year of treatment, based on data from a routine care retrospective cohort of PMR patients visiting the outpatient rheumatology department of the Sint Maartenskliniek, Netherlands, between April 2008 and January 2017. Since multiple criteria have been used to measure relapse, we will explain our choices leading to two different criteria and assess agreement between these. By doing so, we hope to gain insight in the degree of heterogeneity in this outcome and improve comparison between our study and future studies. Furthermore, since we wanted to identify patients who will benefit most from intense early treatment (e.g., starting a DMARD), we chose one criterion for relapse which we think is most closely related to treatment, and thus has most implications, for studying association and modelling. The transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) guideline was used for reporting purposes, and the TRIPOD checklist is shown in Additional file 1 .
Study population and data collection
Records of patients with a clinical PMR diagnosis (electronically registered by treating physicians) were reviewed for cohort eligibility. Patients were excluded if an alternative diagnosis was more likely (judged initially by the treating rheumatologist and retrospectively reviewed by the research physician), if follow-up was less than nine months, or if patients were deceased. Patients were treated and assessed in accordance with international recommendations .
Baseline assessment was the first visit that prednisolone treatment started and follow-up thereafter ended either at the end of the study period (January 2017) or when lost to follow-up. Follow-up visits more than 90 days after oral prednisolone treatment ended were also excluded. We chose to do this, since recurrences after GC free remission have less treatment implications and less impact on GC-related adverse events when compared with relapses during GC tapering . Furthermore, recurrences after treatment cessation may differ pathophysiologically when compared with relapses [2, 12].
Primary outcome: relapse
Based on previous studies, we pragmatically chose a period of initial response of 30 days after starting prednisolone during which no relapse could occur, so each patient had a period of potential remission before a relapse [4, 13]. We also only used visits with > 2.5 mg/day oral prednisolone to exclude relapses at relatively low-dose prednisolone, since these have fewer implications for treatment and treatment related adverse events. Additionally, this may help exclude pragmatic prednisolone dose raising or maintenance due to concomitant or mimicking diseases flaring at this low dose in regular care. After these exclusions, we chose two approaches identifying a PMR relapse, based on The Outcome Measures in Rheumatology (OMERACT) definition for a rheumatoid arthritis flare .
A rheumatologist judgement-based relapse consisted of either (a) judgement of the treating rheumatologist that a patient is not in remission during a visit, or (b) judgement of the treating rheumatologist and/or patient that a patient had a relapse between visits, both of which in combination with no RJ at the previous visit.
A treatment intensification-based relapse consisted of either (a) an increase in prednisolone dose (between visits) or rheumatologist advice to increase prednisolone dose, (b) starting/increasing dose of a DMARD due to treatment inefficacy, or (c) addition of a local or intra-muscular GC injection, all in combination with no TI at the previous visit.
We chose the treatment based criteria to study associations and for further prediction modelling, since this is a more objective and pragmatical outcome closely related to treatment. Therefore, prediction of TI based relapse may have the most treatment implications.
Candidate predictors for treatment intensification-based relapse were measured at baseline, and thus before GC treatment started. The following predictors were assessed: (1) age, (2) sex, (3) medical history of cardiovascular disease, i.e. a previous myocardial infarction, angina, cerebrovascular events, peripheral arterial disease, heart failure, or thoracic aortic aneurysm, (4) medical history of malignancies, (5) smoking defined as either never, stopped, or current smoker, (6) pre-treatment symptom duration in months, (7) clinical disease severity based on presence of pain/stiffness and movement restriction at both shoulder- and hip girdles, resulting in a 0–8 range sum score, (8) presence of clinically diagnosed peripheral arthritis, (9) presence of systemic symptoms, based on the presence of either fever, cold shivering, night sweats, unexplained weight loss, or unexplained fatigue, (10) C-reactive protein (CRP) level in mg/L determined by the chemical analyzer Olympus type AU400, (11) Erythrocyte Sedimentation Rate (ESR) level in mm/h determined by the 30-min automated and extrapolated version of the Westergren method, and (12) hemoglobin (Hb) level in mmol/L. ESR and CRP were divided by a factor 10 for use as predictors to improve coefficient interpretability.
Previous studies reported relapse percentages between 27.6 and 55% within the first (five) year(s) of treatment [3,4,5]. Therefore, with the 450 patient cohort, approximately 33% of patients having an event (relapse) within the first year of treatment, and a rule of thumb of 10 events per variable, we estimate 15 predictors could be included .
STATA/IC version 13.1 for Windows was used for all statistical analysis. Descriptive statistics were examined as appropriate. Numbers and reason for exclusion were recorded to ensure internal validity. Relapse on a visit level was identified for both criteria, and agreement was assessed using percentage of agreement and Cohen’s kappa. Relapse cumulative incidence (proportion of patients) after one and two year(s) of treatment and relapse incidence rate (IR) per patient per year was assessed for both rheumatologist judgement and treatment intensification relapse.
Unadjusted associations between the proportion of patients with a treatment intensification-based relapse and predictors was assessed using logistic regression. Unadjusted associations between the incidence rate of treatment intensification-based relapse and predictors was assessed using Poisson regression. Results were displayed using odds ratios (OR), incidence rate ratios (IRR), 95% confidence intervals in brackets, and we considered p values < 0.05 as significant.
After using a multiple imputation method, a multivariable model was established to predict the occurrence (yes/no) of relapse within the first year of treatment and this model’s predictive performance and internal validity was assessed thereafter. Furthermore, a sensitivity analysis was performed in which only patient fulfilling EULAR/ACR core classification criteria (without scoring scale) were used to develop and assess a multivariable model. Further detail on model development and assessment is given in Additional file 2.
From the cohort, 417 patients with 4375 visits were included (Fig. 1). There were 2422 visits that met criteria for a potential relapse (≥ 30 days after baseline and treated with > 2.5 mg oral prednisolone). Details of the cohort at baseline are shown in Table 1: age was comparable to previous PMR studies, although sex differed somewhat, with a lower percentage of females [3, 5, 12, 16, 17]. A total of 329 (79%) patients met 2012 European League Against Rheumatism/American College of Rheumatology (EULAR/ACR) core classification criteria and 353 (90%) had raised APR from those with APR available . There were missing values (n; %) for smoking (81; 19%), pre-treatment symptom duration (6; 1%), CRP (53; 13%), ESR (21; 5%), and Hb (100; 24%).
There were 399 and 321 relapse visits based on rheumatologist judgement and treatment intensification respectively. Agreement between criteria was 87% and κ = 0.49 (95% CI 0.44–0.54). Therefore, agreement was low to moderate based on kappa, although not on percentage. A reason for the low kappa when compared to percentage agreement may be the relatively high predicted probability of agreement due to chance for visits without a relapse (72%). Based on rheumatologist judgement and treatment intensification, relapse cumulative incidence was 156 (37%) and 133 (32%) after one year, and 202 (48%) and 183 (44%) after two years of treatment, respectively. Furthermore, agreement between criteria for cumulative incidence after one and two years of treatment was 83% (95% CI 0.79–0.87) with κ = 0.63 (95% CI 0.55–0.71) and 84% (95% CI 0.80–0.87) with κ = 0.68 (95% CI 0.61–0.75) respectively. Total follow-up duration was 911 a patient years, resulting in a rheumatologist judgement and treatment intensification- based relapse IR of 0.44 and 0.35 (per patient per year) respectively.
The treatment intensification criterion was used to study univariable associations with potential predictors (Table 2). Relapse cumulative incidence after both one, and two, year(s) of treatment was significantly associated with pre-treatment symptom duration, CRP, and ESR. For example, the odds of having a relapse, after both 1 and 2 years of treatment, is 8% lower per week of pre-treatment symptom duration. Total relapse IR per patient per year was significantly associated with CRP and ESR, although, interestingly, not with pre-treatment symptom duration. For example, the number of relapses per patient year was 4% higher per 10 mg/L increase in CRP.
A multivariable prediction model for relapse within the first year of treatment was established with the variables sex, medical history of cardiovascular disease, medical history of malignancies, pre-treatment symptom duration, ESR serum level, and Hb serum level. Although this model was stable, its predictive performance was only modest (e.g., the concordance statistic or area under the curve ranged between 0.60 and 0.65). A sensitivity analysis, using only patients fulfilling EULAR/ACR core criteria, resulted in a model with the variables sex, medical history of cardiovascular disease, pre-treatment symptom duration, ESR serum level, and Hb serum level. Medical history of malignancies was included in only 40% of imputation sets and thus not included in the final model. However, (optimism corrected) performance sensitivity analysis model was comparable to model of the full population. Further detail on the model, its performance, and an example application is given in Additional file 2.
We found a high number of relapses, although agreement between different criteria was limited, with a lower amount of treatment intensification compared to clinical relapse. The number of relapses is in line with previous studies, although agreement between criteria has not been studied before [3,4,5].
The limited agreement between both relapse definitions may suggest treatment intensification is avoided, even though a physician may suspect a relapse to be present. This may be because of the expected exacerbating or development of concomitant disorders, as PMR is a disease of the elderly, and therefore patients and physicians may choose self-management over treatment intensification [16, 19]. Indeed, the higher agreement after one and two years compared visits, may support the hypothesis that a part of patients and physicians wait before ultimately intensifying treatment. Another reason may be, a PMR specific relapse may be difficult to discern from a flare up due to a concomitant disorder, especially when relying on non-specific symptoms [3, 5, 20, 21]. Discerning a relapse may therefore be facilitated by a composite measure—like the PMR-activity score—although further clinimetric research is required before one can be recommended [20, 22,23,24,25,26,27].
We found an unexpected association with relapse and used multiple predictors to develop a multivariable model, although with relatively modest performance. Increased pre-treatment symptom duration was associated with lower odds of relapse within 1–2 years after start of treatment, but not with number of relapses per year. This may be due to residual confounding or lead time bias, although the effect persisted when corrected for level of serum inflammatory markers, age and sex (not shown). Furthermore, it may be that the rate of relapse is highest in the first (two) years of PMR symptoms , and therefore lead time could explain why PMR patients with longer existing symptoms before treatment may have a head start to a relatively self-limiting disease course . An important reason for the potentially limited performance of the prediction model may be the exclusion of some strong predictors that are not either not routinely available (e.g., baseline angiopoietin-2 levels), or not available before treatment (e.g., GC tapering speed) [4, 12].
Some limitations and strengths should be noted. A limitation of this study with regards to diagnosing a relapse is its retrospective design, although this limitation is found more often in studies on relapse [4, 16]. Furthermore, this design excluded health status before treatment based on EQ-5D as a potential predictor . A strength of this study is the clear description of relapse and use of pragmatic predictors, and how this relates to our goal of predicting treatment intensification at the start of therapy. Furthermore, our large sample size and use of all data through multiple imputation resulted in a valid and stable model.
This retrospective study emphasizes the lack of a uniform definition of relapse and a current inability to stratify patients by prognosis. Therefore, focus should be on reaching consensus on a definition and criteria—preferably based on a composite measure—for relapse. Thereafter, focus should shift on developing a model to identify patients with worse or better prognosis who might benefit more from more intense treatment (e.g., DMARDs) or faster tapering. Furthermore, to ease implementation, practical predictors that are both widely available and available at the start of treatment are preferable, especially considering a large proportion of patients is treated in primary care.
Availability of data and materials
The datasets used and analyzed in the study are available from the corresponding author on reasonable request.
Acute phase reactants
Disease-modifying antirheumatic drugs
Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis
Outcome Measures in Rheumatology
Rheumatologist judgement-based relapse
Treatment intensification-based relapse
Erythrocyte Sedimentation Rate
Incidence rate ratio
European League Against Rheumatism/American College of Rheumatology
González-Gay MA, Matteson EL, Castañeda S. Polymyalgia rheumatica. The Lancet. 2017;390(10103):1700–12.
Dejaco C, Singh YP, Perel P, Hutchings A, Camellino D, Mackie S, et al. 2015 recommendations for the management of polymyalgia rheumatica: a European League Against Rheumatism/American College of Rheumatology collaborative initiative. Ann Rheum Dis. 2015;74(10):1799–807.
Hutchings A, Hollywood J, Lamping DL, Pease CT, Chakravarty K, Silverman B, et al. Clinical outcomes, quality of life, and diagnostic uncertainty in the first year of polymyalgia rheumatica. Arthritis Care Res. 2007;57(5):803–9.
Kremers HM, Reinalda MS, Crowson CS, Zinsmeister AR, Hunder GG, Gabriel SE, et al. Relapse in a population based cohort of patients with polymyalgia rheumatica. J Rheumatol. 2005;32(1):65–73.
Mackie SL, Hensor EMAA, Haugeberg G, Bhakta B, Pease CT. Can the prognosis of polymyalgia rheumatica be predicted at disease onset? Results from a 5-year prospective study. Rheumatology (Oxford). 2010;49(4):716–22.
Cantini F, Salvarani C, Olivieri I, Macchioni L, Ranzi A, Niccoli L, et al. Erythrocyte sedimentation rate and C-reactive protein in the evaluation of disease activity and severity in polymyalgia rheumatica: a prospective follow-up study. Semin Arthritis Rheum. 2000;30(1):17–24.
Cimmino MA, Parodi M, Caporali R, Montecucco C. Is the course of steroid-treated polymyalgia rheumatica more severe in women? Ann N Y Acad Sci. 2006;1069:315–21.
Dejaco C, Singh YP, Perel P, Hutchings A, Camellino D, Mackie S, et al. Current evidence for therapeutic interventions and prognostic factors in polymyalgia rheumatica: a systematic literature review informing the 2015 European League Against Rheumatism/American College of Rheumatology recommendations for the management of polymyalgia rheumatica. Ann Rheum Dis. 2015;74(10):1808.
Lee JH, Choi ST, Kim JS, Yoon BY, Kwok S-KK, Kim H-SSH-RH-SR, et al. Clinical characteristics and prognostic factors for relapse in patients with polymyalgia rheumatica (PMR). Rheumatol Int. 2013;33(6):1475–80.
Salvarani C, Cantini F, Niccoli L, Macchioni P, Consonni D, Bajocchi G, et al. Acute-phase reactants and the risk of relapse/recurrence in polymyalgia rheumatica: A prospective followup study. Arthritis Care Res. 2005;53(1):33–8.
Moons KGM, Altman DG, Reitsma JB, Ioannidis JPA, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162(1):W1-73.
Van Sleen Y, Boots AMH, Abdulahad WH, Bijzet J, Sandovici M, van der Geest KSM, et al. High angiopoietin-2 levels associate with arterial inflammation and long-term glucocorticoid requirement in polymyalgia rheumatica. Rheumatology (Oxford). 2020;59(1):176–84.
Kimura M, Tokuda Y, Oshiawa H, Yoshida K, Utsunomiya M, Kobayashi T, et al. Clinical characteristics of patients with remitting seronegative symmetrical synovitis with pitting edema compared to patients with pure polymyalgia rheumatica. J Rheumatol. 2012;39(1):148–53.
Bingham CO, Pohl C, Woodworth TG, Hewlett SE, May JE, Rahman MU, et al. (eds) Developing a standardized definition for disease "flare" in rheumatoid arthritis (OMERACT 9 special interest group), 2009/10.
Steyerberg E. Clinical prediction models: a practical approach to development, validation, and updating. New York: Springer; 2009.
Do-Nguyen D, Inderjeeth CA, Edelman J, Cheah P. Retrospective analysis of the clinical course of patients treated for polymyalgia. Open Access Rheumatol. 2013;5:33–41.
Salvarani C, Boiardi L, Mantovani V, Ranzi A, Cantini F, Olivieri I, et al. HLA-DRB1 alleles associated with polymyalgia rheumatica in northern Italy: correlation with disease severity. Ann Rheum Dis. 1999;58(5):303–8.
Dasgupta B, Cimmino MA, Kremers HM, Schmidt WA, Schirmer M, Salvarani C, et al. 2012 Provisional classification criteria for polymyalgia rheumatica: A European League Against Rheumatism/American College of Rheumatology collaborative initiative. Arthritis Rheum. 2012;64(4):943–54.
Hewlett S, Sanderson T, May J, Alten R, Bingham CO III, Cross M, et al. ‘I’m hurting, I want to kill myself’: rheumatoid arthritis flare is more than a high joint count—an international patient perspective on flare where medical help is sought. Rheumatology. 2011;51(1):69–76.
Binard A, de Bandt M, Berthelot J-M, Saraux A. Performance of the polymyalgia rheumatica activity score for diagnosing disease flares. Arthritis Rheum. 2008;59(2):263–9.
Cleuziou C, Binard A, De Bandt M, Berthelot JM, Saraux A. Contribution of the polymyalgia rheumatica activity score to glucocorticoid dosage adjustment in everyday practice. J Rheumatol. 2012;39(2):310–3.
Dejaco C, Duftner C, Cimmino MA, Dasgupta B, Salvarani C, Crowson CS, et al. Definition of remission and relapse in polymyalgia rheumatica: data from a literature search compared with a Delphi-based expert consensus. Ann Rheum Dis. 2011;70(3):447–53.
Duarte C, De Oliveira Ferreira RJ, Mackie SL, Kirwan JR, da Silva JAP, Ferreira RJO, et al. Outcome measures in polymyalgia rheumatic. A systematic review. J Rheumatol. 2015;42(12):2503–11.
Twohig H, Owen C, Muller S, Mallen CD, Mitchell C, Hider S, et al. Outcomes measured in polymyalgia rheumatica and measurement properties of instruments considered for the OMERACT core outcome set: a systematic review. J Rheumatol. 2021;48(6):883–93.
Binard A, De Bandt M, Berthelot J-M, Saraux A. Usefulness of the disease activity scores for polymyalgia rheumatica for predicting glucocorticoid dose changes: a study of 243 scenarios. Arthritis Rheum. 2007;57(3):481–6.
Leeb BF, Bird HA. A disease activity score for polymyalgia rheumatica. Ann Rheum Dis. 2004;63(10):1279–83.
Leeb BF, Rintelen B, Sautner J, Fassl C, Bird HA. The polymyalgia rheumatica activity score in daily use: proposal for a definition of remission. Arthritis Rheum. 2007;57(5):810–5.
Mackie SL. Polymyalgia rheumatica: pathogenesis and management. Clin Med. 2013;13(4):398–400.
Muller S, Whittle R, Hider SL, Belcher J, Helliwell T, Morton C, et al. Longitudinal clusters of pain and stiffness in polymyalgia rheumatica: 2-year results from the PMR Cohort Study. Rheumatology (Oxford). 2020;59(8):1906–15.
We would like to thank and acknowledge all the patients and colleagues that made this study possible. A special thanks to Else van Gerresheim, Remi Reurink, Vera de Graaf, Minka den Broeder, Beau de Korte, Michiel Minten, Lise Verhoef and Demi Donner for their hard working and dedication.
This was an investigator-initiated study funded by the Sint Maartenskliniek Nijmegen. The Sint Maartenskliniek had no role in the study design, and collection, analysis, and interpretation of data, and in writing of the manuscript.
Ethical approval and consent to participate
The medical ethical committee of research with human subjects (CMO region Arnhem-Nijmegen) waived the need for formal approval, under number 2017-3506, as the study did not fall under the Medical Research Involving Human Subjects Act (WMO). Patients’ consent to obtain data was acquired through an opt-out procedure and patient data was anonymized before use. No administrative permission was required to access the raw data in this study. All relevant legislation was followed as appropriate.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Bolhuis, T.E., Marsman, D., van den Hoogen, F.H.J. et al. (Dis)agreement of polymyalgia rheumatica relapse criteria, and prediction of relapse in a retrospective cohort. BMC Rheumatol 6, 45 (2022). https://doi.org/10.1186/s41927-022-00274-y
- Polymyalgia rheumatica
- Risk factors