Published ahead of print on March 27, 2008, doi:10.1164/rccm.200706-877OC
© 2008 American Thoracic Society doi: 10.1164/rccm.200706-877OC
Interstitial Lung Disease in Systemic SclerosisA Simple Staging System1 Royal Brompton Hospital and National Heart and Lung Institute, London, United Kingdom; 2 King's College Hospital, London, United Kingdom; 3 Hammersmith Hospital, London, United Kingdom; and 4 Royal Free Hospital, London, United Kingdom Correspondence and requests for reprints should be addressed to Athol U. Wells, M.D., Interstitial Lung Disease Unit, Royal Brompton Hospital and NHLI, Imperial College, Emmanuel Kaye Building, 1B Manresa Road, London SW3 6LP, UK. E-mail: a.wells{at}rbht.nhs.uk
Rationale: In interstitial lung disease complicating systemic sclerosis (SSc-ILD), the optimal prognostic use of baseline pulmonary function tests (PFTs) and high-resolution computed tomography (HRCT) is uncertain. Objectives: To construct a readily applicable prognostic algorithm in SSc-ILD, integrating PFTs and HRCT. Methods: The prognostic value of baseline PFT and HRCT variables was quantified in patients with SSc-ILD (n = 215) against survival and serial PFT data.
Measurements and Main Results: Increasingly extensive disease on HRCT was a powerful predictor of mortality (P < 0.0005), with an optimal extent threshold of 20%. In patients with HRCT extent of 10–30% (termed indeterminate disease), an FVC threshold of 70% was an adequate prognostic substitute. On the basis of these observations, SSc-ILD was staged as limited disease (minimal disease on HRCT or, in indeterminate cases, FVC Conclusions: An easily applicable limited/extensive staging system for SSc-ILD, based on combined evaluation with HRCT and PFTs, provides discriminatory prognostic information.
Key Words: prognosis high-resolution computed tomography pulmonary function test limited extensive
Pulmonary disease is the leading cause of mortality in patients with systemic sclerosis (SSc) (1–3). Accurate prognostic evaluation allows the selection of higher risk patients who may benefit from treatment. Baseline gas transfer (DLCO) and FVC levels have traditionally been used as measures of disease severity and reductions in both parameters have been associated with increased mortality in the interstitial lung disease of systemic sclerosis (SSc-ILD) (3–7). However, the wide range of normal pulmonary function test (PFT) values (80–120% of predicted) is a major constraint. For example, a mildly reduced FVC level of 75% represents a clinically insignificant reduction from a premorbid FVC level of 80% but a striking decline from a premorbid value of 120%; thus, baseline FVC levels are most likely to be misleading when disease is either minimal or severe. In principal, the quantification of disease extent on high-resolution computed tomography (HRCT) would improve the accuracy of staging in these scenarios. However, because formal scoring of disease extent on HRCT is unrealistic in routine clinical practice, the optimal approach is likely to require a combination of semiquantitative HRCT evaluation and PFTs. In this study, we identified the optimal HRCT extent threshold against mortality. For the purposes of analysis, HRCT subthresholds were set, such that disease extent was clearly below or above the threshold value and readily classifiable by less experienced observers as minimal or severe. A prognostic algorithm was created in which disease was classified as limited or extensive when disease extent on HRCT was overtly minimal or severe. In the remaining cases, with disease extent not readily classifiable as minimal or severe (i.e., "indeterminate" in extent), the distinction between limited and extensive disease was based on FVC threshold values. This staging system was compared with the use of HRCT and FVC in isolation. Some of this work has been presented in abstract form (8).
Patients Consecutive patients (n = 330) referred to the Royal Brompton Hospital (London, UK) between January 1990 and December 1999 met criteria for SSc (9) with no overlap features: 277 (84%) had evidence of interstitial lung disease on HRCT. Investigations were performed as part of a routine clinical protocol. Exclusion criteria consisted of the following: (1) HRCT scans performed elsewhere, unavailable for scoring (n = 53); (2) baseline investigations separated by more than 90 days (n = 8); and (3) no follow-up (n = 1). The remaining 215 patients made up the study population. Excluded patients did not differ from the study cohort (see online supplement). Forty-four of the 46 patients with complete data and no pulmonary fibrosis or pulmonary hypertension, as judged by HRCT and echocardiography, were compared with the study cohort.
Clinical Data PFTs (% predicted), echocardiography, and HRCT were performed as previously reported (10, 11). Pulmonary hypertension was defined as a pulmonary arterial systolic pressure of 40 mm Hg or greater (12). HRCT scans were scored by two observers (A.U.W., S.R.D.) at five levels for total disease extent, extent of reticulation, proportion of ground glass, and coarseness of reticulation (see online supplement).
Outcome
Data Analysis Optimal HRCT extent and FVC thresholds, identified against mortality using proportional hazards analysis, were compared with alternative thresholds using a modified time-dependent receiver operating characteristic curve (ROC) analysis (14) (see online supplement). The prognostic value of a limited/extensive classification for disease extent, integrating HRCT observations and FVC values (as shown in Figure 1A), was compared with HRCT and FVC thresholds.
Observations were tested in two patient subgroups, matched for disease extent on HRCT (the patient with the median value was assigned to subgroup B). In subgroup A, the above analyses were repeated as a random subgroup test. In subgroup B, limited and extensive disease were defined as shown in Figure 1B: HRCT extent was graded as definitely greater than 20%, definitely less than 20%, or indeterminate by two radiologists, two physicians with more than 18 months of experience in interstitial lung disease and four trainees (two radiologists and two physicians, with <6 mo experience in interstitial lung disease). All sections from the arch of the aorta to the top of the hemidiaphragm were evaluated. Instruction was confined to three illustrative examples (with less than 10 min discussion).
Patient Characteristics As shown in Table 1, there was a female predominance, and a wide range of disease extent on HRCT (as shown in Figure E1 of the online supplement) and PFTs in the study cohort (n = 215). Age and male to female ratio did not differ significantly between the study cohort and the comparative group of 44 patients with no cardiopulmonary disease (age, 45.9 ± 11.5 yr; male to female ratio, 7:37). Nineteen of 215 patients (9%) had pulmonary hypertension. Sixty-one patients (28%) were receiving treatment at presentation and 38 patients (18%) had treatment introduced within the next 3 months. Seventy-five of 215 patients died (35%) (median follow-up, 89 mo; 10-yr survival, 59%). Decline in FVC was observed in 109 of 194 patients (56%; median time to decline, 61 mo). Decline in DLCO was also observed in 109 of 194 patients (56%; median time to decline, 64 mo). Progression (defined as death or deterioration in either FVC or DLCO) occurred in 150 of 194 patients (77%; median progression-free survival, 36 mo).
Determinants of Outcome Mortality was strongly linked to baseline DLCO levels, the extent of disease on HRCT, the extent of a reticular pattern on HRCT, and the presence of pulmonary hypertension on univariate analysis (Table 2). These severity variables were examined in separate multivariate models, with adjustment for age, sex, smoking status, the proportion of ground glass on HRCT, and the coarseness of reticulation on HRCT. HRCT disease extent was as predictive of mortality as DLCO levels and the presence of pulmonary hypertension (all P < 0.0005) and more predictive than the extent of a reticular pattern (P = 0.001) (Table E2).
On univariate analysis, baseline PFTs, the extent of disease on HRCT, and the extent of a reticular pattern on HRCT were linked to decline in FVC, decline in DLCO and progression-free survival. The extent of a reticular pattern was the strongest determinant of time to decline in FVC and progression-free survival, whereas the presence of pulmonary hypertension was the strongest determinant of time to decline in DLCO. Full details are provided in the online supplement (Table E3).
HRCT Threshold Values against Survival
On the basis of these observations, a cardinal HRCT extent threshold of 20% was selected. This threshold was further evaluated by comparing areas under the curve in a modified time-dependent ROC analysis for thresholds of 15, 20, and 25%, as shown in the online supplement (Figure E2). ROC values were significantly higher for a threshold value of 20% than with the use of threshold values of 15 or 25%. Parallel analyses were performed in an attempt to identify an optimal threshold against mortality for the extent of a reticular pattern on HRCT. However, examining threshold values of 5, 10, 15, and 20% did not identify an optimal value (as shown in Table E4).
FVC Threshold Values against Survival
A Clinical Algorithm for Routine Staging In keeping with the optimal HRCT extent threshold of 20%, subthreshold values of 10 and 30% were chosen for the purpose of further analysis, as indicative of easily classifiable minimal (limited; n = 87; see Figure E4) or severe (extensive; n = 36; see Figure E6) disease on HRCT. In the remaining 92 patients (43%) with an indeterminate extent of disease on HRCT (10–30%) (see Figure E5), FVC values were used to stage disease as limited or extensive as shown in Figure 1A. The following analyses were performed before the adaptation of the staging system for routine scoring.
Final Routine Staging Algorithm Assessed by Eight Observers in Subgroup B (n = 108) For use in routine practice, formal HRCT scoring was substituted by an estimation of disease extent as definitely less than 20%, definitely more than 20%, or indeterminate (i.e., observer difficulty in extent subcategorization). Each observer staged disease severity as limited or extensive based on disease extent on HRCT and FVC levels, as shown in Figure 1B.
HRCT scans, evaluated by two radiologists, two physicians, and four trainees, required 1.49 ± 0.93 minutes/scan for evaluation. Interobserver agreement was good for practitioners (
We propose an easily applicable prognostic algorithm for SSc-ILD, validated against mortality and requiring (1) the rapid identification of overtly minimal or severe disease on HRCT, based on a disease extent threshold of 20%, and (2) the use of an FVC threshold of 70% in the remaining cases (with an indeterminate extent of disease on HRCT). Thus, HRCT is used to stage severity, provided that findings are clear-cut, with recourse to an FVC threshold of 70% in marginal cases requiring greater radiologic expertise for HRCT classification. The staging of interstitial lung disease as limited or extensive had a greater prognostic value than either HRCT or FVC thresholds in isolation, when HRCT extent was formally scored. The prognostic separation between limited and extensive disease was retained when tested by experienced practitioners, confirming that formal HRCT scoring, which is not practicable in routine practice, can be avoided. The HRCT staging of patients with obviously minimal or severe disease reduces misclassification of severity by an FVC threshold (due to the wide range in normal premorbid values). More importantly, the proposed system provides, for the first time in SSc-ILD, an easily applicable and rapid means of integrating HRCT extent and functional severity in routine prognostic evaluation. Formal HRCT scoring was used to construct the staging system and to perform preliminary ancillary analyses. For this purpose, HRCT disease extent thresholds of 10 and 30%, equidistant to 20%, were used to identify patients readily classifiable as having minimal or severe disease, with FVC levels used for staging when disease extent lay between 10 and 30%. The prognostic distinction between limited and extensive disease was not weakened when the presence of pulmonary hypertension and demographic factors were taken into account. Furthermore, the prognostic separation was equally striking in patients considered to require immediate treatment and in the remaining patients in whom observation was initially believed to be appropriate.
However, the true test of the limited/extensive system was its application using rapid semiquantitative estimation of disease extent by four practitioners and four trainees. Strikingly, the two physicians with less than 2 years' experience in interstitial lung disease achieved prognostic separations, both in mortality and progression-free survival, comparable to those of two experienced radiologists, despite a training period of only 10 minutes. It is possible that the group HRCT threshold values tested in analysis were confined to those that can practicably be identified on rapid semiquantitative HRCT evaluation. Thus, prognostic distinctions were examined using 5% increments in disease extent. The chosen threshold of 20% was selected because it signaled a substantial increase in mortality, compared with patients with no cardiopulmonary disease. The use of alternative thresholds of 15 or 25% failed to capture this key threshold effect, and this was confirmed using a recently described time-dependent ROC methodology (14). An HRCT extent threshold of 30% was equally discriminatory, but with this threshold a much smaller subgroup with extensive disease (n = 36) was selected, and in patients with less extensive disease, subgroups with very heterogeneous outcomes (<20%, 20–30%) were amalgamated. The FVC threshold of 70% corresponded exactly to an HRCT disease extent of 20% on linear regression and could be justified on two other grounds. First, an examination of FVC thresholds at 5% increments (to parallel the HRCT thresholds) showed that an FVC threshold of 70% was optimal on proportional hazards analysis, and this observation was confirmed using time-dependent ROC methodology (14). Second, in the recent scleroderma lung studies (16, 17), an FVC level of 70% was the threshold below which therapy was effective in preventing disease progression, an effect that was lost with the cessation of treatment. It should be stressed that the aim of this study was to construct a simple staging system and not a complex multivariate index in which all possible prognostic information is included. In idiopathic pulmonary fibrosis, the clinical-radiographic-physiologic indices have provided important insights into the disease (18, 19) but are not applied in routine practice because of their complexity. Furthermore, clinical-radiographic-physiologic scores and other multivariate systems are continuous and do not separate patients into discrete groups according to disease severity in a clinically useful fashion. The aim of our staging system was to enable clinicians to readily classify patients into two groups ("high risk" and "low risk"), based on obvious differences in outcome, using HRCT extent and FVC thresholds that can be realistically evaluated in clinical practice. Importantly, in preliminary proportional hazards analyses, disease extent on HRCT was shown to provide prognostic information that was comparable to other severity variables that are not specific to the interstitium (DLCO levels, the presence of pulmonary hypertension) or require expert radiologic assessment (the extent of reticulation on HRCT). One limitation of our study is that the extent threshold at which treatment should be instituted cannot be distilled from our data. Patients with extensive disease had a worse outcome, despite the fact that the majority were treated from presentation, and this suggests that, ideally, treatment should be instituted earlier in the disease course, in the hope of preventing progression to the 20% HRCT extent/70% FVC thresholds. However, the therapeutic approach in limited disease was too variable to allow subanalysis, because it was also influenced by other prognostic factors, including observed disease progression. Therapeutic status was categorized coarsely, according to whether treatment was believed to be warranted at presentation. The staging system provided equivalent prognostic information regardless of whether patients were treated or observed. This broad distinction was necessary because the later introduction of treatment could not be accounted for in analysis. The choice, timing, and duration of treatment varied widely during follow-up, with introduced therapy often modified or withdrawn due to side effects. Our approach has the advantage of categorizing therapeutic status at the time at which baseline information was available, thereby taking treatment into account using an "intention to prognosticate" approach. A further limitation related to time to decline in PFTs. This variable is a relatively coarse outcome measure, compared with mortality, for which follow-up time is generally collated at monthly intervals. Patients with mild disease are often monitored infrequently. Thus, it is possible that, in some cases, functional decline might have been missed for many months. However, to put this limitation in perspective, it should be stressed that the maximum time interval for PFT follow-up and thus the maximum inaccuracy in time to decline analyses was 12 months, with the total follow-up averaging over 5 years. Furthermore, this imprecision in outcome applies equally to assessment of all baseline variables and should not, in principle, favor one variable over another in prognostic evaluation. In conclusion, we propose a simple staging system for SSc-ILD as limited or extensive disease, based on simplified HRCT evaluation and FVC estimation, that provides more powerful prognostic information than either component in isolation. The system was readily applied by more experienced observers, who had been minimally trained in its use. In principle, this means of risk stratification is equally applicable to routine practice and the enrollment of patients in pharmaceutical studies.
Supported by Raynaud's and Scleroderma Association, Alsager, Cheshire, United Kingdom. This article has an online supplement, which is accessible from this issue's table of contents at www.atsjournals.org Originally Published in Press as DOI: 10.1164/rccm.200706-877OC on March 27, 2008 Conflict of Interest Statement: None of the authors has a financial relationship with a commercial entity that has an interest in the subject of this manuscript. Received in original form June 16, 2007; accepted in final form March 7, 2008
Related articles in AJRCCM:
This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||