Beruflich Dokumente
Kultur Dokumente
123
1106 Author's personal copy Qual Life Res (2010) 19:1105–1113
123
Qual Life Res (2010) 19:1105–1113 Author's personal copy 1107
Pages as the sampling frame resulting in a representative established in previous studies, a CES-D score of C16 was
population sample for the Northwest region of Adelaide as used to define the presence of depression [34, 35]. From the
detailed in previous studies [29, 30]. A selection of patients ROC analysis on the IDACC data, a specific threshold
recruited for baseline examination was defined as the car- value was derived demonstrating a maximal Youden Index
diac subset. Studies have indicated that self-reported car- (sensitivity ? specificity - 1) [37]. This optimal threshold
diovascular events show good concordance with medical from the IDACC cohort was then evaluated in the NWAHS
records and hospital diagnostic codes [31–33]. Therefore, and coronary angiogram cohorts in conjunction with their
the inclusion criterion for this group was the self-report of respective ROC curves. In addition, positive predictive
patients responding to a history of events, either angina or values (PPV) and negative predictive values (NPV) were
myocardial infarction, as diagnosed by a doctor. Several measured for different threshold scores in the central range
days prior to an appointment for biomedical examination, of the SF-36 MCS scores. Independent samples’ t-tests
participants were mailed a questionnaire pack containing compared SF-36 MCS mean scores between depressed and
the SF-36 and CES-D to be self-administered and then not depressed groups according to CES-D scoring. Clinical
checked for accuracy during the examination. characteristics between the three groups were compared
using Fishers’ exact test for categorical variables and
Coronary angiogram cohort ANOVA for continuous data.
123
1108 Author's personal copy Qual Life Res (2010) 19:1105–1113
Table 1 Clinical characteristics and HRQoL scores (mean ± SD) for the study cohorts
Characteristic IDACC (n = 1,221 NWAHS (n = 222) Angiogram (n = 80)
score C 16), (39 ± 11, 30 ± 10, 35 ± 6, respectively) Using the maximal Youden Index, the optimal SF-36
compared with those not depressed (52 ± 9, 49 ± 10, MCS threshold value for detecting depressive symptoms
48 ± 8, respectively). was 45 (PPV = 72%, 95% CI = 68–76; NPV = 79%,
95% CI = 75–82. At this threshold, sensitivity was 73%
ROC characteristics in IDACC sample (95% CI = 69–77%), so that 27% of patients were falsely
identified as experiencing depressive symptoms. Specific-
Using ROC analysis, overall there was a significant pre- ity was increased to 77% (95% CI = 74–80%), and 76% of
diction for the CES-D scores as measured by the SF-36 patients were appropriately classified. The operating char-
MCS scores (AUC = 0.82 SE 0.01, 95% CI 0.8–0.85). acteristics, NPVs and PPVs are summarised in Table 2.
Utilising a high specificity (90%, 95% CI = 87–92%), at
the SF-36 MCS threshold value of 39, 72% of patients were Accuracy of the SF-36 mental summary score
appropriately assigned as being depressed as per the CES- in NWAHS and coronary angiogram cohorts
D scores. However, the sensitivity of this SF-36 MCS
threshold value was 50% (95% CI = 46–54%), indicating To establish the usefulness of the SF-36 MCS in detecting
that half of the patients depressed as per the CES-D scores depressive symptoms, the ROC curves were analysed
were identified correctly. separately in a cohort with a low prevalence of depression
123
Qual Life Res (2010) 19:1105–1113 Author's personal copy 1109
Table 2 Operating characteristics for the central range of SF-36 MCS cut-points in the IDACC cohort
SF-36 MCS cut-off Sensitivity 95% CI Specificity 95% CI Appropriately classified (%) PPV 95% CI NPV 95% CI
Table 3 Screening abilities of the central range of SF-36 MCS cut-points for the NWAHS and angiogram cohorts
SF-36 MCS cut-off NWAHS Coronary angiogram
Sensitivity Specificity PPV NPV Sensitivity Specificity PPV NPV
B40 86% (67–95) 79% (73–85) 38% (27–51) 97% (93–99) 77% (57–89) 86% (73–94) 77% (57–89) 86% (73–94)
B41 86% (67–95) 77% (70–82) 36% (25–48) 97% (93–99) 80% (61–92) 86% (73–94) 77% (58–90) 88% (75–95)
B42 86% (67–94) 75% (68–80) 34% (23–46) 97% (93–99) 80% (61–92) 84% (70–92) 75% (56–88) 88% (74–95)
B43 90% (72–97) 73% (66–79) 33% (23–45) 98% (93–99) 90% (72–97) 74% (59–85) 68% (51–81) 93% (79–98)
B44 93% (76–99) 71% (64–77) 33% (23–44) 99% (94–100) 90% (72–97) 70% (55–82) 64% (48–78) 92% (78–98)
B45 100% (85–100) 68% (61–74) 32% (23–43) 100% (96–100) 93% (76–99) 64% (49–77) 61% (45–75) 94% (79–99)
SF-36 MCS, SF-36 mental summary score; PPV, positive predictive value; NPV, negative predictive value
Data obtained from receiver-operating characteristics for 222 observations for the NWAHS sample and for 80 observations in the coronary
angiogram cohort. Sensitivity, specificity, PPV and NPV are presented with 95% CI
(NWAHS cohort) and one with a similar prevalence to the the coronary angiogram cohort, the optimal threshold was
IDACC cohort (the coronary angiogram cohort). The 41 (sensitivity = 80%, 95% CI 61–92; specificity = 86%,
overall accuracy for predicting depressive symptoms as 95% CI = 73–94; PPV = 77%, 95% CI 58–90; and
measured by the AUC for the NWAHS and coronary NPV = 88%, 95% CI = 75–95).
angiogram populations were 0.91 (SE 0.02, 95% CI 0.87– At the optimal threshold of 45, the proportion of patients
96) and 0.89 (SE 0.04, 95% CI 0.8–0.95), respectively. appropriately identified as depressed on the SF-36 MCS
Using the optimal SF-36 MCS threshold value of 45 according to CES-D scoring for the NWAHS and coronary
derived from the IDACC cohort, ROC analysis of the angiogram cohort was 71 and 75%, respectively, compared
NWAHS cohort elucidated a sensitivity of 100% (95% to 76% in IDACC (Table 4).
CI = 85–100%) and specificity of 68% (95% CI = 61–
74%). Positive and negative predictive values were 32%
(95% CI = 23–43%) and 100% (95% CI = 96–100%), Discussion
respectively. Similar levels of accuracy were observed for
the coronary angiogram cohort with this SF-36 MCS This investigation has first established that the SF-36 MCS
threshold value of 45 (sensitivity = 93%, 95% CI = 76– is correlated with the CES-D scores in CHD populations.
99%, specificity = 64%, 95% CI = 49–77%; PPV = 61%, Secondly, the SF-36 MCS threshold value of 45 derived
95% CI = 45–75%; and NPV = 94%, 95% CI = 79–99%, from ROC analysis according to the maximal Youden
Table 3). Index could identify a depressed subgroup in the index
According to maximal Youden Index, the optimal (IDACC) cohort (Table 2), as defined by the CES-D, with
threshold found in the NWAHS cohort was also 45, and in sensitivity of 73% and specificity of 77%. Moreover, when
123
1110 Author's personal copy Qual Life Res (2010) 19:1105–1113
Table 4 Proportion of patients appropriately classified by the SF-36 As illustrated in Table 2, assigning a threshold value for
MCS for the central range of cut-points for each study cohort the SF-36 MCS is difficult considering the trade-off
SF-36 MCS Appropriately classified between sensitivity and specificity. In order to achieve a
cut-off balanced threshold, the Youden Index was employed. This
IDACC (%) NWAHS (%) Angiogram (%)
(n = 1,221) (n = 222) (n = 80) index provides the optimal threshold value for which sen-
sitivity ? specificity - 1 is maximised, representing the
B40 73 80 83 point on the ROC curve farthest from chance. Although
B41 74 78 84 other indices are available to establish threshold values, the
B42 74 76 83 Youden Index was chosen as it is well established and not
B43 75 75 80 dependent on prevalence rates [40] and has been previously
B44 75 73 78 used in cardiovascular studies [41].
B45 76 71 75 The relationship of the SF-36 MCS to depressive
Data obtained from receiver-operating characteristic symptoms has been reported previously. The Medical
SF-36 MCS, SF-36 mental summary score Outcomes Study (MOS) investigated the functioning and
well-being of depressed patients and showed that depres-
sion has an additive adverse effect on patient functional
prospectively applying this threshold value to CHD cohorts status and well-being as indicated by scores on the SF-36
with a low and high prevalence of depression (NWAHS [25]. In addition, the SF-36 MCS scale has been evaluated
and coronary angiogram cohorts, respectively), similar as a screening tool for clinical depression using a two-stage
levels of accuracy were observed (Table 3). To our screening approach in which positive CES-D results were
knowledge, this is the first study to establish the accuracy followed by a clinical interview. ROC analysis showed that
of the SF-36 MCS as an indicator of depressive symptoms the best all around cut-off for the SF-36 MCS for detecting
in a cardiac population. It must be recognised that the use depressive disorder was at a score of 42 or below in a
of the SF-36 MCS threshold has not been confirmed as a community sample of MOS patients with chronic condi-
screening or diagnostic tool but as a research tool for group tions [16].
observations utilising a generic measure and that interpre- Other studies have investigated the use of the SF-36
tations of individual scores should not be made. Accord- MCS in disease-specific samples. Discriminate function
ingly, studies where specific depression questionnaires analysis showed that the SF-36 MCS had a positive
were not administered to CHD cohorts may utilise the SF- predictive value of 98% in differentiating major depres-
36 MCS as a surrogate method to define a subgroup sion from minor or no depression in patients with chronic
experiencing depressive symptoms. This will enable fur- pain after controlling for age, gender and pain diagnosis
ther investigation of depression in cardiac patients partic- [27]. In a sample of patients with asthma, depressive
ularly in studying the relationships between depressive symptoms had an important role in patient-derived
symptoms and other variables. functional status and health-related quality of life as
The strength of the above findings includes the follow- measured by the MCS of the SF-36, even after adjusting
ing: (a) three well-characterised study cohorts, (b) use of an for current disease activity [26]. Walsh et al. [38] con-
index cohort to establish the SF-36 MCS threshold criteria firmed the effectiveness of the SF-36 MCS in detecting
with subsequent validation in independent populations and depression in a chronic spinal pain population. The CES-
(c) variable prevalence of depression within the study D was also used a gold standard and a score of 35 on the
cohorts. This later feature is particularly important as MSS was identified as an optimal cut-off. This lower
disease prevalence within a population substantially influ- threshold could be attributed to a higher cut-off of 19 on
ences the predictive ability of the instrument. The differ- CES-D scoring to indicate depression. Thus, the SF-36
ence between the prevalence of depression among the has been shown to be a good indicator of depression in
cohorts reflects the recruitment strategy and further illus- both general and diseased populations. Previous studies
trates the diversity of the study population. The IDACC have demonstrated the ability of the various depressive
cohort was recruited from in-hospital patients, which may screening instruments to identify cardiac patients with
account for the higher prevalence of depression. In con- depression [42, 43]; however, there are no previous
trast, the NWAHS cohort was recruited from the general studies confirming this for the SF-36 as a generic health
population by telephone interview. The coronary angio- measure in cardiac patients.
gram cohort was recruited from those undergoing angiog- Our study has evaluated the effectiveness of the SF-36
raphy with 59% being undertaken electively as day cases. MCS in detecting depressive symptoms in cardiac patients,
Hence, the cohorts are representative of the spectrum of by determining an optimal threshold value on one patient
patients with CHD. population and then applying this value to two different
123
Qual Life Res (2010) 19:1105–1113 Author's personal copy 1111
‘‘test cohorts’’ with clinically diverse characteristics. Thus, validated against the structured diagnostic interview [35].
we have illustrated the variation in sensitivity, specificity The CES-D was chosen as it was available in the cohorts
and predictive value between the different samples, and studied and compares favourably with other depression-
also for different optimal thresholds (Table 5). Accord- specific instruments (Table 6). We acknowledge that the
ingly, differences in the populations being sampled will threshold presented may vary with an alternative diagnostic
lead to differences in levels of accuracy and misclassifi- tool, such as the DSM criteria, resulting in varying esti-
cation. The prevalence of depression (CES-D C 16) was mates of sensitivity and specificity. Our results are also
similar in the IDACC and coronary angiogram cohorts, but limited by the lack of generalizability since the cohorts
it was much lower in the NWAHS cohort, reflecting the were recruited from a single region. In addition, wide
lowest PPV. There were also several differences in the confidence intervals noted in the NWAHS and angio-
average of SF-36 MCS and CES-D values, and also in the graphic cohorts limit the precision of the estimates.
age and risk factor distribution between the three samples. Another limitation of this study relates to the potential of
Since sensitivity and specificity are independent of disease the SF-36 MCS to misclassify depression. However, since
prevalence, these differences may contribute to the vari- depression is a disorder which lies on a continuum scale,
ability noted in the capabilities of the SF-36 MCS in every dichotomous assessment, even when developed from
detecting depressive symptoms. the gold standard, will contend with inappropriate classi-
The main limitation of this study is the absence of a fication to some degree. In this case, it would introduce a
structured diagnostic interview to formally confirm the degree of non-differential misclassification, which would
presence of depression in the study cohorts. However, as tend to bias any difference towards the null and therefore
discussed previously, such a comparison is unlikely to be potentially miss relationships that may exist with depres-
undertaken in population studies given the extensive sion. Lastly, it would have been desirable to confirm the
workload and costs that would be involved. As an alter- self-reported CHD in the NWAHS cohort. Future studies
native, we have used the CES-D questionnaire which is a are needed to validate the findings in large and diverse
well-established, reliable instrument that has been CHD populations.
123
1112 Author's personal copy Qual Life Res (2010) 19:1105–1113
This study has described a SF-36 MCS threshold that Index-Cardiac Version III. Journal of Clinical Epidemiology,
will enable CHD populations to be classified into depressed 51(7), 569–575.
15. Smith, H. J., Taylor, R., & Mitchell, A. (2000). A comparison of
and non-depressed groups relative to CESD scoring, four quality of life instruments in cardiac patients: SF-36, QLI,
thereby enabling important analyses to be undertaken in QLMI, and SEIQoL. Heart, 84(4), 390–394.
large cardiac cohorts that have completed the widely uti- 16. Ware, J. E., Jr., Kosinski, M., & Keller, S. D. (1994). SF-36
lised SF-36 questionnaire. The data presented defines an Physical, mental health summary scales: A user’s manual. Bos-
ton, Massachusettsd: The Health Institute, New England Medical
SF-36 MCS threshold value of 45 as being the optimal Center.
threshold; however, the results for other thresholds are also 17. Ware, J. E., Jr., & Gandek, B. (1998). Overview of the SF-36
presented should future investigators wish to adopt alter- health survey and the international quality of life assessment
native thresholds based upon their study sensitivity/speci- (IQOLA) project. Journal of Clinical Epidemiology, 51(11),
903–912.
ficity requirements. 18. McCallum, J. (1995). The SF-36 in an Australian sample: Vali-
dating a new, generic health status measure. Australian Journal of
Public Health, 19(2), 160–166.
19. Failde, I., & Ramos, I. (2000). Validity and reliability of the
SF-36 Health Survey Questionnaire in patients with coronary
References artery disease. Journal of Clinical Epidemiology, 53(4), 359–365.
20. Permanyer-Miralda, G., et al. (1991). Comparison of perceived
1. Penninx, B. W., et al. (2001). Depression and cardiac mortality: health status and conventional functional evaluation in stable
Results from a community-based longitudinal study. Archives of patients with coronary artery disease. Journal of Clinical Epi-
General Psychiatry, 58(3), 221–227. demiology, 44(8), 779–786.
2. Wulsin, L. R., & Singal, B. M. (2003). Do depressive symptoms 21. Berwick, D. M., et al. (1991). Performance of a five-item mental
increase the risk for the onset of coronary disease? A systematic health screening test. Medical Care, 29(2), 169–176.
quantitative review. Psychosomatic Medicine, 65(2), 201–210. 22. Failde, I., Ramos, I., & Fernandez-Palacin, F. (2000). Compari-
3. Glassman, A. H., et al. (2002). Sertraline treatment of major son between the GHQ-28 and SF-36 (MH 1–5) for the assessment
depression in patients with acute MI or unstable angina. JAMA, of the mental health in patients with ischaemic heart disease.
288(6), 701–709. European Journal of Epidemiology, 16(4), 311–316.
4. Berkman, L. F., et al. (2003). Effects of treating depression and 23. McHorney, C. A., Ware, J. E., Jr., & Raczek, A. E. (1993). The
low perceived social support on clinical events after myocardial MOS 36-item short-form health survey (SF-36): II. Psychometric
infarction: The Enhancing Recovery in Coronary Heart Disease and clinical tests of validity in measuring physical and mental
Patients (ENRICHD) Randomized Trial. JAMA, 289(23), 3106– health constructs. Medical Care, 31(3), 247–263.
3116. 24. Ware, J. E., Jr., et al. (1995). Comparison of methods for the
5. Schrader, G., et al. (2005). Effect of psychiatry liaison with scoring and statistical analysis of SF-36 health profile and sum-
general practitioners on depression severity in recently hospita- mary measures: Summary of results from the Medical Outcomes
lised cardiac patients: A randomised controlled trial. Medical Study. Medical Care, 33(4 Suppl), AS264–AS279.
Journal of Australia, 182(6), 272–276. 25. Wells, K. B., et al. (1989). The functioning and well-being of
6. McDowell, I., & Newell, C. (1996). Measuring health: A guide to depressed patients. Results from the medical outcomes study.
rating scales and questionnaires (p. 523). New York: Oxford JAMA, 262(7), 914–919.
University Press. 26. Mancuso, C. A., Peterson, M. G., & Charlson, M. E. (2000).
7. Kavan, M. G., et al. (1990). Screening for depression: Use of Effects of depressive symptoms on health-related quality of life
patient questionnaires. American Family Physician, 41(3), 897– in asthma patients. Journal of General Internal Medicine, 15(5),
904. 301–310.
8. Attkisson, C. C., & Zich, J. M. (1990). Depression in primary 27. Elliott, T. E., Renier, C. M., & Palcher, J. A. (2003). Chronic
care. National institute of mental health (U.S.), University of pain, depression, and quality of life: Correlations and predictive
California San Francisco. New York: Routledge. value of the SF-36. Pain Medicine, 4(4), 331–339.
9. Weissman, M. M., et al. (1977). Assessing depressive symptoms 28. Cheok, F., et al. (2003). Identification, course, and treatment of
in five psychiatric populations: A validation study. American depression after admission for a cardiac condition: Rationale and
Journal of Epidemiology, 106(3), 203–214. patient characteristics for the Identifying Depression As a
10. Roberts, R. E., & Vernon, S. W. (1983). The center for epide- Comorbid Condition (IDACC) project. American Heart Journal,
miologic studies depression scale: Its use in a community sample. 146(6), 978–984.
American Journal of Psychiatry, 140(1), 41–46. 29. Grant, J. F., et al. (2006) The North West Adelaide Health study:
11. Schulberg, H. C., et al. (1985). Assessing depression in primary Detailed methods and baseline segmentation of a cohort for
medical and psychiatric practices. Archives of General Psychia- selected chronic diseases. Epidemiologic Perspectives Innova-
try, 42(12), 1164–1170. tions 3(4).
12. Radloff, L. S. (1977). The CES-D scale: A self-report depression 30. Grant, J.F., et al. (2008). Cohort profile: The North West Ade-
scale for research in the general population. Applied Psycholog- laide Health study (NWAHS). International Journal of
ical Measurement, 1(3), 385–401. Epidemiology.
13. Spertus, J. A., et al. (2002). Health status predicts long-term 31. Bergmann, M. M., et al. (1998). Validity of self-reported diag-
outcome in outpatients with coronary disease. Circulation, noses leading to hospitalization: A comparison of self-reports
106(1), 43–49. with hospital records in a prospective study of American adults.
14. Dougherty, C. M., et al. (1998). Comparison of three quality of American Journal of Epidemiology, 147(10), 969–977.
life instruments in stable angina pectoris: Seattle Angina Ques- 32. Heckbert, S. R., et al. (2004). Comparison of self-report, hospital
tionnaire, Short Form Health Survey (SF-36), and Quality of Life discharge codes, and adjudication of cardiovascular events in the
123
Qual Life Res (2010) 19:1105–1113 Author's personal copy 1113
Women’s Health Initiative. American Journal of Epidemiology, 41. Schisterman, E. F., et al. (2001). TBARS and cardiovascular
160(12), 1152–1158. disease in a population-based sample. Journal of Cardiovascular
33. Okura, Y., et al. (2004). Agreement between self-report ques- Risk, 8(4), 219–225.
tionnaires and medical record data was substantial for diabetes, 42. Stafford, L., Berk, M., & Jackson, H. J. (2007). Validity of the
hypertension, myocardial infarction and stroke but not for heart hospital anxiety and depression scale and patient health ques-
failure. Journal of Clinical Epidemiology, 57(10), 1096–1103. tionnaire-9 to screen for depression in patients with coronary
34. Ensel, W. (1986) Measuring depression: The CESD scale, in artery disease. General Hospital Psychiatry, 29(5), 417–424.
social support, life events and depression. In Lin, N., Dean, A., 43. Strik, J. J., et al. (2001). Sensitivity and specificity of observer and
Ensel, W. M. (Eds.). Academic Press: New York. self-report questionnaires in major and minor depression follow-
35. Zich, J. M., Attkisson, C. C., & Greenfield, T. K. (1990). ing myocardial infarction. Psychosomatics, 42(5), 423–428.
Screening for depression in primary care clinics: The CES-D and 44. Lowe, B., et al. (2004). Measuring depression outcome with a
the BDI. International Journal of Psychiatry in Medicine, 20(3), brief self-report instrument: Sensitivity to change of the Patient
259–277. Health Questionnaire (PHQ-9). Journal of Affective Disorders,
36. ABS. (2003). Socio-Economic Indexes for Areas, Australia, 2001. 81(1), 61–66.
Canberra: Australian Bureau of Statistics. 45. Oliver, J., & Simmons, M. (1984). Depression as measured by the
37. Youden, W. J. (1950). Index for rating diagnostic tests. Cancer, DSM-III and the beck depression inventory in an unselected adult
3(1), 32–35. population. Journal of Consulting and Clinical Psychology,
38. Walsh, T. L., et al. (2006). Screening for depressive symptoms in 52(5), 892–898.
patients with chronic spinal pain using the SF-36 Health Survey. 46. Parikh, R. M., et al. (1988). The sensitivity and specificity of the
Spine J, 6(3), 316–320. Center for Epidemiologic Studies Depression Scale in screening
39. Schatzkin, A., et al. (1987). Comparing new and old screening for post-stroke depression. International Journal of Psychiatry in
tests when a reference procedure cannot be performed on all Medicine, 18(2), 169–181.
screenees. Example of automated cytometry for early detection 47. Gerety, M. B., et al. (1994). Performance of case-finding tools for
of cervical cancer. American Journal of Epidemiology, 125(4), depression in the nursing home: Influence of clinical and func-
672–678. tional characteristics and selection of optimal threshold scores.
40. Fluss, R., Faraggi, D., & Reiser, B. (2005). Estimation of the Journal of the American Geriatrics Society, 42(10), 1103–1109.
Youden index and its associate cutoff point. Biometric Journal, 48. Okimoto, J. T., et al. (1982). Screening for depression in geriatric
47(4), 458–472. medical patients. American Journal of Psychiatry, 139(6), 799–802.
123