Beruflich Dokumente
Kultur Dokumente
Chronic Illness
9(3) 179–190
‘Just give me the best quality ! The Author(s) 2012
Reprints and permissions:
of life questionnaire’: the sagepub.co.uk/journalsPermissions.nav
DOI: 10.1177/1742395312466903
Karnofsky scale and the chi.sagepub.com
Abstract
Objectives: To use the history of the Karnofsky Performance Scale as a case study illustrating the
emergence of interest in the measurement and standardisation of quality of life; to understand the
origins of current-day practices.
Methods: Articles referring to the Karnofsky scale and quality of life measurements published
from the 1940s to the 1990s were identified by searching databases and screening journals, and
analysed using close-reading techniques. Secondary literature was consulted to understand the
context in which articles were written.
Results: The Karnofsky scale was devised for a different purpose than measuring quality of life: as a
standardisation device that helped quantify effects of chemotherapeutic agents less easily
measurable than survival time. Interest in measuring quality of life only emerged around 1970.
Discussion: When quality of life measurements were increasingly widely discussed in the medical
press from the late 1970s onwards, a consensus emerged that the Karnofsky scale was not a very
good tool. More sophisticated approaches were developed, but Karnofsky continued to be used.
I argue that the scale provided a quick and simple, approximate assessment of the ‘soft’ effects of
treatment by physicians, overlapping but not identical with quality of life.
Keywords
History of medicine, quality of life, clinical scales, cancer, chemotherapy
Received 26 July 2012; accepted 28 September 2012
Centre for the History of Science, Technology and
Medicine, University of Manchester, Manchester, UK
Introduction Corresponding author:
Carsten Timmermann, Centre for the History of Science,
Monitoring and assessing the quality of life Technology and Medicine, University of Manchester, Simon
of patients being treated over long periods of Building, Manchester M13 9PL, UK
time in ways that are practical, reproducible Email: carsten.timmermann@manchester.ac.uk
and meaningful, has been a focus of much more frequent references to performance
interest and debate in the management of scales, along with calls for other ways of
chronic conditions over the past few dec- assessing quality of life, deemed more
ades. It is perhaps surprising that the earliest adequate.4–7 I will address changes in inter-
devices used to quantify and standardise actions between cancer professionals and
quality of life were developed for trials of patients in the 1970s, which accompanied
experimental chemotherapy, involving and informed the new interest in measuring
patients suffering from terminal cancer. In the quality of life of patients undergoing
this article, I will trace the history of the highly interventionist and often debilitating
numerical scale devised in the late 1940s to courses of therapy, at a time when for some
record what its inventors, David Karnofsky cancers remission appeared to turn into
and Joseph Burchenal termed the ‘perform- cure.8
ance status’ of cancer patients.1 The
Karnofsky scale covered 11 stages from
100 (normal health) to 0 (death). An even
Methods
simpler, five-grade performance scale was Standard historical methods were applied:
developed by Zubrod et al.2 for the Eastern articles and books referring to performance
Cooperative Oncology Group (ECOG) in scales and quality of life measurements
the 1950s and endorsed by the World Health published from the 1940s to the 1990s were
Organization (WHO) in a 1979 handbook.3 identified by searching databases and
Interestingly, their inventors did not con- screening journal literature. Pubmed (key-
ceive of these scales as devices to record word) and Science Direct (full text) searches
quality of life. They were used only sporad- for both ‘quality of life’ and ‘performance
ically in the 1950s and 1960s, and more or status’ were performed in August 2011.
less exclusively around experimental cancer Texts were analysed using close-reading
chemotherapy. Only in the 1970s were they techniques. Suitable secondary literature
reinterpreted, retrospectively, as simple was identified on broader developments in
quality of life indices. I suggest that histor- medicine and wider society during the time
ical case studies such as the one I present span covered, and consulted in order to
here can help us understand the origins of understand the context in which perform-
current-day clinical practices. ance scales were developed and concerns
In the article, I will discuss two stages in with quality of life emerged.
the history of the Karnofsky performance
scale. In the first part I will discuss the scale
in the context in which it was developed, in Results
the early days of experimental cancer
1940–1970: standardisation of
chemotherapy, as a standardisation device
that helped quantify effects of cytotoxic-
marginal effects
chemicals on patients. In the second part I The Karnofsky scale had its origins in
will address how and why, in the 1970s and experiments with nitrogen mustard (a close
1980s, the scale turned into a (somewhat relative of the mustard gases used in chem-
crude) marker of quality of life. Just as there ical warfare). The man who gave his name
was increased interest in measuring quality to the scale, David Karnofsky received
of life more broadly, as I will point out, the his medical degree from Stanford
frequency of publications in medical jour- University in 1940. As a resident at the
nals on the quality of life of cancer patients Colis P. Huntington Memorial Laboratory
rose markedly. This went along with much at Harvard University he became interested
in clinical cancer research. During the non-smoker – died in 1969 from lung
Second World War he joined renal physi- cancer, a disease that had been the focus of
ologist Homer Smith’s team at New York much of his work in the 1940s.
University, for secret research on the bio- The performance status scale that came
logical effects of mustard gases funded by to carry Karnofsky’s name was developed in
the US Office of Scientific Research and the late 1940s for a trial of nitrogen mustard
Development. As a first Lieutenant with the in the treatment of lung cancer patients. The
Army Chemical Warfare Service under historian Robert Bud points to another
Cornelius Packard Rhoads, Karnofsky factor that made the Sloan-Kettering
spent months studying what happened to Institute different from earlier institutions:
goats exposed to mustard gas. Rhoads the industrial work models applied to cancer
was an influential figure in the history of research.11 The institute was named after
cancer chemotherapy.9 Having begun his the President of the General Motors
professional career in haematology at the Corporation, Alfred P. Sloan, a trustee of
Rockefeller Institute in the 1920s, in 1940 the hospital who donated four million dol-
Rhoads had been appointed director of the lars, and his Director of Research, Charles
Memorial Hospital for Cancer and Allied F. Kettering. By suggesting that the institute
Diseases in New York, just before he should carry Kettering’s name besides his
became Chief of the Medical Division of own, Sloan signaled that he saw industrial
the Chemical Warfare Service.10 The budget R&D as a model for medical science. A
of the Chemical Warfare Service, one mil- press release on occasion of the opening
lion dollars per annum, was roughly the of the institute in 1945 announced that it
same as the combined budget for cancer was to ‘concentrate on the organization of
research in the US at the time.11 Still in the industrial techniques for cancer research’.11
army, Karnofsky became interested in It is fitting, thus, that Sloan-Kettering
exploring the potential of nitrogen mustard researchers engaged in attempts to stand-
as an anti cancer drug. ardise aspects of clinical medicine that
Karnofsky was assigned to temporary previously had not been subject to much
duty at Memorial Hospital to initiate a standardisation.
clinical study using nitrogen mustard. The use of nitrogen mustard for cancer
Following his discharge from the Army, he chemotherapy is often viewed as the result of
returned to Memorial, like many others who a chance finding following the bombing of
had served under Rhoads in the Chemical an American ship in the Italian port of Bari
Warfare Service. He joined Rhoads as one in 1943, which carried some of the poison-
of the core staff members at the Division ous chemical in its hull. Army doctors, the
of Chemotherapy Research at the new story goes, observed that soldiers and sailors
Sloan-Kettering Institute, an institution pulled out of the waters at Bari after the
whose defining feature was, according to bombing, showed alarmingly low white
Karnofsky’s colleague Joseph Burchenal, blood cell counts, and these doctors then
that ‘biochemical and animal studies could made the connection with the chemicals in
be immediately translated to practical appli- the water.13 In fact, as Ilana Löwy14 points
cation in the patient’.12 Karnofsky spent his out, the capacity of certain chemicals to kill
whole career at Memorial Sloan-Kettering, white blood cells selectively had been
culminating in a position as chief of the observed much earlier. However, this had
chemotherapy division of the institute and not been the object of systematic investiga-
the Medical Oncology Service at Memorial tion until 1941, when a team at Yale
Hospital. Karnofsky – a life-long University led by the pharmacologists
Alfred Gilman and Louis Goodman effect of the nitrogen mustard was that
obtained funding for such a study from patients lost white blood cells. This was and
the Office of Scientific Research and is the central dilemma of cancer chemother-
Development. They found that injections apy: the clinicians had to choose a dose high
with the substance induced remissions in enough to kill the tumour cells and low
laboratory mice with lymphoma and also enough to avoid killing their patients.
managed, in 1942, to induce remission in a Following injections the clinicians measured
human lymphoma patient.15 These results the immediate ‘objective improvement’ (OI),
were not published until 1946, but they were measurements that only the physicians could
discussed at medical meetings and confirmed perform: decrease in the size of the lung
by other groups. Nitrogen mustard was a lesions, reductions in the volume of fluid in
highly toxic substance, and not very specific, the chest, shrinking metastases, etc. They also
but the limited success with this poison – noted if the patients gained weight. Objective
especially in the treatment of lymphoma improvement was classified as 0, 1þ or 2þ.
patients – boosted the search for better The measurement of ‘subjective improve-
chemotherapeutic agents. It also encouraged ment’ (SI), obviously, depended on what
clinical cancer researchers to try nitrogen the patients reported: did they feel better;
mustard as a treatment of last resort for did they regain strength; how was their
other malignancies, including lung cancer. appetite; were symptoms such as pain,
These were the early days of modern cough, shortness of breath or headaches
cancer research, the all-out assault on affected? SI was classified as G (good),
cancer that saw vast amounts of money F (fair), or 0 (none). Finally they noted the
channeled into both clinical and fundamental patients’ performance status: in how far were
science.16,17 In 1948, in the first volume of the patients able to carry on with their daily
new journal Cancer, Karnofsky et al.18 at activities; how dependent were they on help?
Memorial Hospital published first results of a This was what later became known as the
series of attempts to treat lung cancer Karnofsky scale (Table 1).
patients with nitrogen mustard. This highly The results of these nitrogen mustard trials
experimental treatment was a last attempt to were compiled in one big table; but many
intervene rather than let the disease take its individual cases were also discussed in con-
course. The 35 patients – 1 woman and 34 siderable detail in the article. In most cases
men between 42 and 75 years of age – had the response to the injections was clear
been declared inoperable. Surgery by then and initially encouraging: tumours shrank,
had become the default treatment for lung patients felt better and coughed less.
cancer.19 The patients were either considered Unfortunately the responses tended to be
unsuitable for X-ray therapy, had already short-lived. In the longest-surviving patients
received a course of radiation without bene- the tumours did not start growing again for
fit, or had relapsed after a temporary eight months or so after the treatment, but
response. They were given the toxic chemical more common were one or two months.
intravenously, by injecting a solution into the Statistics did not play an important role in
tubing of a running intravenous infusion in the evaluation as there were not many trial
order to avoid damaging the vein or trigger- subjects. The study might be best described as
ing thromboses. Each injection was followed a ‘traditional’ clinical study – there was no
by nausea and vomiting within 1 to 8 h (the formal ‘protocol’. Performance status was
severity of these effects could sometimes be used as one out of a set of measurements that
alleviated by administering sedation with allowed the standardisation of responses to
barbiturates). The most important toxic treatment (Table 2). Karnofsky and his
Table 1. Performance status as defined by Karnofsky, Burchenal and their colleagues in 1948
Performance status
Definition % Criteria
Able to carry on normal activity and to work. 100 Normal; no complaints; no evidence of disease
No special care is needed. 90 Able to carry on normal activity; minor signs or
symptoms of disease.
80 Normal activity with effort; some signs or symp-
toms of disease.
Unable to work. Able to live at home, care for 70 Cares for self. Unable to carry on normal activity
most personal needs. A varying amount of or to do active work.
assistance is needed. 60 Requires occasional assistance, but is able to care
for most of his needs.
50 Requires considerable assistance and frequent
medical care.
Unable to care for self. Requires equivalent of 40 Disabled; requires special care and assistance.
institutional or hospital care. Disease may be 30 Severely disabled; hospitalisation is indicated
progressing rapidly. although death not imminent.
20 Very sick; hospitalisation necessary; active sup-
portive treatment necessary.
10 Moribund; fatal processes progressing rapidly.
0 Dead.
Table 2. Use of performance status along with criteria for objective and subjective improvement by
Karnofsky and colleagues in their trial of nitrogen mustard in patients with carcinoma of the lung
Percent of patients showing objective and subjective improvement and change in performance
status following HN2 therapy
% Improvement
Anaplastic 7 43 28 29 28 29 43 0 29 29 14 28 0 0
Epidermoid 10 30 50 20 30 10 60 20 10 30 30 10 0 0
Total group 21 28 43 29 24 14 62 14 24 29 14 14 5 0
colleagues did not explicitly discuss the qual- A similar scale, but even simpler, was
ity of life of their patients (they did not use devised by Zubrod et al.2 for the Eastern
the term in their publications), but if asked, Cooperative Cancer Chemotherapy Group
they probably would have pointed to the (later Eastern Cooperative Oncology
Subjective Improvement criterion rather than Group, ECOG) about a decade later. The
the Performance Status. Group was formed in 1955 under the
their way into cancer medicine, a domain author of a Lancet editorial in 1991 counted
dominated by an ideology of progress and 207 in Index Medicus with the keyword
intervention at all costs? ‘Assessment of ‘quality of life’ for 1980 alone, and 846
quality of life and, more generally, cost- for 1990.7
benefit evaluations in the health area have Bardelli and Saracci offer no reflections
become a subject of increasing interest in on the accuracy and appropriateness of
recent years’, wrote the epidemiologists performance scale type measurements
Bardelli and Saracci4 in an article for the when it comes to representing quality of
journal of the International Union against life – they appear to be glad to find param-
Cancer, the UICC Technical Reports, pub- eters at all in 1960s trials that they could
lished in the year of the Uppsala sympo- interpret as quality of life measurements.
sium, in 1978. Comparing data from trials This situation changed in the 1980s. The
published in six cancer journals with wide need to measure quality of life during clin-
international circulation in 1965 and 1966, ical trials of cancer therapy, according to the
and in 1975 and 1976, Bardelli and Saracci authors of a paper published in the British
found, not surprisingly, that randomised Journal of Cancer in 1988, was now ‘widely
clinical trials had become more common in recognised’ as treatment was often toxic and
the 1970s, both in relation to non-rando- frequently given with palliative rather than
mised trials and as a portion of all papers curative intentions.22 Hang on, did not
published. More surprising to them was that Karnofsky and Burchenal in the late 1940s
the proportion of trials whose organisers also argue that the effects of their experi-
chose ‘hard’ parameters such as survival ments with nitrogen mustard – a very toxic
time as endpoints had increased from 33% chemical – had had predominantly palliative
to 78%. They had expected a shift towards effects? What had changed?
‘soft’ parameters, in line with the increased There had been major developments both
interest in quality of life. in cancer chemotherapy and palliative care.
The use of what Bardelli and Saracci took Medical oncology, the medical specialism
to be indicators of quality of life – direct or organised around chemotherapy and clinical
indirect – had in fact increased as well, but trials, somewhat marginal in the 1960s, was
far less than that of ‘hard’ parameters. now increasingly firmly established.23 With
Where side effects were recorded, or disease radical courses of chemotherapy, often
free intervals, Bardelli and Saracci inter- involving combinations of drugs, medical
preted this as indirect indicators of quality oncologists had managed to reduce recur-
of life. Direct indicators, to them, were the rence and increase survival rates for leukae-
performance status scales devised by mia, lymphomas and some solid tumours
Karnofsky and Burchenal or ECOG, or (predominantly in children), enough to sup-
some closely derived variations. These were port claims that these diseases had now
used in 6% of the trials examined for 1965– become curable.8 Having established
1966 and in 10% for the years 1975–1976. chemotherapy firmly as a treatment modal-
While this may seem low, the article by ity, medical oncologists in the 1980s and
Bardelli and Saracci in itself is evidence for 1990s increasingly turned to solid cancers in
the new interest in assessing the quality of adults, with some success, but also much
life of patients. Indeed, the authors of a 1983 disappointment. Lung cancers, for example,
review article in Statistics in Medicine, continued to defy them. Nevertheless,
Fayers and Jones6 found more than 200 courses of chemotherapy became part of
papers measuring ‘quality of life’ in cancer many lung cancer patient pathways –
clinical trials between 1978 and 1980; the usually towards the end.19 While benefits
be able to report on their experiences more the late 1960s and 1970s? Not clear.
adequately than their physicians. The word Doctors? Nurses? Or patients themselves,
‘cancer’ did not appear on the cards as, in some of them increasingly vocal and well
the 1980s, still, some patients were unaware informed? In any case, the goal was to find
of what they were suffering from – or an instrument that translated feelings, per-
preferred not to think about it.33 The card ceptions, hope, and in some cases very
addressed ‘obvious and relatively objective significant implications for a person’s iden-
side effects’, such as nausea and vomiting, as tity into numerical values on a scale, fit to be
well as less objective questions regarding the used in statistical calculations. There was no
subjects’ mood and general condition. Its really good quality of life questionnaire;
designers recognised the ambiguity and they all just produced scores that were more
context-dependency of categories such as or less fit for the purpose. The Karnofsky
‘average mood’, suggesting that ‘perhaps scale did the job it was designed for in the
‘normal for me before my present illness’ 1940s, but it was no longer entirely appro-
would be preferable’. However, they found, priate by the 1970s, after the end of what
‘such distinctions [were] difficult to convey historians have described as the ‘Golden
succinctly and clearly on a simple card or Age’ of modern medicine, in a medical world
form’. This statement points to a dilemma that has become much more pluralistic, with
central to all such forms of assessment: if doctors no longer unchallenged with regard
they were to be useful they had to com- to treatment decisions.35
promise between efficiency and adequacy. The proliferation of chemotherapeutic
A long and very detailed form would reflect regimes and their inclusion in more and
the feelings and perceptions of trial subjects more patient pathways may make it neces-
more adequately, but would be impractical. sary to think about quality of life and
In fact, among the most practical forms of performance issues in new ways. Children
assessment, still, were performance indices: treated forleukemia in the 1970s and 1980s,
physicians did not even have to talk to when the disease was declared curable, have
patients to score them. To be sure, most grown into adults and found that they
agreed that performance scales were poor, needed to develop means to manage the
unreliable instruments for assessing quality late effects of their treatment.8 Patients
of life.34 However, quick, easy and intuitive, diagnosed with solid cancers that used to
they continued to be used where trial sub- lead to fairly rapid deaths a few years ago,
jects were treated as inpatients and remained quite frequently survive now for 5 years or
under observation by clinicians. more.36 A recent, high-profile example is the
American feminist writer and academic
Susan Gubar, who was diagnosed with
Discussion ovarian cancer in 2008 and has published a
The changes in the tools and techniques memoir on her experiences.37 While Gubar
viewed as legitimate means for assessing and others in similar situations know that
quality of life reflect broader changes in their cancers are ultimately not curable,
medicine. In the rather patriarchal 1940s, if these cancers become manageable in ways
one wanted to know how a patient felt about that make them comparable to conditions
the effects of an experimental treatment, which were transformed from acute to
who did one ask? The expert, of course: the chronic through the introduction of new
physician administering that treatment. treatment regimes years or even decades
Who would one turn to following the chal- ago, such as diabetes,38,39 kidney failure,40
lenges to established authorities that marked or more recently HIV-AIDS.41 Rather than