Beruflich Dokumente
Kultur Dokumente
How to Write a High Quality Multiple Choice Question (MCQ): A Guide for
Clinicians
a,* b
P.A. Coughlin , C.R. Featherstone
a
Department of Vascular Surgery, Addenbrookes Hospital, Cambridge, UK
b
The Royal College of Surgeons of Edinburgh, Edinburgh, UK
Despite the variety of assessment tools available, multiple choice questions (MCQs) still play an integral part in
examinations at both a national and speciality board level. MCQs have a number of methodological advantages
yet their strength is related to the quality of the question posed. Specifically, there has been a move towards the
MCQ testing a taxonomically higher order concept of integration-interpretation and problem solving. This paper
focuses on question development and the potential pitfalls to avoid.
Ó 2017 European Society for Vascular Surgery. Published by Elsevier Ltd. All rights reserved.
Article history: Received 15 May 2017, Accepted 17 July 2017, Available online 1 September 2017
Keywords: Multiple choice question, Question writing
STEM: An 84 year old women presents with a fall, landing on her left arm. The
hand is cool, pale and there is no palpable pulses at the wrist or in the
antecubital fossa. Furthermore, she is unable to extend the hand at the wrist
joint yet has maintained sensation in the palm of the hand and in the ingers.
Imaging shows a displaced mid humeral fracture and a short mid brachial artery
occlusion.
LEAD IN: Which nerve is likely to have been damaged with the associated fall?
The answer is D, which is the single best answer. The rest of the answers act as
distractors.
Options:
A. Deep venous insuficiency
B. Diabetes
C. Lower limb arterial disease
D. Lymphoedema
E. Necrobiosis Lipoidica
F. Neuropathy
G. Non Hodgkins Lymphoma
H. Squamous Cell Carcinoma
I. Vasculitis
Lead In: For each of the clinical scenarios below, select the single most
appropriate diagnosis from the list of options above.
Stems:
1. An 82 year old man with a history of ischaemic heart disease and a
previous carotid endarterectomy had now developed a painful shallow
ulcer on the dorsal aspect of his foot. (Answer C)
2. A 78 year old woman previously underwent a right total hip replacement
10 years ago since which she has had permanent right leg swelling. Over
the last 7 months she has developed an extensive area of super icial
ulceration over the medial aspect of her calf and ankle. (Answer A)
3. A 93 year old lady with a history of right leg DVT 25 years ago has had a
non healing static ulcer around the right medial malleolus for the last ten
years. It has been resistant to treatment and has maintained a size of
between 1 and 2cm in diameter. Over the last three months it has rapidly
increased in size becoming circumferential and is becoming increasingly
painful. (answer H).
Figure 1. (continued).
higher order concept of integration-interpretation and patient has a wrist drop means that the candidate has to
problem solving.3 This approach to question writing can use higher order thinking to work out that the radial artery
turn the cost effective MCQ into a powerful simulation tool, is a risk with this type of injury and that the clinical findings
increasing both the validity and the educational impact of confirm the injury.
the assessments. The anatomy of an EMQ is somewhat different. EMQs
In driving education and assessing candidates at the level tend to focus a number of questions/clinical scenarios
of consultant, the aim of an MCQ should be to assess clinical around a particular theme. The layout of an EMQ starts with
judgement and problem solving abilities in complex situa- the identification of the theme, then gives a number of
tions. We want the candidate to be able to assimilate the potential options or answers. There is then a lead-in ques-
pieces of information given within a question and to use their tion followed by (usually) a number of different stems. For
clinical judgement to determine the most correct answer. The each stem, the candidates choose the most appropriate or
two most common types of multiple choice questions in likely option. In the example in Fig. 1C, the theme is lower
postgraduate medical examinations are single best answer limb ulceration, the options are listed from A to E and the
questions (SBAs) and the extended matching questions lead-in poses a clear question. The stems, in this case a
(EMQs). Both of these question types allow assessment of couple of clinical scenarios, are then given.
the candidates’ ability to select the most appropriate option
available, given a clinical situation they would be likely to THE PITFALLS OF MCQ WRITING
encounter in the workplace. The best starting point for Although perceived to be relatively straightforward, the
writing a good MCQ is therefore a situation recently writing of a high quality MCQ question often requires a
encountered in the workplace in which clinical judgement number of iterations. In addition to the difficulty of
was required in determining the appropriate next step. providing enough plausible distractors, there are a number
of pitfalls that must be avoided, either because they give
FORMAT OF MOST COMMONLY USED MCQS away the answers to “testwise” candidates, or because they
The anatomy of a SBA consists of a stem and a lead-in make the question more complicated than it needs to be
question both of which should pose a clear question that through “irrelevant difficulty.”
can be answered without looking at the options (Fig. 1B). “Testwise” candidates typically have good examination
Then a list of options should be given, which includes dis- technique, and can sometimes identify the correct answer
tractors along with the single best answer. The distractors all to a question purely on test taking skills alone and not
need to be plausible and lie on a continuum of least correct because of underlying knowledge or the ability to apply that
to most correct. In the example in Fig. 1B, the stem is the knowledge to real situations. Question writing flaws that
clinical scenario, the lead-in is the specific question being give away the answers to “testwise” candidates are there-
asked, and there are a series of distractors (all nerves) that fore a threat to the validity of the assessment. Pitfalls which
could theoretically be damaged, yet the fact that the play into the hands of “testwise” candidates include
How to Write a High Quality Multiple Choice Question 657
grammatical cues, logical cues, the use of absolute terms, assessment. It is therefore also a threat to the validity of the
longer correct answers, word repeats, and the concept of assessment. Irrelevant difficulty can arise from negatively
convergence strategy. These pitfalls, which are defined and phrased questions, the use of “all/none of the above,” long
illustrated in Case and Swanson’s Item Writing Manual1 are and overly complicated questions or the use of vague and
summarised below. imprecise terms.
As the question writer naturally pays more attention to Negatively phrased questions should be avoided for two
the correct answer rather than the accompanying dis- reasons. The first reason is the difficulty caused by requiring
tractors, grammatical errors are more likely to occur within candidates to switch from looking for the best option to
the distractors. As such, care needs to be taken that all looking for the worst option. The second and arguably more
options (best answer and distractors alike) follow on important reason is that these questions do not mimic
grammatically and logically from the main stem. On a typical reasoning in the workplace: the normal approach is
similar theme, as the question writer pays more attention to look for the best, not the worst outcome for patients.
to the correct answer, they are more likely to include “All/none of the above” can either be easily discarded or
additional instructional material and caveats. Given this, causes confusion in the candidates, who may wonder
correct answers may be longer, more specific and more whether an option other than the ones provided by the
complete than the other options and therefore easier to question writer should be the correct answer. As candidates
spot as being the right answer. As a basic rule therefore, typically have a minute in which to read and answer each
answers should be of roughly the same length and should question, long and overly complicated options are in danger
be as short as possible with most of the detail placed within of testing reading speed rather than clinical judgement.
the question stem. Finally, vague and imprecise terms mean that the candi-
If a subset of the options are along a similar theme or dates have to guess the question writers’ personal defini-
mention a particular word, this can form a logical cue, tion of such terms. Indeed, a question is enhanced when
highlighting to the candidate that the correct answer is the options include more accurate information including
likely to be within this subset. The use of absolute terms more specific numerical values wherever possible. When
(e.g. the words “always” or “never”) within a subset of using numerical answers, these should be standardised e
options typically allows the testwise candidate to eliminate listed in numeric order and using a similar format wherever
these options as they are less likely to be true than an possible. Overall, the best MCQs tend to have straightfor-
option that has a less absolute statement. Furthermore, ward, short, and easy to read options.
word repeats, namely the use of a word of phrase in both Having explored the theory behind asking “higher order”
the stem and in the correct answer will tend to highlight the MCQs, and the common pitfalls of question writing, what
correct answer and should be avoided. should question writers aim for, practically speaking, in
Convergence strategy can be used when the correct producing a good MCQ?
answer includes the most elements in common with the The validity and educational impact of an assessment will
other options. This concept is more subtle than the other be enhanced if its questions focus on concepts that are
flaws in question writing that have already been discussed important and relevant either because of the frequency
and tends to originate from the question writer starting with which they appear in the workplace or because of the
with the correct answer and then next writing a number of critical need to know the answer. Having said this, question
allied distractors. As such, the correct answer will have writers should bear in mind that not all clinical or scientific
common elements with the distractors. A very simple problems are amenable to testing by MCQ. Other assess-
example of this would be if the available options were (a) ment tools may provide a more valid measurement,
aspirin and clopidogrel, (b) aspirin and dipyridamole, (c) particularly where the “right answer” is one which is under
aspirin and ticagrolor, (d) clopidogrel and statin, (e) clopi- discussion in the field, or where the candidate’s approach,
dogrel and a fibrate. A testwise candidate would look at the reasoning and/or justification is more important than the
answers and by a simple count would see that aspirin and answer they arrive at.
clopidogrel both appeared three times, more than any of Having identified a learning objective that can be
the other options, and as such the answer would be (a). assessed using an MCQ, question writers should start from
While this is an overt simplification it can be seen how such a scenario in which that learning objective would be used.
a situation could arise if the question writer used differing For example, if candidates need to know about how drug A
variations of the correct answer, for a subset of different interacts with drug B, rather than asking “how does drug A
steps in a treatment plan, as distractors. Similar to this is interact with drug B?,” start with a scenario in which
the posing of two questions within a single MCQ, which a patient, whose presentation would normally lead to
often leads to stems including such answers as “both un- prescribing drug B, is already taking drug A. Clinical sce-
true” or “both true.” Again, “testwise” candidates are more narios such as these are often the optimal way of delivering
likely to spot question trends and as such will score highly information within the stem in a way that mimics how in-
on such questions. formation is presented in the workplace. Such information
“Irrelevant difficulty”, as its name suggests, is used to will include the presenting problem followed by (where
describe a question, which is made difficult for reasons that relevant) a focused clinical history, physical findings, and
are unrelated to the trait that is the focus of the diagnostics/imaging. If the question is more related to
658 P.A. Coughlin and C.R. Featherstone
basic science, a more relevant laboratory vignette may be encourage relevant learning, evaluate whether learning
required. goals have been reached, and identify areas for develop-
The stem must be clear and focused, and should capture ment in training programmes and in trainees.
with sufficient authenticity the information that the candi-
date would typically use in the workplace to determine the CONFLICT OF INTEREST
best course of action. The lead-in should allow the candi- None.
dates to answer the question without looking at the op-
tions, to allow them to come up with an answer themselves FUNDING
before checking it against the list of provided options. The
None.
options must be standardised and consistent with all an-
swers coming from a similar category (e.g. all diagnoses or
REFERENCES
all procedures). They also must be plausible and of similar
length and orders in a logical order (often alphabetical). 1 Case S, Swanson D. Item writing manual. 3rd ed. NBME; 2002
[available online].
CONCLUSION 2 Naeem N, van der Vleuten C, Alfaris EA. Faculty development on
item writing substantially improves item quality. Adv Health Sci
A good MCQ is one that mimics the workplace or a situation Educ Theory Pract 2012;17:369e76.
in which the candidate must apply their knowledge, which 3 Maguire T, Skakun EN, Triska OH. Student thought processes
elicits the cognitive processes that we want to measure, evoked by multiple choice and constructed response item. In:
and which rewards candidates for demonstrating relevant The Seventh Ottawa Conference on Medical Education and
skills. It is with these attributes that good MCQs allow us to Assessment. Maastricht; 1996. p. 253.