Beruflich Dokumente
Kultur Dokumente
Objectives: The objective of this study was to evaluate the efficacy of countries, such as the United Kingdom (Smeeth et al. 2002)
a word-based auditory-training procedure for use with older adults and Australia (Ward et al. 1993).
who have impaired hearing. The emphasis during training and The most common communication complaint of older
assessment is placed on words with a high frequency of occurrence adults with impaired hearing (OHI) is that they can hear speech
in American English.
but cannot understand it. This is especially true when there are
Design: In this study, a repeated-measures group design was used competing sounds, typically other speech sounds, in the back-
with each of the two groups of participants to evaluate the effects of ground. Hearing aids, to be effective, must address this
the word-based training regimen. One group comprised 20 young common complaint and improve speech understanding, espe-
adults with normal hearing and the other consisted of 16 older adults cially in backgrounds of competing speech. The primary
with impaired hearing. The group of young adults was not included
for the purpose of between-group comparisons. Rather, it was
function of hearing aids is to provide level-dependent and
included to demonstrate the efficacy of the training regimen, should frequency-dependent gain to the acoustic input arriving at the
efficacy fail to be demonstrated in the group of older adults, and to hearing aid microphone(s). The gain provided has the potential
estimate the magnitude of the benefits that could be achieved in to improve speech communication by restoring the audibility of
younger listeners. speech sounds rendered inaudible by hearing loss (Humes
Results: Significant improvements were observed in the group means
1991; Humes & Dubno Reference Note 1). For older adults, it
for each of five measures of post-training assessment. Pretraining and is mainly higher frequency lower-amplitude consonants (e.g., s,
post-training performance assessments were all based on the open-set f, t, “sh,” and “th”) that are made inaudible by hearing loss
recognition of speech in a fluctuating speech-like background noise. (e.g., Owens et al. 1972).
Assessment measures ranged from recognition of trained words and The hearing loss of older adults is greatest in the frequency
phrases produced by talkers heard during training to the recognition of region for which the amplitude of speech is the lowest;
untrained sentences produced by a talker not encountered during specifically, frequencies ⱖ2000 Hz (ISO 2000). As a result, a
training. In addition to these group data, analysis of individual data via large amount of hearing aid gain is required in the higher
95% critical differences for each assessment measure revealed that 75
frequencies to restore full audibility of the speech signal
to 80% of the older adults demonstrated significant improvements on
most or all of the post-training measures. through at least 6000 Hz. This is challenging for the clinician
to accomplish, even with the use of amplitude compression. In
Conclusions: The word-based auditory-training program examined fact, this difficulty has led to the development of alternatives to
here, one based on words having a high frequency of occurrence in conventional amplification, such as frequency-compression
American English, has been demonstrated to be efficacious in older
hearing aids or short-electrode cochlear implants (Turner &
adults with impaired hearing. Training on frequent words and
frequent phrases generalized to sentences constructed from fre- Hurtig 1999; Simpson et al. 2005; Gantz et al. 2006), for adults
quently occurring words whether spoken by talkers heard during with severe or profound degrees of high-frequency hearing
training or by a novel talker. loss. Even for many older adults with less severe high-
(Ear & Hearing 2009;30;613– 627)
frequency hearing loss, for whom conventional amplification
may be the best alternative, restoration of optimal audibility
will not always be possible in the higher frequencies (ⱖ2000
INTRODUCTION Hz; Humes 2002). Under such circumstances of less than
In the 2000 U.S. Census, 35 million Americans older than optimal audibility, a better than normal speech to noise ratio is
65 years were counted, representing 12.4% of the U.S. popu- required to achieve “normal” or near-normal speech-under-
lation (Hetzel & Smith 2001). Approximately 30% of those standing performance; that is, to achieve performance that is
older than 65 years in the United States, or more than 9 million equivalent to that of young normal-hearing adults (Lee &
older Americans, have a significant hearing loss that is suffi- Humes 1993; Humes 2008; Humes & Dubno Reference
cient to make them hearing aid candidates (Schoenborn & Note 1). Nonetheless, even with the less than optimal
Marano 1988). Yet, only about 20% of those older Americans audibility provided by hearing aids fit in clinical settings
who could benefit from hearing aids actually seek them out, (Humes 2002; Humes et al. 2000, 2004), nonrandomized
and of those only about 40 to 60% are satisfied with them and interventional studies of hearing aid outcomes in older
use their hearing aids regularly (Kochkin 1993a,b,c, 2000, adults have demonstrated that, on average, clinically fit
2005). Moreover, these figures are typical of those in other bilateral hearing aids provide significant and long-term
benefit. This benefit has been demonstrated in quiet and in
Department of Speech and Hearing Sciences, Indiana University, Bloom- backgrounds of steady-state noise or multitalker babble
ington, Indiana. (Humes et al. 1997, 1999, 2001, 2002a,b, 2004, 2009; Larson et
0196/0202/09/3005-0613/0 • Ear & Hearing • Copyright © 2009 by Lippincott Williams & Wilkins • Printed in the U.S.A.
613
614 HUMES ET AL. / EAR & HEARING, VOL. 30, NO. 5, 613–627
al. 2000), although aided speech-recognition performance is protocol was based on closed-set identification of words
seldom restored to normal. Moreover, in laboratory studies of under computer control, which enabled automation of pre-
aided speech understanding with optimal audibility achieved sentation, scoring, and feedback; (3) multiple talkers were
with earphones, it has been observed that older adults fre- included in training, which facilitated generalization to
quently require a better than normal speech to noise ratio to novel talkers; (4) all training was conducted in noise, the
perform the same as young normal-hearing adults (Humes et al. most problematic listening situation for older adults; and (5)
2006, 2007; Jin & Nelson 2006; George et al. 2006, 2007; after correct/incorrect feedback on every trial, both auditory
Humes 2007; Amos & Humes 2007). There are many possible and orthographic feedback associated with the correct and
reasons underlying the need for a better than normal speech to incorrect responses was provided to the listener after incor-
noise ratio by older adults wearing hearing aids, including rect responses.
peripheral, binaural/central auditory, and cognitive factors, but Although the word-based auditory-training method has
there is mounting evidence that the deficits are cognitive in many novel features that make it unique among auditory-
nature, rather than modality-specific deficits in central-auditory training procedures, it is important to note that auditory
function, especially for fluctuating background sounds, includ- training has had a long history in audiology. For the most part,
ing competing speech (Lunner 2003; Humes 2002, 2005, 2007; however, the focus historically has been placed on young
George et al. 2006, 2007; Humes et al. 2006, 2007; Pichora- children with profound or severe to profound hearing loss.
Fuller & Singh 2006; Lunner & Sunderwall-Thoren 2007; Foo Limited efforts have been directed to adults with impaired
et al. 2007; Ronnberg et al. 2008; Humes & Dubno Reference hearing, with still fewer studies directed toward OHI. A recent
Note 1). evidence-based systematic review of the literature on the
As noted previously, much of everyday communication benefits of auditory training to adult hearing aid wearers
takes place in backgrounds of competing speech, and older (Sweetow & Palmer 2005) found only six studies that met
adults typically express frustration with this listening situ- criteria for valid scientific evidence and subsequent analysis,
ation. Further, hearing aids alone do not address this three of which were conducted with older adults. After review
situation adequately. Although contemporary digital hearing of these six studies, Sweetow and Palmer (2005, p. 501)
aids implement various noise-reduction strategies or direc- concluded that “this systematic review provides very little
tional-microphone technologies, for the most part, these evidence for the effectiveness of individual auditory training.
approaches have had only limited success in improving the However, there is some evidence for efficacy.” The distinction
speech to noise ratio and the benefit to the hearing aid made by these authors is that benefits of auditory training had
wearer (Walden et al. 2004; Cord et al. 2004; Bentler 2005; been observed under optimal conditions in a restricted research
Nordrum et al. 2006). This is especially true when the environment (efficacy), but not under routine clinical or field
competing stimulus is comprised of speech produced by one conditions (effectiveness), the latter typically requiring some
or more talkers; particularly, when the target talker at the form of clinical-trial research. Sweetow and Palmer also noted
moment may become the competing talker a moment later. that there was some indication that synthetic approaches, those
Thus, there are real limits to how much the speech to noise involving words, phrases, sentences, or discourse, were more
ratio can be improved acoustically by the hearing aids in likely to be efficacious than analytic approaches focusing on
many everyday circumstances. sublexical features of speech.
To recap, there is considerable evidence that older adults Since the publication of this systematic review in 2005,
listening to amplified speech require a better than normal two developments regarding the efficacy and effectiveness
speech to noise ratio to achieve normal or near-normal of auditory training in older adults with hearing aids are
speech understanding. However, hearing aids alone have noteworthy. First, Sweetow and Sabes (2006) published data
been unable to deliver the required improvements in speech from a multicenter clinical evaluation of a cognitive-based
to noise ratio. Hearing aids do seem to provide significant top-down auditory-training system referred to as Listening
and long-term benefits, but aided performance often remains and Auditory Communication Enhancement (Sweetow &
less than ideal, especially in the presence of competing Henderson-Sabes 2004). The short-term benefits of the
speech. This, in turn, seems to lead to the discouraging top-down approach pursued by Listening and Auditory
statistics cited at the beginning of this article about market Communication Enhancement in older adults were encour-
penetration, hearing aid usage, and hearing aid satisfaction aging. Second, a series of articles were published describing
among the elderly. the development, evaluation, and efficacy of our automated
Given the foregoing, we decided to explore ways to word-based approach to auditory training (Burk et al. 2006;
improve the ability of older adults to understand amplified Burk & Humes 2007, 2008).
speech for a given speech to noise ratio (SNR), rather than Through a series of laboratory studies evaluating this
attempt to improve the acoustical SNR. (Of course, there is word-based auditory training protocol (Burk et al. 2006; Burk
no reason that these two strategies, one focused on the & Humes 2007, 2008), we have learned the following: (1) OHI
listener and the other on the hearing aid, cannot work in could improve their open-set recognition of words in noise
concert.) Our focus was placed on a novel approach to from about 30 to 40% correct before training to 80 to 85%
intervention; specifically, a word-based auditory training correct after training; (2) training generalized to other talkers
method with all training conducted in noise backgrounds. saying the same trained words, but only slight improvements (7
Some of the unique features of this novel approach included to 10%) occurred for new words, whether spoken by the talkers
that (1) it was word based, which placed the focus on heard during training or other talkers; (3) improvements from
meaningful speech rather than on sublexical sounds of training, although diminished somewhat by time, were retained
speech, such as individual phonemes; (2) the training over periods as long as 6 months (maximum retention interval
HUMES ET AL. / EAR & HEARING, VOL. 30, NO. 5, 613–627 615
TABLE 1. Sample size, gender, test ear and hearing loss for the participants in all subgroups and training protocols in this study.
Age, yr HFPTA, dB HL
Group Protocol N Gender M SD Test ear M SD
YNH 1 9 7F, 2M 20.3 3.1 9R, 0L 6.3 3.8
YNH 2 11 7F, 4M 20.8 3.1 11R, 0L 4.5 3.7
OHI 1 10 3F, 7M 70.2 6.8 8R, 2L 31.0 9.6
OHI 2 6 1F, 5M 72.8 7.6 6R, 0L 26.4 14.1
N, sample size; gender: M, male; F, female; age: M, mean; SD, standard deviation; R, right; L, left (ear used in training/testing); HFPTA, high-frequency pure-tone average (1000, 2000, and
4000 Hz) for the participants in each group; YNH, young adults with normal hearing; OHI, older adults with impaired hearing.
examined to date); (4) similar gains were observed when the as 150, we explored the expansion of set size by a factor of 4
feedback was either entirely orthographic (displaying correct to 600 words. If older adults could improve with this large set,
and incorrect responses on the computer screen) or a mix of then the use of top-down contextual processing, an ability that
orthographic and auditory (rehearing the correct and incorrect does not seem to diminish with age (Humes et al. 2007), should
words after incorrect responses), but not when the feedback help fill in the gaps for the untrained words encountered in
was eliminated entirely (Burk et al. 2006); and (5) there was everyday conversation. We also incorporated some limited
little transfer of the word-based training to the open-set training with frequently occurring phrases as a syntactic bridge
recognition of sentences in noise. The methodological details between isolated words and sentences. In addition, given
varied from study to study, but some common features in- emerging evidence regarding the special difficulties of older
cluded the use of closed-set identification of vocabularies of 50 adults with fluctuating, speech-like competition, the two-talker
to 150 words spoken by multiple talkers in steady-state noise version of International Collegium for Rehabilitative Audiol-
during training and the assessment of training effectiveness ogy (ICRA) noise (Dreschler et al. 2001) was used as the
with these same words, as well as additional words and competition in this experiment. Finally, the words comprising
sentences spoken by the same talkers used in training and by the sentences used to assess generalization of training this time
novel talkers. In addition, for all experiments with OHI, the had much more overlap (50 to 80%) with the training vocab-
spectra of the speech and noise were shaped and the stimuli ulary.
delivered via earphones to ensure sufficient audibility of the
long-term spectrum of the speech stimulus through at least PARTICIPANTS AND METHODS
4000 Hz.
We have interpreted this general pattern of findings, con- Participants
firmed several times now, to suggest that the training process is There were two primary groups of participants in this study:
primarily lexical in nature and serves to reinforce the link young adults with normal hearing (YNH) and OHI. Each of
between the degraded encoding of the acoustic input and the these two groups was further divided into two subgroups based
intact phonological representation of that input in the listener’s on the particular version of the training protocol each received.
lexicon. This link may have been weakened through years of The differences in the two training protocols are described
gradual hearing loss or through some other aspect of aging. below and are referred to here simply as protocols 1 and 2.
Although training with multiple talkers enhances generaliza- Table 1 summarizes the ages, gender, and test ear for the
tion of the training to other talkers, the lexical nature of the participants in all subgroups and also provides pure-tone
training limits the generalization of trained words to other averages for the OHI subgroups. Independent-sample t tests
novel words. This seemed to be the primary reason for the showed that there were no significant differences (p ⬎ 0.1) in
failure of the word-based training to generalize to sentences. In age between the two YNH or between the two OHI subgroups.
retrospect, we realized that there was very little overlap (about The mean pure-tone averages for the two OHI subgroups did
7 to 9%) between the words used in training and the words not differ significantly (p ⬎ 0.1). Figure 1 provides the mean
comprising the sentences used to assess generalization after air conduction pure-tone thresholds for the test ear of the two
training. When only those words in the sentences that had been OHI subgroups. In total, there were 20 YNH and 16 OHI
used as training words were examined, much larger improve- participants in this study.
ments in performance were observed as a result of training
(Burk & Humes 2008). Stimuli and Materials
Given the foregoing, our most recent work in the area of There were a total of four types of speech materials used in the
word-based auditory training is described in this report and has evaluation and training portions of this study: (1) frequent words;
focused on the most frequently occurring words in spoken (2) frequent phrases; (3) re-recordings of selected sentences from
(American) English. The rationale for this emphasis is that, if the Veterans Administration Sentence Test (VAST; Bell & Wil-
the training is word specific, then the focus should be placed on son 2001); and (4) the Auditec recording of Central Institute for
words used most frequently in everyday conversation. A the Deaf (CID) Everyday Sentences (Davis & Silverman 1978).
relatively small number of words, 400 to 600, represent 80 to Because only the last of these sets of materials were previously
90% of the words used most frequently in spoken conversation developed and commercially available, the other three sets of
(French et al. 1931; Godfrey et al. 1992). Having demonstrated materials are described in detail here.
that older adults showed significant improvements in the The frequent words, frequent phrases, and modified VAST
recognition of spoken words in noise for sets of words as large sentences were recorded by the same four talkers (A, B, C, and
616 HUMES ET AL. / EAR & HEARING, VOL. 30, NO. 5, 613–627
the talkers. Otherwise, the word was tabulated as incorrect remainder for a total of 102 unique words from which the
for that listener. Words were then partitioned into six phrases were derived, with all 102 words being very frequent in
different groups to reflect their general difficulty for the occurrence in isolation. Pilot testing of the phrases with five
listeners: (1) words tabulated as incorrect for all five YNH listeners showed mean open-set recognition of the words
listeners (most difficult); (2) words tabulated as incorrect for comprising the phrases to be 95.4% averaged across all four
four of the five listeners; (3) words tabulated as incorrect for talkers at an SNR of ⫺3 dB.
three of the five listeners; (4) words tabulated as incorrect The modified VAST materials represented a subset of the
for two of the five listeners; (5) words tabulated as incorrect sentences developed with keywords having high frequencies of
for one of the five listeners; and (6) words tabulated as occurrence. The VAST sentences were developed with three
incorrect for zero of the five listeners (easiest). Of course, keywords per sentence, and the keywords were distributed such
the actual number of words falling into each of these that one occurred near the beginning of the sentence, one in the
categories will vary with the SNR. The 0-dB SNR, however, middle of the sentence, and one near the end of the sentence
yielded a reasonable distribution of words across these six (Bell & Wilson 2001). A total of four sets of 120 sentences
categories of difficulty. The percentages of words falling were developed, with word frequency (high or low) and
into each of these six categories were 5, 10, 18, 17, 25, and neighborhood density (sparse or dense) of the keywords
25% progressing from the most difficult to the easiest varying across subsets. From each of the two subsets compris-
categories. A total of 147 of the 600 words fell into the ing keywords with high frequency of occurrence, the sentences
easiest category (i.e., all five listeners correctly recognized were screened by the first author for semantic redundancy. This
the word when spoken by at least three of the four talkers). was necessary because, to meet the constraints of word
These 147 words were combined with 53 arbitrarily selected frequency, neighborhood density, and keyword sentence posi-
words of the 149 from the next easiest category to form a tion, some of the sentences in the VAST corpus do not have the
subset of 200 relatively easy to recognize words. These semantic redundancy common to many other sentence materi-
words subsequently received less training during the training als. In the end, 25 sentences (having a total of 75 keywords)
protocol than the remaining 400, generally more difficult words. It were selected from each of the two high-frequency subsets
should be noted that the 200 easiest words, as defined here for the based on the semantic redundancy of the sentence. Sentences
analysis of these pilot data, were not simply the 200 most frequent with high-semantic redundancy, such as “The point of the knife
words among this set of 600 most frequent words. When all of the is too sharp” or “The model wore a plain black dress”
words are high in word frequency, word frequency alone is not (keywords in italics), were retained, whereas others with less
sufficient to predict performance. redundancy or “semantic connectedness” among words (e.g.,
The frequent phrases were obtained from the compilation of “The gang had to live in a small flat”) were not recorded. This
contemporary (post 1980) written and oral British and Amer- judgment regarding the redundancy among the words compris-
ican English published by Biber et al. (1999). For purposes of ing each VAST sentence was made by the first author and was
this project, the focus was placed on the analysis of conversa- entirely subjective in nature. Pilot open-set recognition testing
tional sources by these authors. These conversational samples with five YNH listeners for these sentence materials showed a
were obtained from nearly 4 million words produced by 495 mean percent-correct score of 86.6% averaged across all four
speakers of British English and almost 2.5 million words talkers at an SNR of ⫺5 dB.
produced by 491 speakers of American English, with roughly Lists A, B, C, and D of the Auditec recording of the CID
equal distributions of the British and American talkers regard- Everyday Sentences by a male talker were also used in this
ing talker gender and age decades up to 60 years old. A unique study. These sentence materials were chosen because of several
feature of the Biber et al. analysis is the tabulation of the constraints imposed during their development (Davis & Silver-
frequency of occurrence of “lexical bundles”; recurrent se- man 1978), including the use of words with a high frequency of
quences of words, such as “I don’t” or “do you want.” The occurrence in English. Each list comprised 10 sentences with a
authors note that three-word and four-word bundles are ex- total of 50 keywords that are scored. There are approximately
tremely common in conversational English. For example, two short (two to four words), two long (10 to 12 words), and
three-word bundles occur more than 80,000 times per million six medium-length (five to nine words) sentences in each list.
words in conversation and four-word bundles more than 8500 Further, the sentence form varies in a list such that six are
times. This represents nearly 30% of the words in conversation declarative, two are imperative, and two are interrogative (one
that occur in recurrent lexical bundles or phrases. In developing with falling intonation and one with rising intonation).
these materials for this project, we selected the most frequently ICRA noise (Dreschler et al. 2001) was used as the
occurring four-word and five-word lexical bundles from Biber background noise for all testing and training sessions. In
et al. and, in some cases, modified these to form a more particular, this study made use of the two-talker noise-vocoded
complete phrase by adding frequently occurring words to the competition (Track 6 on the ICRA noise CD) in which one
beginning or end of the bundle. For example, “going to have a” talker is a man and the other a woman. From the 10-min
is among the most frequently occurring four-word lexical recording of this noise on the CD, various noise segments were
bundles in Biber et al. To this bundle, the word “family,” itself randomly selected (at zero crossings) for presentation during
a frequently occurring word, was appended to the end to form this study with the length of the segments adjusted based on the
a five-word phrase: “going to have a family.” In the end, a total particular speech materials used (further details below). This
of 94 frequently occurring phrases, 36 four-word and 58 particular fluctuating speech-like, but unintelligible, noise was
five-word phrases, were selected for use in this study. Eighteen selected for use in this study because this is exactly the type of
frequently occurring words comprised roughly 70% of the competition in which OHI listeners have considerable diffi-
words in the phrases with 84 additional words comprising the culty in understanding speech.
618 HUMES ET AL. / EAR & HEARING, VOL. 30, NO. 5, 613–627
Fig. 3. The 1⁄3-octave band levels measured in the 2-cm3 coupler for the
steady-state calibration noise shaped to match the spectrum of the two-talker
International Collegium for Rehabilitative Audiology noise.
For the YNH, the headphone buffer was increased to the the phrase-based training, closed sets of 11 or 12 phrases were
maximum setting of 27-dB attenuation, which resulted in a displayed on the computer screen at a time and the listener’s
presentation level for all speech materials and talkers of 87.7 task was to select the phrase heard. Otherwise, the training for
dB SPL. For the OHI, calculations were made based on the phrases was identical to that for words.
1⁄3-octave band levels in Figure 3, an assumed overall SPL for
talker for those materials, and do better in post-training RESULTS AND DISCUSSION
assessment because of this, rather than because of the Of the 20 YNH, nine completed protocol 1, and 11
intervening training. completed protocol 2. Of the 16 OHI, 10 completed protocol 1
As noted, two different protocols were followed for the (participants 3 to 9, 11, 12, and 14), and six completed protocol
training sessions. For protocol 1, an eight-session training 2 (participants 1, 2, 10, 13, 15, and 16). Two separate
cycle was devised. In each of these eight training sessions, between-subject General Linear Model analyses were per-
each about 75 to 90 min duration and administered on formed, one for each group of participants, to examine the
separate days, eight blocks of 50 lexical items were pre- effects of training protocol on the training outcome. No
sented, for a total of 400 lexical items, with 100 presented significant effects (F[1,12] ⫽ 0.5 for young adults and F[1,14] ⫽
by each of the four talkers. These are the 400 items that were 0.1 for older adults; p ⬎ 0.1) of training protocol were observed
found in pilot testing to be the 400 more difficult words, as in either group for any of the post-training performance
noted earlier. This was followed by two blocks of 50 words measures. As a result, data have been combined for both
from the set of 200 easiest items identified in pilot testing protocols in all subsequent analyses.
with 25% of these 100 stimulus items spoken by each of the In this study, of the 16 older adults, one in protocol 1
four talkers. Finally, the training session for each day (participant 8) and two in protocol 2 (participants 13 and 15)
concluded with the presentation of four sets of 11 to 12 were unable to complete the entire protocol because of sched-
phrases with approximately 25% of each set produced by uling conflicts or health issues that arose. For one of these
each of the four talkers. Because the training cycle was individuals (participant 15), only the last two training sessions
composed of eight daily sessions, at the end of a training were missed, but for the other two, one completed 75% of the
cycle there had been two repetitions of the 1600-word protocol (participant 13), and the other only 50% of the
stimuli (400 words ⫻ four talkers) that were found to be protocol (participant 8). To decide whether the post-training
more difficult in pilot testing, one repetition of the 800-word data from these three participants should be included in the
stimuli (200 words ⫻ four talkers) that were found to be group analyses of the post-training performance, three other
easiest in pilot testing, and one repetition of the 376 phrase older adults from the same protocols who had the closest
stimuli. Protocol 1 was designed so that each participant pretraining baseline performance to the three older adults with
would complete three eight-session training cycles between partial training were selected for comparison purposes. There
pretraining baseline and post-training evaluation. This total are a total of five post-training performance measures: CID
of 24 training sessions was typically completed at a rate of Everyday Sentences 1 (Lists A and B, same as baseline), CID
three sessions per week for a total of 8 weeks of training, but Everyday Sentences 2 (Lists C and D), 200 randomly selected
some participants preferred a slower rate and required about frequent-word stimuli (the same randomly selected stimulus set
12 weeks for training. for all listeners with 50 unique words spoken by each of the
For protocol 2, six-session training cycles were imple- four talkers), 200 modified VAST sentences (same 50 sen-
mented. In a given 6-day training cycle, the first four training tences spoken by each of the four talkers), and 376 frequent
sessions were devoted to 400 of the 1600 more difficult word phrases (the same 94 phrases spoken by each of the four
stimuli (100 of the 400 corresponding lexical items) with the talkers). When comparing pretraining baseline measures with
same 400 stimuli (100 lexical items) each of the 4 days. This post-training performance measures, there is a direct compar-
ison available for all measures except CID Everyday Sentences
4-day block was followed by a day devoted to training on 400
2. For the analysis of post-training benefits, we have assumed
of the 800 easier word stimuli (or 100 of the 200 easier lexical
that the baseline score for CID Everyday Sentences 1 (Lists A
items) with a subsequent day devoted to training with half of
and B) is representative of the baseline for CID Everyday
the phrases (188 of the 376 phrase stimuli, 25% by each talker,
Sentences 2 (Lists C and D), although this has not been
or 47 of the 94 lexical representations of the phrases). This
confirmed directly by us or, to the best of our knowledge, by
six-session training cycle was repeated four times with each
others. When comparisons were made between the mean
cycle covering a new 25% of the stimulus corpus. During the training benefit received for each of these five performance
training sessions involving phrases at the end of the second and measures for the subgroup of older adults with abbreviated
third training cycles, brief refresher sessions (2 to 6 blocks of training protocols and a baseline-matched subgroup of older
50 stimuli) were added for the word stimuli from earlier cycles. adults with complete training protocols, only one of the
This again resulted in a total of 24 training sessions and, based differences in training benefit exceeded 2% between these
on the participant’s preferences, typically required a total of 8 two subgroups. This was for the sample of 200 frequent
to 12 weeks for completion. words, with the complete training group experiencing a
In the end, the amount of stimulus repetition during 21.7% improvement (46.3% baseline to 68% post-training)
training, including the refresher blocks for the smaller- in the open-set recognition of these items after training
subset protocol, was roughly equivalent for both protocols. whereas the abbreviated training group experienced only a
For those completing either protocol, there were six repeti- 13.3% improvement (47.0% baseline to 60.3% post-train-
tions of the 1600 more difficult word stimuli (or 24 ing). Again, other than this difference, the post-training
repetitions of the 400 corresponding lexical items), three improvements were within 2% for these two subgroups.
repetitions of the 800 easier word stimuli (or 12 repetitions Given that the training sessions primarily involve word-
of the 200 corresponding lexical items), and three repeti- based training, it is perhaps not too surprising that the
tions of the 376 stimulus phrases (or 12 repetitions of the 94 word-based post-training assessment was the one that re-
corresponding lexical phrases). vealed the largest differences between the two subgroups of
HUMES ET AL. / EAR & HEARING, VOL. 30, NO. 5, 613–627 621
Fig. 6. Individual improvements from pretraining baseline to post-training evaluation, training benefit, on for sets of speech materials: (1) CID 1 Everyday
Sentences (top left); (2) CID 2 Everyday Sentences (top right); (3) modified VAST sentences (bottom left); and (4) frequent phrases (bottom right). The solid
horizontal line in each panel again illustrates the 95% critical difference in RAU for each measure. RAU, rationalized arcsine units; OHI, older adults with
impaired hearing; VAST, Veterans Administration Sentence Test; CID, Central Institute for the Deaf.
encountered during training, this demonstrates that about 75% two versions of the CID Everyday Sentences were strongly
of the older adults were able to generalize word-based training correlated (r ⫽ 0.87, p ⬍ 0.001), the regression analysis for
with a set of four talkers to novel sentences produced by a these materials was conducted only for the CID 1 set of
novel talker. materials for which both pre- and post-training scores were
For the modified VAST sentences and the frequent phrases, available. The predictor variables examined included the high-
shown in the bottom panel of Figure 6, 12 of 16 older adults frequency average hearing loss (mean hearing loss at 1000,
exhibited significant improvements in the performance on each 2000, and 4000 Hz in the test ear), the age, and the pretraining
measure. Both of these sets of materials were spoken by the score for the same materials. The last measure was included
same four talkers that the participants heard in the training, but because we have observed frequently in previous training
only the frequent phrases were actually used in the training. studies, as have others, that the largest gains are often observed
Thus, these results provide an evidence of training benefits for in those with the most to gain (Burk et al. 2006; Burk & Humes
trained materials (frequent phrases) and a generalization to new 2007, 2008).
materials (modified VAST sentences). The only significant regression solution that emerged from
Closer inspection of the individual data in Figure 6 reveals these four regression analyses was the one with training benefit
that four older adults (participants 6, 8, 9, and 10) failed to for the CID 1 materials as the dependent variable. Two
show significant improvements on three of the four perfor- predictor variables accounted for 61.6% of the variance in CID
mance measures included in this figure. We wondered whether 1 improvement: (1) pretraining baseline score on the CID 1
these four participants shared some common factors that might materials (43.5% of the variance) and (2) age (18.1% of the
be underlying the limited benefits experienced, and to explore variance). The standardized regression weights for these two
this further, we conducted a series of multiple-regression variables were both negative, indicating that the higher the
analyses for various measures of post-training benefit (Figures pretraining baseline score and the older the participant, the
5 and 6). Because the two measures of training benefit for the smaller the post-training benefit for the CID Everyday Sen-
HUMES ET AL. / EAR & HEARING, VOL. 30, NO. 5, 613–627 623
CONCLUSIONS This work was supported, in part, by a research grant from the National
Institute on Aging (R01 AG008293) (to L.E.H.).
In general, this study, like our earlier studies in this same
area, supports the efficacy of a word-based approach to Address for correspondence: Larry E. Humes, Department of Speech and
auditory training in OHI. Overall, however, when the results of Hearing Sciences; Indiana University, Bloomington, IN 47405-7002.
E-mail: humes@indiana.edu.
the present study are compared with our earlier studies using
similar word-based training approaches (Burk et al. 2006; Burk Received October 27, 2008; accepted May 20, 2009.
624 HUMES ET AL. / EAR & HEARING, VOL. 30, NO. 5, 613–627
APPENDIX
Bell, T.S. & Wilson, R.W. (2001). Sentence materials controlling word with automatic reductions of low-frequency gain. J Speech Hear Res,
usage and confusability. J Am Acad Audiol, 12, 514 –522. 40, 666 – 685.
Bentler, R. A. (2005). Effectiveness of directional microphones and noise Humes, L. E., Christensen, L. A., Bess, F. H., et al. (1999). A comparison
reduction schemes in hearing aids: A systematic review of evidence. of the aided performance and benefit provided by a linear and a
J Am Acad Audiol, 16, 477– 488. two-channel wide-dynamic-range-compression hearing aid. J Speech
Biber, D., Johansson, S., Leech, G., et al. (1999). Longman Grammar of Lang Hear Res, 42, 65–79.
Spoken and Written English. Harlow, Essex: Pearson Education Ltd. Humes, L. E., Garner, C. B., Wilson, D. L., et al. (2001). Hearing-aid
Burk, M. H., & Humes, L. E. (2007). Effects of training on speech- outcome measures following one month of hearing aid use by the
recognition performance in noise using lexically hard words. J Speech elderly. J Speech Lang Hear Res, 44, 469 – 486.
Lang Hear Res, 50, 25– 40. Humes, L. E., Humes, L. E., & Wilson, D. L. (2004). A comparison of
Burk, M. H., & Humes, L. E. (2008). Effects of long-term training on aided single-channel linear amplification and two-channel wide-dynamic-
speech-recognition performance in noise in older adults. J Speech Lang range-compression amplification by means of an independent-group
Hear Res, 51, 759 –771. design. Am J Audiol, 13, 39 –53.
Burk, M. H., Humes, L. E., Amos, N. E., et al. (2006). Effect of training Humes, L. E., Lee, J. H., & Coughlin, M. P. (2006). Auditory measures of
on word-recognition performance in noise for young normal-hearing selective and divided attention in young and older adults using single-
and older hearing-impaired listeners. Ear Hear, 27, 263–278. talker competition. J Acoust Soc Am, 120, 2926 –2937.
Cord, M. T., Surr, R. K., Walden, B. E., et al. (2004). Relationship between Humes, L. E., Wilson, D. L., Barlow, N. N., et al. (2002a). Measures of
laboratory measures of directional advantage and everyday success with hearing-aid benefit following 1 or 2 years of hearing-aid use by older
directional microphone hearing aids. J Am Acad Audiol, 15, 353–364. adults. J Speech Lang Hear Res, 45, 772–782.
Davis, H., & Silverman, S. R. (1978). Hearing and Deafness (4th ed.). Humes, L. E., Wilson, D. L., Barlow, N. N., et al. (2002b). Longitudinal
London: Holt, Rinehart and Winston. changes in hearing-aid satisfaction and usage in the elderly over a period
Dreschler, W. A., Verschuure, H., Ludvigsen, C., et al. (2001). Interna- of one or two years after hearing-aid delivery. Ear Hear, 23, 428 – 437.
tional Collegium for Rehabilitative Audiology (ICRA) noises: Artificial ISO (2000). ISO-7029, Acoustics-Statistical Distribution of Hearing
noise signals with speech-like spectral and temporal properties for Thresholds as a Function of Age. Basel, Switzerland: International
hearing instrument assessment. Audiology, 40, 148 –157. Standards Organization.
Foo, C., Rudner, M., Ronnberg, J., et al. (2007). Recognition of speech in Jin, S.-H., & Nelson, P. B. (2006). Speech perception in gated noise: The
noise with new hearing instrument compression release settings requires effects of temporal resolution. J Acoust Soc Am, 119, 3097–3108.
explicit cognitive storage and processing capacity. J Am Acad Audiol, Kochkin, S. (1993a). MarkeTrak III: Why 20 million in U.S. don’t use
18, 553–566. hearing aids for their hearing loss. Part I. Hear J, 46, 20 –27.
French, N. R., Carter, C. W., Jr., & Koenig, W., Jr. (1931). The words and Kochkin, S. (1993b). MarkeTrak III: Why 20 million in U.S. don’t use
sounds of telephone conversations. Bell Syst Tech J, 9, 290 –324. hearing aids for their hearing loss. Part II. Hear J, 46, 26 –31.
Gantz, B. J., Turner, C., & Gfeller, K. E. (2006). Acoustic plus electric Kochkin, S. (1993c). MarkeTrak III: Why 20 million in U.S. don’t use
speech processing: Preliminary results of a multicenter clinical trial of
hearing aids for their hearing loss. Part III. Hear J, 46, 36 –37.
the Iowa/Nucleus Hybrid implant. Audiol Neurootol, 11 (Suppl 1),
Kochkin, S. (2000). MarkeTrak V: Consumer satisfaction revisited. Hear J,
63– 68.
53, 38 –55.
George, E. L. J., Festen, J. M., & Houtgast, T. (2006). Factors affecting
Kochkin, S. (2005). MarkeTrak VII: Hearing loss population tops 31
masking release for speech in modulated noise for normal-hearing and
million. Hear Rev, 12, 16 –29.
hearing-impaired listeners. J Acoust Soc Am, 120, 2295–2311.
Larson, V. D., Williams, D. W., Henderson, W. G., et al. (2000). Efficacy
George, E. L. J., Zekveld, A. A., Kramer, S. E., et al. (2007). Auditory and
of 3 commonly used hearing aid circuits: A crossover trial. NIDCD/VA
nonauditory factors affecting speech reception in noise by older listen-
Hearing Aid Clinical Trial Group. J Am Med Assoc, 284, 1806 –1813.
ers. J Acoust Soc Am, 121, 2362–2375.
Lee, C.J. (2003). Evidence-based selection of word frequency lists. Journal
Godfrey, J. J., Holliman, E. C., & McDaniel, J. (1992). SWITCHBOARD:
Telephone speech corpus for research and development. Proc IEEE Int of Speech-Language Pathology and Audiology, 27, 170 –173.
Conf Acoust Speech Sig Proc, 517–520. Lee, L. W., & Humes, L. E. (1993). Evaluating a speech-reception
Hetzel, L., & Smith, A. (2001). The 65 Years and Over Population: 2000 threshold model for hearing-impaired listeners. J Acoust Soc Am, 93,
(pp, 1– 8). Washington, DC: U.S. Department of Commerce, Economics 2879 –2885.
and Statistics Administration. Lindfield, K. C., Wingfield, A., & Goodglass, H. (1999). The role of
Humes, L. E. (1991). Understanding the speech-understanding problems of prosody in the mental lexicon. Brain Lang, 68, 312–317.
the hearing impaired. J Am Acad Audiol, 2, 59 –70. Lunner, T. (2003). Cognitive function in relation to hearing aid use. Int J
Humes, L. E. (2002). Factors underlying the speech-recognition perfor- Audiol, 42 (Suppl 1), S49 –S58.
mance of elderly hearing-aid wearers. J Acoust Soc Am, 112, 1112– Lunner T & Sundewall-Thorén E. (2007). Interactions between cognition,
1132. compression, and listening conditions: effects on speech-in-noise per-
Humes, L. E. (2005). Do “auditory processing” tests measure auditory formance in a two-channel hearing aid. J Am Acad Audiol, 18, 539 –552.
processing in the elderly? Ear Hear, 26, 109 –119. Nordrum, S., Erler, S., Garstecki, D., et al. (2006). Comparison of
Humes, L. E. (2007). The contributions of audibility and cognitive factors performance on the hearing in noise test using directional microphones
to the benefit provided by amplified speech to older adults. J Am Acad and digital noise reduction algorithms. Am J Audiol, 15, 81–91.
Audiol, 18, 590 – 603. Owens, E., Benedict, M., & Schubert, E. D. (1972). Consonant phonemic
Humes, L. E. (2008). Issues in the assessment of auditory processing in errors associated with pure-tone configurations and certain kinds of
older adults. In A. T. Cacace & D. J. McFarland (Eds). Current hearing impairment. J Speech Hear Res, 15, 308 –322.
Controversies in Central Auditory Processing Disorder (CAPD) (pp, Pichora-Fuller, K. M., & Singh, G. (2006). Effects of age on auditory and
121–150). San Diego: Plural Publishing. cognitive processing: Implications for hearing aid fitting and audiologic
Humes, L. E., Ahlstrom, J. B., Bratt, G. W., et al. (2009). Studies of rehabilitation. Trends Amplif, 10, 29 –59.
hearing aid outcome measures in older adults: A comparison of Ronnberg, J., Rudner, M., Foo, C., et al. (2008). Cognition counts: A
technologies and an examination of individual differences. Sem Hear- working memory system for ease of language understanding (ELU). Int
ing, 30, 112–128. J Audiol, 47 (Suppl 2), S171–S177.
Humes, L. E., Barlow, N. N., Garner, C. B., et al. (2000). Prescribed Schoenborn, C. A., & Marano, M. (1988). Current estimates from the
clinician-fit versus as-worn coupler gain in a group of elderly hearing National Health Interview Survey. Vital Health Stat 10, 1–233.
aid wearers. J Speech Lang Hear Res, 43, 879 – 892. Simpson, A., Hersbach, A. A., & McDermott, H. J. (2005). Improvements
Humes, L. E., Burk, M. H., Coughlin, M. P., et al. (2007). Auditory speech in speech perception with an experimental nonlinear frequency com-
recognition and visual text recognition in younger and older adults: pression hearing device. Int J Audiol, 44, 281–292.
Similarities and differences between modalities and the effects of Smeeth, L., Fletcher, A. E., Ng, E. S., et al. (2002). Reduced hearing
presentation rate. J Speech Lang Hear Res, 50, 283–303. ownership, and use of hearing aids in elderly people in the UK—The
Humes, L. E., Christensen, L. A., Bess, F. H., et al. (1997). A comparison MRC trial of the assessment and management of older people in the
of the benefit provided by a well-fit linear hearing aid and instruments community: A cross-sectional survey. Lancet, 359, 1466 –1470.
HUMES ET AL. / EAR & HEARING, VOL. 30, NO. 5, 613–627 627
Studebaker, G. A. (1985). A “rationalized” arcsine transform. J Speech Ward, J. A., Lord, S. R., Williams, P., et al. (1993). Hearing impairment
Hear Res, 28, 455– 462. and hearing aid use in women over 65 years of age: Cross-sectional
Sweetow, R., & Henderson-Sabes, J. (2004). The case for LACE: study of women in a large urban community. Med J Aust, 159, 382–384.
Listening and auditory communication enhancement training. Hear Zeno, S. M., Ivens, S. H., Millard, R. T., et al. (1995). The Educator’s
J, 57, 32– 40. Word Frequency Guide. New York, NY: Touchstone Applied Science
Sweetow, R., & Palmer, C. (2005). Efficacy of individual auditory training Associates.
in adults: A systematic review of the evidence. J Am Acad Audiol, 16,
498 –508.
Sweetow, R. W., & Sabes, J. (2006). The need for and development of an
adaptive listening and communication enhancement (LACE) program.
J Am Acad Audiol, 17, 538 –558. REFERENCE NOTE
Turner, C. W., & Hurtig, R. R. (1999). Proportional frequency compression for 1. Humes, L. E., & Dubno, J. R. (in press). Factors affecting speech
listeners with sensorineural hearing loss. J Acoust Soc Am, 106, 877– 886. understanding in older adults. In S. Gordon-Salant, R. D. Frisina, A. N.
Walden, B., Surr, R., Cord, M., et al. (2004). Predicting hearing aid micro- Popper, & R. R. Fay (Eds). The Aging Auditory System: Perceptual
phone preference in everyday listening. J Am Acad Audiol, 15, 365–396. Characterization and Neural Bases of Presbycusis. New York: Springer.