Beruflich Dokumente
Kultur Dokumente
RECOGNITION ACCURACY
Tim Barry
North Coast Simulation
Dayton, Ohio
Tom Solz
John Reising
Dave Williamson
Pilot-Vehicle Interface Technology Branch
Wright-Patterson AFB, Ohio
ABSTRACT
Two studies were performed to test the ability of
three different automated speech recognition devices
working in parallel, along with an Enhanced Majority Rules
(EMR) software algorithm, to obtain a combined speech
recognition accuracy better than the accuracies produced by
each of the individual systems done. The first experiment,
using a rather simple, robust vocabulary, compared the
recognition accuracy of the three individual systems with the
accuracy obtained by combining the data from all three
systems using the EMR algorithm. The second experiment
made the same comparison, but a more difficult, non-robust
vocabulary was used during testing. Results from both
experiments revealed a signficant increase in speech
recognition accuracy obtained by combining the recognition
data from all three systems in the EMR algorithm when
compared with two of the three speech recognition systems.
The third system, a newer-generation recognition device,
performed as well as all three used in concert. The
implications of using intelligent software and multiple
speech recognition devices to improve speech recognition
accuracy are discussed.
OaTECTIVES
INTRODUCTION
The objectives of the two experiments were 1) to
gauge the recognition accuracies of the three different
speech recognition systems using both "easy" and "hard"
vocabularies and 2) to determine whether the Enhanced
Majority Rules (EMR) algorithm, combining raw data from
all three systems, could increase recogrution accuracy above
and beyond the accuracy produced by the individual systems
alone.
EXPERIMENT 1
667
-1-
RS-232
Zero
One
Two
Three
Four
Five
Six
Seven
I Eight
I Niner
Point
Clear
Enter
I Hundred
I Thousand
Frequency
Channel
Range
I Mirmative
1 Negative
668
Yes
Two or more
II
Votan
TI
97.0
97.2
rl-r
EMR
Recognition System
669
--
RESULTS
Planned orthogonal comparisons were run using
SPSS to contrast the EMR algorithm with each of the three
individual speech recognition systems. The results indicated
that when using the three speech recognition systems in
combination, AOA was sigruficantly improved over the
recognition accuracy produced from the TI (F (1,ll) =
27.16, p<.OOO) arid Votan (F (1,ll) = 38.61, p<.OOO)
systems (Figure 4). As was the case in Experiment 1, the
EMR algorithm failed to provide better recognition accuracy
than the ITT system. A test for a gender effect again
revealed no performance differences between males and
females.
EXPERIMENT 2
The objectives of Experiment 2 were to validate the
results of Experiment 1 and to test the recognition
performance of the three systems and the EMR algorithm
using a relatively diilicult vocabulary.
Method
Zero
One
Two
Three
Four
Emriment 2 Vocabularv
I Fourteen Nineteen I Seventy
]Fifteen
IThirty
] Eighty
I Ninetv
I Seven I Sixteen I FOIW
I Eight 1 Seventeen I Fifly 1 On
I Nine I Eighteen I Sixty [ Off
I Five
I Six
Votan
TI
EMR
RecognitionSystem
Figure 4. Results of Experiment 2
DISCUSSION
In both experiments, the collective use of raw data
from the three speech recognition systems in the EMR
algorithm resulted in a better mean AOA than the accuracy
obtained by the: two individual older-generation systems. As
was expected, the use of a difficult vocabulary in Experiment
2 resulted in an overall decrease in accuracy of these systems
compared with the results of Experiment 1. The EMR
algorithm, however, showed a slight increase in mean
accuracy with the hard vocabulary.
670
REFERENCES
Bany, T., Liggett, K., Williamson, D. and Reising, J.
(1992). Enhanced Recognition Accuracy with the
Simultaneous Use of Three Automated Speech Recognition
Systems. In Proceedings of the Human Factors Society 36th
Annual Meeting (pp. 288-292). Santa Monica, CA: Human
Factors Society.
671