Beruflich Dokumente
Kultur Dokumente
which emotions were typically ex- play in the production of emotion- fects also suggest that characteristic
perienced (Bachorowski & Braaten, al speech. acoustic differences between voices
1994). play a role in perceptual evalua-
We observed similar outcomes tions of emotion from speech.
in an unpublished study that used PERCEPTION OF
a more standard emotion-induc- VOCAL EMOTION
tion paradigm in which naive par-
TOWARD A BROADER
ticipants described the thoughts Tests of listeners’ abilities to
FRAMEWORK
and feelings evoked by emotion- infer emotion from speech are crit-
eliciting slides. Notably, efforts to ical for evaluating the perceptual
link both source- and filter-related importance of acoustic cues shown A number of constraints have
acoustic cues with discrete emo- to be important from a production impeded the development of a de-
tions were largely unsuccessful. perspective, and help to inform re- tailed account of vocal-emotion-
Instead, the overall pattern of re- search aimed at developing an related phenomena. For instance,
sults indicated that values of acoustic typology of emotional speech is complex, both in the
acoustic parameters were associ- speech. The standard perception number of potentially relevant
ated with nonspecific arousal and, paradigm is to have listeners acoustic cues related to emotional
to a lesser extent, emotional va- choose which one of several emo- expression and in the multiplicity
lence. Again, differences in emo- tion words best characterizes lin- of other factors that influence the
tional intensity mediated the rela- guistically neutral utterances speech signal at any moment in
tionships between acoustic made by actors attempting to por- time. More pragmatically, accurate
measures and emotional states. tray various emotions (e.g., and detailed acoustic analysis is
Although the expression and Leinonen et al., 1997; Scherer, time-consuming. From a method-
perception of emotion are salient Banse, & Wallbott, 1998). Listeners ological standpoint, the small num-
aspects of human vocal communi- are usually able to perceive the in- ber of participants typically stud-
cation, researchers have yet to fully tended emotions at rates signifi- ied and the reliance on acted
characterize the ways in which cantly better than those expected portrayals have limited the gener-
speech acoustics provide cues to by chance. This general success in ality of findings. Paradigms that in-
emotional states. The most parsi- identifying emotions is typically volve collecting speech samples
monious interpretation of produc- interpreted to indicate that listen- during the controlled induction of
tion-related data is that speech ers associate particular patterns of emotional states best balance the
acoustics provide an external cue acoustic cues with various discrete need for methodological rigor and
to the level of nonspecific arousal emotional states. Evidence for real-world validity.
associated with emotional process- cross-cultural similarities in both Although investigators have
es. Less reliable differentiations are perceptual accuracy and error pat- typically sought to identify invari-
found when researchers look for terns (Scherer et al., 1998) further ant patterns of acoustic cues for
associations between acoustic mea- suggests that the ability to infer various discrete emotional experi-
sures and either emotional valence emotion from speech is a funda- ences, this strategy may be prob-
or discrete emotion categories. mental component of human vocal lematic for a number of reasons.
Moreover, potentially important communication. For instance, this tactic generally
individual differences, including In light of these findings, it is also fails to consider the talker–listener
the identity of the talker and emo- important to note that error rates relationship and the “intended”
tional intensity, are routinely found are also often quite high. A hint impact of vocal signals on the lis-
to mediate vocal expression of about the basis of detection failures tener’s affective states. Some cues,
emotion. As Scherer (1986) has comes from the fact that listeners especially those associated with the
pointed out, there is an apparent are more accurate in inferring emo- rate of vocal-fold vibration, are
contradiction between the difficul- tion from particular voices. readily modifiable. They can be
ty in finding acoustic differentia- Furthermore, for any given actor, used, for example, to signal com-
tion of emotional states and the listeners typically perceive some municative intent or be recruited
comparative ease with which lis- emotions more accurately than oth- for the purposes of affective per-
teners are able to judge emotions ers. Although it is likely that some suasion. Thus, treating these cues
from speech. Resolving this contra- emotions may simply be more diffi- as honest readouts of emotional
diction will likely involve an ex- cult to infer from voice than others, states ignores their other potential
plicit understanding of the role that and that actors vary in the quality of functions in emotion-related com-
individual difference variables their emotion portrayals, these ef- munication.
Incorporating a more talker- na. In that most studies have ar- Acknowledgments—Work on this arti-
centered (i.e., idiographic) per- guably examined affect rather than cle was completed while the author was
spective may also help advance emotion, it may have been unrea- generously hosted as a Visiting Scholar
by the Department of Psychology at
our understanding of emotional sonable to expect that distinct Cornell University. Funds in support of
speech. Evaluations of emotional acoustic patterns could be identi- this work came from National Science
state are necessarily made against fied. Instead, there is remarkable Foundation (POWRE) and National
Institute of Mental Health (B/START)
an acoustic backdrop of individu- consistency in support of the no- awards, and from Vanderbilt University.
ally distinctive voice characteris- tion that the acoustic features of Michael J. Owren provided valuable
tics, and yet differences among “emotional” speech are best de- comments on an earlier version of this
talkers are usually treated as unin- manuscript, and our collaborative work
scribed using dimensions of non- led to some of the ideas presented here.
teresting variability in vocal-emo- specific arousal and affective
tion research. However, everyday valence, and that most vocal pro-
experience suggests that more ac- ductions index affective rather Notes
curate and detailed perceptual than emotional experience.
judgments of emotional state can The expression and perception 1. Address correspondence to Jo-
be made for familiar than for unfa- of emotional states in speech Anne Bachorowski, Department of
miliar talkers. For example, dis- acoustics are fundamental aspects Psychology, Wilson Hall, Vanderbilt
criminations between related emo- of human communication. In fact, University, Nashville, TN 37240; e-
tions, such as amusement and joy, mail: j.a.bachorowski@vanderbilt.edu.
disturbances in either of these com- 2. Preliminary results from work
are probably more accurate for munication components can con- being conducted in other laboratories
speech samples from a close friend tribute to profound deficits in so- demonstrate that both standard emo-
than those from a more casual ac- cial relationships. By its very tion-induction paradigms and playful,
quaintance. Suggestive empirical nature, research in vocal expression gamelike paradigms are successful for
support for the importance of talk- eliciting speech samples that can be
and perception of emotion is richly used to study vocal expression of emo-
er characteristics comes from stud- interdisciplinary—a circumstance tion. Some investigators using these
ies indicating that acoustic differ- that gives rise to both its inherent kinds of strategies include Arvid
ences among talkers exert a complexities and its considerable Kappas (arvid@psy.ulaval.ca), Gary
powerful influence on cognitive intellectual appeal. As a result of Katz (gary.katz@csun.edu), and Tom
operations such as linguistic pro- Johnstone in Klaus Scherer ’s lab
improved digital processing tech- (johnstone@fapse.unige.ch).
cessing and memory (e.g., Palmeri, niques as well as advances in the
Goldinger, & Pisoni, 1993). Thus, related disciplines of speech sci-
more detailed characterizations of ence, cognitive science, and References
the acoustic features of emotional acoustic primatology, findings ob-
Bachorowski, J.-A., & Braaten, E.B. (1994).
speech might be found by examin- tained in the coming years should Emotional intensity: Measurement and theo-
ing fluctuations in acoustic cues prove especially informative for retical implications. Personality and Individual
against comparatively more stable Differences, 17, 191–199.
our understanding of emotional Bachorowski, J.-A., & Owren, M.J. (1995). Vocal
but individually distinctive talker expression through the vocal expression of emotion: Acoustic properties of
characteristics (see Bachorowski & speech are associated with emotional intensity
channel. and context. Psychological Science, 6, 219–224.
Owren, 1998). Bachorowski, J.-A., & Owren, M.J. (1998). Acoustic
cues to gender and talker identity are present in a
Research in vocal-emotion phe- short vowel segment produced in running speech.
nomena might also benefit from a Manuscript submitted for publication.
Banse, R., & Scherer, K.R. (1996). Acoustic profiles
reinterpretation of findings based in vocal emotion expression. Journal of
on Russell and Feldman Barrett’s Recommended Reading
Personality and Social Psychology, 70, 614–636.
Kent, R.D. (1997). The speech sciences. San Diego:
(1998) distinction between affect Singular Publishing.
and emotion. In their account, af- Bachorowski, J.-A., & Owren, M.J. Leinonen, L., Hiltunen, T., Linnankoski, I., &
Laakso, M.-L. (1997). Expression of emotional-
fect is always present and is best (1995). (See References) motivational connotations with a one-word
described by bipolar dimensions of Kent, R.D. (1997). (See References) utterance. Journal of the Acoustical Society of
Murray, I.R., & Arnott, J.L. (1993). America, 102, 1853–1863.
arousal and valence. In contrast, Toward the simulation of emo- Palmeri, T.J., Goldinger, S.D., & Pisoni, D.B. (1993).
prototypical emotion episodes tion in synthetic speech: A Episodic encoding of voice attributes and
recognition memory for spoken words. Journal
happen more rarely and are associ- review of the literature on of Experimental Psychology, 19, 309–328.
ated with identifiable neurophysio- human vocal emotion. Journal of Pittam, J., & Scherer, K.R. (1993). Vocal expression
the Acoustical Society of America, and communication of emotion. In M. Lewis &
logical, behavioral, and cognitive J.M. Haviland (Eds.), Handbook of emotions (pp.
93, 1097–1108.
processes. This distinction certain- Pittam, J., & Scherer, K.R. (1993). (See
185–197). New York: Guilford Press.
Russell, J.A., & Feldman Barrett, L. (1998). Affect
ly sheds new light on vocal pro- References) and prototypical emotional episodes. Manuscript
duction and perception phenome- submitted for publication.
Scherer, K.R. (1986). Vocal affect expression: A Emotion: Theory, research, and experience: Vol. 4. across languages and cultures. Manuscript sub-
review and model for future research. The measurement of emotions (pp. 233–259). New mitted for publication.
Psychological Bulletin, 99, 143–165. York: Academic Press. Scherer, K.R., Banse, R., Wallbott, H.G., & Goldbeck,
Scherer, K.R. (1989). Vocal measurement of emo- Scherer, K.R., Banse, R., & Wallbott, H.G. (1998). T. (1991). Vocal cues in emotion encoding and
tion. In R. Plutchik & H. Kellerman (Eds.), Emotion inferences from vocal expression correlate decoding. Motivation and Emotion, 15, 123–148.