Beruflich Dokumente
Kultur Dokumente
V. CONCLUSIONS
In this paper, analysis and synthesis methods of
emotional voice for man-machine natural interface was
developed. First, the emotional voice was analyzed using
time-frequency representation of speech and similarity
matrix. Then, based on the result of emotional analysis, a
voice with neutral emotion was transformed to synthesize
the particular emotional voice using time-frequency
modifications.
In the simulations, five types of emotion were analyzed
using 50 samples of speech signals and 81.02(%) of
average discrimination rate was achieved. Further, the
synthesized emotional voice was subjectively evaluated. It
is confirmed that the emotional voice was naturally
generated by the proposed time-frequency based approach.
REFERENCES
[1] Y. Kitahara and Y. Tohkura, "Prosodic Control to Express Emotion for
Man-Machine Speech Interaction", IEICE Trans., Vol.E75-A, No.2,
pp.155-163, 1992.
[2] K. Hirose, N. Takahashi, H. Fujisaki and 0. Sumio, "Representation
of Intention and Emotion of Speakers with Fundamental Frequency
Contours of Speech", Technical Report ofIEICE, HC94-41, pp.33-40,
1994-09.
[3] H. Kawanami and K. Hirose, "Considerations on the Prosodic
Features of Utterances with Attitudes and Emotions", Technical
Report of IEICE, SP97-67, pp.73-80, 1997- 11.
[4] Y. Tone, A. Ogihara and H. Shibata, "HMM Based Emotion
Discrimination for Speech Dialogue System", Technical Report of
IEICE, HC2000-22, pp.47-53, 2000-06.
[5] T. Moriyama, H. Saito and S. Ozawa, "Evaluation of the Relationship
between Emotional Concepts and Emotional Parameters on Speech",
(C) [6]
IEICE Trans., Vol. J82-D2, No.4, pp.703-711, 1999.
M. Sigenaga, "Features of Emotionally Uttered Speech Revealed by
Fig. 4. Example of time-frequency representation of Discriminant Analysis", IEICE Trans., Vol. J83-A, No.6, pp.726-735,
synthesized emotional voice (anger). (a) Neutral voice, (b) 2000.
Emotional voice, (c) Synthesized emotional voice. [7] S. Wada, H. Yagi and H. Inaba: "Effective Calculation of Finite Frame
Operator for The Multiple Short-Time Fourier Transform", Proc.
IEEE-SP Int. Symp. on Time-Frequency and Time-Scale Analysis,
pp.205-208, 1998.
[8] R.E. Crochiere and L.R. Rabiner: Multirate Digital Signal Processing,
Prentice-Hall, 1983.