Beruflich Dokumente
Kultur Dokumente
Course schedule
Lecture 1: voice production
Lecture 2: voice structure
Workshop 1: voice analysis
Workshop 2: speech analysis
Lecture 3: Voice & Psychology
Workshop 3: voice synthesis
Course Assessment
Exercise (100%, 1000 words)
produce a spectrogram of your voice, analyse/
report values and discuss their relevance.
See instructions for more details.
Language:
Open communication system that uses a set of written, gestural, or spoken
symbols that refer to people, objects or ideas.
Open-ended system of communication in which the grammatical structure allows
information of great cognitive complexity to be passed from one individual (the
speaker) to another (the listener).
Natural Language:
Spoken or signed language (as opposed to written languages, computer
programming languages). The ASL, Spanish, English & French are natural
languages.
Linguistics:
The study of human language.
Speech:
Human spoken language (as opposed to sign language).
Voice
the sounds made by a person using the vocal folds for talking,
singing, screaming or crying
The voice results from the act of phonation = the use of the
laryngeal system to generate an audible source of acoustic
energy
The term voice refers to the form and to the quality of the vocal
signal rather than to its content.
Bioacoustics
Psychoacoustics
Psychoacoustics:
How animals produce sounds, the physical structure of these sounds, how
animals perceive them, what their function is and how they evolved.
Textbooks:
The Evolution of Communication, Hauser 1996
The Principles of Animal Communication, Bradbury & Vehrencamp, 1998
Animal Signals, Maynard Smith & Harper, 2003.
Study of the relations between the sound stimuli and their auditory
perception in terms of hearing sensations.
These relationships are not simple and linear.
Different people will hear the different things when they listen to the
same sound.
Textbooks:
Speech Physiology, Speech perception, and Acoustic Phonetics,
Lieberman & Blumstein, 1988
Introduction to Acoustics:
What is sound?"
Vibration as perceived by the sense of hearing (Wikipedia Psychoacoustics definition)"
A disturbance of the equilibrium of density (or pressure of a gas,
liquid or solid) (Titze - Physics definition)"
A local pressure disturbance in a continuous medium that contains
frequencies in the range of 20 to 20,000Hz (the audible
range) (Titze, a compromise between physics and psychophysics)"
Small variations in air pressure that occur very rapidly one after
another (Ladefoged)"
Vibrating
source
Pressure
space
The crests correspond to the high pressure points and the troughs
correspond to the low pressure points.
Waveform / Oscillogram
Sound waves can be represented as the temporal variation of sound
pressure at a fixed point in space - for example the membrane of
a microphone.
When we record a sound - we record (analogically or digitally) this
temporal variation.
Periodic Sounds
Most sounds are generated by oscillators (strings, vocal folds,
resonators, etc)
Why oscillators?
Period
The period of a sound wave is the the duration of
an oscillation cycle
Can be measured as the time between two peaks.
Frequency
The frequency of a sound is the number of air
pressure oscillation cycles per second. It is
the multiplicative inverse (or reciprocal) of the
period: F = 1/T
T = 0.74 ms,
F = 1/0.0074 = 133Hz
0.74 ms
7.5 ms
Wavelength
One single oscillatory cycle per second corresponds to 1 Hz. This is not audible.
125 oscillations (the fundamental frequency in male voice), is 125HZ.
200 oscillations (the fundamental frequency in female voice), is 200Hz
2000 oscillations (some bird calls), is 2000Hz,
15000 oscillations (some bats calls), is 15000Hz etc
Wavelength
dB(SPL)
Spectrogram
0
0
Time (s)
Amplitude (dB)
0.5
Time (s)
Spectrum
0.5
Waveform
Complex sounds
Pure tones are single frequency tones with no harmonic content (no
overtones). This corresponds to a sine wave.
Frequency (kHz)
1.5 kHz
Frequency (kHz)
0
dB
Time (s)
0.5
Time (s)
0.5
Examples of pure tones: whistles, scops owl hoots, most electronic beeps.
Harmonics
F0
Time (s)
The pitch
What is the Pitch of a voice?
The pitch is the perceived height of a voice (Titze)
It is mainly determined by the fundamental frequency of the sound.
60Hz
140Hz
very low
male
(early morning)
female
White noise:
- spectral envelope
formants ?
Formants
Time (s)
0.5
- intensity contour
- periodicity
pure tone?
F0+harmonics ?
F0 contour ?
noise ?
0.5
Summary
A vocal sound can be
described in terms of
its:
child
Time (s)
500Hz
0.5
Noise
200Hz
Time (s)
0.5
the pharynx
The mouth
The nasal cavity
Speaker with an
anesthetized vocal
tract
Speaker with a
normal vocal tract
Glottal wave
filtered by a
uniform tube
Glottal wave
filtered by a non
uniform, changing
vocal tract