Beruflich Dokumente
Kultur Dokumente
Surroundings
andrew j. horsburgh, BSc Hons
robert davis, BSc Hons
A Feature Presentation!
This presentation will cover the basic principles
of spatial audio, surround sound and production
techniques.
Focusing on the spatial audio format
Ambisonics, specifically the Production and
reproduction of Ambisonic soundfields.
Buzz-word Bingo
Many artists, producers and engineers is
having the ability to make things an exact
acoustical replica of a space.
Buzz-words like 'immersion', 'depth', 'realism'
often used to describe a situation.
Widely adopted current technologies are not
able to reproduce the audible characteristics
that ensure 'immersion' every time.
Localisation of Sounds
Lord Rayleigh's 'Duplex theory'
defined the localisation method
that the human auditory system
uses to identify sound locations.
These are Interaural Time
Differences (ITD) and Interaural
Level Differences (ILD). Minute
changes in time of arrival and
pressure allow for precise
localisation of sound between the
two ears. Placement of speakers
is related to these principles.
Stereophony
Current standards use two discrete
audio files with a numerically applied
position, implying stereo-field width - but
not featuring the essential delay time or
amplitude differences.
Extending beyond stereo to surround
formats such as Dolby 5.1 and THX This method of encoding audio does not
give true impressions of space or
contain information that can reproduce a
realistic environment that can fool the
listeners.
Why Ambisonics?
Advantages:
Isotropic - Speakers are all treated 'equally'
Mono, Stereo, Quad, 5.1 etc mix compatible
Stable phantom images between each pair
of speakers
Improving resolution does not mean
increasing speaker count by x^2!
Recording is independent to reproduction
system
Ambisonic Birth
Development of the format began during the
1970s, led by mathematician M. Gerzon. The
creation of a omni-directional, isotropic
wavefront recording and reproduction format
was acknowledged by many to be ahead of its
time - until now, 40 years later.
With viable increase of technologies labelled
'3D' and 'immersive' - adoption of Ambisonics
can deliver an audio experience like no other.
That's no Moon
Soundfield Microphones
Reproduction and recording of three
dimensional soundfields is easy, provided you
have chosen the correct set of microphone
patterns.
Omni
F/B Fig 8
L/R Fig 8
CoreSound Microphones
Several companies have created a single-unit
microphone that captures the information
similar to the Omni + Fig8 + Fig8 + Fig8
configuration. One of these is CoreSound's
TetraMic.
Overview
Encoding (Panning)
Decoding (Playback/Monitoring)
Effects
2D (Horizontal) systems
3D (With height) systems
2D Definitions
Polar/Cartesian
co-ordinates
Radial distance (r)
Azimuth ()
Measured anti-clockwise
from due front
Unit circle (r = 1)
Encoding
Directional encoding of virtual sound sources
(Panning)
Mono source input -> Ambisonic sound field
Applies to multi-track mixing techniques, live
performance or sound design
Simulates microphone pick-up patterns
W
W
X
X
Y
Y
X
X
W
W
Y
Y
+
0.471
+
0.667
0.000
Basic Decode
+
0.586
+
0.586
=
0.000
Max-rE Decode
+
0.707
+
0.500
=
0.000
In-Phase Decode
+
0.283
0.400
+
0.000
0.400
0.000
Basic
Max-rE
In-Phase
Basic
Max-rE
In-Phase
Basic
|rV| = 1.000
|rE| = 0.667
Max-rE
|rV| = 0.707
|rE| = 0.707
In-Phase
|rV| = 0.500
|rE| = 0.667
Basic
|rV| = 1.000
|rE| = 0.800
Max-rE
|rV| = 0.866
|rE| = 0.866
In-Phase
|rV| = 0.667
|rE| = 0.800
Basic
|rV| = 1.000
|rE| = 0.857
Max-rE
|rV| = 0.924
|rE| = 0.924
In-Phase
|rV| = 0.750
|rE| = 0.857
3D Definitions
Spherical/Cartesian
co-ordinates
Radial distance (r)
Azimuth ()
Measured anti-clockwise
from due front
Elevation ()
Measured up or down
from horizontal
Unit sphere (r = 1)
Spherical Harmonics
m = degree/mode , n = order
An Array Standard
Now we have plug-ins that work with easily
accessible software.
What now?
Create an array, calibrate the array and then
produce in Ambisonics!
Second Order
Loudness
The Loudness war needs
no introduction.
Hyper-compression is a
problem that has resulted in
several industry wide
recommendations from
professional bodies.
European Broadcast
Union R-128
ITU-R BS 2054.2
LUFS!
R-128 specifies the use of Loudness Unit Full
Scale in broadcasting.
Material is normalised to -23dBFS +/- 1dB,
conforming with calibrated metering of ITU BS
1770.
Not only is the recommendation in the box, but
calibrated at the speakers.
Research Project
A main research project is under way to
compare the qualitative aspect of Ambisonics
against other 'surround' formats.
Technical accuracy has been shown a benefit
of Ambisonics but data relating to listening tests
are small and controlled.
Production Tools
Using the DAW 'Reaper', guides are available
using existing plug-in suites, one of the more
popular is Dr Wiggins's 'WigWare'.
PC exclusive suites are available (Dave
Malhams B-Dec), with Linux exclusive
AmbDec.
Delay Plug-In
Allows delayed
sounds to be
scattered
throughout the
sound field
To create abstract
spatial effects
Or studio delay
Audio Demo
In the demo today we are using a custom Third
Order Ambisonic Decoder attached to 8
speakers.
Using Reaper to host the plug-ins, there is a
mixture of up-mixed monophonic to Ambisonic
material and First Order CoreSound recordings.
Conclusions
In this talk we have covered;
Elementary description of auditory perception
Ambisonic functions (FOA, HOA)
Encoding and Decoding Process
Issues associated with Ambisonics
Production Standard Guidelines