Beruflich Dokumente
Kultur Dokumente
3.4
Audio Recording
MICHAEL STARLING
JOHN EMMETT
INTRODUCTION
Anyone who has been away from the field of audio
recording for a number of years may be surprised by
the range of technologies that are discussed in this
chapter. That is not to say that the process of sound
recording has changed fundamentally, but the specific
recording equipment has changed and, in many cases,
even the physical media might appear to have vanished. In large part, audio has followed text into
becoming a generic Information Technology (IT) data
application.
For much of the 20th century, analog magnetic tape
was the broadcasters principal medium for audio
recording. In the 1980s and 1990s, the industry
focused on the standardization and implementation of
digital recording techniques using magnetic and optical media. Recent years have seen the near-completion
of the migration to digital systems, including solidstate storage with no moving parts and the continuing
refinement of hard-disk-based storage technologies.
This chapter covers a wide range of recording technologies and devices both past and present. It includes
material originally published in previous editions of
the NAB Engineering Handbook1 on both analog and
digital recording, followed by new sections that discuss the migration to IT-based recording.2 In-depth
coverage of computer-based audio recording systems
1
Mike Starling, Audio Recording Systems, NAB Engineering
Handbook, 9th edition, National Association of Broadcasters, 1999,
pp. 321340.
2
John Emmett, Sound Recording, Broadcast Engineers Reference
Book, Focal Press, 2004, pp. 599607. Used with permission.
4-
may be found in Chapter 3.6, Radio Station Automation, Networks, and Storage.
HISTORY
Until about 130 years ago, all sound was live sound.
Despite the developments of mechanical recording, it
was only with the harnessing and commercial use of
electricity that amplification and broadcasting could
be developed, and that subsequently led to a wider
need to store and reproduce sound waves. It also
offered the tools that were needed to develop ever
more sophisticated recording systems.
Mechanical Recording
Audio recording preceded and helped fuel the introduction of broadcasting. The earliest recorded audio
was Thomas Edisons 1877 mechanical cylindrical phonograph employing a constant velocity vertical recording groove. The phonographs cylindrical media
mandated that each recording be a master and stymied mass production.
In 1887, Emile Berliner patented a successful system of sound recording on flat discs. The first discs
were made of glass, later zinc, and eventually plastic.
An acoustic horn was used to collect the sound,
which was converted into vibrations recorded in a
spiral groove that was etched into the disc. For replay,
the disc rotated on a turntable while a pickup arm
held a needle that read the grooves in the record. The
vibrations in the needle produced the sound that was
455
Magnetic Recording
Danish telephone engineer Valdemar Poulsen demonstrated a magnetic wire recorder as early as 1898, as
shown in Figure 3.4-1. It was 30 years later that German
456
TABLE 3.4-1
The Most Common International Broadcast
Tape Number (IBTN) Scheme Sound Recording
Format Codes
Code
Material
16T
17T
33L
35T
45D
78D
A01
A02
A04
A08
A16
A32
AI1
AI2
AI3
AIX
AS2
AT2
CCA
CDA
CDD
CDR
D24
D32
D48
DA2
DAT
DCC
DD2
DP2
457
Material
H8A
LAQ
MDA
NAB
SVA
WAX
open-reel recorders, or duplicators, and then physically transferred to the cartridge for distribution. For
archival or restoration purposes, the tape can be
extracted for replay on an open reel recorder, and this
is to be preferred for transfer, as the tape path control
in cart players is necessarily poor. The format is still in
use by some broadcasters, but for quality and reliability reasons, there has been a steady migration to digital storage such as the MiniDisc and, later, hard disk
and solid-state storage.
In the background is a compact cassette, which typically recorded 45 minutes per side stereo at 1-7/8 ps.
These cassettes were initially developed by Sony and
Philips and introduced for consumer recording in
1962, but from the late 1960s, and with the use of
noise-reduction techniques, they became increasingly
popular as rugged interview recorders for professional users. More recently, they were superseded by
MiniDisc and solid-state recorders.
For many years, the workhorse of broadcast sound
recording was the 1/4-inch open-reel recorder, of
which an example is shown in Figure 3.4-4.
In dealing with legacy analog tape program material that has to be played back, one first needs to establish the tape speed and track layout of the original
recording so a suitable playback machine can be
found. The record characteristics for the tape should
also be determined if possible, although this may be
more difficult to establish. High-frequency and (sometimes) low-frequency equalization is applied during
analog tape recording to compensate for nonuniform
response in the head-tape system. Equalization has to
be applied during playback to achieve a flat overall
frequency response. The characteristic was standardized in the United States according to NAB-published
criteria and by CCIR and IEC in Europe.
Analog 1/4-inch tape recorder. Mounting a recorder that would take 10.5-inch spools into a
19-inch rack was always a challenge, and the Revox
PR99 was one machine with this capability.
FIGURE 3.4-4
FIGURE 3.4-3 The main analog tape media.
458
4-
NAB
Time Constants
Bass
Boost
High
Roll-Off
Bass
Boost
High
Roll-Off
15 ips
Studio
none
35 s
3180 s
50 s
7-1/2 ips
Studio
none
70 s
3180 s
50 s
7-1/2 ips
Studio carts
none
50 s
none
50 s
3-3/4 ips
Reporter & home
3180 s
90 s
3180 s
90 s
1-7/8 ips
Logging & home
3180 s
120 s
3180 s
90 s
1-7/8 ips
Fe-cassette
3180 s
120 s
1-7/8 ips
Cr-cassette
3180 s
70 s
Tape Speed
Use
Table 3.4-2 lists the most common recording equalization characteristics for 1/4-inch tapes. The break
frequencies of the filters are described by the time constants of simple RC low-pass networks. If a replay
machine with playback settings matching the tape
cannot be found, the correction equalization can be
quite easily applied, post replay, inside a digital audio
workstation.
The 1/4-inch analog tapes recorded with center
time code or sync tracks for video synchronization
require a special replay machine that may be increasingly difficult to find.
459
Maximum Relative
Humidity
50C
39%
40C
47%
30C
60%
20C
79%
10C
100%
amounts of debris to interfere with transport functioning. Recommended humidity and temperature conditions for storage are shown in Table 3.4.3.
460
FIGURE 3.4-5
signal with amplitude on the order of one quantization step. The signal value crosses back and forth
across the threshold, resulting in a square wave signal
from the quantizer. Dither suppresses such quantization error. Dither is a low-amplitude analog noise
added to the input analog signal (similarly, digital
dither must be employed in the context of digital computation when rounding occurs).
When dither is added to a signal with amplitude on
the order of a quantization step, the result is dutycycle modulation that preserves the information of the
original signal. The average value of the quantized
signal can move continuously between two steps;
4-
461
FIGURE 3.4-6 Block diagram of the recording (a) and reproduction (b) sections of a linear PCM system. (From Pohlmann, Principles of Digital Audio.)
462
4-
Compact Disc
The compact disc was developed to store up to 74
minutes of stereo digital audio program material of
16-bit PCM data sampled at 44.1 kHz. The total user
capacity is over 650 MB. In addition, for successful
storage on a nonperfect medium, error correction, synchronization, modulation, and subcoding were
required. The CD was originally conceived as a distribution medium to replace vinyl records, but its high
quality of uncompressed digital audio firmly established it as the medium of choice for playback of prerecorded music in the professional domain. Later
variants were introduced for recording and playing
back both data and audio.
Although the audio CD has now to some extent
been superseded by more recent optical developments
such as the DVD family which have come out of it
and, for professional purposes, the streaming format
used for recording has been largely replaced by filebased audio, a study of the design process is common
to all optical storage systems and provides a good
insight into data storage in general.
Compact Disc Physical Design
The diameter of a compact disc is 120 mm, its center
hole diameter is 15 mm, and its thickness is 1.2 mm.
Data are recorded in an area 35.5 mm wide. It is
bounded by a lead-in area and a lead-out area, which
contain nonaudio subcode data used to control the
players operation. The disc is constructed with a
transparent polycarbonate substrate. Data are represented by pits that are impressed on the top of the substrate. The pit surface is covered with a thin metal
(typically aluminum) layer 50100 nm thick, and a
plastic layer 1030 m thick. A label 5 m thick is
printed on top. Disc physical characteristics are shown
in Figure 3.4-8.
Pits are configured in a continuous spiral from the
inner circumference to the outer. The pit construction
4-
463
FIGURE 3.4-9 Typical subcode contents of the P and Q channels. (From Pohlmann, The
Compact Disc.)
464
4-
FIGURE 3.4-10 Channel bits as represented by the pit structure. (From Pohlmann, Prin-
length from 3.0543.560 m, depending on pit track linear velocity. Each pit edge, whether leading or trailing,
is a 1 and all increments in between, whether inside
or outside a pit, are 0s, as shown in Figure 3.4-10.
The start of a frame is marked with a 24-bit synchronization pattern, plus three merging bits. The total
number of channel bits per frame after encoding is
588, composed of 24 synchronization bits, 336 (12 2
14) data bits, 112 (4 2 14) error correction bits, 14
subcode bits, and 102 (34 3) merging bits.
Data Readout
CD pickups use an aluminum gallium arsenide
(AlGaAs) semiconductor laser generating laser light
with a 780 nm wavelength, which was the most economical type to be developed during the late 1970s.
Developments of the CD use shorter wavelength
lasers in order to record smaller pits and hence denser
data. The beam passes though the substrate, is focused
on the metalized pit surface, and is reflected back.
Because the disc data surface is physically separated
from the reading side of the substrate, dust and surface damage on the substrate do not lie in the focal
plane of the reading laser beam; hence, their effect is
minimized. The polycarbonate substrate has a refractive index of 1.55; because of the bending of the beam
from the change in refractive index, thickness of the
substrate, and the numerical aperture (0.45) of the
laser pickups lens, the size of the laser spot is reduced
from approximately 0.8 mm on the disc surface to
approximately 1.7 m at the pit surface. The laser spot
on the data surface is an Airy function with a bright
central spot and successively darker rings, and spot
dimensions are quoted as half-power levels.
When viewed from the lasers perspective, the pits
appear as bumps with height between 0.110.13 mm.
This dimension is slightly less than the laser beams
wavelength in polycarbonate of 500 nm. The height of
4-
465
FIGURE 3.4-12
466
Block diagram of a CD player with digital filtering. (From Pohlmann, The Compact Disc.)
4-
Recordable CD-R
With a CD-R (or CD-WO) write-once optical disc
recorder, the user may record data until the disc capacity is filled. Recorded CD-R discs are playable on conventional CD players. A block diagram of a CD-R
recorder is shown in Figure 3.4-13. An encoder circuit
accepts an input PCM signal and performs CIRC error
correction encoding, EFM modulating, and other coding and directs the data stream to the recorder. The
recorder accepts audio data and records up to 74 minutes in real time. In addition to audio data, a complete
subcode table is written in the disc TOC, and appropriate flags are placed across the playing surface.
Write-once media are manufactured similarly to conventional playback-only discs. As with regular CDs,
they employ a substrate, reflective layer, and protective
top layer. Sandwiched between the substrate and reflective layer, however, is a recording layer composed of an
organic dye. Together with the reflective layer, it provides a typical in-groove reflectivity of 70% or more.
Unlike playback-only CDs, a pregrooved spiral track is
used to guide the recording laser along the spiral track;
this greatly simplifies recorder hardware design and
ensures disc compatibility. Shelf life of the media is said
to be 10 years or more at 25C and 65% relative humidity. However, the dye used in these discs is vulnerable
to sunlight; thus, discs should not be exposed to bright
sun over a long period.
The CD-R format is defined in the Orange Book Standard authored by Philips and Sony. In CD recorders
adhering to the Orange Book I Standard, a disc must be
recorded in one passstart-stop recording is not permitted. In recorders adhering to the Orange Book II
Standard, recording may be stopped and started. In
many players, tracks may be recorded at different
times and replayed, but because the disc lacks the final
FIGURE 3.4-13 Block diagram of a CD-R recorder. (From Pohlmann, The Compact Disc.)
4-
467
468
4-
Audio.)
4-
469
FIGURE 3.4-16 DAT track configuration. (From Pohlmann, Principles of Digital Audio.)
470
4-
recorder.
471
472
FIGURE 3.4-18
early 1990s.
playback (b).
Solid-State Recorders
An obvious solution to possible mechanical vulnerability of portable digital recorders is to record directly
to a solid-state memory. The memory itself may be
removable or fixed, and in order to keep the power
consumption down and the recording time up, bit rate
reduction may be used. Examples of solid-state
recorders are shown in Figures 3.5-15 and 3.5-16 in
Chapter 3.5.
Probably the most important consideration is how
quickly and conveniently the recording can be
dumped to a computer or sent back to the home station. This is an important function so that the recorder
can be quickly reused.
Another consideration if bit rate reduction is used is
to what extent post processing (which might be as
simple as equalization) will compromise the audio
quality. The most efficient bit rate reduction systems
such as MP3 are efficient only because the artifacts are
evenly distributed across the threshold of our hearing.
A simple equalization change can quite easily uncover
those artifacts in a particular and audible area of the
audio spectrum.
4-
Computer Disks
Hard-disk-based storage now forms the heart of most
modern broadcast facilities with digital audio workstations (DAW) and audio servers. Some of the implications of this revolution are discussed later in this
chapter, and the systems and equipment are covered
in depth in Chapter 3.6.
See http://www.ixml.info/.
473
474
Workflow
Therefore, we now need to think of broadcast sound
recording as a part of general workflow, and in this context the audio could usefully accompany other essence
such as text and pictures. These essence items are all
linked together by the metadata, which takes the place
of the information once carried on the label and package of a disc or tape recording, although in the digital
world the metadata by itself possesses a much greater
potential power than a label ever did.
For the overall workflow process to be a success,
several presumptions are made about the sound
recording. The highest source quality needs to be preserved as far along the workflow chain as possible in
order to preserve the possibilities for later processing
or for future (and possibly lucrative) applications of
the recorded material. The technical quality of the
recording must be adequate to survive the numerous
signal processing and editing procedures that may be
applied at any stage along the workflow route. This
does not mean that current sound recording practices
need to be perfect, but it does mean that other items in
the signal chain must produce a greater level of
impairments than the recording. This has implications
for any bit rate reduction that is being considered.
In fact, the days of needing bit-rate-reduced recordings for the convenience or economic use of IT applications are long over, although low bit rate
contribution of interviews and similar recordings over
telephone circuits is likely to remain a prime use of bit
rate reduction for many years to come, especially in
areas without widespread Internet access.
doubt about the technical efficiency of a digital recording process. Digital recording has enabled big and
helpful changes in audio production, mainly as a
result of the transparency factor. For instance, the
recording of two-channel stereo in the form of M
and S (mid-side or mono-stereo) channels was not
considered using analog equipment, because any
recording artifacts such as noise on the M channel
would appear as a coherent center image in the reproduced sound field. A and B (left and right channel)
analog recording resulted in a much more diffusely
reproduced field of any recording noise; therefore, this
method became the standard for two-channel stereo
recording. M and S recording can prove quite useful as
the apparent width of the stereo image can be varied
just by adjusting the S channel level.
Perhaps this is a good point to step back from the
actual details of the recording technology and ask
whether we have actually experienced some kind of
digital audio revolution.
475
Principles
A microphone channel produces streaming audio, and
a real-time output data stream will be required in many
stages along the signal route to feed loudspeakers or
headphones. Any streaming format can be thought of
as one with the capability for audio data delivery in
real time with a low and controlled latency. Streaming
audio cannot be slowed down or speeded up, and it
cannot usually be interrupted without undesirable
consequences. It can start up immediately when the
signal is available, and it can go on streaming indefinitely. As it exists only in real time, it effectively carries its own timing information with it. While this
could apply to any analog audio signal, when adapted
to digital forms, there arises a need to declare the original sampling frequency at the very least in the associated metadata.
A file, on the other hand, never existed in the analog world, although a finished physical recording was
not a dissimilar concept. A file has to wait until a
definable portion of the streaming program is available to be packaged. The size of any file is limited and
needs to be declared, as do all the conditions necessary to rebuild a streaming output (that is, replay it).
Once this metadata information has been gathered
and stored in a header chunk, tightly coupled with the
audio essence (bare data samples), the file is fully
formed and only then can it be handled in the same
way as any data file.
In a typical recording production chain, it might at
first seem easy to see where both streaming and files
sit. This is not necessarily true, as for instance microphone signals might stream into a recorder via a mixer
of some kind. The recorder, however, may then record
the signal as a series of tiny files, and even on a digital
broadcasting system the signal will be formed of packets, which may be viewed as files of a tiny size. These
packets must be delivered at the output as a streaming
format. There is therefore some crossover between
streaming and files, and the fundamental penalty for
any use of intermediate files is signal latency (delay)
between the input and the output streams.
File formats, on the other hand, are fairly welldefined and pre-agreed arrangements for storing and
exchanging data of any kind at unspecified speeds.
For audio, any file format must, at the very minimum,
contain some form of header containing the information necessary to accurately rebuild the audio stream
from which the files were initially built. Summing up,
therefore, file formats and streaming formats are inextricably linked, and in some cases such as in those data
formats used for packet transmission, the division
between streaming and file may be blurred.
476
See http://www.iec.ch.
Available at http://www.ebu.ch/
4-
477
dBu: The level of an analog signal referred to 0.775 Volt RMS. This
voltage level produces 1 milliwatt of power into a load of 600
ohms.
dBv: The level of an analog signal referred to 1.0 Volt RMS.
Headroom: The maximum signal level beyond MPL before unacceptable distortion is produced, or sometimes it is quoted as the
same level above AL.
LFE: Low Frequency Effects
Mean AC Voltage: The average rectified value, as read on a moving
coil meter. (See also RMS.)
ML: Measurement Level. A level of analog signal chosen to be low
enough not to cause problems with crosstalk, etc., but to be high
enough to be useful for continuous tone technical measurements.
MPL: Maximum Permitted Level. The point at which analog transmitter signal limiters are set to operate. Normally 8 or 9 dB
above AL.
Pink Noise: Random or quasi-random noise with equal energy per
octave of bandwidth. (White noise has equal energy per linear
bandwidth.)
RMS: Root Mean Square volts. The value of a DC voltage which
would produce the same average power as the AC signal
quoted. (Not the same as the mean voltage value, except for sine
waves.)
SPL: Sound Pressure Level
Resources
SUMMARY
Now that ever more economical IT system components have caught upwith and exceed most audio
requirements, it does seem at last that some sort of
golden age of audio recording has arrived. Much as
the look, ergonomics, and even the smell of a classic
audio tape recorder may be missed by some audio
engineers, there is little doubt that the recording quality, flexibility, accessibility, and economics of IT-based
audio recording are all vastly superior.
The technology is now in place for brilliant audio
recordings to be made and we need to explore the
opportunities and possibilities that this allows. The
transparency of the modern digital recording actually
makes it more important than ever to get the microphone and positioning just right, with equal care
being taken in composing and focusing on the content.
Glossary
AL: Alignment Level. A reference level, usually recorded as a
header recording, and used to check the alignment of recorded
signals, and signal paths.
DAW: Digital audio workstation
dB: Decibel. A logarithmic scale for power, but for audio, it is usually understood in signal voltage terms. One decibel is quoted as
being the smallest gain change the human ear can distinguish
under good conditions.
dBA: A weighted sound pressure level
dBC: C weighted sound pressure level
dBm: The level of an analog signal referred to 1 mW across a 600
ohm load. (dBm refers to power, this relates to a voltage of
0.775V.)
9
478
The organization that sets the legally recognized standards for radiocommunications is the ITU-R (formerly known as the CCIR),
based in Geneva. (http://www.itu.int)
Standards for the electrotechnical equipment used in broadcasting,
including program meters and interconnection, are the responsibility of the IEC. However, in this case, the actual standards produced are often published in parallel by national standards
organizations. (http://www.iec.ch)
These two international organizations draw their expertise and
technical inputs from a number of specific industry groups
which themselves produce their own standards and recommendations documents.
The Audio Engineering Society covers many key areas of audio
standards and practices (http://www.aes.org)
The Society of Motion Picture and Television Engineers covers a
wide area of interest including the motion picture industry.
(http://www.smpte.org)
Bibliography
Alten, Stanley R., Audio in Media, Fourth Edition, Wadsworth Publishing Company, Belmont, California,1994.
Audio Engineering Society, AES Recommended Practice for Digital
Audio Engineering-Serial Transmission Format for Linearly
Represented Digital Audio Data, Journal of the Audio Engineering
Society, vol. 33, no. 12, December, 1985.
Carasso, M.G., Peck, J.B.H., Sinjou, J.P., The Compact Disc Digital
Audio System, Philips Technical Review, vol. 40, no. 6, 1982.
EBU (European Broadcasting Union). Specification of the Digital
Audio Interface. EBU Doc. Tech. 3250.
IEC (International Electrotechnical Commission). Draft Standard
for a Digital Audio Interface. IEC Report TC 84/WG11, November, 1986.
Jorgensen, Finn, The Complete Handbook of Magnetic Recording, 4th
Edition, McGraw-Hill, 1996.
Mee, Dennis C. & Daniel, Eric D., editors, Magnetic Recording Technology, Second Edition, McGraw-Hill, 1996.
Pohlmann, Ken C., Principles of Digital Audio, Third Edition,
McGraw-Hill, 1995.
Pohlmann, Ken C. The Compact Disc, Second Edition. AR Editions,
Madison, WI, 1992.
Pohlmann, Ken C., Principles of Digital Audio, Second Edition.
Howard W. Sams and Co., Carmel, IN, 1989.
Pohlmann, Ken C., editor, Advanced Digital Audio. Howard W. Sams
and Co., Carmel, IN, 1991.
Tremaine, Howard M., Audio Cyclopedia, Howard W. Sams, Inc., now
out of print but still available via specialty bookstores.
Watkinson, John, RDAT, Focal Press, Oxford, 1991.
4-