Beruflich Dokumente
Kultur Dokumente
Early Reflections Unpleasantly large Late reflection Early reflections are those more or less under the 10 msec mark. late (specular) reflections create a problem with perception, more on that later. The example here is egregious.
A point to recall:
Because high frequencies decay faster than low frequencies (even on a cold, dry day in the desert):
If you measure the early arrival frequency response, it will show a different frequency balance than that of the entire tail If you compare the early and late responses, the difference will be even bigger. Were used to listening to things that way, too, because its what we grow up with.
And a loudspeaker
Radiation patterns of loudspeakers are quite different at different frequencies
Typically, there is little directivity at bass frequencies As frequency goes up, there is more directivity. Many (consumer) speakers have fairly narrow high-frequency radiation patterns
There is a first-arrival
There may be a delayed reflection of a first arrival.
So what do we correct?
Good question
The only time a zero is not a room storage issue is when the loudspeaker has a zero at that frequency
So fix it, already!
Once more, with feeling: adding more energy to the room while its storing energy at that frequency is not a solution!
NOW WHAT?
Ear Continued
The first-arrival provides a very strong localization effect binaurally.
This localization applies to anything that is correlated at the two ears, including with ITD range delays. Signals that are not correlated at the two ears are not localized, and are, rather, heard as envelopment
Remember: Specular reflections are correlated at the two ears. The diffuse tail is not.
Some rooms are far, far, far from satisfactorily diffuse, hence flutter echo and like problems. This is not an easy problem to fix.
In the diffuse tail, bass hangs over much more strongly than high frequencies, both initially (due to loudspeaker radiation pattern) and more so later, due to lossy transmission and reflection of sound.
Diffuse perception
Signals that are not correlated (either by waveform at low frequencies or envelope at high frequencies) at the two ears are heard as diffuse or surrounding. This means that we hear the diffuse response of the room as a different (set of) auditory objects than the direct sounds. We are USED to the diffuse sounds being heavily colored in timbre.
Low frequencies
We live, day in and day out, in environments that provide a huge variation in the low-frequency environment.
Were used to it Nonetheless, huge excursions, especially peaks, are very annoying. Again, remember the rule dont add energy if theres already too much stored.
Whatever you do, dont try to completely invert the system, i.e. correct both phase and magnitude.
Why not?
First, what are you inverting? Pressure? Volume velocity? Some of each? Does it relate to what your head/ear does in the soundfield? (Hint: NO ) Second, if you try to invert phase, youll introduce pre-echo unless your fit and inversion are good to 60dB.
Even if it was when you did it, it wont be when you exhale and change the humidity in front of your head.
4. Try to cancel, to some extent, that single first later reflection, but only at low frequencies.
3.
4.
Relative or Flat
Flat costs more for equipment Flat requires more CPU if done accurately Flat doesnt fix imaging as well, unless relative is also added, in which case you need even more CPU Relative is cheaper, both in equipment and CPU Relative corrects the most obvious defects.
Conclusions:
At low frequencies, correct the overall room response At high frequencies, correct the first arrival Always, obviously, correct gain and delay between channels Relative correction between channels does more perceptually than the same amount of CPU applied to flattening the system analytically. Too much correction is bad Long-window corrections at high frequencies cause the dentist drill experience, because the system will be equalized to provide way, way too much correction at high frequencies for the first-arrival signal.
The Break
Well do door prizes after the break Please take a 15 minute stretch.
Sequence of operations
Generate probe signals Measure delays Measure gains Measure frequency response Identify first reflection (delays are measured from one set of captures, the rest are measured from a second set of captures)
Probe generation
Synthesized in Frequency (Discrete Fourier) domain: Magnitude the same at all frequencies * Phase is continuous across frequency including at pi and zero Extent of time spread is limited by phase change, no window necessary DFT values at a negative and positive frequencies are complex conjugates to generate real signal Transformed to time domain using inverse complex FFT Imaginary part of complex time domain signal is zero Real part of complex time domain signal is the probe signal
*but see next slide
Probe Generation
Time Domain
Autocorrelation
Spectrum
Unwrapped phase
silence between probes (for room to settle) extra marker probe at the end to detect timing glitches in audio capture/playback LS/RS could be LR/RR Can also do 7-channel or other arrangement, using same method
Probe Autocorrelation
Gain, Freq response, etc. measurements (this happens for each channel separately)
N takes are used for wide probe in case sporadic room noises interfere
Gain is derived from the 800-2000Hz average of power spectrum coefficients Only the first N (128) samples of the impulse response are used
Then, for each channel, throw away outliers and average the rest
Finally, normalize all gains relative to the channel with the highest/lowest gain
1. Power spectrum of captured signal 2. Power spectrum of captured signal complex-divided by FFT of probe 3. First 400 samples of IFFT ( FFT ( capture ) / FFT ( probe ) )
Deconvolution by way of division in the frequency domain Then, for each channel, throw away outliers and average the rest Finally, if relative response correction is specified, normalize all responses relative to the average of all channels
Separate correction filters are computed for low vs high frequencies Each filter assumes that the part it doesnt do is flat Durbin LPC is used to obtain all-zero inverse filters (normally Durbin LPC is used to obtain all-pole direct filters) transition between the low high is done in log(power spectrum) domain Low- and high-freq correction filters are convolved to obtain final filter final filter is then (not shown) normalized for unity avg gain 800-2000Hz
Reflection correction filter has a trivial numerator (1) Denominator uses (upside down) the coefficients of a specially crafted M-tap symmetric low-pass FIR positioned at a distance determined by reflection delay. I.e., it recursively subtracts LP-filtered version of echo.
Questions ?