|
Audio
Processing & HD Radio
Frank
Foti, Omnia Audio
Cleveland, Ohio
Hello, HD
Radio!
We live in exciting times. Think about it…
communications and computing technology have changed the
broadcast industry in ways unimaginable just a few short years
ago. While the PC and associated networks dramatically
transformed how radio programs are produced, new transmission
technology resulted, within just five short years, in DSB,
HDTV, and now HD Radio. The last time broadcasting saw
innovation with similar significance was nearly a half-century
ago, when color television and FM Stereo were introduced.
We're lucky to have the experience of working with these
exciting new technologies - it really is exhilarating!
Is HD Radio
what we, as broadcasters, need to take us into the digital
era? Some may be skeptical, but in our experience the
technology works and is ready for the world. Is it perfect?
No. All engineering involves trade-offs, and digital radio is
no exception. I'll bet that when FM stereo was introduced, it
prompted a bit of nay-saying, too - some justified: audio
bandwidth was reduced to 15kHz, and broadcast engineers were
soon to encounter the infernal beast known as multipath.
Yet we did our
best to learn and understand the medium, and we have improved
the performance of the FM Stereo system over time. Certainly
the same will be true with HD Radio. We, together, have the
opportunity to take this technology and make it sound the best
possible. Certainly there are aural limitations imposed by the
FM Stereo system but we learned to live with them, and we will
certainly have some aural challenges with HD Radio.
Nevertheless, there are ways to work around those challenges
and unlock the benefits of this system.
Over the past
few years, as development and testing of the HD Radio system
have progressed, audio processing has been one of the system's
key components. Early on it became apparent that dynamics
processing would figure in both the aural and technical
performance aspects of the transmission system. This has also
been the case regarding signal processing for other bit
rate-reduced audio services, like Internet audio streaming.
Opening up a dialog about audio processing for HD Radio
requires more thought and insight than anyone imagined. If you
thought it would be easy to just "pencil in" your
favorite flavor of processor and be done with it, then I ask
you to think again! There are some important issues that must
be thought out for digital radio's benefits to be fully
realized.
Fortunately,
we at "Omnia HQ" are in a pretty good position to
bring some understanding to the new topics that accompany HD
Radio. We've been doing FM processing for a decade, and our
sister company, Telos Systems, has been doing codecs even
longer. We are both heavily involved in leading edge DSP
development, and our development teams include many
professional broadcasters. Working together, Omnia engineers
learned how to optimize audio to deal with the limitations
imposed by codecs and other bit-reduced environments -
knowledge we found to be essential when tweaking audio
processing for maximum HD Radio performance. (As a side note,
this cross-pollination works the other way, too - the latest
Telos codec product has Omnia audio processing inside.) We
also had access to a fully functional HD Radio transmission
system, with which we logged a lot of lab time listening and
experimenting, so it's possible that we are a bit closer to
understanding the processing demands of the new HD Radio
system than the pioneers of FM Stereo processing were!
The major
principle to keep in mind as we embark on our discussion is
that all transmission mediums are not the same. Sure,
this sounds obvious - but how many people do you know who
thought an old FM Stereo limiter would suffice to process a
narrow-band Internet audio service, only to find that coding
artifacts seemed to be enhanced rather than suppressed? The
same principle applies to HD Radio: a processor designed
specifically for the new medium will be needed, and thought
must be given to the complete transmission system. So, let's
get started.
Differences
Between HD Radio And FM Analog
Before considering processing for HD Radio, let's consider
the technical differences between the two mediums. The most
obvious difference is that HD Radio has a wider audio
bandwidth, extending to 20kHz, compared to FM Analog's maximum
15kHz.
To achieve HD
Radio's 20kHz audio response, your audio processor must
support this bandwidth. This means that the processor needs to
have a base sampling rate of 44.1kHz or above. (This is
according to the Nyquist principle, which states that the
sampling frequency must be at least two times the maximum
operating frequency.) A processor designed for analog FM
utilizing only 32kHz sampling will not provide the full audio
response. HD Radio was developed with the goal of providing
CD-like audio; why then should we limit audio bandwidth on the
digital channel to anything below 20kHz?
Another
significant difference is that HD Radio does not use any form
of emphasis in the audio path, whereas FM Analog does: 75µs
in the USA and 50µs in Europe. The shape of the 75µs HF
curve has a 2.2kHz breakpoint and a 17dB boost at 15kHz. As a
long-time designer of processor algorithms, I can tell you
that this is the number one challenge in producing clean audio
from an analog FM channel; current processors must carefully
manage the high-frequency gain generated by pre-emphasis.
Thankfully, the HD Radio system has a flat response. Without
the HF boost, much less of the HF spectrum is driven deep into
the final limiters, so familiar FM processing side effects,
such as intermodulation and harmonic distortion are greatly
reduced. HD Radio processors can sound much smoother and
cleaner than their FM counterparts at a given processing
level.
Not everyone
may perceive this as a benefit. Pre-emphasis boost creates
what some industry insiders refer to as the "sounds like
radio" effect. Advanced FM Analog processors have a
special distortion control function in their final limiter to
reduce the worst of the audible effects of HF clipping, and
the sonic result is that certain "radio sound" that
we are used to hearing. I expect when some program directors
first hear HD Radio, they will be unhappy because it won't
sound like what they have grown used to. No matter how much
processing you use to try to fix this "problem," the
aesthetic will remain obstinately different. (I suppose we
could implement some kind of HF clipping distortion simulator,
without actually doing the clipping, but let's agree not to go
there…)
Differences
Between HD Radio And AM Analog
The biggest difference between HD Radio and AM Analog is
frequency response. To implement HD Radio for AM, the audio
bandwidth of the analog channel must be further restricted to
5kHz, as compared to the 10kHz NRSC spectrum that has been in
effect. This is necessary to eliminate any interference from
the analog channel into the digital spectra. This does further
reduce the fidelity for the analog channel. The HD Radio
system is designed to provide a 12 kHz response on the AM
band.
At the time of
this writing, development of HD Radio for AM is still ongoing,
so for now we'll concentrate on the FM system.
Processing For HD Radio: The Landscape Has Changed!
The challenges for an HD Radio audio processor are much
different than for an FM Analog processor. In the FM Analog
system, the processor must:
-
Provide
precision peak control to guard against over-modulation,
-
Manage the
pre-emphasis boost to avoid audible distortion generated
by the processing,
-
Offer a
brick-wall filter to protect the 19kHz pilot signal of the
multiplex stereo system.
A processor
for HD Radio has a completely different set of requirements.
The most important issue is in dealing with data reduced
audio. An HD Radio processor must be able to manage the audio
spectrum in an efficient manner, just as the processor for FM
Analog must manage the pre-emphasis boost. The processor needs
to be thought of as a partner with the audio encoder,
analogous to the way an FM Analog processor works together
with the stereo multiplex encoder.
We also need
to understand what needs to be done to manage the audio
spectrum so that the least amount of coding artifacts are
created. It is possible to predict what spectral conditions
will exist that can generate audible artifacts due to coding,
and dynamic algorithms in the processor can offset these
conditions and, in many cases, remove unwanted artifacts,
especially at higher bit rates (such as the 96kbps rate used
for HD Radio). When properly designed and implemented, the
audio processor can improve the efficiency of the encoder.
While peak
control is required in order to keep the modulation input from
exceeding the full-scale headroom limit of the system, an
aggressive clipping function is not required - and actually
becomes a problem for the codec process.
The End Of
Clipping?
Precision peak control can be achieved using numerous
methods. Probably the most common is the hard limiter, or peak
clipper. Most audio processors designed over the last twenty
years also employ some form of distortion masking as a tool to
suppress the Total Harmonic Distortion (THD) that is created
by the clipper. This makes it possible to utilize more of the
clipping function, which translates directly to more perceived
loudness.
Employing a
clipper as a peak limiter in a HD Radio system will work, but
there are sonic penalties to be paid. Any clipping process
yields harmonics of the fundamental source signal, and even
with distortion masking some second order harmonic content
will remain. This adds to the audio spectrum and aggravates
the encoder, which in turn spawns additional sonic artifacts.
This is particularly noticeable in the high-frequency range -
where most codec artifacts exist - and is very noticeable with
certain program material; therefore, another form of peak
limiter is needed.
Enter the
Look-Ahead Limiter
There is another form of peak limiter that is the perfect
companion for the HD Radio application: the Look-Ahead
Limiter. This limiter provides excellent peak control, and it
does so with very little - or no - added harmonic content.
Here's how a
look-ahead processor operates: the processor calculates the
peak level of a signal over a specified period of time and the
audio is delayed by a like amount. Then, as the control signal
is applied to the audio gain function, the peak is reduced at
the precise time that the control signal reaches the maximum
control level so that the crest of the peak is reduced without
truncation. This is how hard clipping is avoided. Figure 1
illustrates this process:

Figure 1: Operation of a Look-Ahead processor (simplified)
As the
illustration shows, peak control is achieved without creating
any harmonic distortion. If the diagrams were expanded to show
detailed sine waves, you would see no peak truncation during
the processing period.
Unfortunately,
there is no free lunch. A simple wide-band look-ahead
processor will not create harmonic distortion, but will
generate Intermodulation Distortion (IMD). This type of
distortion has a different type of sound to it, sort of a
"busy" or "mushy" quality that can be just
as annoying as harmonic distortion.
Innovative
Algorithm Design
THD is very hard to eliminate and will cause audio to
contain out of spectrum components (although some of those can
be removed through precise filtering.) But all is not lost,
because in the case of IMD we can design our processor to
"know" what signals will cause those distortions and
when they will occur. Then we can take advantage of the
look-ahead calculation time to add ancillary control signals
that monitor and remove IMD. We can have our cake and eat it:
precise peak control and very little THD and IMD distortion.
Great care
must also be taken in designing the attack and release time
constants. Even more so than with an FM Analog processor,
these constants must be optimized to provide a transient feel,
for the most natural sound. We've discovered that the best
results are achieved when attack and release times increase as
frequency decreases. In the case of a look-ahead processing
system, this means we will also require different processing
delays for each audio band.
Transmission
System Considerations: Blend-To-Analog Accuracy
Integrating an audio processor into the HD Radio
transmission path would appear to be straightforward, and in
concept it is - but there are a few new items that need
consideration.
First, there
is the important issue regarding the blend-to-analog mode in
the receiver. This function is designed to switch the receiver
to the analog audio path whenever the digital signal is lost
for an extended period of time. Even though the HD Radio
system has designed time-diversity into the transmit/receive
path (so the audio from both the digital and analog signals
arrive at the same time), audio spectrum and phase
relationships must be similar on both the digital and analog
transmission paths.
Should there
exist significant phase differences across the audio spectrums
of the digital and analog signals, the blend-to-analog action
will not be heard as a smooth transition, and the audible jump
is likely to be jarring to listeners. Therefore it is
imperative that the two audio processors employed for the HD
Radio and FM Analog paths have the same, or very close to the
same, phase relationships across the audio spectrum.
One way to
ensure that the phase relationships are maintained between the
digital and analog channels is to apply a single processor
that provides outputs for each of the required channels. A
system of this nature would integrate dedicated final limiting
functions for the HD Radio and FM Analog channels. Figure 2
illustrates such a system.

Figure 2: Combination FM Analog and HD Radio Processor
An
important consequence of the required integration is that the
processor will probably need to be located at the transmitter
facility; the dedicated outputs of two transmission channels
would require double the STL occupancy in order to be
studio-located. Modern processors provide excellent remote
operation capabilities, so remote location should not be as
much of a concern.
Level
Normalization Between The HD Radio and FM Analog Channels
A critical component to the HD Radio transmission system
is the reference audio levels of the digital and analog
signals in the receiver. We want to create audio levels that
are perceived to be relatively the same in average volume when
compared to each other, since the listener experience could be
adversely affected if the audio level abruptly changes when
the receiver switches between the digital and analog signals.
How can this be normalized, and what are the operating levels
required to accomplish this?
The
FM/AM Analog path transmits a maximum peak deviation of
+/-75kHz for FM, and double the RF carrier level for AM. These
levels correlate to 100% modulation. It is accepted that 100%
modulation will create a specified level within a receiver.
For the sake of discussion, let's assume that 100% modulation
will create a peak level of 0dBu in a receiver. For this given
modulation level, an average level will be derived based upon
the amount of processing employed. The RMS level will average
a larger value when more processing is used.
Most
FM and AM stations employ enough processing that the RMS level
is usually within a few dB, whether or not the processing is
set for light or aggressive operation. Present generations of
audio processors do a very fine job of maintaining a
normalized RMS level. Basically, the sonic difference between
light and aggressive processing is the perceived density -
anywhere from "packed up and thick" to "open
and airy." For this discussion, our concern is not the
aural texture but the perceived average level of the analog
path.
One
of the advantages to the digital channel is the ability to
offer wider dynamic range, which allows you to use less
processing. This would reduce the RMS average level and sound
perceptibly quieter compared to the FM/AM Analog path. So,
again, the dilemma concerns what level the HD Radio signal
should be set at to sound comparable to the more heavily
processed analog signal.
To
help determine the optimum operating level for the HD Radio
processing system, the following subjective test was derived.
Two dedicated processors, one for FM/AM Analog and one for HD
Radio, were set side by side and aurally compared. The
FM/AM-Analog processor was set for a relative reference level
of 0dBu. Think of this as the given 100% modulation level that
would feed a transmitter. Since all pre-emphasis/de-emphasis
and peak control are handled in the processor, the output
level can be set to a known reference. Likewise the HD Radio
processor can be set up in the same fashion. Since both units
will provide absolute, precise peak control, output levels can
be set to maximize the dynamic range of their respective
transmission channels.
The
FM/AM Analog processor was set for heavy, aggressive
processing, whereas the HD Radio system was set for light
processing. Once set up, the processing parameters were not
further adjusted during the testing. Fourteen audio clips
including voice, voice-over and music were recorded with the
digital processor (serving as the reference) held at 0dB and
the output of the analog processor recorded in 1 dB steps from
0 to -9dB.
Thirty-one
subjects from iBiquity's offices in Maryland and New Jersey
and from Omnia's office in Ohio participated in the test. The
result of this evaluation determined that the perceptual
difference between a heavily processed analog and lightly
processed digital transmission is 3.57 dB.
In
the transmission system there are two parameters that can't be
changed: first, the FM/AM-Analog is set to 100% peak
modulation, and second, the HD Radio channel is limited to
0dBfs peak operation. Therefore, to insure smooth blending
between the analog and digital sources, the offset to
normalize these levels will have to be incorporated in the
receiver.
Based
upon the results of the subjective test, it is recommended
that to normalize the audio levels between the HD Radio and
FM/AM Analog signal paths, a 5.0 dB relational difference in
level needs to be implemented. This number, for FM operation,
is derived from the 3.57 dB determined through subjective
evaluation and approximate 1.5 dB of pad to allow broadcaster
flexibility. To ensure proper blending, every HD Radio
receiver, independent of manufacture, will be required to have
the same relative offset.
Automotive
receiver manufacturers currently match the levels of various
sources, including CD players, AM & FM radio and DVDs so
that there are minimal level disparities between the devices.
For HD Radio the offset can be done in one of two ways -
either the receiver manufacturer can choose to increase the
level of the digital by 4.5 dB or decrease the level of the
analog by 4.5 dB.
STL
System
HD Radio offers broader audio bandwidth of 20kHz,
requiring an STL link capable of supporting this response with
a sampling rate of 44.1kHz or 48kHz. The HD Radio system
operates at 44.1kHz, but 48kHz is the professional audio
standard. The importance of linear phase relationships between
the HD Radio and FM Analog systems has already been mentioned
- and this is best accomplished by using audio processors that
employ a common design and sampling rate! Even though time
alignment between two different sample-rate systems is
possible, it adds needless complication and potential trouble
to the overall system.
So
again, since just about every facility has an STL link that
can carry only one linear stereo audio pair, the most logical
location for audio processing is the transmitter site. We need
two specialized audio signals for transmission: The 15kHz
emphasized signal for the FM Analog channel (which can in the
form of discrete Left/Right or MPX), and the 20kHz/flat signal
for HD Radio. The single STL link delivers 20kHz audio to the
transmitter site, where it can be routed to a combination
processor as described earlier, or to multiple units.
If
your desire is to locate the processing at the studio, you
will need two linear stereo audio channels in your link. This
would be possible via a multi-channel T-1 STL, but remember
that the link must be linear and uncompressed.
Latency
Issues
Throughput delay is a given with the HD Radio system. The
latency of the system, end-to-end, is almost seven seconds.
The days of monitoring off-the-air are over! This will require
alternative methods for talent monitors, as well as IFB feeds
for remote broadcasts. A simple method for creating an
air-like monitor for talent would be to insert an older
processor into the control room monitor path. This will allow
disc jockeys to maintain the same processed sonic texture in
their headphones.
The
bigger challenge lies in providing remote talent cuing for
live events such as remotes, traffic reports, etc, since
off-air feeds are no longer feasible. Possible alternatives
are the use of the SCA channel, RPU, or ISDN/POTS codecs with
low IFB delay.
System
Layout
Presently, there are two methods of implementing the RF
plant for HD Radio: low-level and high-level.
Low-level
refers to the method of mixing or diplexing the FM Analog and
HD Radio RF signals together at the lower power level, which
can be found at the output of the exciters. This requires the
use of a single transmitter, but it must have sufficient RF
response and group delay specifications to pass both the FM
Analog and HD Radio signals without degradation. High-level is
the method of diplexing the output of two dedicated
transmitters together and passing those onto the antenna. This
method puts less stringent requirements on the FM Analog
transmitter, but it will consume more power, as there will be
an insertion loss in RF power to the FM Analog transmitter. If
you choose to use your existing FM Analog transmitter, be
certain that it contains enough RF power headroom to replenish
the insertion loss that will occur in the diplexer.
With
regards to audio processing, it doesn't matter whether you
employ the high-level or low-level method; all of the data and
timing issues are the same.
The
intended goal for successful HD Radio deployment should be
that the existing FM Analog channel is allowed to operate as
it presently does, while the HD Radio signal is seamlessly
added into the transmission path.
Many
FM transmission systems that operate using one of two
connection methods to the input of the FM exciter: composite
multiplex (MPX) or AES digital, where the stereo encoding is
performed in the exciter. This capability should not be
altered with the addition of the HD Radio path. It is
understood that there are critical issues concerning data
timing and rate conversion, as well as synchronization of the
diversity delay between the FM Analog and HD Radio audio
channels. Insuring that these signals are properly deployed is
key to the operation of the entire HD Radio system.

Figure 3.1: Sample schematics for Low-Level (left) and
High-Level
Combining Systems (courtesy of Broadcast Electronics)

Figure 3.2: Another example of Low-Level (left) and
High-Level (right)
Combining Systems (courtesy of Harris Broadcast)
Please
note that the system diagrams presented here are provided as
examples only; since the rollout of HD Radio, both Harris and
Broadcast Electronics have further refined their designs.
Please contact your transmitter manufacturer of choice for
further information regarding system design, implementation,
and setup. As this system continues to evolve, further
enhancements are sure to occur.
Processing
As A Tool, Not A Weapon!
All too often, processing is pushed to a level where it's
thought of as a "weapon." HD Radio will require us
to think of it more as a subtle tool. Due to the processor's
ability to enhance or improve the efficiency of the audio
encoder, it will act as more of a partner to the transmission
system. Processing will actually improve the intelligibility
of the perceived audio. (Listening to a low bit rate Internet
stream that has a well-tuned audio processor applied to it
will verify this.)
Processing
for effect is still possible - make no mistake! Creating the
appearance of that larger-than-life, big "phat"
sound is most certainly possible. But that synthetic
"smash-mouth" sound characteristic of many current
FM stations will be far less possible.
OK…Let's
Jam!
Now that we've taken care of the preliminary info, we can
discuss the processing in more detail. As we saw, processing
for HD Radio can be done in one of two ways: using a combined
processor, where a single system will manage both the HD Radio
and FM Analog signals, or using a dedicated processor for each
channel. If the latter is chosen, make sure that both units
"agree" with one another regarding phase
relationships. You don't want problems with the
blend-to-analog mode.
The
combined processing system will create a similar sound on both
channels. This system will employ its multiband sections as
common functions to the HD and Analog channels. The only
difference is in the final limiter sections. Decisions about
processing setup will doubtless be dominated by the analog
channel until more HD radios have filled the market. Thus, the
recommendation is to set your processing for the analog
channel, and then fine tune the HD channel by setting the
final mixer and limiter for the desired amount of density.
Since
processing for the analog channel will be set up to take
preemphasis into account, the final spectral mix will be
representative for that of an emphasized signal. Therefore, it
will be important to perform a separate final mix for the HD
channel, so that the spectrum is properly balanced with
respect to the analog channel.
With
the combined system, this is where differential adjustments
can be made for the HD channel. Be careful to avoid widely
different EQ among the bands, as this can cause the frequency
range that contains the most gain to dominate the amount of
look-ahead limiting.
For
example, if the output of Band 2 is set so that it has a +3dB
gain, as compared to all other bands, than it will dominate
the action of the final limiter due to its louder level. Also,
be careful not to over-emphasize the high frequencies, as they
can agitate the audio encoder and be heard as HF coding
artifacts.
The
last adjustment will be setting the relative offset in HD
output level so that the two channels appear normalized. This
is done with the output level adjustment.
Using
separate processors for the two channels provides a range of
possibilities. Again, the processors must be consistent with
each other regarding phase relations, especially in the
respective crossover networks. It would not be a good idea to
retain that +20 year old two-band processor for the analog
channel and install a new multiband unit for the HD side. That
type of arrangement will surely cause massive phase errors
whenever the blend-to-analog function occurs!
In
this type of configuration, the analog channel can be set up
as usual and the HD channel can be adjusted for whatever
texture is desired. If the processing goal for the HD channel
is to sound considerably more "open" and
"airy" as compared to the analog channel, this is
the method to use. Care should be taken to avoid
over-enhancing high frequencies.
If
you need to jam on the HD channel, you can, but with a few
noticeable differences. Since the final limiters will not
contain clippers but look-ahead limiters, there will be a
textural difference in how low frequencies will appear.
Clippers add a "phatness" to the audio due to the
harmonics they generate. Most processors offer some type of
Bass-EQ or bass enhancement. You may need to adjust those
functions a bit differently for the HD channel.
Also,
while the HD processor does not contain a clipper, the final
look-ahead limiter will create its own sonic artifacts if
overdriven. When a look-ahead limiter is driven too hard, you
will get added IMD, rather than THD. As with anything
processing-related, a little goes a long way.
Digital
Loudness Wars?
To understand the nature of future loudness wars, we need
to look deeper into how the HD Radio receiver operates. When
selecting an HD Radio station, an internal buffer inside the
receiver must fill before the digital audio is routed to the
speakers. Depending upon the receiver design, this buffer may
take a few seconds. So, when switching between stations, the
analog signal will be the first to be heard and the digital
audio will follow, when the buffer has been filled in the
receiver.
Because
of the length of time it takes for the digital signal to be
heard after a station swtich, it will be quite difficult for a
listener to remember the loudness level of one HD Radio
station compared to the next. The ear normally has a retention
memory for only a few milliseconds. So, if loudness is still
the key desire for your station, it will still be possible to
create that "button-push" illusion of being louder
than the next guy, but once the HD channel switches in, you
can relax the processing. Create the loudness illusion using
the analog channel, and take advantage of the fidelity
possible with the HD channel.
Also
note - you can't gain extra loudness via over-modulation
anymore. This will not be technically possible because there
is simply no more to be had after you've reached digital
full-scale. So the days of over-modulation will cease with
digital transmission; trying to exceed full-scale will only
result in drastic and annoying distortion.
Further
On The Horizon…
HD Radio continues to evolve, and there will be much more
to learn. Nevertheless, we are confident that today's Omnia
processing and the HD Radio system will work in concert to
provide an experience for listeners that will keep radio
interesting and relevant in a time when the competition for
people's attention is overwhelming.
Top
|