 |
|
Enhancing The Digital Path: Digital Multiplex (D-MPX)
Connectivity
By
Scott Martin, Nautel, Nova Scotia, Canada
and Frank Foti, Omnia Audio, Cleveland, Ohio
April, 2004
ABSTRACT
The all-digital transmission path is
quite common in today’s FM radio broadcast facility: from
content source on through to the modulation and RF generation
stages. With the above system, stereo generation (MPX) must
be done in the exciter, when it should be performed in the
audio processor. Present systems, while they work
electronically, pose problems with regards to modulation
overshoots due to sample rate converters and the connection
method of using AES/EBU. This has been researched and
documented with numerous exciters and audio processors. This
problem is not limited to just one type of configuration or
specific product.
This paper offers an in-depth look at these problems, and
states their cause. Discussion will reveal a new concept for
high performance MPX generation and interconnectivity to an
audio processor. In addition, with the advent of HD Radio a
new concept for a high performance digital path is
presented.
OVERVIEW
The rollout
of HD Radio presents a whole new level of challenge for
digital FM exciters and related audio technology. What is
the point of a clear, digital transmission if the source is
“dirty”? Modulation performance of digital exciters for the
analog, or conventional channel, continues to under-whelm
broadcasters. We have found that using AES/EBU between the
output of an FM audio processor and input to a digital
exciter can cause modulation overshoots. This is not a
unique problem to one specific processor or exciter.
The fundamental problem with AES/EBU connectivity is that
the audio is in separate left and right audio channels,
forcing the exciter to perform the multiplex stereo
generator function. When ancillary operations such as sample
rate conversion, additional low pass filtering, and exciter
based limiting, are added to the system, overshoots occur.
To avoid this, the audio processor and stereo generator need
to operate together and the composite multiplex
(MPX) signal connected directly to the modulator stage of
the exciter. Because this has not been possible, the
audio processor has had to generate the composite signal and
then convert the signal to analog in order to
be passed on to the analog MPX input on the exciter.
This paper describes, in detail, a proposed interconnection
of an audio processor and digital FM exciter. The
methodology and protocol is open source, and could easily be
standardized for the broadcast industry.
The importance is precision modulation peak control and
spectral management of the mpx signal. If precise
peak control is not achieved, loudness will be lost. The
present AES/EBU interconnection method can generate
peak overshoots have been measured as significant as +120%
modulation. This relates to a loudness loss of
approximately 1.5dB!
STATE
OF THE ART
Before we talk about new concepts for
enhancing the audio path, we should review what most would
consider today’s state-of-the-art in terms of transporting
audio information from the studio to the transmitter.
We should also discuss the problems that still exist in this
path and justify the rational for further improvements.
Clearly, it is possible today to achieve a fully digital
audio path from the studio to the transmitter. Our analysis
of the audio path will make the assumptions as described
below.
Studio-Transmitter Links (STL's): There are a wide variety of STLs available today, some RF
based and some not. To achieve best audio path performance
it is essential that the STL meet two criteria. It must
transport audio digitally and it must not compress the 2
audio data as some do in order to operate within a narrow
channel bandwidth.
The reason for digital transport is primarily due to the
desire to eliminate ADC and DAC transitions and the
associated distortion. Compression of audio data,
particularly if the compression ratio is high, will degrade
the audio data because it cannot be perfectly recovered at
the STL receiver.
Additionally, it has been proven, through experience, that
passing processed audio through a codec, creates
added audible distortion. This occurs due to the effects of
added harmonic products, caused by processing/clipping,
being passed through a codec. Even if the audio processor is
placed after the coded
STL, aural anomalies can occur when data reduced audio is
passed through heavy gain control, as is
performed within a processor for broadcast.
Component Interconnect: It has already been implied
that transporting audio digitally is state-of-the-art. The
standard protocol is AES/EBU and is regarded as perfectly
adequate provided that reasonable sample size and rates are
used. While some may argue that 32kHz sampling rate is
sufficient, there can be little debate that higher is
better. There seems to be some unknown reason why 32kHz
sampling rate is used in broadcast paths. It has been
discussed and demonstrated that 48kHz sampling is far
superior in performance. The importance is not so much the
added spectrum that 48kHz sampling provides, but that 32kHz
sampling makes it too easy to cause aliasing distortion in
specific path functions. This is clearly demonstrated by any
DSP based audio processor designed before 1997, and
extensively detailed by one of the authors in
a NAB presentation (2).
With 32kHz sampling, there is also the problem of filter
delay. Considering that conventional FM Stereo
broadcasting requires 15 kHz of audio bandwidth, this leaves
only 1 kHz of guard band spectrum before the
Nyquist point. To facilitate this, a filter of very large
magnitude must be employed in order to suppress all
energy by at least 96 dB at the Nyquist, or aliasing occurs.
This can be done digitally using a finite impulse
response filter (FIR). The only drawback is that it will
require many ‘taps’ within the filter to achieve this
level of stopband rejection. The significance of the ‘taps’
is that for every two taps in the filter, it requires
one sample to perform its duty. For a 15 kHz FIR filter of
this magnitude, it will need 101 taps. This in turn
results in 50 required samples which equates to 1.56
milliseconds of propagation delay through the filter.
Start adding up the number of 15kHz filters employed in a
32kHz sampled transmission path, and those alone
will create enough time delay to disorient on-air
announcers.
The use of at least 48kHz sampling as the AES/EBU
input rate in the exciter insures the best sonic performance
of the signal being applied. In addition, any input that
needed to be converted up from 32kHz sampling would not
create any overshoot component in the modulator. All of
which are benefits to the broadcaster.
If we now apply these assumptions to the studio transmitter
audio path we end up with something like
that shown in Figure 1.

Whether the path includes HD-R is irrelevant for the purpose
of this paper because it is a proprietary system
with defined data transport. It is treated as a parallel
system that does not affect the transport of non HD-R
audio (other than the need to apply a delay).
The placement of the audio processor is shown to be at the
transmitter site although it could just as easily be placed
at the studio. While it may be easier to access the audio
processor at the studio end, it will be show that from a
sonic performance point of view, the closer it is to the
digital exciter the better the overall sonic performance.
With a modern remote control system, there isn’t much to be
gained by having the audio processor at the studio.
The audio path shown in figure 1 should perform very well.
Assuming that the components before the audio
processor are good quality, there should only be minor
degradation of the audio data as it passes through the
sample rate converters in the STL’s and audio processor
input (an explanation of this will follow shortly). The
audio processor will modify the audio data because that is
what it is supposed to do. The audio processor ‘processes’
the audio so it will pass through the transmission path
(modulation -> RF -> demodulation) and end up with a rich,
pleasing sound to the listener.
Since the audio processor works hard to process the audio it
would be beneficial if there were no additional
sources of degradation after the audio leaves the audio
processor. In fact, as we will see there are many
opportunities for degradation, all inside the digital
exciter. Let’s take a closer look.
Digital Exciter
These are latest entry to the digital path. Capable of
incredible modulation performance, the digital exciter
offers two forms of signal input, analog composite (MPX) for
non-digital transmission, and AES/EBU.
The figure below shows both paths in a typical digital
exciter.

The analog MPX input must be passed through an
analog-to-digital converter (ADC) before it can be
digitally modulated. The sampling rate of the ADC must be
relatively high due to the wide bandwidth of
the MPX. Consider for a moment the audio spectrum of FM is
99kHz, any analog spectrum that is connected to
a digital exciter must provide, at the least, a sampling
rate of 200kHz or higher. Once digitized, no other
processing is required before modulation.
The AES/EBU path is quite a bit more complicated than the
MPX path because it must condition the audio
signal in preparation for stereo generation and herein lies
the potential for audio degradation. Some of the
blocks, such as LPF and pre-emphasis, can be bypassd if the
particular function is performed in an audio
processor. Let’s examine each in detail.
Sample Rate Converters (SRC): Consider for a moment
the signal that is arriving at the AES/EBU
input of the exciter. It might be a different sampling rate
than the exciter is expecting. If so, a sample rate
converter is employed to make the proper transition. This
item can pose problems as the digital filter in the
rate converter can generate overshoots to the already tight
peak controlled audio data that is being adjusted.
The following example illustrates how a synchronous SRC
would work, however interfaces between
equipment are not synchronous and therefore require
asynchronous SRCs which are functionally similar but
considerably more complex. In order to synchronously change
48kHz sampling to 32kHz sampling, the
conversion is accomplished by scaling up, or interpolating
the original sampling rate, usually by a
factor of ten. Then, at the 10x rate of 480kHz, filtering
the signal with a low pass filter that is set to the Nyquist
of the new desired sampling rate. This filter is required to
‘smooth’ out the 10x rate. If it was not used, aliasing
products would result. Finally, the signal is scaled down or
decimated by the factor needed, in this case ÷15, to
achieve the new rate of 32kHz. Figure-3 is a block diagram
of a SRC. While this sounds quite simple, and
basically it is, there are a few issues to consider. Of main
interest is the interpolation filter.

All audio processors, both analog and digital apply some
form of overshoot control to the output filtering
section. In most designs, this function is a form of
integrated protection clipper working around the final
low pass filter to obtain control.
In each case the overshoot component can be calculated as a
product of what is known as the ’Gibbs Phenomenon’1, which
states that an overshoot will occur at one-third the cut-off
frequency of any low pass filter whenever a non-linear
waveform is passed through it. In the case of broadcasting,
the non-linear waveform would be that of a clipped waveform.
Knowing that the audio bandwidth used in FM Stereo is 15kHz,
overshoot components will begin with any nonlinear waveform
above 5kHz. In this example, this would affect any signal
above 5kHz that was clipped.
Should the slope of the previously described upsampled
interpolation filter appear greater than the slope of the
final filter in the audio processor, then output overshoots
may result in the sample rate conversion process!
Unfortunately, these overshoots are generated after the
processing unit. To remove them would require another
limiting device.
This does not necessarily indicate that all sample rate
converters will cause overshoots. But in most cases the
filtering used in the sample rate converter will be of a
large magnitude in the bandstop rejection area. In all
probability it will be an FIR filter with at least 96dB
rejection in the stop band.
Of interest will be the direction of rate conversion.
Should the host sampling rate be lower in value, than
the transformed rate, chances of overshoot are small. This
happens due to the up-sampled filter being set to a
broader spectrum than the spectrum of the host signal.
Potential problems may arise when transforming a
larger sampling value to a lower rate, as described in the
example of converting 48kHz to 32kHz sampling.
Then the details of the above description apply.
Integrated Limiter: Some digital exciters provide a
baseband limiter to eliminate potential overshoot problems.
The use of a limiter can have benefits of preventing
modulation error caused by components between the audio
processor and exciter (including the exciter) but it can
also undo the processing done by the audio processor by
further changing the audio.
The integrated limiter used in exciters is incorporated as
part of the stereo generator. Different types of limiters
will add different effects to the audio. A hard limiter or
clipper is the least desirable, it induces harmonic
distortion and aliasing distortion (a.k.a. digital grunge).
A time delay, look-ahead limiter controls peaks without
a harmonic induced clipping function. Waveforms are
controlled with little or no harmonic distortion (T.H.D.)
components, but will produce a larger intermodulation (I.M.)
level.
Technically, this style of limiter will operate sufficiently
when controlling overshoot peaks, or as an additional
limiter to the audio processing. Sonically this type of
limiter will produce a “busier” sound. It will sound more
like a limiter that is operating in “heavy” levels of
compression. That is the result of the added I.M. In the
audio processing realm, adding more I.M. to an already
processed signal is usually not desired!
Though not used in any digital exciters to date, the
cleanest type of limiting is composite clipping, as long
as the limiting algorithm provides linear filtering that
will maintain clean spectra in the 19kHz pilot, and SCA
regions. The key ingredient is the linear filtering method
applied to the algorithm. Composite clipping
produces far less audible I.M. products as does a delay
limiter, and it will yield cleaner sound for the same
amount of limiting/clipping used.
Preemphasis: The exciter has the option of adding the
required emphasis. The optimum setup of the
transmission system would be where the emphasis is generated
once in the audio processor, and that
emphasized, processed signal is coupled directly to the
exciter.
Broadcast audio processors employ preemphasis within their
system architecture. Since emphasized audio must
also fit within the imposed modulation limits, the processor
employs specialized high frequency control sections that
provide both the emphasized boost and control of the high
frequency energy. In this manner, efficient high levels of
modulation are easily obtained since the processor is
designed and set to limit any tradeoffs resulting from
preemphasis and high frequency limiting requirements.
Basically, these two sections work in concert with one
another to allow preemphasis to be employed, and yet control
the emphasized energy content.
Interpolation: Before stereo generation, there is one
last bit of processing that must be carried out. Just as
the analog MPX signal needs a high sampling rate due to its
wide bandwidth, the digital L/R path requires a
translation to a higher sampling rate (interpolation) before
it is passed into the stereo generator because it will come
out of the stereo generator as MPX. An interpolator is a
digital signal processing function that inserts zero value
samples in the audio stream (three per input sample for 4X
increase in sampling rate) and then applies a filter to
recreate the input signal at the higher sampling rate.
Audio
Path Improvements
There is no question that the audio path shown in figure 1
should perform very well, but can it be made to work
even better? Yes, it can.
To summarize the preceding issues we see that placing the
audio processor close to the exciter is best in order
to eliminate sample rate conversions other than that in the
digital exciter. We also see that superior performance is
achieved if the audio processor does all of the audio
processing including filtering, limiting and pre-emphasis
because it is the best equipment to use for precise control
the modulation depth.
Consider also that the audio signal at the output of the
audio processor only requires stereo generation which
is a function that is standard in audio processors. The
reason for not using the stereo generator is that it
requires converting the MPX signal to analog for transfer to
the digital exciter where it will then be digitized again.
The process of converting to analog and back to digital
introduces unwanted noise. To transfer the audio and AES/EBU
requires that it pass through a multitude of processing in
the digital exciter in order to get it to the stereo
generator. All of which can degrade the modulation
efficiency, and sonic performance.
To further illustrate the issue of using AES/EBU between the
audio processor and exciter, consider the
following analysis performed by Omnia. It evaluates the
performance of Omnia equipment with a digital exciter whose
input sample rate is 32kHz. The tests were run using a
several different output sample rates from the audio
processor. In addition to analyzing the performance of the
AES/EBU connection at various sample rates, the analysis is
extended to compare the digital connection to a traditional
analog MPX connection.
FM
Modulation Analysis Using the AES/EBU Transmission Path
Between an Audio Processor and Digital Exciter
Introduction: A hot topic over the past few years has
been the issue of interfacing audio processing and
digital exciters in the FM transmission plant. The issue, as
discussed, has been centered around which sampling
rate works best for AES/EBU interfacing, and what additional
anomalies transpire in this configuration. This analysis
reflects some recent observations made in one of the
author’s test facility regarding FM modulation control when
using a digital exciter, and modulated via the AES/EBU path.
For comparison’s sake, observations were also made using the
conventional composite multiplex (MPX) input to the exciter.
Following is an analysis that reveals modulation performance
using a popular Digital FM Exciter. There have been quite a
few questions from customers who want to understand which
interface method is more desirable for broadcast usage…AES/EBU
or the conventional MPX input. The findings are based upon
tests that were done using an Omnia.6 audio processor, as
well as, the Optimod 8400 system. It must be pointed out
that these tests were done using the two units for
comparative reasons only, not to compare the performance of
one against the other! The objective was to show the
consistency of our findings, and that it didn’t matter which
processor was used. As it will be revealed, they both
perform relatively the same, within 1% of each other.
Test Setup: The test setup was quite basic. The goal
was to observe FM modulation when the audio
processor was connected directly to the FM Exciter using the
AES/EBU input. The Exciter was configured
to operate into a dummy load, which also was provided an RF
tap for the modulation monitor. The monitor used was the
popular Belar Wizard System, which included the PC
monitoring function so we could grab histograms for usage in
this report.
The test setup was as follows:
1. CD player connected directly to processor. Aggressive
program material of rock music chosen.
2. Processor operating with AES output.
3. Aggressive preset <CHR> chosen
4. Main Clipper set to +3.0dB. This is quite extreme!
5. Output sample rates tested at 32kfs, 44.1kfs, and 48kfs.
6. Exciter operating into RF Dummy load.
7. Tap off of RF output connected directly into Belar Wizard
Modulation Monitor.
8. Measurements made using Belar Remote PC S/W.
While this setup does not exactly replicate a radio station
facility, it does provide the basis to determine if the
audio processor and exciter are capable of generating
well-controlled peak levels under modulation. To make the
tests equal in rigor, the exact same segment of program
material was used throughout testing. This removes any
possible inaccuracies in the results. The audio sample was
from the Talking Heads CD, “Stop Making Sense.” NOTE: All
testing was performed with the Digital Exciter’s internal
limiter set to OFF, and the MPX testing was done without
using any composite processing in either the Omnia.6 or
8400.
Test Results: The results were revealing, and for a
number of reasons.
1. When using the AES/EBU path, there does still seem to be
some overshoot present. The good news is that it is not at
the wild levels that were observed with older generation
exciters. The overshoot component appears to
be somewhere between 2% - 4% of modulation. This was
observed on BOTH the Omnia.6 and Optimod 8400 processors.
2. The output sampling rate does seem to effect the
characteristics of peak control. When operating the audio
processors in the 32kfs mode, overshoot appears to occur a
bit more than when set to 44.1kfs or 48kfs. This was
observed in both processing systems. Of note, is that the
overshoot component level did not really change when using
the lower rate, it just occurred a bit more often.
3. Observed was a new anomaly that bears discussion. The
Digital Exciter itself seems to have a bobble in modulation
of approximately 1%. This was confirmed when it could not be
set for total modulation to a precise level using a 400Hz
tone. The test engineers tried to set the modulation
reference via a tone using both Omnia.6 and the Optimod
8400. Neither unit was capable of being set for a precise
modulation level of 100% on the Belar
Wizard. There was always a bobble of a few tenths of a
percent.
This last item may help explain the modulation uncertainty
that exists under certain program conditions. Observed were
moments where there is a modulation spike of a few percent.
This bobble that was uncovered would help explain why this
happens from time to time. In tests, this was observed on
both the Omnia.6 and 8400 processor, which leaves one to
believe that there’s a systemic issue within the Exciter.
Consider that this bobble can create an uncertainty of +/-1%
that results in an overall 2% modulation error factor. When
you add this to the minor overshoot anomalies of Sample Rate
Conversion, it becomes a bit clearer as to what is
happening.
Following are graphics that depict the modulation
performance of both the Omnia.6 and Optimod 8400. These
display performance at the three popular sampling rates of:
32kfs, 44.1kfs, and 48kfs.






It needs to be pointed out that these screen shots show the
PC display of the Belar Wizard software after they
have been magnified to a considerable extent. Any peak
uncertainties are only over a magnitude of a few
percent, which does not adversely affect coverage, loudness,
or interference due to FM deviation. As the screen shots
indicate, the level of overshoot is reduced when the
sampling rate is increased.
There is an explanation as to why there appears to be a bit
more grass-like spikes on the Omnia. It has to do with the
low pass filtering in the AES/EBU path, which employs a
15kHz low pass filter that does not truncate the passband to
a zero stopband level at 16kHz. The rolloff slope of our
filter is a bit more gentle, and thus there is a negligible
amount of spectrum that exists in the 16kHz to 17.5kHz area.
It is this spectra that is causing the slight amount of
overshoot in the sample rate conversion filter of the
Digital Exciter. The 8400 employs a base sampling rate of
32kfs, and thus must adhere to this tighter filter
performance at 16kHz.
The question can be asked why not follow the same criteria?
It was a design decision, and judgment, that
employing a tighter filter slope of this magnitude will
degenerate the audio quality. Research and testing, during
the design phase revealed this. Considering that the
magnitude of overshoot is not of any additional significance
as compared to the 8400, it was decided to remain with the
existing design. While the amount of modulation uncertainty
appears a bit more often in the Omnia.6, it is not, in any
way, a deterrent to the modulation performance. As the
screen shots indicate, there is modulation uncertainty with
BOTH processing systems. Had the performance of one system
varied considerably over the other, then, there would be a
reason for concern. But based upon these findings, the
observed differences appear to be moot.
Now to contrast the above results, the same tests were run
using the MPX input on the Digital Exciter. Notice that the
performance of both systems appears to be relatively the
same, and there is still some negligible overshoot. It is
believed that this is due to the +/-1% modulation bobble as
described earlier.


It is quite clear that modulation uncertainty exists under
the present AES/EBU connectivity method. History has
shown that this was never the case when the MPX output
signal of an audio processor was connected to
the MPX input on an analog exciter. Precision peak control
that was achieved in the audio processor was passed on
through to the exciter, and that would be the performance of
the deviation in the modulator section.
The same performance that was achievable with the older
generation analog exciter must be capable in the
newer digital devices.
Adding HD Radio to the Mix
The addition of HD Radio creates some interesting
propositions. At present, there’s the need to employ two
separate exciters: one for the conventional (analog) signal,
and one for HD Radio. Seems a bit cumbersome
to the thinking here.
Now is the time to design a single exciter solution, and at
the same time, finally, address the modulation
overshoot issue that does exist for the conventional
channel.
Digital MPX (D-MPX)
In the discussion section about the digital exciter,
numerous options were explained about the interfacing
possibilities of the audio processor to the exciter. All of
them revolve around the usage of the AES/EBU input
protocol. In that configuration the audio data arrives in
Left/Right format and requires the exciter to perform
the MPX generation.
Question: Why can’t the digital audio processor, which
already has the MPX encoder inside, be able to connect
its digitally generated baseband signal directly to the
digital modulator of the exciter? This would be analogous to
the analog composite input on any exciter. It is of interest
to the authors why any of the digital exciters available
today do not provide, or propose an application like this.
It provides the best possible coupling to the exciter, and
the performance benefits are significant. Imagine having the
power of a complete digital processing system and integrated
stereo generator that is directly connected to a digital
modulator. Now we’re talking about super efficient
modulation capability. Zero overshoots due to added
emphasis, coding, or sample rate converters. That
would be real power!

Fortunately, a solution is on the horizon. Nautel limited is
adding this feature to their latest FM exciter, the M50.
Nautel and Omnia have agreed on a format for passing a
digital MPX data stream serially using RJ45/CAT-5. The M50
exciter is Nautels first exciter to support hybrid and all
digital HD-R. While the data format is not standard it is
based on common serial protocol supported by a variety of
CODECs and DSPs. Essentially, the interface is synchronous
with the exciter providing a clock to the audio processor
and the audio processor providing a data clock, data and
frame sync on separate signals. All signals are differential
LVDS. The sampling rate of this interface is approximately
372kHz which is almost 7 times the bandwidth of the MPX
signal (without SCAs).
Naturally, this type of configuration would require
installing the processing at the transmitter facility, since
transporting a digital composite signal of this speed and
size would be cost prohibitive. Locating the audio
processing at the transmitter is not a problem, as all
current generation digital processors provide some form
of computer control via modem, or network.
Conclusion
The total digital transmission path is capable of providing
outstanding performance results. To achieve this, audio
processing must be inserted at the.transmitter site, and a
“flat” input should be used on the digital exciter. If a STL
system is employed, a linear system would be preferable, but
a high bitrate coded system is acceptable as long as the
dynamics processing occurs after the coding.
The goal here is to digitally achieve the same technical
model as the accepted method done in analog:
Interconnection of the mpx signal from an audio processor to
the modulator stage of the exciter. History has proven that
this method yields superior peak control results, and allows
the use of ancillary functions like
composite processing.
Additionally, with HD Radio now a reality, there is reason
to offer a complete digital excitation system that
operates 100% in the digital domain, yet it provides input
capabilities for any type of input, be it AES/EBU,
or mpx.
As long as the systems design engineer in a broadcast
facility is aware of these critical issues, there is no
reason why an all digital broadcast facility can not exist
today and not provide exceptional quality broadcasting.
As technological development continues in a manner where
products can provide faster and more powerful
digital delivery methods, the concerns shared here will
continue to dissipate until they are rendered meaningless.
REFERENCES
[1] Baher, H. Analog & Digital Signal Processing, J.
Wiley & Sons, 1990
[2] Foti, F. Critical Issues And Considerations For An
All Digital Transmission Path, NAB Convention, 1998
|
|