Beruflich Dokumente
Kultur Dokumente
Ken Evans
rt.kevans@gmail.com
Digital Clock and Data Recovery for High-Speed Serial Data Links 1
Typical 6 Gbps SATA / SAS PMA Layer
• Adaptive RX equalization
at 6.0 Gbps.
• Digital clock and SS-
modulated data recovery.
• 1:10 ser/des to
accomodate 8b/10b
coding in PCS layer.
• SSCG MPLL serving
shared RX/TX frequency
synthesizer.
• 1b transmitter de-
emphasis, programmable
to -6 dB, -3.5 dB
• Link scaling for legacy
modes (1.5, 3.0 Gbps).
Digital Clock and Data Recovery for High-Speed Serial Data Links 2
Typical 5 Gbps PCI-E or USB PMA Layer
• Adaptive RX equalization
for 5.0 Gbps USB, none
for PCI-E.
• Digital clock and SS-
modulated data recovery.
• 1:10 ser/des to
accomodate 8b/10b
coding in PCS layer.
• SSCG MPLL serving
shared RX/TX frequency
synthesizer.
• 1b transmitter de-
emphasis, programmable
to -6 dB, -3.5 dB
• Link scaling for legacy
PCI-E (2.5 Gbps).
Digital Clock and Data Recovery for High-Speed Serial Data Links 3
Combining Macros into Multilane SERDES
ASIC
Digital Clock and Data Recovery for High-Speed Serial Data Links 4
Notes on SERDES IP Macros
– Multiple applications / standards can be satisfied
with universal macros – PMAs are almost identical.
– Signaling rates can be dialed in by scalable link
design.
– PCS layer functions tend to be universal, i.e.
scrambling, 8b/10b coding, BIST, loopback modes.
– RX / TX equalization optional depending on
channel medium.
• SATA 3.0, USB 3.0 require RX EQ, TX de-emphasis.
• PCI-E 2.0 does not require RX EQ up to 34” FR4.
– Mostly digital implementations mandatory at this
point.
• Multi-lane solutions demand minimal thermal / area footprint.
• Quick process migrations require maximum portability.
• Macros must be digitally programmable to accommodate
specific application specs and usage customizability.
Digital Clock and Data Recovery for High-Speed Serial Data Links 5
Relevant CDR Blocks
• Digital PLL (DPLL)
recovers digital phase
error to drive loop into
lock.
• Varying degrees of data
deserialization
embedded within CDR.
• Digital-to-phase
conversion bridges digital
and analog domains.
• Frequency synthesizer
required to produce clock
which is ideally matched
in frequency with RX
data.
• Loop dynamics are
governed by both linear
and non-linear behavior.
Digital Clock and Data Recovery for High-Speed Serial Data Links 6
Basic DPLL Structure
Block-level Arrangement
• DPLL phase adjustment
accomplished by rotating
the phase of clock
produced by VCO.
• DPLL frequency
adjustment accomplished
by adjusting rate of
phase rotation.
• DPC is the DPLL’s
“VCO”, but does not
System Representation integrate.
• Simplistically, phase
error sampler is viewed
as a slicer. All remaining
elements are linear.
• Non-linear loop dynamics
difficult to analyze in
closed form….
Digital Clock and Data Recovery for High-Speed Serial Data Links 7
Basic !!PD Mechanics
• !!PD 2x oversamples
data by 00(I) and 900(Q)
clock phases.
• In locked state, I clock
samples data at the eye
center and Q clock
samples zero-crossings.
• Early and late phase
errors produced by
XORing sliced data and
edge samples.
• No error produced when
no transition present –
important feature!
• Early, late, and no-
transition states are all
mutually exclusive.
• !!PD never produces
stable output – causes
“dither” jitter at lock point.
Digital Clock and Data Recovery for High-Speed Serial Data Links 8
!!PD Behavior in Presence of Jitter
• Clock and data jitter
causes !!PD to have a
“stochastically” linear
range.
• At any given phase error,
there are odds “P” that
the edge clock will
sample Dk, and “1-P” that
it will sample Dk-1.
• “Linear” range of !!PD is
the probability
distribution of edge
samples over phase
error.
• K!!pd is the slope of the
distribution at zero phase
error.
• Jitter is a combination of
rj and dj components.
Digital Clock and Data Recovery for High-Speed Serial Data Links 9
!!PD Behavior in Presence of Jitter (cont’d)
Digital Clock and Data Recovery for High-Speed Serial Data Links 10
Three PRBS-31 RX Test Subjects
Digital Clock and Data Recovery for High-Speed Serial Data Links 11
Simulated !!PD Transfer Curves
Digital Clock and Data Recovery for High-Speed Serial Data Links 12
Effect of Sampler Offsets
Digital Clock and Data Recovery for High-Speed Serial Data Links 13
Typical Error Generator Implementation
• CDR loop cannot process error symbols at baud rate – digital filter cannot move that
fast.
• Half-rate I/Q clocks from DPCs perform 1:2 data and edge deserialization.
• Symbols are aligned and deserialized 2:10, and half-rate clock is divided by 5.
• Array of unit !!PDs are used to generate 10b error sequence, which is eventually
decimated into a signed, 2b error signal moving at 1/10th the baud rate.
Digital Clock and Data Recovery for High-Speed Serial Data Links 14
Sampling and Deserialization
Digital Clock and Data Recovery for High-Speed Serial Data Links 15
2T Sampling and Alignment Timing
Digital Clock and Data Recovery for High-Speed Serial Data Links 16
Deserialization Timing (Data)
Digital Clock and Data Recovery for High-Speed Serial Data Links 17
Deserialization Timing (Edge)
Digital Clock and Data Recovery for High-Speed Serial Data Links 18
10b !!PD Implementation
Digital Clock and Data Recovery for High-Speed Serial Data Links 19
10b !!PD Timing
Digital Clock and Data Recovery for High-Speed Serial Data Links 20
Error Decimation by Majority Voting
• Majority voting used to
summarize 5b error
signal.
• Democratic process is
inherently non-linear,
must be simulated to
determine effect on
DPLL loop gain.
• MUX matrix maintains a
“tally” of early, late, and
no-transition bits within
the 5b error word.
• Might require inter-stage
retiming depending on
MUX implementation.
Digital Clock and Data Recovery for High-Speed Serial Data Links 21
Majority Voting Simulation
Digital Clock and Data Recovery for High-Speed Serial Data Links 22
DPLL Loop Filter Design
Digital Clock and Data Recovery for High-Speed Serial Data Links 23
DPLL Loop Filter Design (cont’d 1)
• 2nd order filter provides both phase and frequency integration.
• Phase integration path accumulates phase differences between
sampling clocks and the RX data stream.
• Frequency integration path accumulates frequency differences
between analog VCO and RX data stream. Allows DPLL to track
static errors due to host and receiver crystal frequency mismatches
(+/-300 ppm), and time-varying errors due to spread-spectrum clock
modulation (0 to -5000 ppm).
• Gain implemented by bit shifting, i.e.
• (+1) x 22 is ’01’ ‘0100’ (+4)
• (-1) x 22 is ’11’ ’1100’ (-4)
• Attenuation implemented by truncating dither bits, i.e.
• (+4) x 2-2 is ’0100’ ’01’ (+1)
• (-4) x 2-2 is ‘1100’ ’11’ (-1)
Digital Clock and Data Recovery for High-Speed Serial Data Links 24
DPLL Loop Filter Design (cont’d 2)
• Phase path has gain phug = 2{3,4,5,6} x 2-6.
• Frequency path has gain frug = 2{0,1,2,3} x 2-12.
• Frequency integrator requirements:
• Must accumulate frequency error until integrator saturates: adder must never
overflow or underflow.
• Frequency error can be either positive or negative polarity: adder must perform
signed (2’s complement) arithmetic.
• Phase integrator requirements:
• DPC has full-scale rotation range of 3600: adder must overflow and/or underflow
to rotate DPC beyond this range.
• DPC maps unsigned binary code to phase value, adder must be unsigned as
well.
Digital Clock and Data Recovery for High-Speed Serial Data Links 25
DPLL Frequency Tracking
• RX data stream is generally SS-
modulated by host-side TX.
• RX clock is asynchronously SS-
modulated by MPLL unit.
• Worst-case frequency difference
occurs when modulations are 1800
apart – 5000 ppm.
• Including static frequency
differences, DPLL must elastically
track up to +300 ppm to -5300 ppm
over time.
• Frequency integrator must have
sufficient bits to accommodate, with
integrator loss (2-6), decimation
factor (10), and DPC resolution (2-9).
• Frequency resolution is the ppm
value that the DPLL can change
recovered clock frequency per
update time.
Digital Clock and Data Recovery for High-Speed Serial Data Links 26
DPLL Frequency Tracking (cont’d)
• DPLL step response
simulations performed with
and without static frequency
offset.
• 9b DPC resolution yields
512 samples per UI.
• 514 samples are allocated
between each retiming
interval.
• Toffset = 106 x (514/512 – 1)
= 3900 ppm.
• Slope of accumulation
proportional to amount of
offset tracked by DPLL.
• Slope of accumulation will
time-vary according to RX
data spread-spectrum
modulation.
Digital Clock and Data Recovery for High-Speed Serial Data Links 27
CDR Small-Signal Model (z-1)
Digital Clock and Data Recovery for High-Speed Serial Data Links 28
CDR Small-Signal Model (s)
• For operating frequencies much lower than the update rate of the DPLL, Euler
difference equation can be used to transform z-domain to s-domain.
• Loop gain L(s) has the same form as that of a typical charge-pump APLL, but with an
added exponential term that models the digital latency of the DPLL.
• In the simplistic case where latency is omitted, the DPLL tracking response has
identical form with that of the APLL counterpart.
• Loop bandwidth and transfer peaking can both be designed through this
transformation.
Digital Clock and Data Recovery for High-Speed Serial Data Links 29
CDR Frequency Response
• DPLL bandwith and peaking response is sensitive to CDR update rate and jitter.
• For fixed σj, Tupd, bandwidth can be extended and peaking suppressed by increasing
phase update gain (phug).
• Serial data link jitter should be empirically observed, and phug and frug values should
then be chosen to yield desired bandwidth and peaking.
Digital Clock and Data Recovery for High-Speed Serial Data Links 30
CDR Frequency Response (cont’d)
σj = 0.01UI σj = 0.03UI σj = 0.05UI
Tupd = 2ns Tupd = 2ns Tupd = 2ns
phug frug BW peaking phug frug BW peaking phug frug BW peaking
(MHz) (dB) (MHz) (dB) (MHz) (dB)
Digital Clock and Data Recovery for High-Speed Serial Data Links 31
Effect of Loop Latency
σrj = 0.03UI σrj = 0.03UI
phug = 2-3 phug = 1
frug = 2-12 frug = 2-12
• Low loop bandwidth settings are relatively insensitive to increases in digital latency.
• High loop bandwidth settings see a severe degradation in phase margin.
• Bandwidth and peaking values shift – be careful!
• Lower phase margin causes the loop to have higher quality factor – bad for jitter transfer.
• Limit cycle jitter worsens due to lengthened phase correction time.
Digital Clock and Data Recovery for High-Speed Serial Data Links 32
Jitterless CDR Behavior
• A jitterless CDR does not
obey linear loop dynamics
at all.
• A 5b step in ϕdata causes
the DPLL to slew towards
the lock point.
• The lock point is
characterized by a 1 lsb
prbs dithering sequence,
which is known as “limit
cycle” behavior.
• Dithering is caused by the
full-scale meta-stable
output of the !!PD, and
characterized by σ!!pd.
• Dither magnitude will grow
beyond 1 lsb if loop latency
increases – another reason
to retime sparingly.
Digital Clock and Data Recovery for High-Speed Serial Data Links 33
Jittered CDR Behavior
Digital Clock and Data Recovery for High-Speed Serial Data Links 34
CDR Jitter Tolerance
• Jitter Tolerance (JT) testing
is a frequently-used
SERDES compliance test.
• Measure of sinusoidal jitter
tracking capability.
• Compliant CDRs do not
penetrate the mask
definition.
• Mask typically has a corner
frequency of rb / 1667.
• Low-jitter environments can
meet the mask specification
over a variety of loop
settings, as shown.
Digital Clock and Data Recovery for High-Speed Serial Data Links 35
CDR Jitter Tolerance (cont’d)
• As jitter increases, the lower bandwidth settings start approaching the mask
boundary. For high jitter conditions, some settings cause mask violations.
• For a given level of RX jitter, the DPLL requires relatively low bandwidth settings to
maintain good phase margin and jitter transfer performance. But Jitter tolerance
requires higher bandwidth settings….
• Conclusion: jitter transfer and jitter tolerance are conflicting design goals and must be
balanced.
Digital Clock and Data Recovery for High-Speed Serial Data Links 36
References
1. J. Lee et al., “Analysis and Modeling of Bang-Bang Clock and Data Recovery
Circuits”, IEEE Journal of Solid-State Circuits, Vol. 39, No. 9, September 2004, pp.
1571-1580.
2. J. Sonntag et al., “A Digital Clock and Data Recovery Architecture for Multi-Gigabit/s
Binary Links”, IEEE Journal of Solid-State Circuits, Vol. 41, No. 8, August 2006, pp.
1867-1875.
3. P. K. Hanumolu, et al., “A Wide-Tracking Range Clock and Data Recovery Circuit”,
IEEE Journal of Solid-State Circuits, Vol. 43, No. 2, February 2008, pp. 425-439.
Digital Clock and Data Recovery for High-Speed Serial Data Links 37