Beruflich Dokumente
Kultur Dokumente
where SNRu (pe , Bu ) and SNRo (pe , Bo ) are the signal-to-noise Fig. 2. Channel response h = [0.0949, 0.2539, 0.1552, 0.0793, 0.0435,
ratios (SNRs) needed by a CUA and a BOA, respectively, to 0.0356, 0.0220] with L = 7 of a 20 backplane channel carrying 10 Gb/s
achieve a BER equal to pe with identical receiver processing, data [19].
and Bu and Bo are the resolutions of the CUA and the BOA,
respectively. This metric quantifies the reduction in the channel link receiver. In particular, we propose two channel-dependent
SNR necessary in order to achieve a given BER pe due to the parameters to quantify these benefits: 1) m-clustering value,
use of a BOA. The ADC shaping gain is analogous to a coding and 2) the threshold non-uniformity metric ht . Furthermore, we
gain when evaluating links with error control coding. It was show that the BER improvement is greater than 106 when m
shown in [12] and [13] that SG(1012 , 3, 3) ranged from 2.5 dB 5 and ht 0.8 for a family of channels. Second, we present the
to more than 30 dB for highly dispersive channels. We note that design of a 4 GS/s, 4-bit BOA IC in a 90 nm CMOS process that
a BOA employs representation levels that are dependent on sig- includes a single-core, multiple-output passive DAC to enable
nal statistics and BER, and hence are typically non-uniformly a variable-threshold and variable-resolution ADC configura-
spaced within the FSR of the ADC. The work in [12] and [13] tion and verify the aforementioned analysis. Measured results
also showed that the non-uniform BOA representation levels demonstrate that a 3-bit BOA has lower SNR requirement than
are significantly different from and superior to the non-uniform a 4-bit CUA, thereby supporting the claims in [12], [13], and
(also signal statistics-dependent) representation levels obtained [16] in the presence of non-idealities, such as finite sampling
from the well-known Lloyd-Max (LM) quantization algorithm bandwidth and metastability. In the process, we demonstrate
[14], [15]. This is because the LM algorithm minimizes the conclusively that ENOB is not the best metric when designing
SQNR, which is also a fidelity metric. In [16], it was shown ADCs for serial links. Third, we propose architectures to im-
that BOA can relax the component specifications of ADCs. In plement the gradient descent algorithm in [12] and [13] which
particular, BOA can achieve the same or even better BER while is employed to compute the representation levels iteratively.
it has less stringent metastability and preamplifier bandwidth In particular, architectures for the quantization level update
requirements on the ADC comparators. unit (QL-UD) and the individual representation level update
A power optimized ADC-based 10 Gb/s serial link receiver in (RL-UD) block are proposed.
65 nm CMOS was designed in [4] using a low-gain analog and The rest of this paper is organized as follows. Section II re-
mixed-mode pre-equalizer in conjunction with non-uniform views the theory behind the BOA. Section III discusses how to
representation levels for the ADC. The work in [4] and [17] maximize the benefits of a BOA receiver over a CUA receiver.
proposes to merge slicers whose thresholds are similar into In Section IV, the design of a 4 GS/s, 4-bit BOA IC in a 90 nm
one for loop-unrolled decision feedback equalizers (DFEs) and CMOS process is described. Both stand-alone ADC and link
adjusts a pseudo-BER metric (voltage margin) to minimize measurement results are summarized in Section V, and conclu-
BER, which in effect emulates a BOA followed by a DFE. sions are drawn in Section VI.
However, this technique is applicable only to loop-unrolled
DFEs, and has to rely on a continuous-time linear equalizer to
II. BOA R EVIEW
cancel the precursor ISI. In contrast, our solution employs the
true system BER to adjust the ADC thresholds, and is applica- In this section, the concept of the BOA is reviewed. Fig. 1(a)
ble to ADC-based receivers with different equalizer designs. depicts a typical ADC-based serial link, where the ADC is fol-
Second, as mentioned in [17], their procedure to determine lowed by an equalizer prior to detection. When considering an
the optimal threshold placement is not suitable for online cali- equivalent discrete-time, symbol-spaced, time-invariant chan-
bration. In contrast, our solution employs a standard gradient nel corrupted by additive white Gaussian noise (AWGN), and
descent algorithm to compute the thresholds iteratively, and 2-PAM modulation, the channel output at time n is given by
is able to adapt the ADC thresholds online. More recently,
Son et al. [18] proposed a power efficient equalizing receiver
L1
xc [n] = xc (nT ) = h[i]b[n i] + v[n]
front-end that includes a two-step adaptive BER-minimizing i=0
equalizer algorithm. These works demonstrate that the use of
information-based metrics such as the BER are indeed quite where b[n] {1} is the transmitted sequence, h[n] is
effective in reducing link component power in serial links. the equivalent discrete time channel impulse response (see
In [13], BOA benefits under various channels, modulation Fig. 2 for an example) with memory L, and v[n] is AWGN
and equalization techniques were studied and presented. This with variance v2 . At the receiver, the processor estimates the
work augments the results presented in [13] with the following transmitted symbols from quantized channel outputs through
three new contributions. First, we study, through analysis and the ADC. A subsequent slicer determines the transmitted bit.
simulations, the benefits of the BOA over the CUA in a serial A subsequent slicer determines the transmitted bit. Fig. 2 shows
LIN et al.: A STUDY OF BER-OPTIMAL ADC-BASED RECEIVER FOR SERIAL LINKS 695
B. An Illustrative Example
A BOA [12] exploits signal statistics to maximize the prob-
ability of correctly detecting the transmitted bits. To pro-
vide insight into the operation of a BOA, we consider an
ADC followed by a memoryless symbol-by-symbol maximum
likelihood (ML) detector (ADC-ML) receiver, as shown in
Fig. 3(a) and provide an example to illustrate the point.
First, we define the set of quantization thresholds for the
CUA and the BOA.
Definition 1: The vectors tu = [tu,1, tu,2, . . . ,tu,M ], ru = [ru,1,
ru,2 , . . . , ru,M+1 ], and the set I u = {Iu,1 , Iu,2 , . . . , Iu,M+1 }
are the thresholds, output representation levels, and interval set
of a log2 (M + 1)-bit CUA, where Iu,1 = (, tu,1 ], Iu,k =
[tu,k1 , tu,k ] for k = 2, . . . , M , Iu,M+1 = [tu,M , +), and the
CUA output x[n] = ru,k if xc (nT ) Iu,k for k = 1, . . . ,
M + 1. Similarly, the vectors to = [to,1 , to,2 , . . . , to,N ], ro =
Fig. 3. An illustrative example: (a) the block diagram of an ADC-ML [ro,1 , ro,2 , . . . , ro,N +1 ], and the set I o = {Io,1 , Io,2 , . . . , Io,N +1}
receiver, (b) the conditional pdf of the channel output given b[n] = 1,
(c) the conditional pdf of the channel output given b[n] = +1, (d) BOAs are the thresholds, output representation levels, and interval set
quantization thresholds (inverted triangles in yellow) and uniform quantization of a log2 (N + 1)-bit BOA, where Io,1 = (, to,1 ], Io,k =
thresholds (dashed lines in red) for channel h = [0.08, 0.07, 0.1, 0.04] when [to,k1 , to,k ] for k = 2, . . . , N , Io,N +1 = [to,N , +), and the
SNR = 36 dB, and (e) the simulated BERR(SNR, 4, 3), peo (SNR, 3) and
peu (SNR, 4) versus SNR plot.
BOA output x[n] = ro,k if xc (nT ) Io,k for k = 1, . . . , N +1.
Consider a 4-tap channel with impulse response h = [0.08,
an example of the channel response h for a 20
backplane 0.07, 0.1, 0.04]. The conditional probability density functions
channel carrying 10 Gb/s data [19]. (pdfs) P (xc [n]|b[n] = 1) and P (xc [n]|b[n] = +1) corre-
sponding to the marginal pdf of the channel output conditioned
on the bit b[n] being either 1 or +1 at time n, are illustrated
A. Comparison Metric
in Fig. 3(b) and (c), respectively. In a CUA, the thresh-
Comparison of a BOA and a CUA requires an appropriate olds are set uniformly within the ADCs FSR. Assuming
metric to be defined. The ADC shaping gain SG(pe , Bu , Bo ) that the FSR is [0.3, +0.3], a 4-bit CUA will have its
in (1) is one such metric. In applications where it is difficult thresholds at tu = [0.26005, 0.2290, 0.18575, 0.14875,
to measure the underlying circuit and environmental noise, 0.1145, 0.0743, 0.03715, 0] [Fig. 3(d)]. In contrast, the
however it is easy to maintain consistent operational conditions thresholds in a BOA are positioned at the crossover points of the
between two experiments, comparing the ratio of the resulting two conditional pdfs. For this example, the BOAs thresholds
bit-error-rates, or BER ratio may be of interest. Similarly, are found to be to = [0.11, 0.08, 0.03, 0] [see Fig. 3(d)]
when two systems are being compared, and only one of the as there are 7 crossover points. Thus, the BOA illustrated
two experiences an exponential decay in BER with SNR as here is a 3-bit ADC. Fig. 3(e) shows that the BOA achieves
shown in Fig. 3(e), it may not be possible to measure a shaping a 1 bit reduction in the ADC resolution while achieving
gain in SNR (i.e., the additional SNR required to achieve BERR(40, 4, 3) 108 and SG(103 , 4, 3) 8 dB.
the same BER) given a target BER, however at a given SNR, In order to compute to , we need the following definition of
the ratio of the measured BERs may be readily measurable. noise-free channel outputs:
696 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMSI: REGULAR PAPERS, VOL. 63, NO. 5, MAY 2016
Definition 2: The -set of a channel h = [h0 , h1 , . . . , hL1 ] III. ACHIEVABLE BER I MPROVEMENT VIA BOA
is an ordered set defined as = {+ }, where both + =
2L1 2L1
In this section, we study through analysis and simulations
{+l }l=1 and = {l }l=1 are ordered sets of noise-free how to maximize the benefits of the BOA over the CUA. Note:
channel outputs conditioned on the transmitted symbol b[n] a BOA receiver always achieves the same if not better BER
taking the value +1 and 1, respectively. The , + and as compared to a CUA receiver, given the same number of
sets have elements in ascending order. bits, channel and noise power, because a CUA is a special
In general, the N thresholds of a BOA for an ADC-ML re- case of the BOA. An important question to ask is under
ceiver can be obtained [12] as the N solutions for the unknowns what conditions are the benefits offered by a BOA over a
{to,i } (i = 1, . . . , N ) to the following equation: CUA substantial. Specifically, we wish to determine channel
2
L1 conditions under which BERR(SNR, Bu , Bo ) is say at least
L+1
10 . BERR(SNR, Bu , Bo ) is empirically observed to depend
2 N to,i ; +
l , n strongly on the difference between the CUAs and the BOAs
l=1
2
L1 thresholds, the number of adjacent noise-free channel outputs
with opposing signs, channel SNR and the ADC resolution.
= 2L+1 N to,i ;
l , n , (i = 1, . . . , N ) (2)
l=1
We therefore discuss the relationship between these factors
on BERR(SNR, Bu , Bo ). We restrict our analysis to channels
2 2
where N (x; , n ) = (1/ 2n2 )e((x) )/2n ) , N 2L 1, with memory L < 7 in order to enable the derivation of useful
l and
+
l (1 l 2
L1
) are the 2L1 noise-free channel out- insights analytically. Note that the performance of BOA for
puts (see Definition 2). In the example shown in Fig. 3, h = channels with large memory L > 7 has been studied in [13].
[0.08, 0.07, 0.1, 0.04], L = 4, {+
8 In this section, for tractability of analysis, we assume that the
l }l=1 = {0.09, 0.01, 0.05,
ADC (BOA or CUA) is followed by a memoryless symbol-by-
0.07, 0.13, 0.15, 0.21, 0.29} and {
8
l }l=1 = {0.29, 0.21, symbol ML decoder and that BPSK signaling is used over a
0.15, 0.13, 0.07, 0.05, 0.01, 0.09}.
known channel with impulse response h. Thus, dropping the
time index n, and employing the notation Xc and Xu to
C. BOA With a Linear Equalizer (LE) represent the random variables (RVs) corresponding to xc (nT )
and x[n], respectively, for a CUA, we have
Consider a BOA followed by a K-tap linear equalizer (LE)
with taps w = [w0 , w1 , . . . , wK1 ], such that the equalizer out- P (Xu = ru,k |b) = P (Xc Iu,k |b), (k = 1, . . . , M + 1)
put yeq [n] = K1k=0 wk x[n k]. In a BOA, the representation
levels r o = {ro,1 , ro,2 , . . . , ro,N +1 } and the thresholds to are where Xu {ru,1 , ru,2 , . . . , ru,M+1 }, ru,k is the kth CUA
chosen to minimize the link BER. Obtaining a closed form representation level (see Definition 1). Then, the memoryless
expression for the BOAs representation levels in the presence ML decision rule for a CUA is given by
of channel ISI and a LE is in general intractable. Therefore, u |b=+1)
+1, if PP (X (Xu |b=1) > 1
the gradient descent algorithm [12] is employed to compute the b =
representation levels iteratively as follows: 1, otherwise.
takes odd values only, and m > N at low SNR scenario while
Fig. 5. An illustrative example for do,k , k and du,k .
m = N at high SNR scenario.
Definition 4: The threshold non-uniformity metric ht of a
log2 (N + 1)-bit BOA is a measure of the difference between
to and tu , and is defined as
(N +1)
1 2
to,i to,i1 to,i to,i1
ht = N +1 log2
log2 2 ymax ymax
i=2
to,1 + ymax to,1 + ymax
+ log2 (6)
ymax ymax
where [ymax , ymax ] is the ADC FSR, and only the non-
positive BOA thresholds are used since the BOAs thresholds Fig. 6. Comparison among peo from MC simulation (peo(sim) ), peo esti-
are symmetric about the origin. Note: 0 ht 1, and the mated using (10) (peo(10) ), peu from MC simulation (peu(sim) ), and peu
estimated using (13) (peu(13) ), for channels (a) h = [0.09, 0.1, 0.08, 0.05]
larger the value of ht the closer are the BOAs thresholds to when Bu = 6 and Bo = 3, and (b) h = [0.08, 0.07, 0.1, 0.04] when Bu = 4
those of the CUA. and Bo = 3, respectively.
Fig. 4(a) shows an example when m = 3 for a 4-tap channel
(thus there are 24 = 16 elements in ), and Fig. 4(b) illustrates
Definition 7: Let du,k = min(k tu,k1 , tu,k k ) be the
two examples when ht = 0.7564 and ht = 1, respectively.
minimum distance of the kth CUAs dominant noise-free chan-
Algorithm 1 can be employed to obtain m and ht for a specific
nel output k from the boundaries of the kth interval Iu,k . Then,
channel.
the minimum CUA distance du,min = min1k(M+1) (du,k ).
Fig. 5 shows an example of the marginal pdf of the channel
Algorithm 1 Algorithm to obtain m-clustered value and ht output, illustrating the corresponding do,k , k and du,k .
for an ADC-ML receiver. In the Appendix, we show that BERR(SNR, Bu , Bo ) is
given by
1) Initialize the channel h and the SNR, calculate noise d2 d2
do,min o,min2n2 u,min
variance v2 based on h and the SNR.
e , if du,min > 0
2L1
2du,min d2
2) Define the main cursor of h, calculate + = {+
i }l=1 BERR(SNR, Bu , Bo ) do,min e 2n2 ,
o,min
2L1
if du,min < 0
and = { i }l=1 , respectively, and obtain the ordered
2 n d2o,min
set = { +
}. do,min e 2n2 ,
if d = 0.
8 n u,min
3) Count the number of transitions in , which is the (7)
m-clustered value. Equation (7) indicates that BERR(SNR, Bu , Bo ) increases
4) Obtain t0 using equation (2). with d2o,min or (d2o,min d2u,min ). Furthermore, BERR(SNR,
5) Calculate ht using equation (6).
Bu , Bo ) can be predicted given the channel h (thus , + ,
d2o,min and d2u,min ) and SNR.
Definition 5: Let do,k = min (|to,k |) be the mini- MC simulations of link BER were run with 108 (for SNR
mum distance of the kth BOA threshold to,k (k = 1, . . . , N ) 36 dB) or 1012 (for SNR > 36 dB) samples and for SNR
to the nearest noise-free channel output . Then, the minimum ranging from 18 dB to 40 dB for channels h = [0.09, 0.1, 0.08,
BOA distance do,min = min1kN (do,k ). 0.05] and h = [0.08, 0.07, 0.1, 0.04], respectively. Fig. 6 in-
Definition 6: Each of the (M + 1) intervals Iu,k with k = dicates that the analytical expressions (10) and (13) can predict
1, . . . , M + 1, in a CUA has a dominant noise-free channel the results of the MC simulations to within an order-of-
output k given by magnitude, and thus can be employed to estimate the peo and
peu . Furthermore, as expected, the expressions (10) and (13)
|b=+1)
P (X =r
N x; l , n dx , if P (Xuu =ru,k become more accurate at high SNRs. Finally, it can be seen in
arg max
u,k |b=1)
>1
l Iu,k Fig. 6 that 3-bit BOA achieves a shaping gain of 2 dB (8 dB)
k = over a 6-bit (4-bit) CUA at a BER of 105 (103 ). Fig. 7 shows
arg+ max N x; l , n dx , otherwise.
+ that BERR can also be predicted via (7) to within an order-of-
l + Iu,k magnitude of MC simulations, and that it increases with SNR.
698 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMSI: REGULAR PAPERS, VOL. 63, NO. 5, MAY 2016
Fig. 7. Comparison between BERR(SNR, Bu , Bo ) from MC simulation Fig. 9. BERR(SNR, Bx , Bx ) vs. Bx of (a) an ADC-ML receiver, when the
(BERRsim ), and BERR(SNR, Bu , Bo ) estimated using (7) (BERR(7) ) for channel impulse response is h = [0.09, 0.1, 0.08, 0.04] and SNR = 36 dB,
channels (a) h = [0.09, 0.1, 0.08, 0.05] when Bu = 6 and Bo = 3, and and (b) an ADC-LE receiver, when the channel impulse response is equal to the
(b) h = [0.08, 0.07, 0.1, 0.04] when Bu = 4 and Bo = 3, respectively. FR-4 channel (see Fig. 2) and SNR = 36 dB.
Fig. 13. Architectures of: (a) the QL-UD unit, and (b) the ith RL-UD unit.
Fig. 11. Block diagram of the BOA chip.
analog input with the threshold voltages simultaneously and
produces comparison results in the form of a thermometer
code. Thus, a binary encoder following the comparator array
is needed for the ease of back-end digital processing.
C. Comparator Design
to suppress bubble errors. The Gray code is finally converted
The comparator consists of a preamplifier and 3 cascaded to binary code by XOR gates, for the purpose of further DSP
latches, and Fig. 14 illustrates the schematics of the pream- processing in the subsequent blocks. In this chip, pipelined D
plifier and latches. The preamplifier subtracts the analog input flip-flops (DFFs) are used between adjacent logic gates in the
from the threshold and provides polarity of the comparison encoder to guarantee 4 Gb/s operation.
result. The cascaded latches amplify the preamplifier output to
reduce the occurrence of meta-stability. The preamplifier design V. M EASUREMENT R ESULTS
is shown in Fig. 14, which is widely employed for high-speed
ADCs [7], [8]. The first latch is a current mode latch [7], which This section summarizes the measurement results of both the
is composed of an input differential pair (M1 , M2 ) and a cross- stand-alone ADC chip and the ADC-based receiver. A 4-bit
coupled regenerative latch pair (M3 , M4 ) sharing the same 4 GS/s ADC chip is fabricated in a 90 nm low power (LP)
resistive load, RD . When CLK is high, the circuit is in tracking CMOS technology with an active area of 0.33 mm2 and tested
mode with low gain and large bandwidth. When the CLK is in a chip-on-board assembly. The chips micrograph is shown
low, the circuit shifts to the regenerative mode, and the sampled in Fig. 15(a).
signal from the tracking mode is amplified and delivered to First, we ensure the standalone ADC performance is sufficient
the next stage. The second and third latches do not consume to support link operation. Since the ADC chip includes an 8-bit
static power once they fully regenerate, and are referred to DAC for variable-threshold and variable-resolution ADC con-
as dynamic latches [22], [23]. In this design, only the first figuration, we utilize it to configure the ADCs threshold volt-
latch uses a current mode latch because it is the most critical ages for calibration. The measured DNL and INL characteristics
to guarantee accurate comparison results. For example, if the before and after calibration are shown in Fig. 16(a) and (b),
first latch does not have a large enough bandwidth to follow respectively. The DNL and INL are reduced from 1.3 LSB
the updated polarity of the preamplifier output, the comparator and 4 LSB to 0.04 LSB and 0.14 LSB, respectively, after cal-
may generate an incorrect output, regardless of how well the ibration, indicating the effectiveness of the calibration process.
following latches perform. It should be noted that the cascaded Fig. 16(c) illustrates that the ADC can achieve up to 3.4 bits
latches operate in a pipelined manner for speed consideration. of ENOB at near-Nyquist rate input frequency of 1.9375 GHz.
In particular, when the preceding latch is working in a tracking The figure of merit (FOM) of the ADC is 1.42 pJ/conv.step
mode, the subsequent latch is working in the regeneration at 4 GS/s excluding clock buffers, which is comparable to the
mode, and vice versa. state-of-art using a similar technology [8], [9].
Fig. 15(b) shows the block diagram of the link test set-up,
which mainly consists of a BER tester (BERT), channel board,
D. Encoder Design
ADC PCB board, and FPGA board. The BERT provides 4 Gb/s
A Gray encoder is used because it is more compact and faster synchronous data and clock, with the data passing through
than a summing encoder [8]. Since less pipelining is required a 20-inch FR4 channel before entering the ADC board. The
for multi-gigahertz operation, the Gray encoder consumes less ADC chip quantizes the incoming analog signal and its outputs
power than a summing encoder. There are three steps to en- are fed into the FPGA board. The back-end DSP units in the
coding. First, the thermometer code is converted to a 1-of-N FPGA then perform equalization and optimal ADC represen-
code. And then, the 1-of-N code is converted to a Gray code tation levels search. Finally, the updated representation levels
LIN et al.: A STUDY OF BER-OPTIMAL ADC-BASED RECEIVER FOR SERIAL LINKS 701
Fig. 19. Measured BER vs. sampling phase using a 20-inch channel when TX
amplitude is 180 mVppd and the FSR of the CUA is 100 mVppd .
Fig. 17. Measured ADC output: (a) eye diagram, and (b) histogram for a
20-inch FR4 channel at 4 Gb/s when TX amplitude is 180 mVppd and ADC
FSR is 100 mVppd . TABLE I
P ERFORMANCE S UMMARY OF THE ADC-BASED R ECEIVERS
TABLE II
P ERFORMANCE C OMPARISON W ITH S TATE - OF -A RT ADC-BASED
Fig. 18. ENOB and BER measurements: (a) ENOB vs. input frequency, and
R ECEIVERS AND A NALOG R ECEIVERS IN CMOS
(b) BER vs. TX amplitude at 4 Gb/s when the FSR of the CUA is 100 mVppd .
achieving more than 2 higher energy efficiency compared where peo,k is the BER contribution from the kth -transition.
with [2], [3], and [5]. Taking channel loss into account, our so- Specifically, peo,k includes the BER contribution from all the
lution achieves higher (better) figure-of-merit FOM2 (proposed peaks N (x; + l , n ) with l ( N (x; l , n ) with l
+ +
l
in [24]) than [2] and [3]. Implemented in a more advanced tech- ) to the interval [to,k , to,k ) and the BER contribution from
nology and combining several low power circuit techniques, [4] all the peaks N (x;
l , n ) with l ( N (x; l , n ) with
+
achieves a better energy efficiency than this work. However, [4] l ) to the interval [to,k , to,k ) if the memoryless ML
+ + r
only showed measured BER of 107 although an extrapolated decision for the interval [tlo,k , to,k ) is 1 ( +1), where tlo,k and
BER of 1015 was reported. A 2.3-bit (5 comparators) BOA is tro,k are defined as follows
sufficient if the target BER is relaxed from 1012 to 107 based
on simulations, which translates to about 2/7 power savings in , if k = 1
l
the ADC. As a result, the efficiency is improved to 13.0 pJ/bit to,k = to,k1 +to,k
from 15.2 pJ/bit. Furthermore, it should be noted that the , if 1 < k N
2
to,k +to,k+1
power consumption of a flash ADC is mostly determined by , if 1 k < N
its sampling rate and the process technology. Compared to [4], tro,k = 2
+, if k = m.
our ADC has higher sampling rate (4 GS/s vs. 2.5 GS/s in [4])
while being implemented in a slower technology (90 nm LP vs.
65 nm in [4]). Therefore, it is expected that our solution will Note: if the decision for the interval [tlo,k , to,k ) is +1 (or 1)
achieve comparable or better energy efficiency if the sampling then the decision for the interval [to,k , tro,k ) is 1 (or +1). At
rate and process technology were identical. On the other hand, high SNR, this contribution is well-approximated by
Table II shows that energy efficiency and FOM2 of ADC-based
do,k
receivers need to be further improved compared with analog peo,k 2(L1) Q (9)
n
receivers [25][27].
A key outcome of this work is the demonstration of
information-based metric benefits, such as the BER, in reducing where do,k (see Definition 5) is the minimum distance of the
the ADC precision requirements, and the identification of con- kth BOA threshold to the nearest noise-free channel output .
ditions that maximize the BER improvement offered by a BOA Substituting (9) into (8), employing the
high SNR approxi-
mation for the Q-function (Q(y) (1/y 2)ey /2 , for y >
2
receiver over a CUA receiver under the same condition. Al- max(ak )
though we demonstrate BOA concept via a flash ADC, BOA is 0), and the approximation k eak e k , we get
applicable to different ADC architectures in principle, because
N
BOA adjusts the ADC thresholds but does not change the ADC N
(L1)
do,k
architecture. However, the power savings when designing BOA peo (SNR, Bo ) = peo,k 2 Q
n
using other ADC architectures will not be as great as with k=1
k=1
d
2
flash ADCs. In particular, for flash ADC every bit decrease N
n 12 o,k
(L1)
= 2
e n
in resolution almost halves the size of the ADC core circuitry 2d
k=1 o,k
and the power. In contrast, for a SAR, pipelined, or sigma- d 2
(L1) n 1 o,min
delta ADC, the die size and power will decrease linearly with a 2 e 2 n . (10)
decrease in resolution. 2do,min
max(ak )
keak e k , and the relationship Q(y) = 1 Q(y) [9] S. Sheikhaei, S. Mirabbasi, and A. Ivanov, A 43 mW single-channel
for y < 0, we get 4 GS/s 4-bit flash ADC in 0.18 m CMOS, in Proc. CICC IEEE Custom
Integr. Circuits Conf., Sep. 2007, pp. 333336.
[10] C.-C. Yeh and J. Barry, Adaptive minimum bit-error rate equalization for
peu (SNR, Bu ) binary signaling, IEEE Trans. Commun., vol. 48, no. 7, pp. 12261235,
M+1
M+1
du,k
du,min
Jul. 2000.
L L
= peu,k 2 Q 2 Q [11] E.-H. Chen, W. Leven, N. Warke, A. Joy, S. Hubbins, A. Amerasekera,
n n and C.-K. Yang, Adaptation of CDR and full scale range of ADC-
k=1 k=1 2 based SerDes receiver, in Proc. Symp. VLSI Circuits, Jun. 2009,
L
d
1 u,min pp. 1213.
2
n
e 2 n , if du,min > 0 [12] M. Lu, N. Shanbhag, and A. Singer, BER-optimal analog-to-digital con-
2d u,min 2 verters for communication links, in Proc. 2010 IEEE Int. Symp. Circuits
1 du,min
= 2L 1 + n Syst. (ISCAS), May 2010, pp. 10291032.
e 2 n , if du,min < 0
2du,min [13] R. Narasimha, M. Lu, N. Shanbhag, and A. Singer, Ber-optimal analog-
2(L+1) , to-digital converters for communication links, IEEE Trans. Signal
if du,min = 0. Process., vol. 60, no. 7, pp. 36833691, 2012.
(13) [14] S. Lloyd, Least squares quantization in PCM, IEEE Trans. Inf. Theory,
vol. IT-28, no. 2, pp. 129137, Mar. 1982.
Therefore, from (10) and (13), we obtain [15] J. Max, Quantizing for minimum distortion, IRE Trans. Inf. Theory,
vol. 6, no. 1, pp. 712, Mar. 1960.
BERR(SNR, Bu , Bo ) [16] Y. Lin, A. Xu, N. Shanbhag, and A. Singer, Energy-efficient high-speed
links using ber-optimal ADCs, in Proc. IEEE Electr. Design Adv. Packag.
peu (SNR, Bu ) Syst. Symp. (EDAPS), 2011, pp. 14.
= [17] J. Kim, E.-H. Chen, J. Ren, B. Leibowitz, P. Satarzadeh, J. Zerbe, and
peo (SNR, Bo )
d2 2 C.-K. Yang, Equalizer design and performance trade-offs in ADC-based
o,min du,min
serial links, IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 58, no. 9,
do,min
e 2n2
, if du,min > 0
2du,min d2 d2
pp. 20962107, 2011.
[18] S. Son, H.-S. Kim, M.-J. Park, K. Kim, E.-H. Chen, B. Leibowitz, and
o,min u,min
do,min 2 (14) J. Kim, A 2.3-mW, 5-Gb/s low-power decision-feedback equalizer receiver
2du,min e (1 + ), if du,min < 0
2n
front-end and its two-step, minimum bit-error-rate adaptation algo-
d 2
rithm, IEEE J. Solid-State Circuits, vol. 48, no. 11, pp. 26932704,
do,min e 2n2 ,
o,min
if du,min = 0 Nov. 2013.
8 n [19] IEEE P802.3ap task force channel model material. [Online].
2 2
Available: http://www.ieee802.org/3/ap/public/channel_model/index.
where = ( 2du,min /n )edu,min /2n . Note: do,min du,min html#Backplane
max(ak ) [20] C.-M. Hsu, M. Straayer, and M. Perrott, A low-noise wide-BW 3.6-GHz
and do,min 0. Applying the approximation k eak e k digital fractional-N frequency synthesizer with a noise-shaping time-to-
digital converter and quantization noise cancellation, IEEE J. Solid-State
to the case when du,min < 0, (14) can be further simplified to (7). Circuits, vol. 43, no. 12, pp. 27762786, Dec. 2008.
[21] Stratix II GX Edition transceiver signal integrity development FPGA
ACKNOWLEDGMENT board, Altera Corp., 2003. [Online]. Available: http://www.altera.com/
products/devkits/altera/kit-signal_integrity_s2gx.html
The authors acknowledge discussions with Hyeon-Min Bae [22] G. Al-Rawi, A new offset measurement and cancellation technique for
dynamic latches, in Proc. IEEE Int. Symp. Circuits Syst. (ISCAS 2002),
(KAIST) and Pavan Hanumolu (UIUC). vol. 5, pp. V-149V-152.
[23] J. He, S. Zhan, D. Chen, and R. Geiger, Analyses of static and dynamic
R EFERENCES random offset voltages in dynamic comparators, IEEE Trans. Circuits
[1] H.-M. Bae, J. Ashbrook, J. Park, N. Shanbhag, A. Singer, and S. Chopra, Syst. I, Reg. Papers, vol. 56, no. 5, pp. 911919, 2009.
An MLSE receiver for electronic dispersion compensation of OC-192 [24] S. Saxena, R. Nandwana, and P. Hanumolu, A 5 Gb/s 3.2 mW/Gb/s 28 dB
fiber links, IEEE J. Solid-State Circuits, vol. 41, no. 11, pp. 25412554, loss-compensating pulse-width modulated voltage-mode transmitter,
Nov. 2006. in Proc. IEEE Custom Integr. Circuits Conf. (CICC), 2013, pp. 14.
[2] J. Cao, B. Zhang, U. Singh, D. Cui, A. Vasani, A. Garg, W. Zhang, [25] M. Ramezani, M. Abdalla, A. Shoval, M. van Ierssel, A. Rezayee,
N. Kocaman, D. Pi, B. Raghavan, H. Pan, I. Fujimori, and A. Momtaz, A. McLaren, C. Holdenried, J. Pham, E. So, D. Cassan, and S. Sadr,
A 500 mW digitally calibrated AFE in 65 nm CMOS for 10 Gb/s serial An 8.4 mW/Gb/s 4-lane 48 Gb/s multi-standard-compliant transceiver in
links over backplane and multimode fiber, in Proc. IEEE Int. Solid-State 40 nm digital CMOS technology, in Proc. IEEE Int. Solid-State Circuits
Circuits Conf. Dig. Tech. Papers (ISSCC), Feb. 2009, pp. 370371a. Conf. Dig. Tech. Papers (ISSCC), 2011, pp. 352354.
[3] B. Abiri, A. Sheikholeslami, H. Tamura, and M. Kibune, A 5 Gb/s [26] F. Zhong, S. Quan, W. Liu, P. Aziz, T. Jing, J. Dong, C. Desai, H. Gao,
adaptive DFE for 2 blind ADC-based CDR in 65 nm CMOS, in Proc. M. Garcia, G. Hom, T. Huynh, H. Kimura, R. Kothari, L. Li, C. Liu,
IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers (ISSCC), Feb. 2011, S. Lowrie, K. Ling, A. Malipatil, R. Narayan, T. Prokop, C. Palusa,
pp. 436438. A. Rajashekara, A. Sinha, C. Zhong, and E. Zhang, A 1.062514.025 Gb/s
[4] E.-H. Chen, R. Yousry, and C.-K. Yang, Power optimized ADC- multi-media transceiver with full-rate source-series-terminated transmit
based serial link receiver, IEEE J. Solid-State Circuits, vol. 47, no. 4, driver and floating-tap decision-feedback equalizer in 40 nm CMOS,
pp. 938951, Apr. 2012. IEEE J. Solid-State Circuits, vol. 46, no. 12, pp. 31263139, 2011.
[5] O. Agazzi, D. Crivelli, M. Hueda, H. Carrer, G. Luna, A. Nazemi, [27] J. Poulton, R. Palmer, A. Fuller, T. Greer, J. Eyles, W. Dally, and
C. Grace, B. Kobeissy, C. Abidin, M. Kazemi, M. Kargar, C. Marquez, M. Horowitz, A 14-mW 6.25-Gb/s Transceiver in 90-nm CMOS, IEEE
S. Ramprasad, F. Bollo, V. Posse, S. Wang, G. Asmanis, G. Eaton, J. Solid-State Circuits, vol. 42, no. 12, pp. 27452757, 2007.
N. Swenson, T. Lindsay, and P. Voois, A 90 nm CMOS DSP MLSD
transceiver with integrated AFE for electronic dispersion compensation Yingyan Lin (S10) received the B.S. and M.S. de-
of multi-mode optical fibers at 10 Gb/s, in Proc. IEEE Int. Solid-State grees in electrical engineering from Huazhong Uni-
Circuits Conf. Dig. Tech. Papers (ISSCC), Feb. 2008, pp. 232609. versity of Science and Technology, Wuhan, China.
[6] M. Harwood, N. Warke, R. Simpson, T. Leslie, A. Amerasekera, S. Batty, Since August 2012, she has been pursuing the
D. Colman, E. Carr, V. Gopinathan, S. Hubbins, P. Hunt, A. Joy,
P. Khandelwal, B. Killips, T. Krause, S. Lytollis, A. Pickering, M. Saxton, Ph.D. degree in electrical and computer engineering
D. Sebastio, G. Swanson, A. Szczepanek, T. Ward, J. Williams, at the University of Illinois at Urbana-Champaign,
R. Williams, and T. Willwerth, A 12.5 Gb/s SerDes in 65 nm CMOS IL, USA.
using a baud-rate ADC with digital receiver equalization and clock recov- From 2007 to 2009, she worked at Wuhan Asian
ery, in Proc. IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers, 2007, Microelectronics CO., LTD., China, where she led
pp. 436591. projects developing high-speed interface circuit IPs
[7] R. Plassche, Integrated Analog-to-Digital and Digital-to-Analog for TOSHIBA Microelectronics Corp. From 2009
Converters. Dordrecht, The Netherlands: Kluwer Academic, 1994. to 2011, she was a visiting scholar at the University of Illinois at Urbana-
[8] S. Park, Y. Palaskas, A. Ravi, R. Bishop, and M. Flynn, A 3.5 GS/s 5-b Champaign under the supervision of Prof. Naresh Shanbhag. Her research
flash ADC in 90 nm CMOS, in Proc. CICC IEEE Custom Integr. Circuits interests are in the design of high-speed mixed signal circuits and low power
Conf., Sep. 2006, pp. 489492. error resilient integrated circuits for machine learning.
704 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMSI: REGULAR PAPERS, VOL. 63, NO. 5, MAY 2016
Min-Sun Keel (S12M16) received the B.S. and Elyse Rosenbaum (F11) received the B.S. degree
M.S. degrees in electronics engineering from Korea (with distinction) from Cornell University, Ithaca,
University, Seoul, Korea, in 2000 and 2002, respec- NY, USA, in 1984, the M.S. degree from Stanford
tively, and the Ph.D. degree in electrical and com- University, Stanford, CA, USA, in 1985, and the
puter engineering from the University of Illinois at Ph.D. degree from the University of California,
Urbana-Champaign, Urbana, IL, USA, in 2015. Berkeley, CA, USA, in 1992. All of these degrees
From 2002, he is working for Samsung Electron- were in electrical engineering. From 1984 through
ics as an analog IC designer for CMOS image sen- 1987, she was a Member of Technical Staff at AT&T
sors. His research interests include energy-efficient Bell Laboratories, Holmdel, NJ, USA. She is cur-
high-speed I/Os, mixed-signal ICs with system- rently a Professor in the Department of Electrical and
awareness and low-noise sensor readout circuits. Computer Engineering at the University of Illinois at
Urbana-Champaign, IL, USA.
Dr. Rosenbaums present research interests include design, testing, modeling
Adam Faust received the B.S. degree in computer and simulation of on-chip ESD protection circuits, system-level ESD reliability,
engineering and the M.S. and Ph.D. degrees in elec- latch-up, design of high-speed I/O circuits, and charged device model ESD
trical and computer engineering from the University (CDM). She has authored or coauthored well over 100 technical papers. She has
of Illinois at Urbana-Champaign, IL, USA, in 2006, presented tutorials on reliability physics at the International Reliability Physics
2011, and 2012 respectively. Symposium, the EOS/ESD Symposium, and the RFIC Symposium. She was
While a student, Dr. Faust was the recipient of the the keynote lecturer at the 2004 Taiwan ESD Conference, and has given invited
Micron Technology Foundation Scholar, ECE Dis- lectures at many universities and industrial laboratories. From 2001 through
tinguished Fellowship, and Texas Instruments/GRC 2011 she was an editor for IEEE T RANSACTIONS ON D EVICE AND M ATE -
Fellowship awards. In 2012, he joined Advanced De-
RIALS R ELIABILITY . She is currently an editor for IEEE T RANSACTIONS ON
sign, Portland Technology Development, at the Intel
E LECTRON D EVICES . Dr. Rosenbaum is the technical program chair for the
Corporation, Hillsboro, OR, USA, as an Analog En-
2016 International Reliability Physics Symposium.
gineer where he works on the development of I/O circuits in advanced CMOS
processes technologies. Dr. Rosenbaum has been a visiting professor at Katholieke University,
Leaven, Belgium, and National Chiao-Tung University, Hsinchu, Taiwan. She
has been the recipient of a Best Student Paper Award from the IEDM, an Out-
standing Paper Award from the EOS/ESD Symposium, a Technical Excellence
Aolin Xu received the B.S. degree from Beijing Award from the SRC, an NSF CAREER award, an IBM Faculty Award, and a
University of Posts and Telecommunications, China, UIUC Bliss Faculty Scholar Award.
in 2007, and M.S. degree in 2010 from Tsinghua
University, China, all in electrical engineering. He
is a Ph.D. student in the Department of Electrical
and Computer Engineering, University of Illinois at Andrew C. Singer (F09) received the S.B., S.M.,
Urbana-Champaign, IL, USA. His research interest and Ph.D. degrees, all in electrical engineering and
is information and communication theory. computer science, from the Massachusetts Institute
of Technology, Cambridge, MA, USA. From 1996
to 1998, he was a Research Scientist at Sanders, a
Lockheed Martin Company, Manchester, NH, USA.
Since 1998, he has been on the faculty of the De-
partment of Electrical and Computer Engineering at
the University of Illinois at Urbana-Champaign, IL,
Naresh R. Shanbhag (F06) is the Jack Kilby Pro- USA, where he currently holds a Fox Family En-
fessor of Electrical and Computer Engineering at dowed Professorship in the Electrical and Computer
the University of Illinois at Urbana-Champaign. He Engineering department. His research interests include signal processing and
received his Ph.D. degree from the University of communication systems. In 2000, he cofounded Intersymbol Communications,
Minnesota (1993) in electrical engineering. From Inc., a fabless semiconductor IC company, based in Champaign, IL, USA,
1993 to 1995, he worked at AT&T Bell Laboratories which built the worlds fastest signal-processing-enhanced receivers for 10 Gb/s
at Murray Hill where he led the design of high-speed optical communications. In 2007, Intersymbol Communications, Inc. was ac-
transceiver chip-sets for very high-speed digital sub- quired by Finisar Corporation (FNSR). He continues to work in areas related
scriber line (VDSL), before joining the University to signal processing algorithms and their potential to enhance mixed-signal
of Illinois at Urbana-Champaign in August 1995. He analog and digital circuits both in his research and as Associate Director of
has held visiting faculty appointments at the National the SRC-funded Systems On Nanoscale Information fabriCs (SONIC) center.
Taiwan University (Aug.Dec. 2007) and Stanford University (Aug.Dec. He received the National Science Foundation CAREER Award in 2000; in
2014). His research interests are in the design of energy-efficient integrated cir- 2001 he received the Xerox Faculty Research Award, and in 2002 he was
cuits and systems for communications, signal processing and machine learning. named a Willett Faculty Scholar. In 2005, he was appointed the Director of the
He has more than 200 publications in this area and holds thirteen US patents. Technology Entrepreneur Center at the University of Illinois, where he directs
He is also a co-author of the research monograph Pipelined Adaptive Digital a wide range of entrepreneurship activities in the College of Engineering. In
Filters published by Kluwer Academic Publishers in 1994. 2006 he received the IEEE J OURNAL OF S OLID S TATE C IRCUITS Best Paper
Dr. Shanbhag received the 2010 Richard Newton GSRC Industrial Impact Award for the paper entitled leftAn MLSE Receiver for Electronic Dispersion
Award, became an IEEE Fellow in 2006, received the 2006 IEEE Journal Compensation of OC-192 Fiber Links. In 2008, he received the IEEE Signal
of Solid-State Circuits Best Paper Award, the 2001 IEEE Transactions on Processing Magazine Award for the paper entitled leftTurbo Equalization.
VLSI Best Paper Award, the 1999 IEEE Leon K. Kirchmayer Best Paper In 2014, he was named as a Distinguished Lecturer of the IEEE Signal
Award, the 1999 Xerox Faculty Award, the IEEE Circuits and Systems Society Processing Society.
Distinguished Lecturership in 1997, the National Science Foundation CAREER
Award in 1996, and the 1994 Darlington Best Paper Award from the IEEE
Circuits and Systems Society. In 2000, Dr. Shanbhag co-founded and served as
the Chief Technology Officer of Intersymbol Communications, Inc., (acquired
in 2007 by Finisar Corp (NASDAQ:FNSR)) a semiconductor start-up that pro-
vided DSP-enhanced mixed-signal ICs for electronic dispersion compensation
of OC-192 optical links. Since January 2013, he is the founding Director of the
Systems On Nanoscale Information fabriCs (SONIC) Center, a 5-year multi-
university center funded by DARPA and SRC under the STARnet program.