Beruflich Dokumente
Kultur Dokumente
Erlangen
FAU University Press
2016
Bibliografische Information der Deutschen Nationalbibliothek:
Die Deutsche Nationalbibliothek verzeichnet diese Publikation in der
Deutschen Nationalbibliografie; detaillierte bibliografische Daten sind
im Internet über http://dnb.d-nb.de abrufbar.
Der vollständige Inhalt des Buchs ist als PDF über den OPUS Server
der Friedrich-Alexander-Universität Erlangen-Nürnberg abrufbar:
https://opus4.kobv.de/opus4-fau/home
I thank you all very much for your individual contributions and sup-
port. The last years were a great and pleasant experience for me. I have
enjoyed being with all of you and I hope that a lot of good years are
following.
- Jörg Fuhrmann
Abstract
The further development of the mobile communication standard to-
wards the 4th generation (4G) long term evolution (LTE) and the simul-
taneous development of technology standards create new challenges,
that have to be fulfilled, while designing power amplifiers (PAs). The
downscaling of complementary metal-oxide-semiconductor (CMOS)
integrated circuits (ICs) according to Moore’s law makes the overall
transceiver system more compact and reduces the required chip area.
Recently a fully integrated CMOS power amplifier was included in a
single-chip 3rd generation (3G) high speed packet access (HSPA) trans-
ceiver with a radio frequency digital to analog converter (RFDAC). For
further integration the RFDAC and PA can be merged to a digital PA
(DPA). The signal can be generated using polar modulation (PM) which
allows a separated consideration of amplitude and phase. The current
summing digital power amplifier (CSDPA) is one solution for fully in-
tegrated CMOS PM architectures. A CSDPA can be implemented as
switched power amplifier architecture with an inverse class-D PA that
can theoretically achieve high efficiency what makes it a promising can-
didate for fully integrated circuits.
i
Abstract
ii
Abstract
iii
Kurzfassung
Die Weiterentwicklung des Mobilkommunikationsstandards bis hin zu
der heutigen vierten Generation (4G) Long Term Evolution (LTE) und
die gleichzeitige Weiterentwicklung des Technologiestandards erzeu-
gen neue Herausforderungen bei dem Design eines Leistungsverstärk-
ers (engl. kurz PA), welche erfüllt werden müssen. Die Skalierung von
integrierten Schaltungen (engl. kurz ICs), mit komplementärer Logik
aus Metall-Oxid-Halbleitern (engl. kurz CMOS), nach dem Moore-
schen Gesetz macht den gesamten Transceiver kompakter und redu-
ziert die benötigte Chipfläche. Kürzlich wurde für die dritte Genera-
tion (3G) ein vollintegrierter CMOS-PA auf einem einzelnen Chip zu-
sammen mit einem Hochfrequenz-Digital-Analog-Umsetzer (engl.
kurz RFDAC), für einen High Speed Packet Access (HSPA)-Transceiver,
integriert. Um das Konzept weiterzuintegrieren können der RFDAC
und der PA zu einem digitalen PA (engl. kurz DPA) zusammengeführt
werden. Das Signal kann mittels Polarmodulation (PM) generiert wer-
den, welche eine getrennte Betrachtung von Amplitude und Phase er-
laubt. Der stromsummierende DPA (engl. kurz CSDPA) ist eine Lö-
sung für vollintegrierte CMOS-PM-Architekturen. Ein CSDPA kann als
geschaltete PA-Architektur mit einem inversen Klasse-D PA implemen-
tiert werden. Die theoretisch erzielbare hohe Effizienz macht ihn zu
einem vielversprechenden Kandidaten für vollintegrierte Schaltungen.
v
Kurzfassung
Der isolierte lineare PA ist für LTE Frequenzduplex (engl. kurz FDD)
Band 1 entworfen. Das Die ohne Gehäuse misst 1.88 × 0.51 mm2 und
ist direkt auf eine Leiterplatte (engl. kurz PCB) gelötet. Bei gepul-
sten Messungen werden ein maximaler Leistungswirkungsgrad (engl.
kurz PAE) von 35.2 %, ein Drain-Wirkungsgrad ηd von 39.5 %, eine Ver-
stärkung von 15.5 dB und eine maximale Ausgangsleistung Pmax von
31.7 dBm bei 1.83 GHz und einer 3.2 V Versorgungsspannung erreicht.
Die Verwendung einer digitalen Vorverzerrung (engl. kurz DPD) wird
anhand eines voll belegten LTE-15 Band 1 PUSCH 16-QAM OFDM-Si-
gnals gezeigt. Die LTE-Anforderungen für BW 1.4-20 MHz werden mit
voll belegten Band 1 PUSCH QPSK-Signalen gemessen. Das geforderte
EVM von 17.5 %, das UTRA ACLR von −33 dBc und das E-UTRA von
−30 dBc werden unter Verwendung einer DPD bei allen BWs erreicht .
vi
Kurzfassung
realisiert, der direkt mit dem digitalen Eingang (engl. kurz DFE) ver-
bunden ist. Das DFE wandelt die IQ-modulierten Signaldaten zu
einem polarmodulierten Signal um. Die modulierte Phaseninforma-
tion ist im Signal des lokalen Oszillators (engl. kurz LO) enthalten.
Die Amplitudeninformation wird in einem segmentierten 15 Bit-Feld
dekodiert. Die 10 Bits mit dem höchsten Stellenwert (engl. kurz MSBs)
sind thermometer-dekodiert, um die Monotonie zu gewährleisten. Die
5 Bits mit dem niedrigsten Stellenwert (engl. kurz LSBs) bleiben binär-
dekodiert, um die Komplexität zu reduzieren. Die Signalamplitude
und Phaseninformation werden innerhalb des DPA wieder zusammen-
geführt. Die geforderte Ausgangsleistung für LTE wird auch ohne zu-
sätzliche Verstärkung erreicht. Der Ausgang des DPAs wird mit einem
Transformator angepasst. Der Transformator dient zeitgleich als Balun,
der den differentiellen auf einen einpoligen Ausgang transformiert. In-
dem man einen inversen Klasse-D PA als Einheitszelle (engl. kurz UC)
im Zellfeld verwendet, lassen sich die Ausgänge kurzschließen und mit
dem Anpassnetzwerk am Ausgang (engl. kurz OMN) verbinden. Dies
resultiert in einer kompakten Implementierung. Das induktive Ele-
ment des Anpassnetzwerks ist im OMN vereinigt. Der Transforma-
tor ist mit einer Windung auf dem Chip und der sekundären Win-
dung im Gehäuse unterteilt. Da die sekundäre Windung des Trans-
formators in einer zusätzlichen Lage (engl. kurz RDL) innerhalb des
Gehäuses realisiert wird, können die Kupferlagen auf dem Chip für
die Primärwindung verwendet werden. Die Primärwindung ist in den
oberen, niederohmigen Metalllagen und der Aluminiumlage imple-
mentiert, um eine gute Leitfähigkeit und dadurch geringere Verluste zu
garantieren. Das resultiert in einem verbesserten Gütefaktor des OMN.
Der Ausgang des Transformators ist dann mit einer 50 Ω-Ausgangs-
last am PCB abgeschlossen. Der Mittelabgriff des Transformators ist
mit einer 2.5 V-Versorgungsspannung verbunden. Der DPA hat eine
Fläche von 0.61 × 0.5 mm2 . Die Messungen mit einem Dauerstrichsig-
nal (engl. kurz CW) zeigen ein Pmax von 31.2 dBm und ein maximales ηd
von 34.3 %. Der Dynamikbereich (engl. kurz DR) des DPAs ist 87.9 dB.
Bei der geforderten CHP von 26 dBm ist das E-UTRA ACLR für Band 7
26.9 dBc für ein LTE-Signal mit 5 MHz Bandbreite (LTE-5) und 27.4 dBc
für ein LTE-Signal mit 10 MHz Bandbreite (LTE-10). Die EVM-Anfor-
derungen wurden für alle Messungen erfüllt. Das Duplex-Rauschen
(engl. kurz DN) bei 26 dBm CHP ist −140.7 dBc/Hz für LTE-5 und
−138.3 dBc/Hz für LTE-10.
vii
Acronyms
ix
Acronyms
x
Acronyms
xi
Acronyms
xii
Symbols
xiii
Symbols
xiv
Symbols
xv
Contents
Abstract i
Kurzfassung v
1 Introduction 1
1.1 State-of-the-Art . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Theoretical Design Concepts . . . . . . . . . . . . . . . 4
1.3 DPA Theory . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.4 Combiner Techniques . . . . . . . . . . . . . . . . . . . 8
1.5 Power Amplifier Classes . . . . . . . . . . . . . . . . . . 12
1.5.1 Class-D and Inverse Class-D PA . . . . . . . . . 13
1.5.2 Class-E and Inverse Class-E PA . . . . . . . . . . 14
1.5.3 Class-F and Inverse Class-F PA . . . . . . . . . . 15
1.5.4 Losses and Output Power . . . . . . . . . . . . . 16
1.6 Linearization Concepts . . . . . . . . . . . . . . . . . . 17
1.6.1 Outphasing . . . . . . . . . . . . . . . . . . . . . 17
1.6.2 Envelope Elimination and Restoration . . . . . . 19
1.6.3 Digital Polar Transmitter . . . . . . . . . . . . . 20
1.6.4 Summing Digital Power Amplifier . . . . . . . . 21
1.7 Watt-Level Output Power . . . . . . . . . . . . . . . . . 23
1.8 Modulated Signals . . . . . . . . . . . . . . . . . . . . . 24
1.9 Summary of DPAs . . . . . . . . . . . . . . . . . . . . . 25
1.10 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . 25
2 Specifications 39
2.1 Power Amplifier Basics . . . . . . . . . . . . . . . . . . 39
2.2 LTE Signal . . . . . . . . . . . . . . . . . . . . . . . . . . 43
2.3 LTE Specification . . . . . . . . . . . . . . . . . . . . . . 44
2.3.1 Temperature . . . . . . . . . . . . . . . . . . . . 45
2.3.2 Output Power . . . . . . . . . . . . . . . . . . . 46
xvii
Contents
xviii
Contents
5 Conclusion 151
xix
Chapter 1
Introduction
Since the invention of the transistor in 1947 the industry as well as
the mobile communication have changed drastically [1]. Moore’s Law
brought the motivation of complementary metal oxide semiconduc-
tor (CMOS) technology downscaling and led to further development
of integrated circuits (ICs) [2]. Later technology scaling brought new
inventions such as the Intel® 45 nm high-k metal gate silicon technol-
ogy and the first demonstrated 32 nm logic process. Besides this tech-
nology scaling further developments of non-digital function led to an
expression called more than Moore (MtM). These developments are
silicon based technologies but they do not scale in the same way [3].
At the same time as the transistor was invented, the theoretical foun-
dations for communication were laid. To this day mobile communica-
tion systems have been developed up to the 4th generation (4G). The
2nd generation (2G) was represented by global system for mobile com-
munications (GSM) and the 3rd generation (3G) by universal mobile
telecommunications system (UMTS) [4, 5]. Today’s 4G long term evo-
lution (LTE) is already available for users on the market. Based on this
standard, internet of things (IoT) and machine-to-machine (M2M) cre-
ate another market for ICs [6, 7].
It is crucial for all mobile devices to design them for high efficiency,
and so for low power consumption, to guarantee a long operating time.
Due to the high output power that is required by the 3rd generation
partnership project (3GPP) the transmitter (TX) and here especially
the power amplifier (PA) is a high power consumer. For the overall
efficiency it is therefore of high interest to design this component care-
fully. Besides high power efficiency also fully integrated designs make
the overall design more compact. It is therefore of interest to further
1
Chapter 1. Introduction
develop existing PAs for later technology nodes and the next generation
of mobile communication [8].
In the past years, the mobile communication standard has evolved to
4G. New devices with applications that use high definition video trans-
fers and on demand services require a high data transfer rate. LTE is
currently available as state-of-the-art standard and will be further de-
veloped to LTE-Advanced. The high output power for mobile commu-
nication standards, compared to wireless local area network (WLAN)
or Bluetooth, require additional effort and care while designing a PA.
Since this block is crucial to the overall system performance, a special
focus on the PA design is mandatory.
1.1 State-of-the-Art
In recent years a summary of published CMOS PAs, that are able to
achieve watt-level output power which is required for LTE, has been
presented [9]. As mentioned before, the PA is a crucial part for the
overall efficiency in a transceiver system. Therefore, a special focus on
its design is mandatory. High peak-to-average ratio (PAR) in orthog-
onal frequency-division multiplexing (OFDM) for LTE force the PA to
operate in backoff (BO) while highest efficiency is achieved at satu-
rated output power. Additional measurements such as adjacent chan-
nel leakage power ratio (ACLR) and error vector magnitude (EVM) de-
fine the quality of linearity, that a PA has to fulfill by transmitting a
signal. This is important for low bit failure rates and spectral band co-
existence with other wireless technology standards such as Bluetooth
and WLAN.
It was shown that linear class-AB PAs can achieve 3GPP LTE speci-
fications even in latest nanometer technology nodes [10]. Due to the
fact that linear designs already theoretically suffer from poor efficiency,
switched amplifier designs such as class-D,-E,-F become very attrac-
tive [11]. On one hand these switched amplifiers have a theoretical effi-
ciency of 100 % but on the other hand it is impossible to linearly mod-
ulate the amplitude of a signal without further design improvements.
Outphasing, linear amplification with nonlinear components (LINC),
envelope elimination restoration (EER), digital polar transmitter (DPT)
or switched-mode power amplifier (SMPA) are architectures that lin-
earize these highly non-linear PAs.
2
1.1. State-of-the-Art
Recently, integrated radio frequency (RF) CMOS PAs became the fo-
cus of attention to compete with III-V heterojunction bipolar transis-
tors (HBTs) based PAs in the mobile handset market [12]. For a full
system on chip (SoC) in connection with very-large-scale integration
(VLSI) it is important to design the PA in the same technology as the
digital front-end (DFE) [13]. Transceivers have been further developed
towards all-digital transceiver architectures and architectures were pre-
sented that use an radio frequency digital to analog converter (RFDAC)
in 65 nm CMOS technology with integrated PA [14–16].
It was also already presented that it is possible to implement full
transceivers in CMOS for all kinds of wireless communication systems.
Table 1.1 shows a selection of reported publications on Bluetooth [17],
wireless personal area network (WPAN) [18], WLAN [19], LTE [20] and
LTE-Advanced [21].
For more compact design solutions the digital to analog data con-
version and amplification can be implemented by merging the digital-
to-analog converter (DAC) with the PA into a digital power amplifier
(DPA). DPA solutions are a promising candidate for compact designs
and full SoC solutions with good efficiency. Different DPA concepts
can be considered to fulfill the system requirements in the best possi-
ble way.
This chapter shows the theoretical design concepts to fulfill 3GPP re-
quirements and recently presented implementations. Section 1.2 ex-
plains the 3GPP requirements for LTE and the challenges for low CMOS
technology nodes. The theory of DPAs and their specification is ex-
plained in Section 1.3. In Section 1.4 different power combining tech-
3
Chapter 1. Introduction
block (RB) allocation of the design. Due to chain losses and peak-to-
average power ratio (PAPR) the watt-level, or 30 dBm, is one of the con-
sidered design requirements for PAs as shown in
Pout [dBm]
Pout [W ] = 10 10 · 1mW. (1.3)
Pout at a resistor load Rload can be calculated, for a sinusoidal signal
with the amplitude Vout , as
2
1 Vout
Pout = . (1.4)
2 Rload
In Table 1.3 the gate oxide thickness for the different technologies is
given by the international technology roadmap for semiconductors
(ITRS) [26]. It can be seen that in low CMOS technology node ar-
chitectures it is impossible to achieve the required output power with
the output power of the presented classes [9]. Therefore, most de-
signs use a combination of differential push-pull designs with pream-
plifier and with a power combining architecture [27]. To fulfill the
stringent required rise and fall times to reduce amplitude to phase dis-
tortion (AM-PM) distortion predriver stages can be used [28]. This
stages have to be designed carefully not to dissipate to much power.
To minimize the driver power it is better to use a small number of cas-
caded drivers with large fan-out to drive the large PA devices than more
drivers with smaller fan-out [28]. Taking these components into ac-
count this can highly drop the overall efficiency if a design [29]. Volt-
age switching classes have the additional disadvantage that while using
stacked designs two driving stages are needed to drive the inverter. A
non-overlapping clock has to be implemented between the two stages
not to loose efficiency. Inverse class-D, -E or -F PAs can simply be
stacked because no inverter is needed.
Especially for lower technology nodes the voltage that drops at the
PA is still too high for a single transistor, a stacked design is used to
5
Chapter 1. Introduction
Table 1.3: Selected ITRS Specifications for Thick Oxide and Thin Oxide
Transistors in Deep Nanometer CMOS
Technology Node Voltage Supply Tox Gate Length
[nm] [V] [nm] [nm]
Thin Oxide
65 1.2 2 53
45 1 1.5 32
28 0.95 1.1 20
Thick Oxide
65 2.5 5 250
45 1.8 3 180
28 1.8 3 180
distribute the voltage drop across more transistors [30, 31]. In Fig. 1.1 a
stack of three transistors is shown. The bottom transistor N1 is in com-
mon source (CS) mode and is driven by the signal Vrf . The transistors
N2 and N3 are implemented as common gate (CG) and biased by Vbias,1
and Vbias,2 . It has already been shown that in deep nanometer CMOS
technology LTE output power can be achieved using a triple stack de-
sign in a linear class-AB PA [10]. The bulks of the transistors are con-
nected to the source of the transistors to dynamically change the bulk
potential to distribute the voltage stress for RF signals. Furthermore, a
capacitor Cf b can be implemented that generates a feedback from the
Vdd
Cf b
N3
Vbias,2
N2
Vbias,1
N1
Vrf
6
1.3. DPA Theory
drain to further reduce the drain voltage stress that occurs at the upper
transistor N3 . With an n times higher output voltage Vout the same Pout
can be achieved using a load Rload that is n2 higher. With n being the
number of transistors used in the stack [9]. This might be especially of
interest if Rload has to be made as small as the parasitic resistance and
capacitance at the output of the stage. Then the losses can be signif-
icant enough to degrade the whole performance of the design due to
voltage drops.
ηd % 100 · PPout
dc
−Vn
INL LSB max{ Vn+1 VLSB − 1}
V −Vn,ideal
DNL LSB max{ n VLSB }
The power that is dissipated in an ideal digital design with the supply
voltage Vdd can be calculated as
2
Pin = f Cin Vdd . (1.5)
Cin is the input capacitance that has to be driven and the frequency f
is the value of how often the capacitance is loaded and unloaded [32].
Since most of the times Pin has no independent power supply it is im-
possible to measure it. For this reason the PAE and gain are usually not
provided and cannot be compared. Another approach is to describe
Pin as the total used digital input power, that take local oscillator (LO)
7
Chapter 1. Introduction
and clock generation into account. It is then spoken of the total effi-
ciency [33]. The ηd is independent of Pin , if Pdc is only measured for the
PA. On the other side, additional aspects can be considered that were
not relevant in designing a classical PA. By using controlled unit cells
(UCs) to generate the output power and no more linear devices, the
transfer characteristics of a PA change as well from continuous to dis-
crete values. The bit resolution, integral nonlinearity (INL) and differ-
ential nonlinearity (DNL) errors are values that should also be reported
to fully describe a DPA.
In Table 1.5 the bits, INL, DNL, Pout and frequency of different pre-
sented DPAs are shown. A current summing DPA (CSDPA) with a unit
class-E amplifier can operate at frequencies up to 47 GHz. Another de-
sign with 10 bit resolution and segmented unit cells achieves INL/DNL
values of 2.43/3.2 least significant bit (LSB) by using predistortion. One
has to take into account that by using predistortion the effective reso-
lutions is reduced. The good matching characteristics of CMOS capaci-
tors of voltage summing DPA (VSDPA) result in DNL values of ±0.5 LSB
and INL of ±3 LSB.
8
1.4. Combiner Techniques
I1
Vin np ns V1 Vin np ns
RL
I2
Vin np ns V2 Vin np ns
RL
(a) Block level diagram of a series trans- (b) Block level diagram of a parallel
former combiner. transformer combiner.
jBC
λ/4 C
Vin λ/4 Vin L
−jBC
L RL
−jBC λ/4
Vin RL Vin C
λ/4
jBC
(c) Block level diagram of a parallel λ/4 (d) Block level diagram of a parallel dis-
combiner. tributed LC matching.
9
Chapter 1. Introduction
10
1.4. Combiner Techniques
√ 1
Chireix ( )2 [41]
sin(2ϕ)−RL BC
1+ 14
(√ )
sin2 (ϕ)
E−1
LC 1− Qind [27]
100
80
Efficiency [%]
Qs = 15
60 Qs = 10
Qs = 5
40
20
0 0
10 101 102
Primary Quality Factor
11
Chapter 1. Introduction
100
80
Efficiency [%]
40
20
0
−30 −25 −20 −15 −10 −5 0
Output Power Backoff [dB]
It shows that due to this resistors the valley between the two maximas
drops further down, what could result in a worse overall efficiency for
signals with a high PAPR.
The efficiency of an LC combiner with different quality factors Qind
for the inductance is shown in Figure 1.5. The PER is the product of
the impedance transformation ratio and its efficiency. It can be seen
that the overall efficiency ηoa rises with the impedance ratio and with
increased losses. It is therefore important for networks with lossy on-
chip components [27].
12
1.5. Power Amplifier Classes
100
80
Efficiency [%]
60
40 Qind = 15
20 Qind = 5 Qind = 10
0
0 20 40 60 80 100
Power Enhancement Ratio
Figure 1.5: Efficiency diagram of an LC combiner.
soidal wave and vice versa. Therefore, the designs can be implemented
in series, voltage summing or parallel, current summing.
13
Chapter 1. Introduction
Vdd
C L
Vin Vds
Rload
put transformer network that makes the design more compact. Addi-
tionally the parallel capacitor allows the absorption of the device par-
asitics into the network. Zero-voltage switching (ZVS) can be used but
the efficiency decreases due to losses in the tank [46]. To avoid these
losses zero-current switching (ZCS) can be implemented [47].
Vdd
L
C L
Vds
Vin
CS Rload
14
1.5. Power Amplifier Classes
from open to short circuit. If there is still charge stored in this transis-
tor capacitance it will be dissipated through the switch. As in class-D,
ZVS can be used to compensate this degradation because it drives the
voltage to zero before the transistor is conductive [48].
Vdd
Harmonic Resonator
L
C
Vds
Vin
L C Rload
It is well known that by only isolating the third harmonic the effi-
ciency improves from 78 %, for class-B amplifier, to 88 %. Theoretically
the class-F amplifier can achieve 100 % efficiency but only with a infi-
nite number of harmonics [2]. As in class-E designs the drain source
voltage Vds also depends on the lumped elements and exceeds the sup-
ply voltage Vdd . Instead of filtering the odd harmonics, an inverse class-
F cancels the even harmonics [50]. Therefore, the implementation and
efficiency tradeoff is the same as mentioned above.
15
Chapter 1. Introduction
Class-F PAs are cited to have much better waveform metrics than
class-E but are said to be unrealizable in the strong switching case [48].
Therefore, a hybrid class-EF design was implemented that has the ben-
efits of class-E, such as integration of the transistor parasitic capaci-
tance, exact switching time-domain solutions and ZVS operation com-
bined with inverse class-F to improve the waveforms and thereby the
performance of the overall design [44].
Table 1.7 shows the output power equations and the drain voltage
stresses that occur for the different SMPAs. It can be seen that the high-
est output power can be achieved by an inverse class-E PA that is eight
times higher compared to the standard class-D design. The push-pull
architectures of class-D and its inverse achieve the same output power
as class-F and inverse class-F with infinite harmonics cancellation. By
using the push-pull design the power for class-D increases four times.
The highest voltage stress occurs for class-E. Compared to its inverse
class-E design it achieves three times less output power and even gen-
erates a higher voltage stress for the transistor. Class-D generates the
16
1.6. Linearization Concepts
lowest drain to source voltage stress because the voltage at the input is
a generated rectangular signal of the supply voltage.
V2
¹ normalized Pout to Rload
dd
2
² push-pull π82 ≈ 0.811 ³ push-pull π8 ≈ 1.234
81
⁴ all harmonics, for 3rd[ harmonic 128
( ≈ 0.633
)]
⁵ all harmonics
√ ⁶ 2π 2 − arctan 2
π π
π2
⁷1 + 4 +1
1.6.1 Outphasing
Outphasing was invented by Chireix and is a well known concept that
is used for IQ modulated signals. In recent years it is also cited as LINC.
In Figure 1.9 the concept of outphasing is shown. Two phase delayed
signals, consist of ϕ1 , ϕ2 and their constant amplitudes a1 and a2 . The
two signals are independently amplified in their path and than com-
bined at the output.
17
Chapter 1. Introduction
a1 sin(ωt + ϕ1 )
a2 sin(ωt + ϕ2 )
18
1.6. Linearization Concepts
Amplitude Detector
a(t)
a(t)ϕ(t)
Limiter
Figure 1.10: Block level diagram of EER and hybrid EER without limiter.
19
Chapter 1. Introduction
Some years ago it was said that most high level PAs use a class-S am-
plifier as envelope modulator and that they achieve in practice a high
efficiency over a wide dynamic range but the BW is limited to 10 MHz
in IC implementations. Therefore, for wideband applications class-G
and split band modulators were suggested [11]. The oversampling ratio
of the PA has to be high enough to achieve signal accuracy which limits
the achievable signal BW [61]. It was stated that for a BW higher than
10 MHz it is not possible that such a design can be implemented [39].
The limiter in the RF path is a problem for some wide dynamic range
OFDM signals [60]. Due to BW limitations it can be difficult to real-
ize it for higher BW applications like LTE-Advanced that requires up to
100 MHz with carrier-aggregation (CA).
In order to reduce the stringent RF and envelope BW requirements,
hybrid structures were proposed [22]. In a hybrid structure the limiter
is removed thus that the RF input signal that arrives at the input of the
PA still contains the envelope a(t) and phase ϕ(t) modulation. An ad-
ditional advantages to lower BW requirements are higher gain and so
better efficiency and lower sensitivity to mismatch. On the other side,
an amplitude modulation technique has to be implemented in the PA
what can make the design more complex. In later years different imple-
mentations were presented. For higher efficiency over more BW using
an all-pass network [62]. The envelope amplifier consists of an op-amp
as voltage source and a buck converter with inductor as current source
that is controlled by a feedback path with hysteresis comparator [63].
This design was later further developed for the hybrid EER architec-
ture. Conventional digital predistortion can be used to linearize the
PA [22]. To align the RF and envelope paths an adaptive time align-
ment can be implemented [64]. Problems in the spectral domain arise
due to frequency response in the amplitude path that generates errors
in the frequency domain or envelope distortion that occurring within
a drain modulated PA [65].
20
1.6. Linearization Concepts
DPWM
A(t)
ϕ(t)
21
Chapter 1. Introduction
Binary Input
A(t)
ϕ(t)
PM
For SDPAs there are two possible ways of implementations. Firstly, VS-
DPAs that collect the voltages that are generated by unit cells. One
design architecture that became popular in recent years is the class-D
like switched capacitor power amplifier (SCPA) [32]. Therefore, for low
breakdown voltage technology nodes the design has the same inverter
based disadvantages for high required output power that is achieved for
a supply voltage Vdd and an output resistance Rload . The design uses the
good capacitor matching abilities to sweep the output power according
to
2 ( n )2 Vdd
2
Pout = 2 . (1.8)
π N Rload
Due to the switching, high-order harmonics are generated in the spec-
tral output that have to be filtered by the matching network [32]. A
transformer based PA with class-E/F operation was proposed to over-
come class-D PA losses in the parasitic capacitance at higher frequen-
cies and so improve the efficiency. In addition a duty cycle tuner was
implemented that provides a selection for the linearity and efficiency
tradeoff [44]. Secondly, the sum of currents can be done using Kirch-
hoff’s law. CSDPAs have the advantage that for low voltage technolo-
gies higher output powers can be achieved by shorting the output of
the different cells. Therefore, no complex power combining network is
needed as for VSDPA [66].
22
1.7. Watt-Level Output Power
2016/
22 nm
Year / Technology Node
24
1.9. Summary of DPAs
found that are implemented as EER and outphasing that are presented
for LTE 20 MHz BW that achieve the required ACLR for evolved univer-
sal terrestrial radio access (E-UTRA). Outphasing achieved very good
E-UTRA ACLR values with the use of digital predistortion (DPD) for a
64-QAM modulated signal [58]. EER can fulfill the linearity require-
ments for E-UTRA and universal terrestrial radio access (UTRA) with-
out a predistortion technique [73] and shows backward compatibility
to 3G. DPT was presented for a wideband code division multiple ac-
cess (WCDMA) signal that achieved the UTRA ACLR values at lower
frequency. The channel power (CHP) of CSDPA is below the required
3GPP. Furthermore it uses adaptive digital predistortion (ADP) to fulfill
the requirements.
1.10 Motivation
Watt-level output power can be achieved by stacking transistors in deep
nanometer technology nodes. CSDPA is a promising approach to pro-
vide this output power since the drain of the transistors can be com-
bined without any further power combining concept. The currents can
be summed and provided to the load. This avoids a design with many
transformers to combine power what might lead to less efficiency due
to transformer losses [74]. Nevertheless, to alleviate the voltage stress
25
26
Chapter 1. Introduction
Table 1.8: Comparison of Output Power for Different Implemented Architectures
Concept Class Node Pout [dBm] Supply [V] Combiner Stages Stack Ref.
Outphasing Class-D 45 nm 31.5 2.4 Series 4 - [69]
Outphasing Class-D 32 nm 25.3 2 Series 2 2 [28]
DPT inverse Class-D 150 nm 31 - Guanella Reverse 1 2 [67]
EER Class-E 180 nm 27.8 1.4 LC Matching 1 3 [29]
VSDPA Class-D 90 nm 25.2 - LC Matching 1 2 [24]
VSDPA Class-E/F 65 nm 25.6 1.8 Series 2 2 [44]
VSDPA Class-D 90 nm 27 - Series Figure 8 4 2 [71]
CSDPA Class-E 130 nm 25.2 2.5 LC Matching 1 2 [35]
Table 1.9: Comparison of Different Implemented CMOS DPA Implementations for LTE
Concept Modulation BW Pout f E-UTRA(1) UTRA(1) Efficiency DPD Ref.
[MHz] [dBm](2) [GHz] [dBc] [dBc](3) [%]
(7)
Outphasing LTE 20 22.8 2.4 -50 - 21(6) DPD [58]
(8) (5)
EER LTE 20 25.6 1.95 -33 -41.1 32.2 No [73]
(4)
DPT WCDMA 5 24 0.75 - -33 26.5 No [67]
(7) (4)
CSDPA WiMAX 5 15.3 1.9 - -46 22 ADP [35]
(1) ACLR (2) Pout is the in-band power for the stated modulation
(3) WCDMA signal is used for UTRA ACLR (4) ηoa (5) PAE (6) ηd (7) 64-QAM OFDM
(8) 16-QAM OFDM
1.10. Motivation
for one stack the design can be build as a differential design which re-
sults in a gained factor for Vout by two. To combine these two path a
transformer is needed. This transforms the output impedance that is
seen by the stack with 1 : n [27]. This transformation also results in an
increased Pout . Inverse class-D can be used as UC. It generates a simi-
lar Vds as inverse class-E but has the advantage of merging the resonant
components into the transformer.
27
Bibliography
[1] B. Hoefflinger, Chips 2020: A guide to the future of nanoelectronics,
ser. The Frontiers Collection. Springer, 2012.
[3] G. Zhang and A. van Roosmalen, More than Moore: Creating high
value micro/nanoelectronics systems. Springer US, 2010.
29
Bibliography
30
Bibliography
[17] S.-W. Lee, K.-Y. Lee, E. Song, Y.-J. Jung, H. Jeong, J.-M. Kim, H.-J.
Lim, J.-W. Lee, J. Park, K. Lee, S.-I. Chae, D.-K. Jeong, and W. Kim,
“A single-chip 2.4 GHz direct-conversion CMOS transceiver with
GFSK modem for Bluetooth application,” in VLSI Circuits, 2001.
Digest of Technical Papers. 2001 Symposium on, pp. 245–246, June
2001.
[18] Y.-I. Kwon, S.-G. Park, T.-J. Park, K.-S. Cho, and H.-Y. Lee, “An ultra
low-power CMOS transceiver using various low-power techniques
for LR-WPAN applications,” Circuits and Systems I: Regular Papers,
IEEE Transactions on, vol. 59, no. 2, pp. 324–336, Feb 2012.
31
Bibliography
[44] H. Wang and H. Hashemi, “A 0.5-6 GHz 25.6 dBm fully integrated
digital power amplifier in 65 nm CMOS,” pp. 409–412, June 2014.
34
Bibliography
[49] M.-W. Lee, S.-H. Kam, and Y.-H. Jeong, “A highly efficient dual-
band inverse class-E power amplifier with double CRLH-TLs for
LTE and WCDMA applications,” in Microwave Conference Proceed-
ings (APMC), 2011 Asia-Pacific, pp. 514–517, Dec 2011.
[52] T. Mury and V. Fusco, “Analysis of the effect of finite d.c. blocking
capacitance and finite d.c. feed inductance on the performance of
inverse class-E amplifiers,” Circuits, Devices and Systems, IEE Pro-
ceedings -, vol. 153, no. 2, pp. 129–135, April 2006.
[56] J. Yao and S. Long, “Power amplifier selection for LINC applica-
tions,” Circuits and Systems II: Express Briefs, IEEE Transactions
on, vol. 53, no. 8, pp. 763–767, Aug 2006.
35
Bibliography
[62] J.-H. Chen, K. U-yen, and J. Kenney, “An envelope elimination and
restoration power amplifier using a CMOS dynamic power supply
circuit,” in Microwave Symposium Digest, 2004 IEEE MTT-S Inter-
national, vol. 3, pp. 1519–1522, June 2004.
36
Bibliography
37
Bibliography
38
Chapter 2
Specifications
For the design of a transmitter system there are several requirements
given by 3GPP regarding to output power and linearity requirements [1].
The design of a PA is a crucial part for the overall system performance
since it has a direct impact on these requirements. In case of the DPA
it is the stage where the conversion from the digital to the analog do-
main is done and therefore the first stage where nonlinearities and so
amplitude to amplitude distortion (AM-AM) and AM-PM occur. The
requirements can be divided into signal performance requirements, e.g.
linearity and power and design performance, such as area, efficiency
and reliability. The output power is the power that a TX chain has to
deliver to the antenna. The linearity is divided into in-band require-
ments that is defined by the EVM and the maximum out of band emis-
sion that is defined by the ACLR. For the linearity it is also of interest
if a linearization technique is used and how much impact it has on the
performance in terms of linearity, area and efficiency.
39
Chapter 2. Specifications
Vn − Vn,ideal ∑
n
IN Ln = = DN Lk . (2.6)
VLSB
k=1
41
Chapter 2. Specifications
Q Q Q Q
I I I I
Figure 2.1: Diagram of the signal modulation using BPSK (a), QPSK (b),
16-QAM (c) or 64-QAM (d).
Q
Magnitude
Error Error Vector
Phase of Error Vector
Measured
Signal
Phase
Error
Ideal Signal
42
2.2. LTE Signal
Resource Subcarrier
Block Spacing = 15 kHz
Frequency
1 Slot UE UE UE UE UE UE
0.5=ms
1 Subframe UE UE UE UE UE UE
=
1 ms
UE UE UE UE UE UE
UE UE UE UE UE UE
Time
Figure 2.3: Diagram of the OFDM time-frequency multiplexing.
43
Chapter 2. Specifications
Frequency
Harmonic RX/TX
DPA Notch Filter Filter Switch Coupler
Figure 2.5: Block diagram of the TX chain from the DPA to the antenna.
44
2.3. LTE Specification
The last stage before the signal arrives at the antenna is a directional
coupler that allows measurements of the emitted power.
In Table 2.1 the characterizations of the different components are
shown. The post PA attenuation is an estimated value based on ex-
perience of 0.1 dB. The notch filter is especially needed to protect the
WLAN band and is normally characterized in the range of -20 to +85 °C.
The filter should be designed for band 40 and 41. Therefore, it would be
specified from 2300 MHz to 2690 MHz. Notice that band 38 is already
included in band 41. But since the attenuation for this filter is very high
it can not be used. For the low pass (LP) a filter for band 38,40 and 41
has to be used. Usually the center frequency should be around 2.5 GHz
and the pass band range should be ± 200 MHz. The temperature range
has to be between -40 and +85 °C. An example for a LP is provided be-
low. Since the notch filter and the LP are next to each other one might
consider to merge the two components. For the antenna switch module
(ASM) the values for all three bands are the same as well as for the cou-
pler and the antenna connector. After summing up the typical and the
worst case room (WCR) the range for the transition loss is from 3.83 dB
to 4.00 dB. To meet the maximum output power given in Table 2.3 with
a DPA that has maximum 27 dBm output power a loss of only 2.56 dB
for the notch and LP filter together is acceptable.
2.3.1 Temperature
In the following a selection of requirements, that are given by 3GPP,
are presented. All the specifications, that are stated for the UE, have to
be fulfilled in the temperature range from -10 to +55 C°.
45
Chapter 2. Specifications
For the values between the minimum and maximum output power
the output power is define in between certain tolerance boundaries.
The tolerance values for the specific output power at the antenna are
given in Table 2.4.
46
2.3. LTE Specification
2.3.4 VSWR
The 3GPP specifications assume a load of 50 Ω at the antenna con-
nector. In reality due to antenna mismatch this impedance can vary.
Therefore, at the antenna port a voltage standing wave ratio (VSWR)
of 3:1 is required for performance specification. For robustness the PA
should be designed for a VSWR of 10:1 at the antenna port.
47
Chapter 2. Specifications
The VSWR calculated for the PA, is considering the values given in Ta-
ble 2.7. Considering the VSWR for robustness and worst case scenarios
the PA has to be specified for a VSWR from 1.50 − 2.03 : 1.
48
2.3. LTE Specification
needs to be designed for the frequency range 2.3-2.7 GHz. The duplex
distance between TX and RX are 120 MHz for band 7. Notice that the
lower band of WLAN operates at 2.4 GHz. This has to be taken into
account for coexistence measurements.
RB
E-UTRA Band
Figure 2.6: Transmitter RF spectrum.
The boundary between E-UTRA OOB and the spurious emission do-
main depends on the BW. Table 2.10 shows the BW of the OOB bound-
ary. Its range is from 2.8 MHz for 1.4 MHz CHBW to 25 MHz for 20 MHz.
49
Chapter 2. Specifications
The ACLR requirements for the E-UTRA channel inside of foob are
shown in Figure 2.7 for E-UTRA and UTRA. It can be seen that foob
depends on the CHBW but the UTRA channels are fixed.
∆fOOB E-UTRA Channel
In Table 2.11 the general requirements for E-UTRA ACLR and UTRA
ACLR are given. The minimum E-UTRA ACLR, first UTRA ACLR and
second UTRA ACLR are for all BW 30, 33, 36 dB.
50
2.3. LTE Specification
RB
0 dBm
-5 dBm
-10 dBm
-15 dBm
-20 dBm
-25 dBm
-30 dBm
-30 -20 -10 0 10 20 30
MHz MHz MHz MHz MHz MHz MHz
Figure 2.8: The general E-UTRA spectrum emission mask for 20 MHz.
The spurious emission limits are given in Table 2.12. They are defined
for frequencies above and below OOB.
51
Chapter 2. Specifications
(CCDF) [8]. To further specify the signal the PAPR, CF and CF that
contains 99.9% can be calculated using CCDF [8]. The first violation is
stated for any spectral requirement due to power clipping.
Without Clipping
Output Amplitude
Classical Clipping
Input Amplitude
In Table 2.13 the results for an LTE physical uplink shared channel
(PUSCH) TC signals with a length of 10 slots and a 16-QAM symbol
modulation are shown. Depending on the BW the PAPR of the signals
is in the range of 7.84 dB to 8.78 dB. The CF range for 100 % is in between
2.47 and 2.75 and for 99.9 % the lower boundary is 2.08 and the upper
2.10. Due to the pseudo random generation the results might differ a
little bit of the worst case scenarios. It can be seen that for a undistorted
signal and the given TCs the power clipping has to be less then 5 dB to
fulfill the spectral requirements.
52
2.3. LTE Specification
Figure 2.10 contents the simulations results for the signal with 20 MHz
BW. For power clipping the signal behavior of the upper and lower
channel are identical. Both channels degrade in the same manner and
violate the specifications at the same time for all leakage ratios. It can
be seen that in this simulation, if the signal clips at more than 6 dB be-
low Pmax , E-UTRA ACLR and UTRA ACLR requirements are violated.
The in-band is more robust against clipping and does even not violate
the EVM requirements at a clipping of 8 dB.
100 100
E-UTRA ACLR [dBc]
90 Lower Channel 18
80 Upper Channel 16
70 14
EVM [%]
60 12
50 10
40 8
30 6
20 4
10 2
0 0
0 1 2 3 4 5 6 7 8 0 1 2 3 4 5 6 7 8
Power Reduction [dB] Power Reduction [dB]
(c) UTRA ACLR 2 OOB Emission (d) EVM
Figure 2.10: Simulation of E-UTRA ACLR and UTRA ACLR 1/2 OOB
emissions and EVM.
2.3.9 Resolution
For the later design implementation the resolution for the DAC is one
key consideration. This resolution has an impact on the design of the
53
Chapter 2. Specifications
Decoder as well as on the transistors in the UC. The signal to noise ratio
(SNR) is given as
SN R = 6.02N + 1.76. (2.13)
This is for a sinusoidal signal and a resolution of N bits. For any other
signal, the SNR can be calculated using the CF, an additional factor and
the N bit resolution. In this case the SNR can be calculated as
( )
Amax
SN R = 6.02N + 4.77 − 20 log . (2.14)
Arms
Considering Table 2.13 and 2 dB power clipping then an average CF for
an LTE can be considered as 2. To achieve the required ACLR of 36 dBc
a minimum resolution of 7 bits is needed as can be calculated by
36 − 4.77 + 6
≈ 7 ≤ N. (2.15)
6.02
Equation 2.14 can be extended to equation 2.16 where a oversampling
factor, calculated by the sampling frequency fsample and BW, is consid-
ered [9]. This oversampling factor can reduce the total number of bits
if the SNR is the limiting factor.
( ) ( )
Amax fsample
SN R = 6.02N + 4.77 − 20 log + 10 log (2.16)
Arms 2BW
Figure 2.11 shows the needed resolution considering the required dy-
Output Power [dBm]
20
−20
−60
100 101 102 103 104 105
Code [Unit Cells]
54
2.3. LTE Specification
55
Bibliography
[1] LTE evolved universal terrestrial radio access (E-UTRA) user equip-
ment (UE) radio transmission and reception, (3GPP TS 36.101 ver-
sion 11.6.0 Release 11) , International Technology Roadmap for
Semiconductors (ITRS) Std., 2013.
57
Bibliography
58
Chapter 3
Linear Power Amplifier
The CMOS technology is well known for its outstanding characteris-
tics in digital circuits. Nevertheless, already a few decades ago the first
investigations were made to evaluate the potential of metal oxide semi-
conductor (MOS) technologies in analog circuits [1]. For high integra-
tion and full SoC solutions it is necessary that both the analog and dig-
ital part can be implemented on the same chip. In the last years, it
was shown that a PA can be built in common standard CMOS technol-
ogy nodes of 90 nm, 65 nm, 45 nm and 28 nm [2–5]. Furthermore, it
was shown that despite of low breakdown voltages, CMOS devices can
accomplish high output voltages, that are required for cellular devices,
using stacked designs [6]. To overcome the limitation of the upper tran-
sistor and to fully exploit the voltage range of the bottom transistors a
feedback path can be implemented for a PA self biasing solution. This
increases the reliability by allowing larger signal swings before encoun-
tering hot carrier degradation [7].
59
Chapter 3. Linear Power Amplifier
3.1 Fundamentals
Speaking about linear PAs it is referred to class-A, -B, -AB and -C PA. In
Figure 3.1 the basic structure of the linear PA is shown. The bias current
Idc of the transistor flows from the supply through the inductance L to
the ground. This defines its bias setting and thereby its characteristics.
The modulated RF signal is generated at the drain of the transistor. It is
blocked by the inductance and flows through the capacitance C to the
output. The difference between the different classes is the bias point
at which the PA operates and so the shape of this output signal.
Vdd
Idc
L
C
Vin Ropt
60
3.1. Fundamentals
Vdd is the supply voltage, Vd,sat the knee voltage and Imax the maxi-
mum current at the saturation point. The drain efficiency ηd also de-
pends on these parameters and can be calculated as
Vdd − Vd,sat ϕ − sin(ϕ)
ηd = . (3.2)
Vdd 4(sin( ϕ2 − ϕ2 cos( ϕ2 ))
The bias current Idc is an essential indication of the operating mode
and so the class defintion of the PA. It therefore also depends on the
conduction angle ϕ as well as Imax and can be calculated as
∫ϕ/2
1 Imax ϕ
In = (cos(ϕ) − cos( )) cos (nα)dα. (3.4)
π 1 − cos( ϕ2 ) 2
−ϕ/2
61
Chapter 3. Linear Power Amplifier
1 1
Output Power / Vdd
0.8 0.8
Efficiency [%]
0.6 0.6
0.4 0.4
0.2 0.2
0 π 0
0 ←C ← BA → 2π
Conduction Angle [rad]
In Figure 3.3 the bias amplitude, the fundamental and the harmonics
for class-A to class-C are shown over the conduction angle. It can be
seen that with a decreasing conduction angle the bias current decreases
faster than the fundamental what results in a higher efficiency. The
harmonics of the PA are plotted up to the third order. A class-A PA has
no harmonics. Their values increase from this region until they reach
their maximum in the class-C area, where they then decrease again
with the fundamental. Here the system is most efficient but has no
more output power.
62
3.1. Fundamentals
0.6
Amplitude 0.4
DC
Fundamental
0.2 1st harmonic
2nd harmonic
3rd harmonic
0
0 ←C π ← BA → 2π
Conduction Angle [rad]
Figure 3.3: DC, fundamental and n-th order harmonic plot of a linear
PA depending on the conduction angle ϕ.
0.4
0.3
Vgs = 0.5 V
Ids [A]
63
Chapter 3. Linear Power Amplifier
TX PA FE
Pout,P A Pout,ant
64
3.3. Circuit Design
n-1 depends on the settings at the gate and the threshold voltage of the
next transistor n in the stack and can be calculated to
65
66
N 3p Irf,p
Off-Chip
Γin = Z0 Γgm
N 2p
Cof f
RFin
Off-Chip
N 1p
Balun Cm,p Cm,s
N 1n
N 2n
N 3n Irf,n
Cf b
Idc,P A
Figure 3.6: Schematic of triple stack PA design with an IMN for a differential RF input signal applied at the
gates of the bottom transistors and an OMN for load matching. On-chip ESD protection circuit,
supply generation and off-chip signal conversion with matching capacitors.
3.3. Circuit Design
To decouple the high current flow Irf that occurs at 1 W output power
a capacitor Cdec was placed between the center tap of the transformer
and the ground supply. Therefore, the current has an RF short loop on
the chip, with the transformer inductor L, for the frequency f that can
be calculated as
1
f= √ . (3.10)
2π (LT,out /2)Cdec
Figure 3.7 depicts the equivalent transistor level schematic of the
triple stack for DC characterizations on the PCB. Due to the test board
interconnections the transistor N 3 has a DC short at the gate and drain
of the upper transistor N 3 and can be thereby considered as a diode-
connected metal oxide semiconductor field effect transistor (MOSFET).
The bias voltage Vg1 for the bottom transistor N 1 can be either directly
connected to a supply or can be provided by the on-chip replica stage.
The gate voltage Vg2 for the transistor N 2 is also provided by the bias
stage.
Vdd
Vd3
Vg3
N3
Vd2
Vg2
N2
Vreplica
Vd1
Vg1
N1
Vpcb
Figure 3.7: Triple stack of the class-AB PA with shorted gate drain and
two voltage inputsfor the gate of the bottom transistor.
Since the diode-connected transistor is always in saturation with
Vds = Vgs > Vgs − Vth , the triple stack has a switch on voltage at Vth that
can be expressed by the Shichman-Hodges transistor equations [14] as
67
Chapter 3. Linear Power Amplifier
√
Ids
Vdiode,N 3 = Vth + 1
( W
) . (3.11)
2 µn C ox L (1 + λVth )
The transistor N 2 works in common gate and transistor N 1 in com-
mon source mode. The I-V characteristic of this triple stack is a mixture
of different behaving transistors. The drain current for transistor N 1
and N 2 is according to the equations of the linear and saturation re-
gion. The supply settings for the triple stack can be set at the gate of
N 2 and N 1. For N 1 there are two supply connections possible. One
has a direct connection to the PCB and is therefor regulated by a sup-
ply generator. The other is an on-chip bias generation that consists a
replica stage.
Vd3 , Vg3
6 Vd2 , Vs3
DC Voltage [V]
Vg2
4 Vs2
Vg1
Vss
2
0
3 3.5 4 4.5 5 5.5 6
Vdd,PA [V]
68
3.5. Silicon Implementation
the bottom transistor also remains the same. Vds and Vgd of N2 are the
only voltages that are increased. Therefore, this is the reason for the
breakdown of the transistor stack as it is shown in the following DC
breakdown measurement.
Figure 3.10: Test board with SMA connector, balun, matching compo-
nents and supply connectors.
70
3.7. DC Characterization
∑
K ∑
Q
z(n) = akq x(n − q)|x(n − q)|k−1 . (3.12)
k=1 q=0
Signal Signal
Generator DUT Analyzer
DPD
Figure 3.11: Block diagram of the measurement setup for the DUT.
3.7 DC Characterization
In Figure 3.12 the I-V characteristics of the DUT with replica supply at
the bottom transistor can be seen. In the replica stage a current, that
is provided through the PCB, is converted into a voltage. The drain
current Idd,P A is plotted over the drain voltage Vdd,P A . Additionally, the
replica current Ireplica and thus the bias point of the transistor stack was
71
Chapter 3. Linear Power Amplifier
0.3
Ireplica 150 µA
Supply Current [A]
200 µA
0.2 250 µA
300 µA
350 µA
400 µA
0.1
450 µA
500 µA
550 µA
0
0 0.5 1 1.5 2 2.5 3
Supply Voltage [V]
500 mV
0.3 550 mV
600 mV
0.2 650 mV
0.1
0
0 0.5 1 1.5 2 2.5 3
Supply Voltage [V]
72
3.8. DC Breakdown Measurements
0.4
0.3
0.2
Before
0.1
Breakdown
0 After
0 1 2 3 4 5 6
Supply Voltage [V]
73
Chapter 3. Linear Power Amplifier
40
Output Power [dBm],
Pout
30 PAE
Efficiency [%]
ηd
20
10
0
−10
−20 −10 0 10 20 30
Input Power [dBm]
In Figure 3.16 the efficiency is shown over output power. The plot
shows typical poor efficiency characteristics in the linear region com-
pared to saturation. Since for modulated signals the PA is mostly driven
in BO mode the maximum efficiency is a first order comparison. An-
other important fact is the efficiency at the operating mode, that is
roughly at 6 dB BO for LTE. For other modulated signals it is also in-
teresting to see the impact of small or large PAPR on the performance.
Between the output power range from 20 to 30 dBm a linear increase of
2.5 %/1 dB can be obtained. An assumption of the efficiency degrada-
tion at higher Pout is due to the clipping of the signal that generates an
unwanted distortion what leads to a higher Pdc .
74
3.9. Single Tone Measurements
40 PAE
ηd
Efficiency [%]
30
20
10
0
−10 −5 0 5 10 15 20 25 30
Output Power [dBm]
18
0
S21 Magnitude [dB]
16
S21 Phase [◦ ]
14 −5
12
−10
10
8 −15
−25 −20 −15 −10 −5 0 5 10 15
Input Power [dBm]
Figure 3.17: Measurements of the S21 -parameter over Pin for AM-AM
and AM-PM characterization.
75
Chapter 3. Linear Power Amplifier
15
Gain [dB]
10
−10 0 10 20 30
Output Power [dBm]
Figure 3.18: Pulsed measurement of gain over Pout .
76
3.10. LTE Measurements
31.5
30.5
30
Vdd 2.9 V
29.5 Vdd 3.0 V
Vdd 3.1 V
29
1.6 1.65 1.7 1.75 1.8 1.85 1.9 1.95 2
Frequency [Hz] ·109
Table 3.2: Selected 3GPP LTE Output Power and Linearity Require-
ments
Requirement Value Unit
Pout at antenna 23 dBm
MPR 1-2 dB
EVM QPSK 17.5 %
16-QAM 12.5 %
ACLR E-UTRA 30 dBc
UTRA 33 dBc
77
Chapter 3. Linear Power Amplifier
usage can be seen in the spectral output. Without DPD the spectral
emission requirement, especially in the upper band, is violated. Fur-
thermore, show the upper and lower band different ACLR values. With
DPD and memory polynomial cancellation the upper and lower band
become more similar and it can be seen that the PA upper bands be-
come less. Note the the resolution bandwidth (RBW) was 30 kHz for
the whole measurement range. The required LTE mask was adapted.
The emissions might relax when measuring with 1 MHz RBW as speci-
fied.
0
Power [dBm]
−10
−20
−30
DPD OFF
−40 DPD ON + Memory
1.93 1.94 1.95 1.96 1.97
Frequency [GHz] ·109
Figure 3.20: Measured output spectrum with LTE mask of a fully al-
located band 1 LTE-15 16-QAM OFDM PUSCH test signal
with and without DPD.
−10
DPD OFF
−20 DPD ON + Memory
−40
−50
16 18 20 22 24 26
Output Power [dBm]
seen, that with DPD at 26 dBm Pout the CF is up to 17 dB. Since the re-
quired Pout was reached at this point the device was no further stressed
to avoid degradation or even gate oxide breakdowns.
18
DPD OFF
Crest Factor [dB]
16 DPD ON + Memory
14
12
10
8
16 18 20 22 24 26
Output Power [dBm]
Output Power
In Figure 3.23 Pout is shown. For a fully allocated QPSK modulated sig-
nal an MPR of 1 dB is allowed. The subsequent measurements are all
done at this output power level. It can be seen that without DPD the
output power is met in almost all cases. With DPD Pout has to be re-
duced not to exceed the stress of the bottom transistor, as described
before. Compared to the measurements without DPD, these measure-
ments were taken at 0.2 dB less output power.
27.5
DPD OFF
Output Power [dBm]
27 DPD ON + Memory
26.5
26
25.5
25
LTE-1.4 LTE-3 LTE-5 LTE-10 LTE-15LTE-20
Bandwidth [MHz]
Figure 3.23: Pout measurement for band 1 LTE-1 to LTE-20 QPSK OFDM
PUSCH with full allocation.
EVM
In Figure 3.24 the EVM of the QPSK modulated signals is shown over
the LTE BWs. The in-band requirements are achieved with and without
the use of DPD. For the LTE-20 signal it still has a margin of 10 % to the
limit of 17.5 %. In this case a crest factor reduction (CFR) algorithm can
be included to increase Pout , on the cost of EVM. This measure improves
the overall performance of the design [21].
80
3.10. LTE Measurements
10
0
LTE-1.4 LTE-3 LTE-5 LTE-10 LTE-15LTE-20
Bandwidth [MHz]
DPD OFF p
DPD ON n
−20 DPD ON p
DPD ON + Memory n
−30 DPD ON + Memory p
−40
81
Chapter 3. Linear Power Amplifier
With DPD the upper and lower spectrum become more equal and
the required ACLR values are now met up to LTE signal with 10 MHz
bandwidth (LTE-10). If additionally a memory cancellation is added,
the ACLR values are met up to LTE-20. It can be seen that the non-
linearity and the memory effects strongly depend on the BW.
DPD OFF p
DPD ON n
−20 DPD ON p
DPD ON + Memory n
−30 DPD ON + Memory p
−40
Drain Efficiency
The ηd can be seen in Figure 3.27. Regardless of the BW and usage of
24
DPD OFF
Drain Efficiency [%]
23 DPD ON + Memory
22
21
20
82
3.11. Comparison
3.11 Comparison
Table 3.3 compares this design with other published PAs. It is imple-
mented in the smallest technology node in 28 nm and one of the de-
signs with the highest saturated output power. Compared to the design
with the same saturated output power both achieve the power with the
same supply voltage. It only needs half of the area but operates at a
lower frequency and uses a triple stack. The PAE is in the in the middle
of what was presented. It has 7 % less efficiency than the highest but
8.5 % more than the lowest. The gain is comparable to other 1 stage
designs but only has approximately half of the gain compared to the
designs with predriver. All designs have the IMN and OMN on-chip.
83
84
85
Bibliography
[8] LTE evolved universal terrestrial radio access (E-UTRA) user equip-
ment (UE) radio transmission and reception, (3GPP TS 36.101 ver-
sion 11.6.0 Release 11) , International Technology Roadmap for
Semiconductors (ITRS) Std., 2013.
[16] R. Baker, CMOS: Circuit design, layout, and simulation, ser. IEEE
Press Series on Microelectronic Systems. Wiley, 2011.
86
Bibliography
88
Chapter 4
Digital Power Amplifier
Figure 4.1 shows the block diagram of a TX chain with DAC and PA. The
digital signal is processed in the DFE. It is then up-sampled to the de-
sired output frequency and converted from digital to analog. The ana-
log signal is then amplified by a PA. For further integration the design
can be made more compact by merging the DAC and PA. This results in
a DAC that is capable of delivering the required output power and spec-
tral requirements that are necessary to fulfill the 3GPP requirements for
LTE.
DSP DAC PA
Q
This chapter discusses the design of the DPA and its on-chip imple-
mentation. Several theoretical error sources that might lead to distor-
tions are approached. The design is further analyzed using transient
and harmonic balance simulations. 3GPP requirements are tested us-
ing on-chip generated LTE test signals.
89
Chapter 4. Digital Power Amplifier
Decoder
Matching
ϕ =( )
−1 Q(t)
DFE tan I(t) Rload
In Figure 4.3 the block diagram of the CSDPA is shown. The decoder
connects the amplitude that is generated in the DFE with the cell field.
Between the decoder and the cell field an interface is placed that syn-
chronizes the single-ended clocked data of the decoder with the LO
of the analog domain. The LO is distributed to 32 drivers. This re-
duces the load that has to be driven by a single buffer. Each buffer
drives 32 UCs inside every row. To avoid timing mismatches between
the buffers special care has to be taken in the layout. The parasitic re-
sistance and capacitance have to be small enough to make the timing
difference negligible to the clock period. The UC can be individually
en- or disabled by the decoder and a logic block inside each UC that is
done by the signal EN.
Figure 4.4 shows the enable logic inside each UC. Depending on the
binary to thermometer decoding the AND gate in each UC can transmit
or block the LO. The bottom transistor is place directly after the AND
90
4.1. DPA Design
Driver and UC
Logic and CS
LO
EN
gate. The positive upper part P is immediately changed with the low
phase of the LO. The differential path N is changed half an LO cycle
later to ensure the signal change at the time when the negative LO is
low. The transmission gate (TG) ensures this half-cycle switch.
EN
TG N
Figure 4.4: Block diagram of the LO signal enabling inside each UC.
Vdd
Lchoke Lchoke
C
Ropt
v1 v2
is1 is2
Vin V̄in
92
4.1. DPA Design
the same time the currents can be summed up and the voltage at the
output increases proportionally.
Voltage/Current
Normalized
V
I
1
T /2 T 3T /2
Normalized Switching Time
93
Chapter 4. Digital Power Amplifier
package. This saves metal area on chip, that can be used for the primary
winding and so improve its quality factor. The primary and secondary
winding is coupled by the factor k.
Vdd
L/2 L/2
k
Vbias,1 Vbias,1
LOp LOn
94
4.2. Theoretical Error Sources
∑
N
IN = N I¯ + ¯ n.
Iϵ (4.5)
n=1
95
Chapter 4. Digital Power Amplifier
R1 R2
V0
Cin V1 Cin V2
Resonant Tank
ZT 1 ZT 2
RS RS
LM
Cin I1 Cin I1 + I2
Csub Csub CM RL
96
4.2. Theoretical Error Sources
current switches, so the currents are added and not the voltages. Each
of this UC has a different output impedance Z that it has to
drive. The different driving cells are controlled by the decoder
and are switching thereby the combination of UC. At the output
of the inverter stages a series resonance circuit is placed to generate
the wanted output sinusoid. The currents are combined according to
Kirchoff’s law.
2
Normalized
Magnitude
1
δt
0
0 1 2 3 4 5 6
Time
Figure 4.10: Overlap of two timing delayed rectangular signals.
0, for 2n
2f + δt < |t| < (2n+1)
2f
1, for (2n+1)
≤ |t| < (2n+1)
+ δt
2f 2f
frect (t) = (4.6)
and 2(n+1)
< |t| ≤ 2(n+1)
+ δt
2f 2f
2, for (2n+1)
+ δt ≤ |t| ≤ 2(n+1)
− δt.
2f 2f
97
Chapter 4. Digital Power Amplifier
Tank Connection
Vbias,1
Rpar i
Vin
Cpar
98
4.4. Matrix Controlling
upper transistor is biased with Vbias,1 and has to be chosen wide enough
to provide sufficient current to the bottom transistor.
Figure 4.12 shows the distortion of the rectangular current waveform
in the stack because of increased transistor width. The bottom transis-
tor is swept between a nominal value and 1/20 of it. It can be seen that
with an increasing number of conducting transistors the amplitude is
more distorted and the phase is shifted [6].
·10−2
3
Current [A]
2 T=1
T=15/20
1 T=10/20
T=5/20
0 T=1/20
T/2 T
Period
Figure 4.12: Simulation of AM-AM and AM-PM distortion due to stack
resistance.
¯ + ϵn )
In = I(1 (4.9)
99
Chapter 4. Digital Power Amplifier
I is the mean value of all current sources and ϵn is the error of each
UC n that produces the current In . So the INL and DNL of the DPA can
be calculated. The switching schemes give the sequence how to switch
on or off these UC.
The switching schemes shown before is called Type A and the one il-
lustrated in Figure 4.16 Type B. As well as the Type A this also compen-
sates linear and symmetrical errors. In contrary to Type B the switching
scheme jumps around the middle. For linear and symmetrical errors
around the middle this can be compensated with every second step.
15 11 7 3 1 5 9 13 14 10 6 2 4 8 12 16
Figure 4.16: 1-dimensional hierarchical symmetrical scheme Type B.
100
4.4. Matrix Controlling
5 1 3 7 8 4 2 6
13 9 11 15 16 12 10 14
21 17 19 23 24 20 18 22
29 25 27 31 32 28 26 30
Figure 4.18 shows the scheme of the random walk (RW) [9]. The idea
of this scheme is to cancel local as well as global mismatches. Each
local block consists of a 4 × 4 matrix with UCs. The local sequence
jumps around its center in a way that it cancels best any kind of error.
P B F K
12 10 6 2
8 1 14 4
D N H M 5 15 9
0
3 7 11 13
L J A O
G C R E
101
Chapter 4. Digital Power Amplifier
1 1
Magnitude
Magnitude
0.5 0.5
0 0
N N N N
1 1
Magnitude
Magnitude
0.5 0.5
0 0
N N N N
Figure 4.19: Diagram of different gradients that represent the UCs dif-
ference across the matrix.
102
4.4. Matrix Controlling
Linear, quadratic and joint gradients can be used to model the error [8].
In Figure 4.19a a linear gradient is shown. In Figure 4.19b the linear gra-
dient is rotated so that the peak is placed in the corner. Figure 4.19c has
a symmetrical gradient with a peak in the center and Figure 4.19c has
a decentralized bump. For these different gradients the transfer line
of the DAC compared to an ideal DAC changes. The INL and DNL can
now be calculated and compared. Since the DNL value does not change
as discussed before only the INL is considered.
Figure 4.20 shows the resulting transfer function for RC scheme that
switches the cell fields of Figure 4.19. Figure 4.20a is the result of a cell
1 1
Normalized Amplitude
Normalized Amplitude
Ideal DAC Ideal DAC
0.8 DAC 0.8 DAC
0.6 0.6
0.4 0.4
0.2 0.2
0 0
1 512 1,024 1 512 1,024
Code [Unit Cells] Code [Unit Cells]
(a) Linear Gradient (b) Rotate Linear Gradient
1 1
Normalized Amplitude
Normalized Amplitude
103
Chapter 4. Digital Power Amplifier
field with linear gradient. It can be seen that with every second UC
that is switched active the DAC is equal to an ideal DAC. This kind of
switching scheme is intended for linear or symmetrical gradients that
are perpendicular to the switching scheme. If the gradient is turned by
45° the scheme cannot compensate anymore the difference that results
in an INL. For a symmetrical gradient the same conclusion for the linear
gradient can be made. For a gradient with decentralized bump the INL
is shifted away from half code.
Figure 4.21 shows the transfer functions of the DAC for the RW switch-
ing scheme. The same gradients are used as seen before. Compared to
1 1
Normalized Amplitude
1 1
Normalized Amplitude
Normalized Amplitude
104
4.5. Theoretical Error Cancellation
the RC scheme for this one no preference for any gradient can be seen.
The transfer functions look for all the gradients similar, what makes
this transfer function more independent of the gradient and easier to
predict. The main characteristic and difference to RC is that an overall
INL is distributed to smaller deviations around the ideal transfer func-
tion.
1
2
Magnitude
0
0
−1
−2 4
1 2 3 4 2 −2
5
Row Column
105
Chapter 4. Digital Power Amplifier
∑ ∑
N /2 M
IN Lmax = ϵmax = ϵn,m
( n=1 m=1 )
ϵa N
= M 1 + 2 + ... +
2 2
( )
ϵa N + 2 N
= M . (4.10)
2 2 4
Assuming the same fixed error ϵn as before the worst case after one
sequence is that all cells above the average are at the maximum dis-
tance in a local matrix and all below are close to the average value. The
maximum INL can be described as
(M ×N )
∑
IN Lmax = ϵmax = max ϵn,x
[ n=1
ϵa M M NM
= 4 + 8 + ... +
2 4 4 2 4
( ) ]
M M N M
− − 5 − ... − −3
4 4 2 4
ϵa 3N
= M . (4.11)
2 32
M is the number of columns of global matrices, N the number of rows
and x is the local selection of a UC. Considering only a static mismatch,
then the DNL error is for both schemes the same. For dynamic errors
such might occur due to voltage drop or short channel effects the DNL
changes cannot be considered equal because it is the sum of static and
dynamic errors in different conditions.
For a 32 × 32 matrix and a linear gradient, which has a difference of
-0.155 LSB to 0.155 LSB, the resulting INL is displayed in Figure 4.23. It
can be seen that for this linear gradient the RW scheme has a higher
INL compared to the RC scheme, which cancels the error after every
second cell.
106
4.5. Theoretical Error Cancellation
0.5 RC
RW
0
−1
0.5
In Figure 4.25 the phase distortion for both switching schemes is plot-
ted. The phases are normalized to the first phase. The overall phase
distortion is for all switched on cells the same.
107
Chapter 4. Digital Power Amplifier
·10−2
1.5
Phase [◦ ] 1
0.5
RC Phase
0 RW Phase
0 200 400 600 800 1,000
Code [Unit Cells]
4.6 Decoder
For CSDAC different architectures to weight the digital to analog con-
version were discussed. There exists three architectures, the binary,
the unary and an architecture were both are combined, the segmented
architecture. The advantages of the binary architecture are a simple
implementation because no decoder is needed and every cell has to
be double the size of the previous. This results in a simple and power
saving architecture. The disadvantages are that monotonicity is not
ensured and glitches can occur. This disadvantages are covered by the
unary architecture. Since the cells are weighted by switching on and off
a sequence of unary cells the monotonicity is implicit. But for higher bit
resolutions the decoding becomes more and more complex. Therefore,
the segmented design can be chosen. For the LSBs a simpler and more
efficient design can be chosen. For the most significant bits (MSBs) a
more complex design might be required to fulfill the linearity require-
ments [13]. The decoder has to work in an required frequency range of
2.3-2.7 GHz to meet LTE requirements as written in a previous chap-
ter. The basic requirements of the decoder are that it works reliably
at this frequencies and guarantees a synchronous output of the LO as
well as the decoded data. Additionally, the decoder has to be able to
drive the UCs inside the matrix. This can be achieved by using driver
cells after the output ports. The decoder is the last fully digital design
in our power amplifier digital to analog converter (PADAC). The whole
108
4.6. Decoder
digital part is realized with low voltage transistors (LVT) which work at
a specified supply voltage of 1.1 V.
4.6.1 Motivation
The theoretical concept of matrix controlling was described in the pre-
vious section. For further on-chip investigations the two different con-
trolling schemes were implemented in the decoder. The decoder was
build using the digitally automated design flow. This has the benefit
that the layout can be done automated and extracted for timing anal-
ysis. These timing analysis can be done over PVT and are therefore
faster compared to analog manual characterizations.
CLK
Dec_ON
All_n
Register
Register
Dec_Mode_Sel Cell
Logic
Thermo 10b Col
Row
Binary 5b Bin
109
Chapter 4. Digital Power Amplifier
and delivered to the output. The binary 10 bit thermometer data is de-
coded according to one of the two activated decoding selections. Ac-
cording to these modulations the UCs can be activated inside the cell
field. Row activates the row, Col the column and Cell a block of cells.
If an entire block of cells is activated this can be switched on by All_n.
The differential LO and inverse local oscillator (LOX) generated signal
contain the PM.
To ensure that later the data input into each UC cell is reliable a half
clock cycle delay of the clock to the data is generated at the output of
the decoder. Furthermore, the input data is first synchronized with a
register to be able to define specific requirements to the previous stage.
The internal data are again synchronized at the output to ensure that
all data wire synchronously go out of the decoder into the UCs and to
be able to specify a defined clock delay between the data and the clock.
For RW four cores, with each consisting of 256 UCs, are needed to get
the required 1024 UCs. To alleviate the driver requirements for each
output port, especially for the column, the decoder is designed in a T-
shape. Therefore, the cell field is divided into two 8 × 8 matrices. The
output ports of the decoder and their later position in the Layout can
be seen in Figure 4.27.
110
4.7. Cell Field Layout
111
Chapter 4. Digital Power Amplifier
Metal
Height
Length
Width
For copper it is defined for relibility that the current density should
not exceed 106 A/cm2 or 10−2 A/um−2 in the temperature range from
−55 to 125 °C [15, 16]. Table 4.1 shows the current density defined for
copper.
112
4.8. DPA Simulation
113
Chapter 4. Digital Power Amplifier
1 LO +
LO −
Vout
Voltage [V]
0.5
0
−0.5
−1
2 3 4 5 6 7 8 9 10
Time [ns]
114
4.8. DPA Simulation
code word of 5 Bit to 10 Bit and reverse. A UC, that is en- and disabled
during this period, represents the basic behavior of all UCs. When the
cell is active the LO signal is passed and drives the gate of the bottom
transistor which contributes to the amplification of the output signal’s
amplitude Vout . The complementary differential LOX signal path is en-
abled half an LO period later.
−20
−30
−40
−50
10 20 30 40 50 60
Code [Unit Cells]
Figure 4.32: Simulation of Pout over five binary bits and the transitions
to the first two UCs.
Figure 4.33 shows Pout , ηd and ηoa over the thermometer cells for the
both switching schemes RC and RW. By only considering the ther-
mometer cells, the DPA has a DR of Pout from -20.2 dBm to 32.4 dBm.
Taking the output power of the binary bits, shown in the plot above,
into account this results in a total DR of 82.3 dB. The ηd and ηoa is in the
range of 0.01 % to 46.8 % and 0.01 % to 45.5 %, respectively.
115
Chapter 4. Digital Power Amplifier
Efficiency [%]
Pout RC
20
Pout RW
ηd RC
0 ηd RW
ηoa RC
−20 ηoa RW
0 200 400 600 800 1,000
Code [Unit Cells]
Figure 4.34 shows the efficiencies, seen in the figure above, over Pout .
The maximum ηd is 46.8 % at full input code. At 3 dB BO the efficiency
decreases to 32.5 % and at 6 dB BO to 22.5 %.
50
ηd RC
40 ηd RW
Efficiency [%]
ηoa RC
30
ηoa RW
20
10
0
−10 0 10 20 30
Output Power [dBm]
116
4.8. DPA Simulation
·10−2
6
0
10 20 30 40 50 60
Code [Unit Cells]
Figure 4.35: Simulation of Vout over code.
Figure 4.36 shows Vout over the 10 bit thermometer code. The maxi-
mum output voltage that is achieved in the simulation is 13.2 V. It can
be seen that the transfer characteristics are identical for RC and RW.
15
Vout RC
Output Voltage [V]
12.5 Vout RW
10
7.5
5
2.5
0
0 200 400 600 800 1,000
Code [Unit Cells]
Figure 4.36: Simulation of Pout , ηd and ηoa over code.
117
Chapter 4. Digital Power Amplifier
Phase [°] −5
−10
−15
−20
10 20 30 40 50 60
Code [Unit Cells]
Figure 4.37: Simulation of the phase over code for the five binary bits.
In Figure 4.38 the phase of the output signal is simulated and com-
pared for RC and RW. It can be seen that the maximum difference of
output phase of the RW signal is approximately 0.28° higher than the
phase generated by RC.
Phase RC
4 Phase RW
Phase [°]
118
4.9. Variable LO Load
drain capacitance Cgd and gate source capacitance Cgs of the pMOS
and nMOS transistors. In case that the cell is disabled some charges
are stored the drain capacitance at the knot between the two bottom
transistor. According to C = Q/V the charge of the capacitance Cgs
that has to be recharged by the LO is changed. This results in a variable
row capacitance that depends on the number of enabled cells.
Vdd
Vdd
S
Cgd
LO Cgd
Cgs
S
Figure 4.40 shows the voltage difference that occurs at the output due
to the changed load of the LO. The maximum difference is 0.013 V. It
·10−2
1
Voltage [V]
0.5
0
0 200 400 600 800 1,000
Code [Unit Cells]
119
Chapter 4. Digital Power Amplifier
can be seen that this difference has no significant impact on the overall
output voltage.
Figure 4.41 shows the phase over code for RW and RC. By changing
the gate connection of the LO to the bottom nMOS transistor the phase
difference can be reduced.
4 Phase RC
Phase RW
Phase [°]
−2
0 200 400 600 800 1,000
Code [Unit Cells]
Figure 4.41: Simulation of phase over code for the ten thermometer
bits.
4.11 CW Measurements
In this chapter different measurements to investigate the DPA perfor-
mance are done. The design was implemented for the LTE FDD band
7 and the TDD bands 38, 40 and 41. First the core board with the chip
120
4.11. CW Measurements
121
Chapter 4. Digital Power Amplifier
the chip bump. For all measurements a supply voltage of 2.5 V is set.
As default setting RC is set.
Figure 4.44 depicts the measurement setup for the DUT. The test sig-
nal is generated directly on chip and then amplified by the integrated
DPA. The output of the PCB is then connected to a load tuner to be
able to match the impedance to 50 Ω. The output is then connected
to either a power meter or a spectral analyzer depending on the mea-
surements. For CW measurements the power meter is used because of
higher accuracy.
Spectrum Analyzer
Γ Power Meter
Figure 4.44: Block diagram of the measurement setup for the DUT.
122
4.11. CW Measurements
−30
−40
−50
−60
5 10 15 20 25 30
Code [Unit Cells]
Figure 4.45: Measurements of Pout for the binary cells at the center fre-
quency of LTE band 40.
Figure 4.46 presents the drain efficiency ηd over the input code of
the five binary bits. It can be seen that in this range the efficiency is
·10−3
1
Efficiency [%]
0.5
0
10 15 20 25 305
Code [Unit Cells]
Figure 4.46: Measurements of ηd for the binary cells at the center fre-
quency of LTE band 40.
123
Chapter 4. Digital Power Amplifier
35
Output Power [dBm],
30
Efficiency [%]
20
10
0 Pout RC
−10 ηd RC
Pout RW
−20 ηd RW
−30
0 200 400 600 800 1,000
Code [Unit Cells]
Figure 4.47: Measurements of Pout and ηd at the center frequency of LTE
band 40.
Figure 4.48 displays ηd over Pout for the ten MSBs. The maximum ηd
is 34.3 % at 31.0 dBm. At 3 dB BO the efficiency decreases to 24.0 % and
40
ηd RC
30 ηd RW
Efficiency [%]
20
10
0
−20 −10 0 10 20 30
Output Power [dBm]
Figure 4.48: Measurements of ηd over Pout at the center frequency of
LTE band 40.
124
4.11. CW Measurements
20
−20
−40
2
Voltage [V]
1.5
1
0.5 Vout
0 Vout Ideal
5 10 15 20 25 30
Code [Unit Cells]
Figure 4.50: Measurements of Vout for the binary cells at the center fre-
quency of LTE band 40.
125
Chapter 4. Digital Power Amplifier
0.5
INL [Binary LSB]
−0.5
5 10 15 20 25 30
Code [Unit Cells]
Figure 4.52 shows the DNL of the thermometer decoded cells over
input code. The maximum DNL is −0.64 LSB at the transition to the
thermometer code. It can be seen that the most influence is given by
the change from the first to the second binary cell and from the binary
0.2
DNL [Binary LSB]
0
−0.2
−0.4
−0.6
5 10 15 20 25 30
Code [Unit Cells]
126
4.11. CW Measurements
10
Voltage [V]
7.5
5
Vout RC
2.5 Vout RW
Vout Ideal
0
0 200 400 600 800 1,000
Code [Unit Cells]
Figure 4.53: Diagram of AM-AM for center frequency of LTE band 40.
Figure 4.54 shows the INL over code of the thermometer bits for RC
and RW. Consistently to the plot before both scheme show the same
characteristic behavior. It can be seen that the maximum INL is 283 LSB
for RC and 290 LSB for RW.
300
INL [Thermo LSB]
200
100
INL RC
0 INL RW
0 200 400 600 800 1,000
Code [Unit Cells]
127
Chapter 4. Digital Power Amplifier
Figure 4.55 shows the DNL of the thermometer bits for RC and RW.
The maximum DNL is 1.30 LSB at input code 1 for RC and 1.31 LSB at
input code 3 for RW. Consider that the smaller variations in the plot
could be caused by measurement inaccuracy.
2
DNL RC
DNL [Thermo LSB]
DNL RW
1
−1
−2
0 200 400 600 800 1,000
Code [Unit Cells]
Figure 4.56 shows the AM-AM for RC over the entire 15 bits. Com-
pared to the measurements with static binary bits, the non-monotonic
behavior of the binary cells distorts the output transfer function. For
future use it might be considered to only use the binary cells at low Vout .
12.5
10
Voltage [V]
7.5
2.5
0
0 0.5 1 1.5 2 2.5 3
Code [Unit Cells] ·104
Figure 4.56: Diagram of AM-AM for center frequency of LTE band 40.
128
4.11. CW Measurements
Figure 4.57 shows the INL of the full code for RC. It can be seen that
the maximum INL is 9134 LSB. The basic behavior of the plot is similar
compared to the plot that only used thermometer decoded cells but the
transitions between become more random.
·104
1
0.8
INL [LSB]
0.6
0.4
0.2
0
0 0.5 1 1.5 2 2.5 3
Code [Unit Cells] ·104
Figure 4.58 shows the DNL of the full code for RC. The maximum DNL
is 18.51 LSB at the input code 31954 and the minimum is -26.79 LSB at
31743. The tendency, when only the thermometer decoded cells were
considered, cannot be seen anymore.
20
10
DNL [LSB]
−10
−20
−30
0 0.5 1 1.5 2 2.5 3
Code [Unit Cells] ·104
129
Chapter 4. Digital Power Amplifier
RE RC
FE RC
40
RE RW
Phase [°]
FE RW
20
0
200 400 600 800 1,000
Code [Unit Cells]
Figure 4.60 shows the supply voltage over code. It can be seen that
with increasing the input code the supply voltage Vdd drops. This re-
sults in a lower supply for the DPA and a decrease of the Pout .
Vdd RC
Supply Voltage [V]
2.48 Vdd RW
2.46
2.44
2.42
2.4
0 200 400 600 800 1,000
Code [Unit Cells]
Figure 4.60: Measurements of Vdd for the center frequency of LTE band
40 for RC.
130
4.11. CW Measurements
38
Output Power [dBm],
36
Efficiency[%]
34 Pout
ηd
32
30
2.30 2.35 2.40
Frequency [GHz]
36
Efficiency[%]
34
32
30
28 Pout
26 ηd
1.6 1.8 2 2.2 2.4 2.6
Supply Voltage [V]
131
Chapter 4. Digital Power Amplifier
40 33
Output Power [dBm]
30
Output Power [dBm]
32
20
31
10
0 30
800 850 900 950
Code [Unit Cells]
1,000
−10
Pout Sim. RC
−20 Pout Meas. RC
−30
0 200 400 600 800 1,000
Code [Unit Cells]
Figure 4.64 shows ηd for the simulation and measurement of RC. The
efficiency has a difference, at the maximum measured output power, of
5.34 %. Assuming the same Pout for measurements and the same con-
stant difference of the efficiency, as at Pmax , the efficiency at 32.4 dBm
would be 41.5 %. In BO the difference of ηd decreases to 3.46 % at
29 dBm and 2.53 % at 26 dBm. At 20 dBm the difference is 1 % and de-
creases further with reduced Pout .
132
4.12. Simulations vs. Measurements
50
ηd Sim. RC
40 ηd Meas. RC
Efficiency [%]
30
20
10
0
−20 −10 0 10 20 30
Output Power [dBm]
Figure 4.65 shows Vout for the simulation and measurement of RC.
The difference of the Vout and especially at full code is beneficial for
the design. The voltage difference at full code is 2 V. This reduces the
maximum voltage stress of each transistor and therefore increases the
reliability of the whole design.
15
Output Voltage [V]
12.5
10
7.5
5
2.5 Vout Sim. RC
Vout Meas. RC
0
0 200 400 600 800 1,000
Code [Unit Cells]
Figure 4.66 shows ϕ for the simulation and measurement. The phase
difference of the measurements is around 48°. For a non parasitic sim-
ulation the maximum deviation is 4.4°. After reaching the maximum
both plots decrease monotonously. Further investigations of the high
discrepancy between simulation and measurement have to be done.
133
Chapter 4. Digital Power Amplifier
Additionally, the major contributor for the high phase difference of the
design has to be found.
Phase Sim.
40 Phase Meas.
Phase [°]
30
20
10
0
200 400 600 800 1,000
Code [Unit Cells]
4.13.1 ACLR
Figure 4.67 shows the ACLR values for RC and RW over Pout . At a CHP
of 26.5 dBm the ACLR for RC is -27.5 dBc and -26.7 dBc for RW. At the
required CHP of 26 dBm the ACLR of RC has a difference of 2.0 dB
to the required 30 dBc. For RW the difference is 2.2 dB. The ACLR
requirements are met at a CHP of 24.5 dBm for RC and 24.0 dBm for
RW.
134
4.13. LTE Measurements
−20
ACLR1,l RC
ACLR1,u RC
ACLR [dBc]
−30 ACLR1,l RW
ACLR1,u RW
ACLR limit
−40
−50
0 5 10 15 20 25
Output Power [dBm]
Figure 4.67: ACLR measurements of an LTE-5 PUSCH signal over Pout .
Figure 4.68 depicts the ACLR values for RC and RW over Pout . At
an increased CHP of 26.5 dBm the ACLR is -26.4 dBc for RC. At the
required CHP of 26 dBm the ACLR of RC misses 3.1 dB to the required
30 dBc. The required 30 dBc are met at 22.5 dBm. Compared to LTE-5
the ACLR is 5 dB higher. The ACLR requirements are never met for RW.
Further investigation have to be done to explain this behavior.
−20
ACLR1,l RC
ACLR1,u RC
ACLR [dBc]
−30 ACLR1,l RW
ACLR1,u RW
ACLR limit
−40
−50
0 5 10 15 20 25
Output Power [dBm]
Figure 4.68: ACLR measurements of an LTE-10 PUSCH signal over Pout .
4.13.2 EVM
Figure 4.69 shows the EVM for RC and RW over Pout . In both cases
the EVM values are below the required 17.5 %. At the increased CHP of
135
Chapter 4. Digital Power Amplifier
26.5 dBm the EVM for RC is 7.6 % and for RW 8.2 %. At the required
CHP of 26 dBm the EVM for RC is 7.9 % and for RW 8.0 %. For both
switching schemes the EVM is decreasing until 20 dBm. As for the
ACLR values, both switching schemes have a similar behavior.
20
15
EVM [%]
10
EVM RC
5 EVM RW
EVM limit
0
0 5 10 15 20 25
Output Power [dBm]
Figure 4.70 shows the EVM for RC and RW over Pout . In both cases
the EVM values are below the required 17.5 %.At the increased output
power, the EVM for RC is 8.3 % and 9.3% for RW, respectively. At the
increased CHP of 26.5 dBm the EVM for RC is 8.3 % and for RW 9.2 %.
20
15
EVM [%]
10
EVM RC
5 EVM RW
EVM limit
0
0 5 10 15 20 25
Output Power [dBm]
136
4.13. LTE Measurements
At the required CHP of 26 dBm the EVM for RC is 8.2 % and for RW
9.2 %. For both switching schemes the EVMremains constant with a
difference of 1 %.
ηd RC
20 ηd RW
Efficiency [%]
15
10
5
0
0 5 10 15 20 25
Output Power [dBm]
Figure 4.72 shows the efficiency over Pout for LTE-10 signal modulated
with RC and RW. The efficiency increases with increased Pout . At the
maximum required CHP of 26 dBm the efficiency is 18.7 %. At 3 dB and
6 dB BO the efficiency drops to 13.6 % and 9.4 %. At 27.5 dBm the effi-
ciency increases up to 23.9 %. For lower Pout the efficiency drops fur-
ther. It is below 5 % at 15.5 dBm. As in LTE-5, ηd has the same behavior
for both switching schemes.
137
Chapter 4. Digital Power Amplifier
25
ηd RC
20 ηd RW
Efficiency [%]
15
10
5
0
0 5 10 15 20 25
Output Power [dBm]
4.13.4 Spectrum
Figure 4.73 shows the setup for the DN measurements. The DN is mea-
sured in the receive band of the same channel to show how much noise
is produced by the DPA. The setup consists the DUT, which receives
the digitally modulated test signal from the interface and converts it
to an analog output signal. The DUT is terminated by a load tuner
that matches the output to a 50 Ω load. The first measurement is done
138
4.13. LTE Measurements
ignoring the notch filter. The spectral analyzer measures the in-band
power as well as the ACLR and EVM. The second measurement includes
the notch filter with 90 MHz BW to attenuate the signal and noise. So
the noise floor and the spectral emissions can be measured. The RBW
is 100 kHz. Afterward, both measurements are combined. For mea-
surements band 7 LTE-5/LTE-10 OFDM QPSK PUSCH are used.
Spectrum LTE-5
Figure 4.74 depicts the normalized output spectrum of an LTE-5 sig-
nal at different CHPs. Compared to the CHPs in BO the figure shows
that at the required CHP of 26 dBm the OOB emissions increase and
so decrease the E-UTRA ACLR values, as shown before. The noise floor
above and below 100 MHz of the center frequency do not change signifi-
cantly, which is important for DN measurements. By further increasing
Pout to 29 dBm, the clipping of the signal can be seen in an increased
spectral emission. This increases the noise level inside the notch filter’s
BW. It can be seen that the noise floor is increase by the distortions and
would be even higher without the notch filter. Spurs produced in the
DFE create images of the signal at multiples of ±78 MHz.
CHP = 29 dBm
−60 CHP = 26 dBm
Relative Spectral
Power [dBc/Hz]
−120
−140
2.3 2.535 2.75
Frequency [GHz]
Figure 4.75 illustrates the LTE-5 signal at different CHPs for RW. Even
in BO at 17 dBm the signal has a highly increased noise level inside the
90 MHz filter BW. The noise next to the signal band only decreases
slightly resulting in a slowly improving ACLR value, as shown before.
139
Chapter 4. Digital Power Amplifier
The noise increases in BO, above the notch filter’s BW, which results in
a worse DN.
CHP = 29 dBm
−60 CHP = 26 dBm
Relative Spectral
Power [dBc/Hz]
DN
−120
−140
2.3 2.535 2.75
Frequency [GHz]
Spectrum LTE-10
Figure 4.76 depicts the normalized output spectrum of an LTE-10 sig-
nal at different CHPs. At the required Pout of 26 dBm the OOB emission
increase significantly compared to Pout in BO although the noise floor
CHP = 27 dBm
−60 CHP = 26 dBm
Relative Spectral
Power [dBc/Hz]
−120
−140
2.3 2.535 2.75
Frequency [GHz]
140
4.13. LTE Measurements
above and below 100 MHz of the center frequency is not increased and
so does not decrease the noise to CHP ratio in the duplex receive band
noise. At 27 dBm CHP the signal begins to clip and to be significantly
distorted. This results in broader and increased noise level around the
channel. Compared to the LTE-5 spectrum LTE-10 has a broader in-
creased noise next to the CHBW.
Figure 4.77 illustrates the spectral measurements for LTE-10 RW. As
already seen in LTE-10 this scheme increases the noise level signifi-
cantly compared to RC. In OOB the noise decreases in between the
BO range from 26 dBm to 22 dBm and so the ACLR values. But it can
be seen that the noise does not decrease any more in the range from
22 dBm to 17 dBm and so do the ACLR values stay constant. Outside
OOB the noise increase in BO which might be to two different noise
sources. This might be due to a higher impact of frequency modula-
tion (FM). Especially in this frequency range FM is significant, as will
be seen in the next section. The increased noise level in BO also de-
grades the DN.
CHP = 27 dBm
−60 CHP = 26 dBm
Relative Spectral
Power [dBc/Hz]
−120
−140
2.3 2.535 2.75
Frequency [GHz]
CW only
−60
AM only
Relative Spectral
Power [dBc/Hz]
−80 FM only
Composite
−100
DN
−120
−140
−160
2.3 2.535 2.75
Frequency [GHz]
Table 4.2 summarizes the values of the duplex noise at 120 MHz dis-
tance for the aforementioned modulations. As seen from the spectrum
Table 4.2: Measured Duplex Receive Band Noise for CW, AM, FM and
Composite
Measurement Duplex Noise
CW only -155.8 dBc/Hz
AM only -139.5 dBc/Hz
Composite -140.1 dBc/Hz
142
4.13. LTE Measurements
Duplex Noise
Figure 4.79 depicts the DN over Pout for a LTE-5 QPSK OFDM PUSCH
test signal. At the CHP of 25.74 dBm the DN is -140.7 dBc/Hz for RC
and -136.7 dBc/Hz for RW. In BO the noise can be slightly improved for
RC to 1.41.3 dBc/Hz. By further reducing Pout the noise level does not
decrease and so the CHP to noise ratio increases again.
−120
Duplex Noise [dBc/Hz]
NF RC
NF RW
−130
−140
−150
16 18 20 22 24 26 28
Output Power [dBm]
Figure 4.80 shows the DN over Pout for a LTE-10 QPSK OFDM PUSCH
test signal. At the CHP of 25.80 dBm the DN is -138.3 dBc/Hz for RC and
-133.8 dBc/Hz for RW. In 3 dB BO the dynamic biasing (DB) is 0.2 dB
better. With decreased CHP the DN starts to increase as already seen
for LTE-5.
143
Chapter 4. Digital Power Amplifier
−120
−140
−150
16 18 20 22 24 26
Output Power [dBm]
The second and third harmonic distortions can be seen at 4.7 GHz and
7.05 GHz, respectively. For a RBW of 30 kHz the noise level is constant
around -65 dBm. RC and RW show the same noise behavior over the
entire spectrum.
RC
Spectral Power [dBm]
0
RW
2nd
−20
3rd
−40
−60
0 1 2 3 4 5 6 7 8
Frequency [GHz]
Figure 4.82 shows the full span spectrum over 8 GHz of a fully allo-
cated LTE-10 band 40 signal, modulated by RC and RW at the required
CHP of 26 dBm. The second and third harmonic of the fundamental
signal can be seen at 4.7 GHz and 7.05 GHz. For a RBW of 30 kHz the
noise level is constant around -65 dBm. It can be seen that for RC the
144
4.14. Failure Causes
0 RC
Spectral Power [dBm] RW
−20
2nd
3rd
−40
−60
0 1 2 3 4 5 6 7 8
Frequency [GHz]
4.14.1 Simulation
Figure 4.83 shows the transient simulation of the current that flows
inside the decoder during its operation. The root mean square value of
the current settles around 6.35 mA. This current is distributed and does
therefore not flow over the entire supply line what results in an unequal
voltage drop distribution. It can be taken nevertheless as a first order
approximation to explain the resulting effects.
145
Chapter 4. Digital Power Amplifier
·10−3
8
6
Current [A]
4
2
0
−2
0 2 4 6 8 10 12 14 16 18 20
Time [ns]
Figure 4.83: Simulation of the current in the digital supply path to the
decoder.
It can be seen in Figure 4.84 that the voltage drops from the ideally
1.1 V to 850 mV. This causes that the transistors are not fully switched on
anymore. The voltage drops due to parasitics at the supply path. The
simulations were done with a resistance and inductance at the supply
source. The values of the parasitics were calculated for the furthest
point in the decoder.
1 0.9
0.8
Voltage [V]
Voltage [V]
0.7
0.6
0.5 0.5
0.4
0.3
V CLK 0.2 V CLK
0 V CLK at FF 0.1 V CLK at FF
0
0 5 10 15 20 5 6 7 8
Time [ns] Time [ns]
Figure 4.84: Simulation of the voltage drop inside the decoder due to
parasitics at the supply path.
146
4.14. Failure Causes
4.14.2 Measurement
It is not possible to do an independent current consumption measure-
ments for the digital part of the DPA. Since the digital part has no in-
dependent supply, but uses the same supply as the DFE, only the total
current consumption can be measured. The current is measured with
enabled and disabled decoder. The difference of both measurements
represents the current that is used by the decoder. In case that the de-
coder is disabled the current consumption is around 7 mA less. Com-
pared to the simulated 6.35 mA this value can be used as a first order
value to explain the effects.
Figure 4.85 shows Pout of the DPA with different low-dropout regu-
lator (LDO) voltages for the RC modulated decoder. Due to the sym-
metric design of the decoder the voltage drop can be seen linear. At
input code 192 the voltage drop in the upper part of the decoder is so
high that the transistors cannot be switch on or off anymore. There-
fore, Pout stays 32 cells the same until the next row is active in the lower
part of the decoder that is closer to the supply.
Output Power [dBm]
20
0
LDO 1.2 V
LDO 1.3 V
−20 LDO 1.4 V
200 400 600 800 1,000
Thermo Code [Unit Cells]
Figure 4.85: Measurement of Pout over code for different LDO voltages.
147
Chapter 4. Digital Power Amplifier
0
LDO 1.2 V
LDO 1.3 V
−20 LDO 1.4 V
100 101 102 103
Thermo Code [Unit Cells]
Figure 4.87 shows Vout for the MSBs of the design. It can be seen that
even for an LDO voltage of 1.3 V at the maximum output UCs are not
switched on.
11.5
Output Voltage [V]
11
10.5
10
9.5 LDO 1.2 V
LDO 1.3 V
9 LDO 1.4 V
800 850 900 950 1,000
Thermo Code [Unit Cells]
Figure 4.87: Measurement of Vout over code for different LDO voltages.
148
Bibliography
[1] D. Chowdhury, L. Ye, E. Alon, and A. Niknejad, “An Efficient
Mixed-Signal 2.4-GHz Polar Power Amplifier in 65-nm CMOS
Technology,” Solid-State Circuits, IEEE Journal of, vol. 46, no. 8,
pp. 1796–1809, Aug 2011.
[5] W. Sansen, Analog design essentials, ser. Analog circuits and signal
processsing series. Springer, 2006, no. Bd. 1.
149
Bibliography
Circuits and Systems II: Analog and Digital Signal Processing, IEEE
Transactions on, vol. 47, no. 7, pp. 585–595, Jul 2000.
[14] W.-K. Loo, K.-S. Tan, and Y.-K. Teh, “A study and design of
CMOS H-Tree clock distribution network in system-on-chip,” in
ASIC, 2009. ASICON ’09. IEEE 8th International Conference on, pp.
411–414, Oct 2009.
150
Chapter 5
Conclusion
Two designs have been implemented and tested by using LTE test sig-
nals. First a linear class-AB PA was designed to proof that high watt-
level output power can be achieved in 28 nm CMOS technology using
a triple transistor stack.
After the proof of concept of the linear PA, a CSDPA was implemented
to merge the DAC with the PA and so become more compact. For CW
measurements the maximum Pout was 31.2 dBm and the maximum ηd
34.3 %. LTE measurements were done with a QPSK OFDM PUSCH LTE
band 7 signal for 5 and 10 MHz BW. At the required CHP of 26 dBm the
ACLR was 26.9 dBc for LTE-5 and 27.4 dBc for LTE-10. The required -
30 for E-UTRA were met at 21.7 dBm CHP and 22.0 dBm, respectively.
EVM requirements were met for all test cases. The DN at 26 dBm CHP
for LTE band 7 is -140.7 dBc/Hz for LTE-5 and -138.3 dBc/Hz for LTE-10.
151
Chapter 5. Conclusion
The fully integrated design has a total area of 0.61 mm × 0.5 mm.
152
Bibliography
[1] G. Liu, P. Haldi, T.-J. K. Liu, and A. Niknejad, “Fully integrated
CMOS power amplifier with efficiency enhancement at power back-
off,” Solid-State Circuits, IEEE Journal of, vol. 43, no. 3, pp. 600–609,
March 2008.
153
List of Figures
155
List of Figures
156
List of Figures
157
List of Figures
158
List of Tables
159