Beruflich Dokumente
Kultur Dokumente
Research Article
Abstract: This paper presents reconfigurable and hardware-efficient VLSI architecture of time domain cyclostationary-feature
detector (TCD) for spectrum sensing in the cognitive-radio wireless network. It incorporates new architecture for autocorrelator
that supports the entire range of subcarriers used by orthogonal frequency division multiplexing signals compliant to 4G LTE-
Advanced wireless network. A novel scheme of overflow/underflow protection is proposed for the coordinate rotation digital
computer engine of TCD. Additionally, hardware-efficient techniques have been introduced for the multiply-&-accumulate and
accumulator architectures of suggested TCD design. Real-world signals are captured using universal software radio peripheral
devices and are fed to its FPGA prototype. An application specific integrated circuit synthesis and post-layout simulation of the
proposed detector have been performed using 65 nm-CMOS technology and it occupies 0.32 mm2 of core area and consumes
total power of 18.5 mW at 100 MHz clock frequency. In comparison with the state-of-the-art works, the proposed detector
requires 34 and 93% lesser hardware resource and memory, respectively
Fig. 2 System-level overview for conventional architecture of TCD for 3.1 Reconfigurable architecture
spectrum sensing integrated with analogue RF section and antenna
The contemporary LTE-A wireless network uses OFDM for
N−1 communication where the PUs are allocated bandwidth based on
1
N n∑
^
D = E Y t2(α) = yt2[n, α] . (11) their demand [24]. Thereby, such allocation decides the number of
=0 subcarriers required to generate an OFDM frame for transmission.
In the recent trend, diverse PUs of the wireless network have
The expression for test statistic computation using the above different bandwidth requirements [2]. Thereby, various sizes of
equations can be formulated as [21] OFDM frames which are constructed with varying subcarriers are
used for communication. For such wireless network, the number of
^ Xt(α)2 × D + Y t(α)2 × A − 2 × Xt(α) × Y t(α) × B subcarriers allocated to construct different sizes of OFDM frames
Tς = N . (12) varies in the powers of 2, ranging from 64 to 1024. Therefore, the
A × D − B2 transceiver design for SU with CR capability becomes challenging
in such environment. The rationale behind this is the CTCD from
Under the null hypothesis, test statistic is χ22 distributed and Fig. 2, which is responsible for the cognitive activity of SU defers
therefore, the decision threshold is obtained from the inverse of to detect the PUs with varying OFDM frames. Hence, it is
cumulative distribution function of χ22 distribution. Consecutively, necessary to conceive TCD architecture with the capability of
^
the value of decision threshold T δ used for the test is given by detecting OFDM frames with variable number of subcarriers.
Therefore, we propose an autocorrelator architecture to incorporate
^ reconfigurability in TCD for detecting OFDM frames constructed
T δ = F −1
χ (2/2) 1 − Pη (13) using 64, 128, 256, 512 and 1024 subcarriers. These values are
compliant to LTE-A 4G wireless network [2, 24]. Additionally, the
which is pre-computed and fixed. In order to fix this decision suggested autocorrelator has been designed to switch among
threshold, it is necessary to have a trade-off between the different configurations on the fly. For the reconfigurable
probability of detection Pd and the probability of false alarm Pη . capability, lag factor τ of the autocorrelator must be configurable at
Thereby, an appropriate value of Pη is selected for the specific ^
runtime. Conjugate autocorrelation λ(τ) of the received signal x[n]
^
application. Eventually, the test statistic result T ς is compared with is given by
^
the threshold value T δ to get the final result from TCD. ^
λ(τ) = x[n] × x∗[n − τ] (15)
2.3 System overview and TCD architecture
from (2). It is to be noted that the autocorrelator has been designed
^
Overall system that encompasses the TCD has been depicted in to compute the value of λ(τ). To ensure the periodicity in
Fig. 2 where antenna at the receiver transforms electromagnetic autocorrelation of OFDM signal, lag factor τ must take the values
waves to RF signal. Band select filter passes the desired frequency of ±ND. However, we considered only the positive value of τ
signals and such feeble signal is boosted using low noise amplifier,
because the autocorrelation signal has only single cyclic frequency
as shown in Fig. 2. First mixer M1 then translates RF to
α = 1/Ns. Hence, the value of τ has been fixed at τ = ND for a
intermediate frequency (IF) signal using fixed-frequency local-
oscillator O1. Subsequently, IF signal is low-pass filtered to reject particular OFDM frame, as shown schematically in Fig. 1.
Therefore, the proposed architecture of reconfigurable
high-frequency components and passed along the second stage of
autocorrelator has been presented in Fig. 3 where the ‘delay
mixers MI and MQ. Tunable and channel-select frequency
memory’ module from the CTCD has been replaced with the series
synthesiser O2 translates low-pass IF signals to baseband signals of 1024 registers, each of 20 bit, and a multiplexer. Outputs of
and the alternate channel energy is removed by channel select 64th, 128th, 256th, 512th and 1024th registers are tapped and fed
filter. Thereafter, the inphase (I) and quadrature (Q) signal to multiplexer. Depending on the select line sel value, lag factor of
components are sampled and converted into digital signal using 10- this autocorrelator can be configured for τ = 64, 128, 256, 512 and
bit analogue-to-digital converters (ADCs), as shown in Fig. 2. 1024 for different OFDM frames compliant to the LTE-A wireless
These 10-bit I and Q samples are fed to the CTCD for spectrum network. Then the delayed signal x[n−τ ] is conjugated and
sensing [11]. Specifically, such input samples are fed to the
autocorrelation module which computes the conjugate
ΘT = ϝ(ϑin, N D) (17)
N−1
1
N n∑
M × Oacc = M × x[n] × y[n] , (19)
=0
N−1
1
N n∑
Omac = x[n], (20)
=0
N−1
1
N n∑
M × Omac = M × x[n] . (21)
=0
Table 1 Comparison of CORDIC architecture accuracies with the proposed overflow/underflow protection technique and
without it for the bit width of N = 10 bit
Θ sin Θ cos Θ
F pv Fxc Fxp Ecv, % E pv, % F pv Fxc Fxp Ecv, % E pv, %
76.320 0.9716 −0.0586 0.9707 106 0.09 0.2365 0.2461 0.2383 3.9 0.7
108.140 0.9503 −0.0996 0.9453 110 0.57 −0.3113 −0.3457 −0.386 11 0.8
120.360 0.8629 −0.2754 0.8632 113 0.03 −0.5054 −0.513 −0.5059 1.5 0.05
172.730 0.1265 0.082 0.1211 35 4.27 −0.9919 −0.0176 −0.9922 101 0.03
190.950 −0.1899 −0.1699 −0.1894 10 0.26 −0.9818 0.0371 −0.9804 103 0.14
Table 2 Comparison of hardware consumed by the proposed and conventional CORDIC architectures when implemented on
FPGA platform
Conventional architecture Proposed architecture Hardware loss, %
registers 700 752 6.91
logic 2060 2851 27.74
memory 2092 bits 2851 bits 26.62
Table 3 Comparison table of magnitude ratios of the proposed MAC and accumulator outputs for different SNR values such
that Oacci and Omaci ∀i = {64, 128, 256, 512, 1024} are their values for different OFDM subcarriers compliant to LTE-A wireless
network
SNR ND = 128 ND = 256 ND = 512 ND = 1024
φ1 ω1 φ2 ω2 φ3 ω3 φ4 ω4
0 dB 4.04 1.97 15.82 3.97 81.40 9.58 356.94 19.05
−6 dB 4.11 1.99 18.69 4.37 92.09 9.78 418.16 21.88
−11 dB 4.15 2.23 17.94 5.88 85.53 9.49 376.28 21.27
−14 dB 3.92 1.81 17.50 4.56 84.29 9.56 380.08 20.32
−20 dB 4.19 2.20 16.99 4.11 86.43 8.64 409.94 20.39
−26 dB 4.02 1.93 16.93 3.62 75.34 8.99 361.16 19.96
ω1 = Oacc64 /Oacc128; φ1 = Omac64 /Omac128.
ω2 = Oacc64 /Oacc256; φ2 = Omac64 /Omac256.
ω3 = Oacc64 /Oacc512; φ3 = Omac64 /Omac512.
ω4 = Oacc64 /Oacc1024; φ4 = Omac64 /Omac1024.
indicates the absence of PU. Note that the clock signal in Fig. 8b is
blurred due to its congested waveform at 1 MHz. Closer view of Fig. 8 Hardware (FPGA) implementation and testing of the proposed TCD
logic analyser output waveform is presented in Fig. 8c where the (a) FPGA test setup for the proposed TDCD detector, (b) Output waveform at 1 MHz,
DetOut signal generates logic high after the detection period indicating the absence of PU after the detection period of 5.12 ms, (c) Output
completes indicating the presence of PU. waveform indicating the presence of PU under same timing considerations, (d) Test
setup to capture real-world signals using the USRPs as OFDM transmitter and receiver
5.3 ASIC implementation of wireless network
The suggested TCD architecture has been simulated for functional critical inputs for accurate power estimation. Functionally verified
verification with appropriate test vectors using Synopsys-VCS design has been synthesised with the standard cell libraries of
engine. Simultaneously, it generates switching-activity interface- UMC 65 nm-CMOS (G-9LT-Logic Mixed-Mode 65N-SP-Low-K)
format (SAIF) file which is a record of transitions that every net in technology node, along with various real-world timing constraints,
the design undergoes during the simulation. Such file is one of the using the Synopsys-DC tool. This process has been carried out with
a supply voltage of 0.72 V under worst-case timing corner and the
synthesis report indicates that the proposed design consumes
Table 4 Scale factors or shift values (in the proposed MAC 165263 standard cells with 62 logic levels. Thereafter, the
and accumulator architectures) for different ND values of generated gate-level netlist is static timing analysed using
OFDM frame compliant to the LTE-A wireless network Synopsys-PT tool as well as functionally verified via post-
ND MAC scale factor Accumulator scale factor synthesis simulation in Synopsys-VCS environment.
64 1 (0-bit shift) 1 (0-bit shift) For the physical design process, netlist along with the six-metal
128 4 (2-bit shift) 2 (1-bit shift) layer library-exchange-format and timing library file is imported
using Cadence-SoC Encounter tool. Additional files for input/
256 16 (4-bit shift) 4 (2-bit shift)
output (I/O) pad integration and their orientation around the core
512 64 (6-bit shift) 8 (3-bit shift) have been imported along with aforementioned files. Thereafter,
1024 256 (8-bit shift) 16 (4-bit shift) core as well as die area of the chip has been floor planned to
accommodate the imported cells. Power rings and strips are added
around and across the core, respectively, for supply and ground
Table 5 Comparison of the hardware resource and power connections of cells and pads. These imported cells are placed on
consumed by the proposed TCD with the reported work [11], the floor-planned core area where they are signal and power routed
when implemented using FPGA platform as well as clock tree synthesised. Subsequently, post-route static-
Proposed CTCD [11] FCD [11] AFD [11] timing analysis is performed iteratively until the design is free from
registers 993 1291 8802 1060 violations. Then, we have added core as well as I/O leaf cells and
logic 5498 8593 16,591 6779 finally performed static timing analysis (STA) for the timing
closure. It indicates that the proposed design can operate with a
memory, bits 20,480 327,680 405,674 327,680
maximum clock frequency up to 217.43 MHz. The physical
power, mW 20.26 34.99 61.29 35.29 verifications like design rule check, layout versus schematic as