Beruflich Dokumente
Kultur Dokumente
C. N.Marimuthu1, P. Thangaraj2,
1
HOD – E C E, Maharaja Engineering College, Avinashi, Anna University,
2
Professor & Head, CT Dept, Kongu Engineering College, Perundurai, Anna University,
Tamil nadu , India
muthu_me2005@yahoo.co.in1 & ctpt@kongu.ac.in2
Abstract
There are different entities that one would like to Dynamic power dissipation which is the major part of
optimize when designing a VLSI circuit. These entities total power dissipation is due to the charging and
can often not be optimized simultaneously, only discharging capacitance in the circuit. The golden
improve one entity at the expense of one or more others formula for calculation of dynamic power dissipation is
The design of an efficient integrated circuit in terms of Pd = CLV2f. Power reduction can be achieved by various
power, area, and speed simultaneously, has become a manners. They are reduction of output Capacitance CL,
very challenging problem. Power dissipation is reduction of power supply voltage V, reduction of
recognized as a critical parameter in modern VLSI design switching activity and clock frequency f.
field. In Very Large Scale Integration, Low power VLSI
design is necessary to meet MOORE’S law and to Fast multipliers are essential parts of digital signal
produce consumer electronics with more back up and less processing systems. The speed of multiply operation is of
weight. Multiplication occurs frequently in finite impulse great importance in digital signal processing as well as in
response filters, fast Fourier transforms, discrete cosine the general purpose processors today, especially since the
transforms, convolution, and other important DSP and media processing took off. In the past multiplication was
multimedia kernels. The objective of a good multiplier is generally implemented via a sequence of addition,
to provide a physically compact, good speed and low subtraction, and shift operations. Multiplication can be
power consuming chip. To save significant power considered as a series of repeated additions. The number
consumption of a VLSI design, it is a good direction to to be added is the multiplicand, the number of times that
reduce its dynamic power that is the major part of total it is added is the multiplier, and the result is the product.
power dissipation. In this paper, we propose a high speed Each step of addition generates a partial product. In most
low-power multiplier adopting the new SPST computers, the operand usually contains the same number
implementing approach. This multiplier is designed by of bits. When the operands are interpreted as integers,
equipping the Spurious Power Suppression Technique the product is generally twice the length of operands in
(SPST) on a modified Booth encoder which is controlled order to preserve the information content. This repeated
by a detection unit using an AND gate. The modified addition method that is suggested by the arithmetic
booth encoder will reduce the number of partial products definition is slow that it is almost always replaced by an
generated by a factor of 2. The SPST adder will avoid the algorithm that makes use of positional representation. It
unwanted addition and thus minimize the switching is possible to decompose multipliers into two parts. The
power dissipation. The proposed high speed low power first part is dedicated to the generation of partial products,
multiplier can attain 30% speed improvement and 22% and the second one collects and adds them.
power reduction in the modified booth encoder when
compared with the conventional array multipliers. The basic multiplication principle is two fold i.e,
evaluation of partial products and accumulation of the
Keywords: array multiplier, booth encoder, low power, shifted partial products. It is performed by the successive
spurious power suppression technique additions of the columns of the shifted partial product
matrix. The ‘multiplier’ is successfully shifted and gates
the appropriate bit of the ‘multiplicand’. The delayed,
1. Introduction gated instance of the multiplicand must all be in the same
Power dissipation is recognized as a critical parameter in column of the shifted partial product matrix. They are
modern VLSI design field. To satisfy MOORE’S law and then added to form the product bit for the particular form.
to produce consumer electronics goods with more backup Multiplication is therefore a multi operand operation. To
and less weight, low power VLSI design is necessary. extend the multiplication to both signed and unsigned
31
ICGST-PDCS, Volume 8, Issue 1, December 2008
numbers, a convenient number system would be the to the incoming partial product, PPi to generate the
representation of numbers in two’s complement format. outgoing partial product PP(i+1), if Yi =1. If Yi=0, PPi is
passed vertically downward unchanged. The worst case
The remainder of this paper is arranged as follows signal propagation delay path is from the upper right
section (2) briefly reviews the techniques for corner of the array to the high order product bit output at
implementing multiplier, section (3) discuss the spurious the bottom left corner of the array. Assuming that there
power suppression technique, section (4) discuss the are two gate delays from the inputs to the outputs of a full
proposed spurious power suppression technique, section adder block. Since the array multiplier is having low
(5) introduces the modified booth encoder, section (6) speed and consumes more power many low power
explains the proposed low power high performance multiplier design techniques are available now. The
multiplier, section (7) shows the simulation results and design techniques such as Glitching power minimization
discussion. Finally a conclusion is given in the section (8). by selective gate freezing[2], Fast power efficient circuit
lock switch-off schemes[3], power-aware scalable
2. Techniques for implementing multipliers pipelined Booth multiplier[4], partially guarded
Binary multiplication of positive operands can be computation (PGC) [5], High performance low power left
implemented in a combinational two dimensional logic to right array multiplier design[6] are existing works that
array [1]. A 4X4 array multiplier is shown in figure 1. reduce the dynamic power consumption by minimizing
The functions of M0, M1, M2, and M4 are also shown in the switched capacitance. Glitching power minimization
figure 1. X3X2X1X0 is the 4 bit multiplicand and by selective gate freezing replaces some existing gates
Y3Y2Y1Y0 is the 4 bit multiplier. The main component in with functionally equivalent ones that can be frozen by
each cell is a full adder. The AND gate in each cell asserting a control signal. It can only achieve savings of
determines whether a multiplicand bit, Xj is added to the 6.3% in total power dissipation since it operates in the
incoming partial product bit based on the value of the layout environment which is tightly restricted. The Fast
multiplier bit Yi. Each row adds the multiplicand power efficient circuit block switch-off scheme proposes
(appropriately shifted) a double switch circuit block switch scheme capable of
32
ICGST-PDCS, Volume 8, Issue 1, December 2008
reducing power dissipation during down time by technique on multipliers for high speed and low power
shortening the settling time after reactivation. The purposes is present.
drawbacks of this scheme are the necessity for two 3. Spurious power suppression technique
independent virtual rails and the necessity for two The former SPST has been discussed in [7] and [8].
additional transistors for switching each cell. A power- figure 2 shows the five cases of a 16-bit addition in
aware scalable pipelined Booth multiplier design presents which the spurious switching activities occur. The 1st
a multiplier using Dynamic Range Detection (DRD) unit. case illustrates a transient state in which the spurious
is used to select the input operand with a smaller transitions of carry signals occur in the MSP though the
effective dynamic range to yield the Booth codes. There final result of the MSP are unchanged. The 2nd and the
are three separate Wallace trees for the 4X4, 8X8, and 3rd cases describe the situations of one negative operand
16X16 multiplications. This will certainly induce area adding another positive operand without and with carry
and capacitance penalties. Under a 0.13µm CMOS from LSP, respectively. Moreover, the 4th and the 5th
technology, this design can obtain a 20% power cases respectively demonstrate the addition of two
reduction over the conventional multipliers. But there is a negative operands without and with carry-in from LSP.
44% n area overhead. Partially guarded computation In those cases, the results of the MSP are predictable
(PGC) divides the arithmetic units, e.g., adders, and Therefore the computations in the MSP are useless and
multipliers, into two parts, and turns off the unused part can be neglected. The data are separated into the Most
to minimize the power consumption. The reported results Significant Part (MSP) and the Least Significant Part
show that the PGC can reduce power consumption by (LSP).To know whether the MSP affects the computation
10% to 40% in an array multiplier but with 30% to 36% results or not. We need a detection logic unit to detect the
area overhead. In all the above techniques there is no effective ranges of the inputs. The Boolean logical
improvement in speed. So in this paper by applying an equations shown below express the behavioral principles
advanced version of former spurious power suppression of the detection logic unit in the MSP circuits of the
SPST-based adder/subtractor:
33
ICGST-PDCS, Volume 8, Issue 1, December 2008
AMSP = A[15:8]; BMSP = B[15:8] ; (1) which can determine whether the input data of MSP
should be latched or not. Moreover, we add three 1-bit
Aand = A[15] A[14] A[8]; (2)
to control the assertion of the close, sign, and Carr-ctrl
Band = B[15] B[14] B[8]; (3) signals in order to further decrease the glitch signals
occurred in the cascaded circuits which are usually
Anor = ; (4) adopted in VLSI architectures designed for video coding
Bnor = ; (5)
Close = ; (6)
34
ICGST-PDCS, Volume 8, Issue 1, December 2008
denotes the earliest required time of all the inputs. This seriously concerned, this implementing approach enables
will filter out the glitch signals as well as to keep the an extremely high flexibility on adjusting the data
computation results correct. The restriction that Φ must asserting time of the SPST equipped multipliers.
be greater than ψ to guarantee the registers from latching Therefore the proposed advanced SPST can benefit
the wrong values of control usually decreases the overall multipliers on both high speed and low power features.
speed of the applied designs. This issue should be noticed
in high-end applications which demands both high speed 5. Modified Booth Encoder
and low power requirements. To solve this problem we In order to achieve high-speed multiplication,
adopt the other implementing approach of control signal multiplication algorithms using parallel counters, such as
assertion circuit using AND gate. the modified Booth algorithm has been proposed, and
some multipliers based on the algorithms have been
implemented for practical use. This type of multiplier
operates much faster than an array multiplier for longer
operands because its computation time is proportional to
the logarithm of the word length of operands.
35
ICGST-PDCS, Volume 8, Issue 1, December 2008
Table 1
Block Re - coded digit Operation on X
000 0 0X
001 +1 +1 X
010 +1 +1 X
011 +2 +2 X
100 -2 -2 X
101 -1 -1 X
110 -1 -1 X
figure 9. SPST equipped modified Booth encoder
111 0 0X
figure 8. Illustration of multiplication using modified Booth encoding figure 11. Proposed high performance low power equipped multiplier
36
ICGST-PDCS, Volume 8, Issue 1, December 2008
9. References
[1] G. Lakshmi Narayanan and B. Venkataramani
“Optimization Techniques for FPGA-Based Wave
Pipelined DSP Blocks” IEEE Trans. Very Large
Scale Integr. (VLSI) Syst., vol.13. no 7 . pp 783-
792, July 2005
[2] L.Benini, G.D.Micheli, A.Macii, E.Macii, M. Poncino,
and R.Scarsi, “Glitching power minimization by
selective gate freezing,” IEEE Trans. Very Large
figure 12 Simulation results for array multiplier Scale Integr. (VLSI) Syst., vol. 8, no. 3, pp.287–297,
June 2000.
[3] S.Henzler, G.Georgakos, J. Berthold, and D. Schmitt-
Landsiedel, “Fast power-efficient circuit-block
switch-off scheme,” Electron. Lett.,vol. 40, no. 2, pp.
103–104, Jan. 2004.
[4] H. Lee, “A power-aware scalable pipelined Booth
multiplier,” in Proc.IEEE Int. SOC Conf., 2004, pp.
123–126.
[5] J. Choi, J. Jeon, and K. Choi, “Power minimization of
functional units by partially guarded computation,” in
Proc. IEEE Int. Symp. Low PowerElectron. Des.,
2000, pp. 131–136.
[6] Z. Huang and M. D. Ercegovac, “High-performance
low-power left-toright array multiplier design,” IEEE
figure 13 Simulation results for proposed multiplier
Trans. Comput., vol. 54, no. 3, pp. 272–283, Mar.
The comparison of synthesis report for two multipliers is 2005.
given in Table 2. [7] K. H. Chen, K. C. Chao, J. I. Guo, J. S. Wang, and Y.
Table 2 S. Chu, “An efficient spurious power suppression
Multiplier type Array multiplier Proposed technique (SPST) and its applications on MPEG-4
AVC/H.264 transform coding design,” in Proc. IEEE
multiplier Int. Symp. Low Power Electron. Des., 2005, pp. 155–
Vendor Xilinx Xilinx 160.
[8] K. H. Chen, Y. M. Chen, and Y. S. Chu, “A versatile
Device and Spartan 2 Spartan 2 multimedia functional unit design using the spurious
power suppression technique,” in Proc. IEEE Asian
family
Solid-State Circuits Conf., 2006, pp. 111–114.
Estimate delay 21.068 ns 14.886ns [9] Zhijun Huang,”High level optimization techniques for
low power multiplier design” University of California,
Power 44mW 35mW
los angels, 2003.
dissipation
37
ICGST-PDCS, Volume 8, Issue 1, December 2008
38