Beruflich Dokumente
Kultur Dokumente
1.INTRODUCTION
Generally, direct implementation of a K-tap FIR filter requires K multiply-and-accumulate
(MAC) blocks, basically, which are complex in logic and resource usage. To resolve this issue, we
first present DA, which is a multiplier-less architecture. An alternative to computing the
multiplication is to decompose the MAC operations into a series of lookup table (LUT) accesses
and summations. This approach is termed distributed arithmetic (DA), a bit serial method of
computing the inner product of two vectors with a fixed number of cycles, as in ref[1].
Distributed Arithmetic structure is used to increase the resource usage while pipeline
structure is also used to increase the system speed. In addition, the divided LUT method is also used
to decrease the required memory units. However, according to Distributed Arithmetic, we can make
a Look-Up-Table (LUT) to conserve the MAC values and callout the values according to the input
data if necessary. Therefore, LUT can be created to take the place of MAC units so as to save the
hardware resources, as in ref[2].
This project provide the principles of Distributed Arithmetic, and introduce it into the FIR
filters design, which save considerable MAC blocks to decrease the circuit scale, meanwhile,
divided LUT method is used to decrease the required memory units and pipeline structure is also
used to increase the system speed.
Where, y-
Output respons
𝐴k -Constant filter coefficients
𝑥k - Input data
Let xk N-bits scaled two’s complement number and it can be represented as,
Where 𝑏k0 is the sign bit and by substituting (2) in (1), get the expression as,
Rearranging the summation based on power terms and then grouping the sum of the products, the
final reformulation is given by
first sequence of bits starting from LSB , started accessing the LUT, at
the same time next sequence of bits from , also started accessing LUT,
here the LUT’s are called in parallel and the corresponding output from both the LUT’s are summed
and shifted in order to get the partial product computation. Finally these values will get added with
another set of sequence of bits i.e. (from and ) and the output will be stored in the accumulator,
which is not shown here.
Instead of waiting for the completion of the first iteration all the corresponding next
sequences will get processed at the same time at the cost of improved throughput rate. In order to
improve the performance further the conventional shift /add accumulation method are replaced by
a conditional signed carry save accumulator unit as shown in the below figure,
REFERENCE
[1] G.Pavithra, Ms.R.Kousalya “Design and Verification of High Performance VLSI
Architecture for Reconfigurable FIR Filter Using Pipelined Distributed Arithmetic,”
International Journal of Pure and Applied Mathematics Vol. 119, ISSN:1311-8080.
[2] M. Keerthi, Vasujadevi Midasala(2012) “FPGA Implementation of Distributed Arithmetic
for FIR Filter,” International Journal of Engineering & Technology (IJERT) Vol. 1 Issue 9,
ISSN:2278-0181.
[3] A.Antonion, Digital Filters: Analysis, Design, and Applications, McGraw-Hill, New York,
1993.