Beruflich Dokumente
Kultur Dokumente
Abstract
This paper proposes an efficient architecture for 2D DWT. The
proposed architecture includes a transform module, a RAM
module and a multiplexer. In transform module, polyphase
decomposition and coefficient folding technique is applied to the
decimation filters of stages 1 and 2 respectively. The advantages
of the proposed architecture are the 100% hardware utilization,
fast computing time, regular data flow and low complexity.
Because of the regular structure, the proposed architecture can be
easily be scaled with the filter length and 2D DWT level. VLSI
architecture for the 2-D DWT is implemented using FPGA using
Verilog HDL.
Keywords:
Discrete
Wavelet
architectures, image compression
Transform,
VLSI
1. Introduction
(1)
i1 =0 i2 =0
K1 K1
1063-8210 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
(2)
K1 K1
(3)
RAM
N/2*N/2
i1 =0 i2 =0
LLLL,LL
M
U
K 1 K 1
J1
TRANSFORM
MODULE
LLLH,LH
LLHL,HL
(2n1 i1 )(2n2 i2 )
(4)
LLHH,HH
INPUT
IMAGE
i1 =0 i2 =0
ROM
DWT
COEFFICIENTS
Transform Module
Stage 1
Stage 2
Polyphase
Decomposition
Technique
Co-Efficient
Folding
Technique
3. Proposed System
The proposed system focuses to implement an efficient
architecture for the two-dimensional discrete wavelet
transform (2-D DWT). The advantages of the proposed
architecture are 100% hardware utilization, fast computing
time than that of parallel filter architecture, regular data
flow, and low control complexity, making this architecture
suitable for JPEG-2000.
1063-8210 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
H(Z)
y (m)
CLK SW IN
0
0 x(0)
1
1 x(1)
2
0 x(2)
ODD
a1x(0)
EVEN
OUT
a0 x(1)
a0 x(1)+a1x(0)
a1x(2)+a3x(0)
a0x(3)+a1x(2)+
3
4
1
0
x(3)
x(4)
5
6
1
0
x(5)
x(6)
7
8
1
0
x(7)
x(8)
x(9)
a0x(3)+a2x(1) a2x(1)+a3x(0)
a1x(4)+a3x(2)
a0x(5)+a1x(4)+
a0x(5)+a2x(3) a2x(3)+a3x(2)
a1x(6)+a3x(4)
a0x(7)+a1x(6)+
a0x(7)+a2x(5) a2x(5)+a3x(4)
a1x(8)+a3x(6)
a0x(9)+a1x(8)+
a0x(9)+a2x(7) a2x(7)+a3x(6)
1063-8210 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Methods
Stage 1 Stage2
Original Design
T/2
T/2
A/2
A/2
T/2
A/2
A/2
T/2
Stage 1
A
T
a
t
a
t/2
a/2
t
a
t/2
a/2
t
Stage 2
A
T
2a
t/2
2a
t/4
a
t/2
a
t/2
2a
t/4
Total
Area
3a
3a
3a/2
2a
5a/2
Total Stage2
T
Idle
t
at
t/2
at/2
t
at/2
t/2
0
t
3at/2
A-Area: T-Time
Table 3. Data flow of decimation filter employing coefficient
folding technique
CLK SW IN
PE1
PE0
0
0 X(0) a3X(0)
a1X(0)
a2X(1)+
1
1 X(1) a3X(0) a0X(1)+ a1X(0)
2
OUT
a0X(1)+a1X(0)
X(2) a3X(2)
a1X(2)+ a2X(1)
X(4) a3X(4)
X(6) a3X(6)
X(8) a3X(8)
+a3X(0)
Figure 6. Schematic diagram of coefficient folding technique
a1X(4)+ a2X(3)
+a3X(2)
a1X(6)+ a2X(5)
+a3X(4)
a1X(6)+ a2X(5)
+a3X(4)
1063-8210 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
References
Hardware Implementation
Device
Utilization
Summary
IOBS
LUT
LOAD
Stage1
Polyphase
Decomposition
13%
2%
20%
TIMING REPORT
2.926ns
2.926ns
MIN PERIOD
DELAY
OFFSET
MEMORY USAGE
150752KB
TOTAL
Stage 2
Coefficient
Folding
20%
1%
28%
2.725ns
2.725ns
153376KB
5. Conclusion
Many 2- D DWT architectures have been proposed to meet
the requirements of real time processing. However, the
hardware utilization of these architectures needs to be
further improved. Therefore, in this paper an efficient
architecture for the 2-D DWT. The proposed architecture
1063-8210 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.