Beruflich Dokumente
Kultur Dokumente
DOI: 10.2478/s11772−013−0071−0
In the multi−view video coding, both temporal and inter−view redundancies can be exploited by using standard block−based
motion estimation (BBME) technique. In this paper, an extensive review of BBME algorithms proposed within the last three
decades is presented. Algorithms are divided into five categories: 1) based on the search position number reduction;
2) multiresolution; 3) based on the simplification of matching criterion; 4) fast full search; 5) computation−aware. Algo−
rithms are compared in terms of their efficiency and computational complexity.
ged in the cross (+) pattern around the centre. Initial step the eight SPs surrounding the window centre, the search in
size is d/4, where d is the maximum displacement. The step the second step will be performed only for eight neighbour−
size is halved only when the minimum distortion is at the ing points of the minimum and then stopped. Otherwise, the
centre of the pattern or at the boundary of the search win− search goes exactly as for TSS.
dow. When the step size is equal to 1 – eight points around Four step search (4SS) [10] uses more compact search
the centre are checked. pattern than TSS with the initial step size set to d/4 [(Fig. 2
In Ref. 7, Koga et al. proposed simple and regular three (a)]. The step size is maintained constant through steps 1 to
step search (TSS) which utilizes the square search pattern 3. Similarly to NTSS, the algorithm incorporates a half−way
with eight SPs around the centre at each step [(Fig. 1 (b)]. stop technique. If in the second step, the minimum BDM
The initial step size is d/2 and is halved in the subsequent point is centre located, the search goes to the final step
steps. When d equals 7, the number of steps is 3 and the where step size is reduced to 1.
number of used SPs is 25. For larger SRs, TSS can be easily Less known TSS−based schemes include: fast three step se−
extended to n steps with the number of SPs equal to arch (FTSS) [11], adaptive centre non−linear three step search
1 + 8[log2(d + 1)]. (ACNTSS) [12] and efficient three step search (E3SS) [13].
Several modifications of the original TSS scheme have One−at−a−time search (OTS) algorithm [14] performs
been proposed. In Ref. 8, the set of thresholds is introduced search first in the horizontal direction using three adjacent
to determine the motion activity of a block and decide SPs [(Fig. 2(b)]. When the minimum BDM value is found at
whether to perform the next step of TSS or to finish. the central point, the search direction changes to vertical and
New three step search (NTSS) [9], exploits the cen− the search is continued up to the moment when the minimum
tre−biased characteristic of the MV distribution in most BDM value is located at the centre of the search pattern.
real−world video sequences. Thus, eight extra SPs around Orthogonal search algorithm (OSA) [15] starts with
the centre are checked in the first step [(Fig. 1(c)]. Addition− three horizontally arranged SPs: one at the centre and two in
ally, the algorithm incorporates a half−way stop technique in d/2 distance [(Fig 2(c)]. Next, two vertically arranged SPs
order to fast identify and estimate the motion of stationary are checked around the minimum BDM point, the step size
and quasi−stationary blocks. If the minimum BDM value in is halved and the procedure is repeated until the step size
the first step occurs at the search window centre, the search equals 1. For d = 7, the required number of SPs equals 13;
stops. If the minimum BDM point in the first step is one of for larger displacements: 1 + 4[log2(d + 1)].
Fig. 2. Example search path of: (a) 4SS, (b) OTS, and (c) OSA algorithm.
Fig. 3. Example search path of: (a) CSA, (b) DS, and (c) CZS algorithm.
Cross search algorithm (CSA) [16] [(Fig. 3(a)], first Several more complex DS−based schemes have been
checks the zero motion point (ZMP) and compares the proposed. Cross−diamond search (CDS) [22] incorporates
BDM value with a predefined threshold to decide whether the cross search pattern at the first step to utilize cross−cen−
the block is stationary or not. The search continues only in tre−biased MVs distribution observed in many real−world
the second case. In the subsequent steps, four SPs arranged video sequences. Small−cross−diamond search (SCDS) [23]
in a (×) cross shape are checked and the step size is halved and kite−cross−diamond search (KCDS) [24] are designed
starting form d/2. When the step size reaches the length of mainly for the small−motion activity video sequences such
one, additional step is carried out with a (+) cross search pat− as videoconferencing and utilize the small cross and kite
tern if the minimum BDM point of the previous step is search patterns combined with a half−way stop technique to
either the centre, upper left or lower right SP. Otherwise, (×) save more computation.
cross search pattern is used. Number of required SPs (if the Adaptive motion tracking block matching described in
stop condition is not met) is 5 + 4[log 2(d + 1)]. Ref. 25 utilizes correlation between MVs of adjacent MBs.
Parallel hierarchical one−dimensional search (PHODS) MVs of four neighbours are used to compute the search cen−
[17], is one of the first ME algorithms designed specifically tre. First, the difference between each of neighbouring MVs
for hardware implementation. The search along x and y axis and their mean is computed, and if the maximum value is
is carried out independently and pipeline interleaving the x smaller than a predefined threshold, the minimum differ−
and y axis searches is possible when the hardware realiza− ence between neighbouring MV and the mean is taken as
the search centre. Otherwise, ZMP is used as the search cen−
tion is taken into consideration. The algorithm uses the
tre. Then any fast block−matching algorithm, such as TSS or
same search pattern as OSA and the distance between SPs is
BBGDS, can be applied.
halved at each step as well. As the final solution, MV com−
Circular zonal search (CZS) [26] [Fig. 3(c)] and its later
posed form displacements found along x and y axis is taken.
modifications: modified circular zonal search (MCZS) [27]
Dynamic search−window adjustment and interlaced
and advanced diamond zonal search (ADZS) [28, 29]
search (DSWA/IS) [18], uses alternatively (×) and (+) cross
exploit spatiotemporal correlations among MVs. To deter−
shape search patterns. The step size at each step is not fixed
mine the search centre, the set of predictors is used includ−
but determined by the parameter which describes the speed ing MVs of left, top and top right neighbours and MV of the
of convergence towards the global minimum. The parame− co−located MB from the previous frame. Other predictors,
ter is calculated as the difference between the smallest and like neighbouring MVs of the co−located MB are also con−
second smallest BDM value found at the present stage of the sidered [29]. The search is performed around the search cen−
process and compared with a predefined threshold. At the tre using either circular− [26,27] or diamond−shape zones
last step, 3×3 SPs are checked around the up−to−date mini− [28,29]. Fixed or adaptive threshold is used to determine
mum BDM point. whether the up−to−date minimum BDM value can be consi−
Block−based gradient descent search (BBGDS) [19], dered as optimal.
uses 3×3 SPs square search pattern at each step. Number of Hilbert scanning search algorithm (HSSA) [30] transfers
search steps is unrestricted and the search stops when the the 2D space of the search window into a 1D space of
minimum BDM point is located at the centre. Hilbert curve and employs the binary search to locate the
Diamond search (DS) [20,21], employs two search pat− global optimal MV. Both one−candidate and multiple−candi−
terns: small diamond search pattern (SDSP) and large dia− date schemes can be used.
mond search pattern (LDSP) composed of five and nine 1D gradient descent search (1DGDS) [31] first checks
SPs, respectively [(Fig. 3(b)]. First, the search is performed ZMP and MVs of four neighbouring MBs to determine the
with LDSP until the minimum BDM point is found at the search direction and the step size. If the distance between
centre of LDSP. Then, at the last step, SDSP is applied. the initial SP and ZMP is smaller than 3 – the step size is set
Multi−resolution motion search (MRMCS) [43] algo− In Ref. 48. a block matching is performed on the binary
rithm and its modification [44] use three levels of the hierar− edge maps obtained from the reference and the current frame.
chy with 16:1 and 4:1 sub−sampling at level 2 and 1, respec− If there are too few edge pixels inside a given MB, MV of the
tively. At the coarsest level FS is performed and two best current, MB is interpolated by using neighbouring MVs.
MVs are selected as candidates for the refinement at the In Ref. 49, the convolution kernel is applied to the cur−
level 1 together with the median predictor obtained from rent and the reference frame to obtain their filtered versions.
neighbouring MVs. At the level 1 the search is performed Then, filtered and unfiltered frames are compared and if the
around these three candidates within ±2−pixel SR. The best unfiltered pixel value is larger than the filtered one, value 1
MV form the level 1 is used at the level 0 as the search cen− is assigned to the binary version of the frame. Next, any
tre for search within ±2−pixel SR. The block size is scaled at BBME algorithm, such as FS or TSS, can be applied.
each level. The fast binary pyramid ME (FBPME) algorithm [50]
In the hardware−oriented parallel multi−resolution ME combines the hierarchical approach to ME with binary
(PMRME) algorithm [45] three levels of the hierarchy are block matching. The 4−level hierarchy is used. Levels from
searched in parallel with the 4:1 sub−sampling at level 2 and 4 to 2 are constructed by the binarization of the image
1. Level 2 and 1 are centred at ZMP to make possible an through filtering, sub−sampling, and up−sampling by inter−
effective data reuse. Level 0 is centred at the motion vector polation. Then the difference between original and up−sam−
predictor (MVP) obtained as the median from MVs of spa− pled version of an image is compared with a threshold to get
tially−adjacent MBs: left, top and top right. Two MVs with a binary version. At level 1, the sub−sampled version of the
the lowest SAD from each level are taken for the detailed original frame is used. The search starts from the top level
check. with blocks of different sizes (8×8, 8×4, 4×8, and 4×4). The
The hardware−oriented multiresolution ME algorithm best MVs from each block size are propagated to the next
proposed in Ref. 46 uses three levels of hierarchy, as well. level and refined. At the level 2, four vectors from higher
FS algorithm at the level 2 is employed and local FS around levels are refined but only for the 8×8 block size. The best
multiple candidates (four best MVs and MVP) at the level 1 one is projected to level 1 and refined by using 3×3 search
within ±10−pixel horizontal and ±6−pixel vertical SR. At the window and 16×16 block size.
level 0, local FS is performed only around the best MV from To improve the ME accuracy obtained with 1−bit repre−
the previous level within ±14−pixel horizontal and ±8−pixel sentation, Erturk and Erturk proposed 2−bit transform for bi−
vertical SR. nary BBME in Ref. 51. The first and the second bit plane are
obtained through comparison of each pixel value with the lo−
cal mean and the mean plus standard deviation, respectively.
2.3. Algorithms based on simplification of matching The matching criterion is defined by using Boolean XOR and
criterion OR operators. It is possible to make XOR comparisons for
In this category of BBME algorithms, two sub−classes can each bit plane of the current and reference frame in parallel to
be distinguished. In the first one, the simplification relies on speed up the computation process.
the pixel bit precision reduction. In the second one, only In Ref. 52, the reduced bit mean absolute difference
a selected part of current and candidate block pixels is used (RBMAD) is proposed as BDM. With the bit precision
for the BDM calculation. reduction from one to seven bits, the results are close to
those obtained with full−resolution MAD. The significant
hardware resource reduction and the speed−up ratio (57%
2.3.1. Pixel bit precision reduction algorithms and 34% with 4−bit RBMAD, respectively) are reported.
In Ref. 53, this idea is extended to adaptive bit−reduced
Since the block matching process performed on the low−bit mean absolute difference (ABRMAD), where the number of
resolution images requires less computation and hardware bits for MAD calculation is selected adaptively according to
resources, the original eight−bit resolution of pixels can be the position of the first non−zero bit of the pixel with the
reduced by truncating a certain number of the least signifi− largest intensity in the current block. The performance of
cant bits or transforming the pixels into a 1−bit representa− ABRMAD is superior to RBMAD, especially for a low−bit
tion. Particularly, binary ME algorithms use an exclusive−or resolution (1 – 3 bits) cases. In Fig. 5, a comparison of pixel
(XOR) Boolean block−matching criterion instead of the bits’ selection for 8−bit MAD, 4−bit RBMAD, and 4−bit
integer criterion. ARBMAD is presented.
In Ref. 47, the binarization of each pixel of SA and the Lee et al. [54] proposed the low−resolution quantization
current MB are performed by comparison with the mean motion estimation (LRQME) algorithm. First, the mean of
pixel value of SA and MB, respectively. Then, FS is per− the current block is subtracted from each pixel value of the
formed over the bit−plane search window to select the best search window and the block itself. The result is adaptively
candidates using an adaptive threshold. Next, full−resolution quantized into a 2−bit code. Then, the search is performed on
BDM is calculated for these best candidates. The algorithm the low−bit−resolution image and a set of candidate MVs is
incorporates the starting point prediction by using MVs of selected to calculate the full 8−bit−resolution BDM, how−
neighbouring MBs and the adaptive SR adjustment. ever, with 4:1 sub−sampling to reduce the computation.
2.3.2. Block sub-sampling algorithms Pixel decimation scheme proposed in Ref. 58 is based
In order to reduce computations associated with the BDM on N−queen lattice which is obtained by placing N queens
calculation, various types of sub−sampling can be applied to on a chessboard. Only pixels at the positions occupied by
the current and reference block. In Ref. 40, every second the queens are selected for the BDM calculation. The
pixel of these blocks is selected in both horizontal and verti− scheme guarantees that at least one pixel is selected from
cal directions. each row, column, and diagonal. In Fig. 7, the fixed 4:1
In Ref. 55, Liu and Zaccarin, instead of the fixed deci− sub−sampling pattern is compared with a 4−queen pattern.
mation pattern, proposed a set of four patterns with 4:1 In Ref. 59, several decimation patterns are proposed based
sub−sampling which are altered during the search process to on the boundary approach (observation that new objects
reduce the influence of sub−sampling on the ME accuracy
(Fig. 6).
Hierarchical partial distortion search (HPDS) algorithm
[56] uses hierarchical sub−sampling with three levels of
hierarchy. At each level, the number of candidate MVs
taken into account decreases and the number of pixels taken
for the BDM calculation increases. These two parameters
can be adjusted to balance the speed and the accuracy of the
ME process.
Globally adaptive pixel−decimation (GAPD) [57]
applies Hilbert scan to convert a 2D block into a 1D Hilbert
sequence. Adaptive thresholding is employed to select pix−
els belonging to the edge and highly structured regions
which are used for the BDM calculation.
enter an MB through its boundary regions) and N−queen selection, the complexity of each sub−block is estimated by
lattices. Genetic algorithms are employed for the optimal using the gradient magnitude. Additionally, to save compu−
pattern selection. tations, the prediction of the starting level is employed,
which utilizes information about the rejection level of the
preceding block. Other modifications of SEA are: extended
2.4. Fast full-search algorithms
SEA (ESEA) [68], extended MSEA (EMSEA) [69] and
BBME algorithms which belong to this category can obtain adaptive MSEA (AdaMSEA) [70].
exactly the same results as FS with a significant reduction of
computational complexity which is achieved by an early 2.5. Computation-aware algorithms
detection of non−optimal solution and by skipping unneces−
sary computations. With an appropriate selection of the Computation−aware BBME algorithms [71] employ adap−
search starting point, most of the unnecessary calculations tively different techniques and algorithms to achieve opti−
can be avoided. mal or suboptimal results in a computation−limited and com−
In the partial distortion elimination (PDE) algorithm putation−variant environment. Previously described BBME
[60], recommended for the H.263 standard, the current algorithms do not take into account the available computa−
BDM value is gradually accumulated and the partial distor− tional resources and stop only when all necessary SPs are
tion is compared with the previously found minimum BDM checked. By this time, the available computational power
value. All candidate blocks for which the partial distortion is may be exhausted, making real−time implementation impos−
already larger than the current minimum distortion are sible. When the search procedure is interrupted to meet
skipped. Successive elimination algorithm (SEA) [61] uti− real−time requirements, obtained results may be far from the
lizes the reversed triangle inequality for the fast rejection of optimum. On the other hand, when computational resources
the most candidate blocks. Particularly, the candidate block are abundant, traditional BBME algorithms are not able to
is rejected if the absolute difference between the sums of utilize the excess efficiently. Computation−aware algo−
samples within compared blocks is larger than the up−to− rithms can fulfil real−time requirements without any signifi−
−date smallest mean absolute difference (MAD). As the ini− cant performance degradation and improve the performance
tial SP, MV of the co−located MB from the previous frame is when additional resources are available.
proposed. Tai et al. [71] proposed the frame− and block−level com−
The fast full search block−matching algorithm described putation distribution scheme. The frame−level computation
in Ref. 62, prior to calculation of the full BDM value, tests distribution bases on the information about computation
three fast matching error measures based on the reduced usage in the previous frames. The block−level computation
representation of compared blocks. The representations are allocation is proportional to the BDM value at the initial SP.
calculated as massive, vertical, and horizontal integral pro− Then, blocks are arranged in the descending order of BDM
jections of a block. If any of the fast matching criteria holds, value and one step of the algorithm (e.g., TSS) is performed
further calculations can be skipped and a candidate block for each block. After first step, blocks are sorted again and
discarded. the procedure is repeated until the resources are exhausted.
The algorithm proposed in Ref. 63 can be considered as To avoid multiple allocations and sorting, Chen et al.
the combination of PDE and SEA algorithms since the SEA [72] proposed one−pass scheme which operates in a block−
criterion is evaluated not only once for the whole block, but −by−block manner. All computational resources are divided
also for its subsampled versions (1×1, 2×2, 4×4, and 8×8). equally between frames and distributed among MBs. Each
Thus, the rejection at an earlier stage is possible. The same MB receives the same basic amount of computation and
approach is adopted in the multilevel successive elimination some extra resources according to the ratio of SAD at the
algorithm (MSEA) [64]. initial SP, which is the median of left, upper and right−upper
In Ref. 65, a PDE modification is based on the observa− MVs of neighbouring MBs, to average minimum SAD of
tion that BDM value is proportional to the gradient magni− the previously processed MBs. An adaptive search strategy
tude of the reference block. It allows for the early selection selection is employed which uses variation of neighbouring
of more significant pixels for BDM calculation, which MVs around the median to determine the motion activity
increases the probability of the fast rejection of inappropri− and selects either DS or TSS algorithm (see Fig. 8). If an
ate candidates. Similar approach is applied in Ref. 66; how− early−stop condition is not met and some resources are still
ever, the pixel selection order is determined by the arrange− available – FS algorithm is invoked. For further speed up,
ment of pixel differences calculated at ZMP according to 2:1 subsampling for BDM calculation can be applied. Simi−
their contribution into the BDM value. lar approach is adopted in the multi−path search (MPS) algo−
Fine granularity successive elimination algorithm rithm [73,74], where the initial SP is selected from the pre−
(FGSEA) [67] successively checks for up to 85 conditions diction set containing MVs of left, left−upper, upper, and
(levels) for early rejection of inappropriate candidates. Each right−upper neighbour, ZMP, and the co−located block in the
level is formed by the incremental computation of the SADs previous frame. Search strategy is selected among TSS, DS
between the parts of the candidate and the current block, and KCDS depending on the estimated block motion
from 1×1 to 16×16. To determine the order of the pixel activity and the available computation.
between the views. Next, a relation between disparity and (352×240) (“Football”, “Flower garden”, “Table tennis”,
motion is utilized to determine SR for motion estimation. and “Mobile calendar”) video sequences. The sequences
Additionally, the early stop condition is provided to limit represent different types of motion activity, from low (“Mo−
the number of RFs. The initial RF for a given block is the ther and daughter”), through moderate (“Foreman”,
same as used by the majority of neighbouring blocks. If the “Flower garden”, “Mobile calendar”), to high (“Football”,
Lagrangian cost obtained in this RF is smaller than a certain “Table tennis”). SR is set to ±7 and ±15 for QCIF and SIF
threshold, this RF is taken as the optimal and other refer− resolution, respectively. The block size is 16×16 and IPPP
ences are not searched. Obtained results are only tenths of GOP structure is used. In Table 1, motion−compensated
dB in PSNR and less than 1% in the bitrate worse than PSNR values and averaged numbers of SPs per MB for each
obtained with the scheme implemented in the joint multi− algorithm are presented. It is clear that there is no single
−view video model (JMVM) [88] software with over double algorithm which performs equally well for all the types of
speed−up. sequences. However, DS and BBGDS can be distinguished
In Ref. 89, in order to adjust SR, the motion homogene− as combining relatively good performance with low compu−
ity is measured as deviation of MVs of 4×4 blocks with tational complexity. Only for the Foreman sequence these
respect to their mean. Blocks taken for motion homogeneity algorithms are distinctly outperformed by TSS, FSS, and
estimation come from location of the corresponding MB in NTSS. Foreman is also the only sequence for which the
the previously coded view obtained by means of a global performance of TSS is not the poorest one.
disparity vector and from its eight neighbouring MBs. Then, In Ref. 90, NTSS, 4SS, DS, CDS, KCDS, and HEXBS
obtained value of the motion homogeneity is compared with algorithms are compared by using four 200−frame QCIF
two predefined thresholds to determine whether MB is (“Akiyo”, “Car phone”, “Foreman”, “Stefan”) and three
located in a homogenous, moderately homogenous, or com− 100−frame CIF (352×288) (“Flower garden”, “Mobile cal−
plex motion region, and SR is adjusted accordingly. The endar”, “Football”) video sequences. The block size is set to
method achieves almost the same results as JMVM with two 16×16 and SR to ±7 for all sequences. The comparison in
to five speed−up ratio. terms of motion−compensated PSNR and an averaged num−
ber of SPs per MB is presented in Table 2. In this compari−
3. Comparison of algorithms son, CDS and HEXBS appear as the algorithms which
maintain a relatively good performance and low computa−
In this section, selected BBME algorithms are compared on tional complexity for all the types of sequences. For high−
the basis of results published in several papers. Since the −motion activity sequences (“Stefan”, “Football”), NTSS
experiments were conducted with different video sequen− outperforms the remaining algorithms, however, at the ex−
ces, search ranges, block sizes, numbers of frames, and spa− pense of increased computational complexity. KCDS mani−
tial/temporal resolutions, the results are not always consis− fests the lowest computational complexity, but for most of
tent. However, some general conclusions can be drawn. the sequences the performance degradation is not accept−
The widest comparison of BBME algorithm belonging able. Only the videoconferencing materials such as “Akiyo”
to the first category can be found in Refs. 37, 90, and 91. In seem to be suitable for this algorithm.
Ref. 37, TSS, NTSS, 4SS, BBGDS, and DS algorithms are In Ref. 91, 2DLOG, 4SS, BBGDS, DS, CDS, and
compared by using two 100−frame QCIF (176×144) (“Fore− HEXBS algorithms are compared using ten CIF sequences:
man” and “Mother and daughter”) and four 300−frame SIF “Football”, “Coastguard”, “Akiyo”, “Foreman”, “Sean”,
Table 1. Performance and complexity comparison of FS, TSS, 4SS, NTSS, BBGDS, and DS [37].
Algorithm
Sequence
FS TSS 4SS NTSS BBGDS DS
PSNR [dB] 23.92 21.18 23.39 21.93 23.52 23.61
Flower garden
SPs 859.45 30.52 19.03 20.54 14.03 16.89
PSNR [dB] 23.08 21.82 22.27 22.02 22.08 22.24
Football
SPs 859.45 30.52 19.19 21.66 15.15 17.11
PSNR [dB] 30.65 30.16 29.83 30.23 29.73 29.83
Foreman
SPs 184.56 21.66 16.70 18.37 12.47 14.50
PSNR [dB] 22.61 22.31 22.51 22.58 22.58 22.54
Mobile calendar
SPs 859.45 30.52 15.88 18.04 10.73 12.60
PSNR [dB] 39.71 39.59 39.63 39.65 39.62 39.63
Mother and daughter
SPs 184.56 21.66 14.94 15.42 8.43 11.85
PSNR [dB] 27.49 25.82 26.42 26.26 26.25 26.61
Table tennis
SPs 859.45 30.52 18.31 20.71 12.71 15.54
“News”, “Silent”, “Stefan”, “Mobile calendar”, and “Con− For most of the sequences, 4SS exhibits similar perfor−
tainer”. The block size is set to 16×16 and SR to ±16 pixels. mance as DS and CDS or even outperforms them (“Foot−
The results are shown in Table 3. This comparison confirms ball”, “Stefan”), but requires more computational resources.
efficiency of DS and CDS algorithms. Results obtained by BBGDS is the fastest among the compared algorithms, and
both algorithms are very similar. However, CDS uses less its performance is in the most cases comparable with DS
SPs than DS. and CDS. In general, DS and CDS can be considered as the
most versatile and robust algorithms. In the case when the
Table 3. Performance and complexity comparison of FS, 2DLOG, 4SS, HEXBS, BBGDS, CDS, and DS [91].
Algorithm
Sequence
FS 2DLOG 4SS HEXBS BBGDS CDS DS
PSNR [dB] 25.84 24.91 25.06 24.66 24.71 24.83 24.88
Football
SPs 858.46 33.00 27.48 16.14 20.76 21.50 21.82
PSNR [dB] 30.04 29.40 29.76 29.79 29.81 29.88 29.88
Coastguard
SPs 868.33 33.00 26.02 13.04 13.72 15.68 16.54
PSNR [dB] 43.04 42.92 42.94 42.82 43.04 43.01 43.03
Akiyo
SPs 868.33 33.00 23.24 10.34 8.52 8.70 11.29
PSNR [dB] 32.84 31.92 32.00 31.42 32.14 32.01 32.10
Foreman
SPs 868.33 33.00 26.29 13.71 16.73 16.69 17.93
PSNR [dB] 39.46 39.31 39.33 39.30 39.45 39.41 39.42
Sean
SPs 868.33 33.00 23.44 10.48 8.92 9.15 11.66
PSNR [dB] 37.00 36.61 36.70 36.55 36.66 36.68 36.69
News
SPs 868.33 33.00 23.54 10.67 9.25 9.52 11.90
PSNR [dB] 36.24 35.85 35.88 35.56 35.56 35.57 35.66
Silent
SPs 868.33 33.00 23.76 10.96 9.98 10.14 12.46
PSNR [dB] 25.68 24.04 24.62 23.93 23.58 23.95 24.03
Stefan
SPs 868.33 33.00 25.94 13.87 15.52 16.96 17.61
PSNR [dB] 24.34 23.95 24.03 24.00 24.28 24.25 24.24
Mobile calendar
SPs 868.33 33.00 23.70 10.85 10.67 10.34 12.81
PSNR [dB] 38.45 38.45 38.45 38.45 38.45 38.45 38.45
Container
SPs 868.33 33.00 23.34 10.42 8.58 8.84 11.42
processing speed is a critical factor, BBGDS can be recom− binary block matching (BBM) [48], binary block matching
mended. Additionally, the performance of each algorithm with 1−bit transform (1BT) [49], reduced−bit mean absolute
can be improved by introducing the search centre prediction difference (RBMAD) [52], and low−resolution quantization
based on spatially and/or temporally adjacent MVs [6]. motion estimation algorithm (LRQME) [54]. Four CIF
Several multiresolution (hierarchical) algorithms are video sequences were used in the experiments (“Miss Ame−
compared in Ref. 43, where MRMCS is compared with rica”, “Car phone”, “Foreman”, “Claire”), 100 frames each
MRMC using four candidates at each level, MRST and with SR set to ±16 pixels. The results in Table 5 show that
one−level TSS algorithm. Four 300−frame CIF (“Mother and for all 1− and 2−bit criteria the PSNR drop exceeds 2 dB
daughter”, “News”, “Car phone”, “Foreman”) and three which is unacceptable value. For RBMAD criterion, 2−bit
300−frame SIF (“Flower garden”, “Football”, “Table ten− truncation does not create significant deterioration of the
nis”) video sequences were used. SR was set to ±16 pixels. search results. For ARBMAD even four bits can be safely
In Table 4, the motion−compensated PSNR value and com− truncated.
putational complexity (CC), with respect to FS, for each In Ref. 92, motion−compensated PSNR obtained with
algorithm are presented. In most cases, the computational full eight−bit pixel resolution is compared with results for
complexity of multi−resolution algorithms is higher than 1BT and 2−bit transform (2BT) [51]. Five CIF video se−
TSS, however, the PSNR values obtained with TSS are even quences were used in the experiments (“Coastguard”, “Mo−
1–2 dB lower (“Car phone”, “Foreman”, “Flower garden”, bile calendar”, “Football”, “Flower garden”, and “Bowing”)
“Table tennis”). MRST seems to be a good compromise with SR set to ±8 pixels. The results are presented in Table
between the speed and the performance achieving results 6. The performance obtained with 1BT and 2BT is similar
only tenths of dB away from FS and MRMCS. Note that with the exception of the Bowing sequence for which the
both MRST and MRMCS utilize the spatiotemporal cor− 0.8 dB difference is observed. However, the increased com−
relations between MVs. putational effort related to 2BT seems to be too high regard−
Considering the results from Table 1, where TSS is com− ing only a moderate improvement. It is characteristic that
pared with other BBME algorithms based on the search the biggest discrepancy between the results obtained with
position number reduction, it might be expected that the 8− and reduced−bit resolution picture representation is
BBGDS and DS can achieve results comparable with multi− observed for sequences with low motion activity and small
resolution algorithms with lower computational burden. amount of details such as “Claire” or “Miss America”. In
ME algorithms based on the reduced pixel bit precision general, truncating more than five least significant bits
representations of frames are dedicated for customized causes too large performance deterioration. However, from
hardware implementations to reduce the amount of compu− two to five bits truncation can be considered as an efficient
tation and the gate count. For this reason, the speed gain is way to reduce the hardware complexity and the power
highly implementation−dependent and only the efficiency in consumption of an ME unit.
terms of motion−compensated PSNR is compared. In Ref. The comparison of a few pixel decimation schemes can
53, the adaptive bit−reduced mean absolute difference be found in Refs. 57 and 59. In Ref. 57, Hilbert−scan−based
(ABRMAD) criterion is compared with full 8−bit MAD, globally adaptive pixel−decimation (HSPD) scheme using
Table 4. Performance and complexity comparison of FS, TSS, MRMC, MRST, and MRMCS [43].
Algorithm
Sequence
FS TSS MRMC MRST MRMCS
PSNR [dB] 40.10 39.64 40.03 40.05 40.09
Mother and daughter
CC [%] 100 3.2 4.6 2.1 4.7
PSNR [dB] 36.12 35.70 36.03 35.99 36.11
News
CC [%] 100 3.2 4.6 1.9 4.7
PSNR [dB] 32.37 31.28 31.96 32.19 32.30
Car phone
CC [%] 100 3.2 4.6 4.8 4.7
PSNR [dB] 32.53 30.78 32.10 32.35 32.50
Foreman
CC [%] 100 3.2 4.6 5.2 4.7
PSNR [dB] 25.26 22.49 24.65 25.11 25.25
Flower garden
CC [%] 100 3.2 4.6 4.6 4.7
PSNR [dB] 22.74 21.64 22.57 22.55 22.71
Football
CC [%] 100 3.2 4.6 3.6 4.7
PSNR [dB] 29.75 27.47 29.14 29.53 29.70
Table tennis
CC [%] 100 3.2 4.6 3.2 4.7
variable number of pixels is compared with fixed [40] and respectively. Motion−compensated PSNR and speed−up
alternating [55] 4:1 sub−sampling (Table 7). Four CIF se− ratio (SUR) values are shown in Table 8. B4Q4 and BVQ8
quences were used in the experiments (“Salesman”, are the schemes which combine boundary−based approach
“Claire”, “Football”, and “Table tennis”) with block size set with 4− and 8−queen (4Q, 8Q) scheme, respectively. The let−
to 16×16 and SR to ±8. As the complexity measure, the ters BV denote that the number of horizontal and vertical
number of pixels taken for the BDM calculation was taken. layers for the boundary−based approach is not identical.
The fixed subsampling is the most regular among the com− This comparison shows that more sophisticated subsam−
pared decimation schemes and its performance is the poor− pling schemes can achieve similar performance as N−queen
est one as well, especially for low−motion activity sequen− patterns with a higher speed−up ratio. The performance and
ces – “Claire” and “Salesman”. Alternating subsampling the speed−up ratio of BVQ8 seems to be comparable with
allows to obtain similar results as HSPD by using compara− HSPD, considering the difference in PSNR for the “Foot−
ble number of pixels, however, the second scheme is more ball” sequence when HSPD uses 31.3 pixels per block
flexible and able to maintain good performance for smaller which gives approximately 8.18−time speed−up. This type of
numbers of pixels used for distortion calculation. simplification of a matching criterion is, in general, less
In Ref. 59, boundary−based and N−queen decimation harmful for the performance than the pixel bit precision
schemes are compared with 4− and 8−queen schemes [58]. reduction.
Four video sequences were used in the experiments: “Con− The comparison of several fast full search algorithms
tainer”, “Foreman” (QCIF), “Stefan” (CIF), and “Football” can be found in Ref. 67. In the experiments, 11 QCIF and
(SIF) with the block size and SR set to 16×16 and ±16, CIF video sequences, 100−frame each were used (“Car pho−
Table 7. Performance and complexity comparison of pixel decimation patterns: fixed 4:1, alternated 4:1, and HSPD with variable number
of pixels [58].
Algorithm
Sequence
No decimation Fixed 4:1 Alternating 4:1 HSPD
PSNR [dB] 42.31 39.31 42.26 42.25 42.23 42.10 42.00
Claire
Pixels/Block 256 64 64 73.0 61.3 36.7 30.8
PSNR [dB] 23.66 23.52 23.61 23.57 23.56 23.42 23.34
Football
Pixels/Block 256 64 64 75.0 64.3 36.2 31.3
PSNR [dB] 35.43 34.56 35.33 35.37 35.36 35.18 35.16
Salesman
Pixels/Block 256 64 64 73.2 62.6 36.5 30.4
PSNR [dB] 28.56 28.42 28.49 28.49 28.48 28.40 28.30
Table tennis
Pixels/Block 256 64 64 73.4 65.7 36.5 28.6
Table 8. Performance and complexity comparison of pixel decimation patterns: 4−queen (4Q), 8−queen (8Q), B4Q4, and BVQ8 [57].
Algorithm
Sequence
No decimation 4Q 8Q B4Q4 BVQ8
PSNR [dB] 43.09 43.08 43.00 43.08 43.00
Container
SUR 1 4.0 8.1 6.4 9.2
PSNR [dB] 31.84 31.76 31.57 31.65 31.52
Foreman
SUR 1 4.0 8.1 6.4 9.2
PSNR [dB] 25.90 25.82 25.62 25.69 25.54
Stefan
SUR 1 4.0 8.1 6.4 9.2
PSNR [dB] 22.88 22.72 22.41 22.47 22.26
Football
SUR 1 4.0 8.1 6.4 9.2
ne”, “Coastguard”, “Flower garden”, “Foreman”, “Grand− provide computational scalability feature, their work is in−
mother”, “Hall monitor”, “Miss America”, “Mother and terrupted after reaching a targeted number of SPs. The
daughter”, “Stefan”, “Suzie”, and “Table tennis”). SEA results for target number of 10 and 15 SPs per MB are pre−
[61], MSEA [63,64], and FGSE [67] algorithm performan− sented in Table 10. Five CIF sequences were used for the
ces were compared. The block size and SR were set to experiments, 90 frames each: “Coastguard”, “Football”,
16×16 and ±16, respectively. “Foreman”, “Paris”, and “Stefan”, the block size and SR
Since the performance in terms of PSNR of all the algo− were set to 16×16 and ±15 pixels, respectively.
rithms is the same as FS, only the speed−up ratios are pre− Table 9. Speed−up ratios of fast full search algorithms: SEA,
sented in Table 9. It shows that the speed gain for this type MSEA, and FGSE [67].
of algorithms can be even higher than for some search posi−
tion reduction algorithms such as TSS. However, it is highly
content−dependent. For instance, speed−up of FGSE can
vary from 6.3 (“Table tennis”) to 43.7 (“Car phone”). Thus,
this type of fast BBME algorithms can be a good choice for
quality−oriented video compression systems.
The performance of computation−aware BBME algo−
rithms is usually visualized in the form of charts such as in
Fig. 9, where changes of quality, measured by MSE or
PSNR, are presented as the function of available computa−
tional resources. To simplify the comparison, the perfor−
mance of selected algorithms is presented only at selected
operating points which characterize an algorithm capability
to converge to the optimal solution. In Ref. 73, the computa−
tion−aware multi−path search (MPS) algorithm is compared
with Chen’s one−pass scheme [72] (OPS) and three search
position reduction algorithms: NTSS, DS, and HEXBS. Fig. 9. Performance changes along with available computational re−
Since except MPS and OPS the rest of the algorithms do not sources for MPS algorithm [73].
Table 10. MPS, OPS, NTSS, DS, and HEXBS performance at 10 and 15 SPs per MB [73].
Algorithm
Sequence
MPS OPS NTSS DS HEXBS FS limit
PSNR@10 SPs/MB [dB] 29.64 29.11 26.62 28.45 28.37
Coastguard 29.73
PSNR@15 SPs/MB [dB] 29.68 29.52 27.02 28.84 29.14
PSNR@10 SPs/MB [dB] 25.76 24.04 22.11 22.34 22.91
Football 27.29
PSNR@15 SPs/MB [dB] 26.32 25.34 23.71 23.31 24.25
PSNR@10 SPs/MB [dB] 33.07 32.26 31.02 31.09 31.16
Foreman 33.52
PSNR@15 SPs/MB [dB] 33.25 33.09 31.52 32.39 32.20
PSNR@10 SPs/MB [dB] 30.36 29.81 29.54 29.29 29.01
Paris 30.70
PSNR@15 SPs/MB [dB] 30.56 30.46 29.66 30.08 30.20
PSNR@10 SPs/MB [dB] 25.68 24.60 21.35 22.98 23.10
Stefan 25.89
PSNR@15 SPs/MB [dB] 25.73 25.27 21.93 23.74 23.99
For instance, KCDS is well suited for estimating motion 9. L. Reoxiang, Z. Bing, and M.L. Liou, “A new three−step
in videoconferencing materials but performs poorly for search algorithm for block motion estimation”, IEEE T. Circ.
high−motion activity sequences. TSS is simple and regular Syst. Vid. 4, 438–442 (1994).
10. L.M. Po and W.C. Ma, “A novel four−step search algorithm
but its performance is mediocre in comparison with newer
for fast block motion estimation”, IEEE T. Circ. Syst. Vid. 6,
algorithms such as DS. Multiresolution techniques are very 313–317 (1996).
attractive for applications with large frame sizes and search 11. J.N. Kim and T.S. Choi, “A fast three−step search algorithm
ranges. Algorithms based on the pixel bit precision reduc− with minimum checking points using unimodal error surface
tion are dedicated mainly to custom hardware architectures. assumption”, IEEE T. Consum. Electr. 44, 638–648 (1998).
Fast full search algorithms seem to be most useful for qual− 12. H.Y. Chung, P.Y.S. Cheung, and N.H.C. Yung, “Adaptive
ity−oriented compression systems where stable processing search centre non−linear three step search”, Proc. Int. Conf.
time is not a critical factor. Computation−aware algorithms on Image Processing, pp.191–194, Chicago, 1998.
13. X. Jingand and L.P. Chau, “An efficient three−step search al−
are most flexible but require more sophisticated hardware
gorithm for block motion estimation”, IEEE T. Multimedia
architectures. 6, 435–438 (2004).
14. R. Srinivasan and K.R. Rao, “Predictive coding based on ef−
4. Conclusions ficient motion estimation”, IEEE T. Commun. 33, 888–896
(1985).
Motion estimation usually consumes the most computation 15. A. Puri, H.M. Hang, and D. Schilling, “An efficient block−
in a video coding system. In this paper, many fast BBME −matching algorithm for motion−compensated coding”, Proc.
IEEE Int. Conf. Acoustics, Speech, and Signal Processing
algorithms belonging to different categories are character−
12, pp. 1063–1066, Dallas, 1987.
ized. Performance and computational complexity of selec−
16. M. Ghanbari, “The cross−search algorithm for motion esti−
ted algorithms is compared to facilitate the choice of an mation”, IEEE T. Commun. 38, 950–953 (1990).
appropriate algorithm to a specific application. 17. L.G. Chen, W.T. Chen, Y.S. Jehng, and T.D. Chiueh, “An ef−
ficient parallel motion estimation algorithm for digital image
Acknowledgements processing”, IEEE T. Circ. Syst. Vid. 1, 378–385 (1991).
18. L.W. Lee, J.F. Wang, J.Y. Lee, and J.D. Shie, “Dynamic
This work was supported in part by the Polish National Cen− search−window adjustment and interlaced search for block−
−matching algorithm”, IEEE T. Circ. Syst. Vid. 3, 85–87
tre for Research and Development under Grant LIDER/05/
(1993).
8/L−2/10/NCBiR/2011. 19. L.K. Liu and E. Feig, “A block−based gradient descent search
algorithm for block motion estimation in video coding",
References IEEE T. Circ. Syst. Vid. 6, 419–422 (1996).
20. S. Zhu and K.K. Ma, “A new diamond search algorithm for
1. F. Dufaux and F. Moscheni, “Motion estimation techniques fast block matching motion estimation”, J. VLSI Sig. Proc.
for digital TV – a review and a new contribution”, Proc. Syst., pp. 292–296, Singapore, 1997.
IEEE 83, 858–876 (1995). 21. J.Y. Tham, S. Ranganath, M. Ranganath, and A.a. Kas−
2. J.R. Jain and A.K. Jain, “Displacement measurement and its sim,“A novel unrestricted centre−biased diamond search al−
application in interframe image−coding” IEEE T. Commun. gorithm for block motion estimation”, IEEE T. Circ. Syst.
29, 1799–1808 (1981). Vid. 8, 369–377 (1998).
3. I.E.G. Richardson, Video Codec Design: Developing Image 22. C.H. Cheung and L.M. Po, “A novel cross−diamond search
and Video Compression Systems, edited by John Wiley & algorithm for fast block motion estimation” IEEE T. Circ.
Sons, Chichester, 2002. Syst. Vid. 12, 1168–1177 (2002).
4. J.−B. Lee and H. Kalva, The VC−1 and H.264 Video Com− 23. C.H. Cheung and L.M. Po, “A novel small−cross−diamond
pression Standards for Broadband Video Services, Springer search algorithm for fast video coding and videoconferen−
Science+Business Media, New York, 2008. cing applications”, Proc. IEEE Int. Conf. Image Processing
5. M. Flierl and B. Girod, “Multiview video compression – ex− I, pp. 681–684, Rochester, 2002.
ploiting inter−image similarities”, IEEE Signal Proc. Mag. 24. C.W. Lam, L.M. Po, and C.H. Cheung, “A novel kite−cross−
24, 66–76 (2007). −diamond search algorithm for fast block matching motion
estimation”, IEEE Int. Symp. on Circuits and Systems 3, pp.
6. Y.W. Huang, C.Y. Chen, C.H. Tsai, C.F. Shen, and L.G.
729–732, Vancouver, 2004.
Chen, “Survey on block matching motion estimation algo−
25. J.B. Xu, L.M. Po, and C.K. Cheung, “Adaptive motion track−
rithms and architectures with new results”, J. VLSI Sig. Proc.
ing block matching algorithms for video coding” IEEE T.
Syst. 42, 297–320 (2006).
Circ. Syst. Vid. 9, 1025–1029 (1999).
7. T. Koga, K. Iinuma, A. Hirano, Y. Iijima, and T. Ishiguro,
26. A.M. Tourapis, O.C. Au, and M.L. Liou, “Fast motion esti−
“Motion−compensated interframe coding for video conferen−
mation using circular zonal search”, Proc. SPIE Symp. Vi−
cing”, Proc. Nat. Telecom. Conf., pp. G.5.3.1–G.5.3.5, New
sual Comm. and Image Processing 3653, pp. 1496–1504,
Orleans, 1981.
1999.
8. S. Kwatra, L. Chow−Ming, and W. Whyte, “An adaptive al− 27. A.M. Tourapis and O.C. Au, “Fast motion estimation using
gorithm for motion compensated color image coding”, IEEE modified circular zonal search”, Proc. IEEE Int. Symp. Cir−
T. Commun. 35(7), 747–754 (1987). cuits and Systems 4, pp. 231–234, Orlando, 1999.
64. X.Q. Gao, C.J. Duanmu, and C.R. Zou, “A multilevel succes− 79. C.H. Lin and J.L. Wu, “A lightweight genetic block−match−
sive elimination algorithm for block matching motion esti− ing algorithm for video coding”, IEEE T. Circ. Syst. Vid. 8,
mation”, IEEE T. Image Process. 9, 501–504 (2000). 386–392 (1998).
65. J.N. Kim and T.S. Choi, “A fast full−search motion−estima− 80. Y.T. Roan and P.Y. Chen, “A fuzzy search algorithm for the
tion algorithm using representative pixels and adaptive estimation of motion vectors”, IEEE T. Broadcast. 46,
matching scan”, IEEE T. Circ. Syst. Vid. 10, 1040–1048 121–127 (2000).
(2000). 81. P.Y. Chen and M. Jou, “An efficient blocking−matching al−
66. W.G. Hong and T.M. Oh, “Sorting−based partial distortion gorithm based on fuzzy reasoning”, IEEE T. Syst. Man Cy.
search algorithm for motion estimation”, Electron. Lett. 40, B31, 253–259 (2001).
113–115 (2004). 82. C.M. Kuo, C.H. Hsieh, Y.D. Jou, H.C. Lin, and P.C. Lu,
67. C. Zhu, W.S. Qi, and W. Ser, “Predictive fine granularity “Motion estimation for video compression using Kalman fil−
successive elimination for fast optimal block−matching mo− tering”, IEEE T. Broadcast. 42, 110–116 (1996).
tion estimation”, IEEE T. Image Process. 14, 213–221 83. P.L. Lai and A. Ortega, “Predictive fast motion/disparity
(2005). search for multiview video coding”, Proc. IEEE Visual Com−
68. M. Brunig and W. Niehsen, “Fast full−search block match− munications and Image Processing 6077, pp. 7709–7709,
ing”, IEEE T. Circ. Syst. Vid. 11, 241–247 (2001). San Jose, 2006.
69. T.G. Ahn, Y.H. Moon, and J.H. Kim, “Fast full−search mo− 84. J.B. Lu, C.T. Hua, J.G. Lou, and J. Li, “An effective epipolar
tion estimation based on multilevel successive elimination
geometry assisted motion estimation technique for multi−
algorithm”, IEEE T. Circ. Syst. Vid. 14, 1265–1269 (2004).
−view image and video coding”, Proc. IEEE Int. Conf. Image
70. S.W. Liu, S.D. Wei, and S.H. Lai, “Fast optimal motion esti−
Processing, pp. 1089–1092, Atlanta, 2006.
mation based on gradient−based adaptive multilevel succes−
85. Z.Y. Zhang, “Determining the epipolar geometry and its un−
sive elimination”, IEEE T. Circ. Syst. Vid. 18, 263–267
certainty: A review”, Int. J. Comput. Vision, 27, 161–195
(2008).
(1998).
71. P.L. Tai, S.Y. Huang, C.T. Liu, and J.S. Wang, “Computa−
86. Y. Kim, J. Kim, and K. Sohn, “Fast disparity and motion esti−
tion−aware scheme for software−based block motion estima−
mation for mufti−view video coding”, IEEE T. Consum.
tion”, IEEE T. Circ. Syst. Vid. 13, 901–913 (2003).
Electr. 53, 712–719 (2007).
72. C.Y. Chen Y.W. Huang, C.L. Lee, and L.G. Chen, “One−pass
computation−aware motion estimation with adaptive search 87. X.M. Li, D.B. Zhao, S.W. Ma, and W. Gao, “Fast disparity
strategy”, IEEE T. Multimedia 8, 698–706 (2006). and motion estimation based on correlations for multiview
73. M. Jakubowski and G. Pastuszak, “A new multi−path scheme video coding”, IEEE T. Consum. Electr. 54, 2037–2044
for adaptive computation−aware motion estimation”, Opto− (2008).
−Electron. Rev. 15, 118–124 (2007). 88. A. Vetro, Y. Su, H. Kimata, and A. Smolic, “Joint multiview
74. M. Jakubowski and G. Pastuszak, “An adaptive computa− video model JMVM 2.0.”, ITU−T and ISO/IEC Joint Video
tion−aware algorithm for multi−frame variable block−size Team Document JVT−U207 (http://ftp3.itu.int/av−arch/ jvt−
motion estimation in H.264/AVC”, Int. Conf. on Signal Pro− −site/2006_10_Hangzhou/JVT−U207.zip), 2006.
cessing and Multimedia Applications, pp. 122–125, Milan, 89. L.Q. Shen, G.R. Feng, Z. Liu, Z.Y. Zhang, and P. An,
2009. “Macroblock−level adaptive search range algorithm for mo−
75. Z. Yang, J.J. Bu, C. Chen, and L.J. Mo, “Configurable com− tion estimation in multiview video coding”, J. Electron. Im−
plexity−bounded motion estimation for real−time video en− aging 18, (2009).
coding”, in Advanced Concepts for Intelligent Vision Sys− 90. X.Q. Yi and N. Ling, “Rapid block−matching motion estima−
tems, 3708, pp. 555–562, Antwerp, 2005. tion using modified diamond search algorithm”, Proc. IEEE
76. G. Sorwar, M. Murshed, and L.S. Dooley, “A fully adaptive Int. Symp. Circuits and Systems, pp. 5489–5492, Kobe, 2005.
distance−dependent thresholding search (FADTS) algorithm 91. K.H. Ng, L.M. Po, and K.M. Wong, “Search patterns switch−
for performance−management motion estimation”, IEEE T. ing for motion estimation using rate of error descent”, Proc.
Circ. Syst. Vid. 17, 429–440 (2007). IEEE Int. Conf. Multimedia and Expo, pp. 1583–1586, Bar−
77. K.H.K. Chow and M.L. Liou, “Genetic motion search algo− celona, 2007.
rithm for video compression”, IEEE T. Circ. Syst. Vid. 3, 92. A. Vlachos and V. Fotopoulos, and A.N. Skodras, “Low bit
440–445 (1993). depth representation motion estimation algorithms: a com−
78. I.K. Kim and R.H. Park, “Block matching algorithm using parative study”, J. Real−Time Image Proc. 5, 141–148
a genetic algorithm”, Proc. IEEE Visual Communications (2010).
and Image Processing 2501, pp. 1545–1552, Taipei, 1995.