Sie sind auf Seite 1von 8

MAMMERI_LAYOUT_Layout 12/20/13 12:21 PM Page 36

ACCEPTED FROM OPEN CALL

DESIGN OF TRAFFIC SIGN DETECTION,


RECOGNITION, AND TRANSMISSION SYSTEMS FOR
SMART VEHICLES
ABDELHAMID MAMMERI AND AZZEDINE BOUKERCHE, UNIVERSITY OF OTTAWA
MOHAMMED ALMULLA, KUWAIT UNIVERSITY

ABSTRACT system, the aforementioned issues have to be


accurately addressed.
Traffic sign detection and recognition In most published TSDR systems, a two-stage
(TSDR) is an essential component of advanced sequential strategy has been adopted: the detec-
driver assistance systems (ADAS). It is mainly tion stage is followed by a recognition stage. In
designed to enhance driver safety through the the detection stage, a search for the regions of
fast acquisition and interpretation of traffic interest (ROIs), which contain the traffic sign
signs. However, such systems still suffer from the within the image, is performed. These ROIs are
inability to accurately recognize signs. Moreover, then examined by the recognition stage to cor-
the sharing of wirelessly recognized signs among rectly identify signs. We believe that a well
vehicles is not yet supported by current systems. designed system involves, rather, three to four
In some safety scenarios, vehicle-to-vehicle com- stages. Each stage requires special consideration.
munication of traffic sign information is These stages are the following: preprocessing,
required. In this article, we first address chal- detection, recognition, and tracking.
lenges and undesirable factors facing TSDR sys- The purpose of the pre-processing stage is to
tems. After that, we show how to design a TSDR enhance the input image in order to increase the
system by addressing some useful techniques likelihood of the successful delivery of ROIs to
used in each stage of the system. For each stage, subsequent stages. It includes mainly image pro-
these techniques are regrouped into different cessing and camera calibration [1]. To avoid
categories. Then, for each category, a short dense scanning of the whole image in this first
description is given followed by some concluding stage, a localization technique may be used to
remarks. Finally, the transmission of the recog- select the areas that possibly contain signs. The
nized signs is briefly investigated. extraction of such an ROI is performed in the
second stage (i.e., detection). It is accomplished
by extracting certain features from the input
INTRODUCTION image like color, shape, or gradient information.
Advanced driver assistance systems (ADAS) After that, the recognition stage receives a list of
are intelligent systems designed mainly to ROIs that possibly contain one or more signs. In
improve driving comfort and enhance road this stage, the ROIs in question will either be
safety by assisting drivers in their tasks. Some recognized as traffic signs or they will not; and
examples of such systems include the pedestri- they will be furthermore classified according to
an protection system, adaptive cruise control their categories (stop signs, speed limit signs,
system, and vehicular communication system. etc.). This will be carried out using the knowl-
One major component of ADAS is the traffic edge obtained from a set of training samples.
sign detection and recognition (TSDR) system. Finally, to enable the following of signs over
It is intended to increase driver and pedestrian time, a tracking stage was included in our strate-
safety through the quick detection and interpre- gy. Compared to the individual frame-based sys-
tation of signs. However, even with large num- tems, this stage has several advantages including
bers of published works and platforms, current the ability to decrease false detections over
systems are still far from perfect. This is mainly times, and to predict future sign positioning in
due to some challenges and issues. These chal- the image; thus, this stage feeds the detection
lenges are various in number; we classify them stage with updated candidates. Consequently,
in different levels: low-to-medium camera reso- the most common trackers used in TSDR sys-
lution; unsatisfactory road conditions; imperfect tems are the Kalman filter (KF), discussed in [2],
sign state; varied surrounding conditions; and a and the particle filter (PF), which is based on
moving vibrating car. To design a robust TSDR Bayesian inference, adopted in [3].

36 1536-1284/13/$25.00 © 2013 IEEE IEEE Wireless Communications • December 2013


MAMMERI_LAYOUT_Layout 12/20/13 12:21 PM Page 37

In almost all published TSDR systems, the


Stages Techniques Examples
recognized sign is used by one driver without
being shared between neighboring vehicles. In
some safety scenarios, the intervehicle com- Preprocessing Camera calibration [1]
munication of sign information is required to
prevent approaching drivers from a possible Color thresholding
crash. To the best of our knowledge, such sys- Color [4–6]
tems for the detection, recognition, and trans- Color learning
mission of sign information do not exist. This
is due to the fact that the existing technology Edge extraction
has not yet reached the level of maturity need-
ed to perform inter vehicle communication Detection Template matching
perfectly. We expect in the near future that a Shape [4, 7–12, 15]
new generation of vehicles will be equipped Machine learning
with onboard units based on the standard
IEEE 802.11 interfaces and antennas. In fact, Gradient HOG
the development of IEEE 802.11p and the
frequency band allocations in the 5.8–5.9 GHz Hybrid methods [2]
bands for public safety services illustrate the
next step forward in the deployment of net- Particle-filter-based technique [3]
works of vehicles. Tracking
In this article, we show how a TSDR system Kalman-filter-based technique [2]
should be designed. We point out some appro-
priate techniques used by the four stages of this Template matching techniques [11]
system. For each stage, these techniques are Recognition
classified into different categories, as shown in Machine learning techniques [5]
Table 1. Furthermore, for each category, a short
description is given followed by a brief example Table 1. Some techniques used by each stage of TSDR systems.
from the literature in this field and some useful
remarks. Finally, the transmission of recognized
signs is briefly investigated. This is a survey type an adequate level of stabilization, or through the
of article on an important topic that is currently use of an electromechanical stabilization plat-
receiving a great deal of attention. This article is form. Another possibility is through image stabi-
meant to provide design guidelines, which shall lization algorithms. To the best of our
supply readers with useful techniques at each knowledge, almost no TSDR systems include a
stage for the building of a well structured TSDR stabilization module.
system.
SURROUNDING CONDITIONS
CHALLENGES Surrounding conditions concern mainly the fol-
lowing factors: lighting, occlusions, and objects
Several undesirable factors may influence the similar to signs. The degree of light intensity
design of an ideal TSDR. We classify them in influences detection quality. This quality depends
five different categories: a low-acquisition vision mainly on weather conditions and those found
system; an unsatisfactory road condition; a through the day where lighting conditions differ
changeable surrounding condition; a moving from night to day. Surrounding objects, such as
vibrating car; and an imperfect sign state. buildings or trees, may directly affect (by occlu-
sion) or indirectly (by their shadow) the quality
ACQUISITION VISION SYSTEM of detection. Moreover, occlusion could be par-
When a traffic sign is far away from a camera, tial or total. In the case of total occlusion, all
its ideogram appears small in the image. The detection algorithms fail.
recognition of such signs with a low-resolution
camera is a challenging problem for moving MOVING CAR
vehicles. This is due to the fact that signs not As images of signs are acquired from moving
only appear small, but noisy and blurry in the vehicles, sign detection suffers from car vibra-
image. One way to handle this problem is to use tions and motion blur. Motion blur is defined as
a high resolution camera. Many researchers use the visible streaking of rapidly moving signs in
a standard camera with a resolution that is usu- frames. It results from a change in the recorded
ally at most 640 ¥ 480 pixels. Unfortunately, this image during the recording process of a single
resolution seems insufficient, especially when frame due to fast movement. Moreover, car
signs are far away from vehicles. vibrations are mainly due to unsatisfactory road
conditions.
ROAD CONDITIONS
Camera vibrations and oscillations are an issue TRAFFIC SIGN STATE
for moving cars. In fact, the mounted TSDR sys- The physical degradation of signs can be classi-
tem suffers from car movements caused by road fied into two categories: nature-made and man-
roughness. The performance of TSDR systems made. Nature-made degradation is due to
can be improved if the vibrations introduced by ultra-violet radiation and retro-reflective materi-
a camera are corrected by an image stabilization als, which mainly influence the color sign. Man-
system. One way to decrease camera vibrations made degradation can affect shapes, colors, and
is to exploit the movement of lenses to achieve parts of the sign or the entire sign.

IEEE Wireless Communications • December 2013 37


MAMMERI_LAYOUT_Layout 12/20/13 12:21 PM Page 38

select areas that possibly contain signs. For


instance, known positions of the road in the
STOP respective image help reduce the search space
for signs, and horizontal edges of buildings with-
in the target image guide us to focus our search
on some regions instead of the entire image.
STOP To the best of our knowledge, the majority of
Tracking
existing TSDR systems do not explicitly include
such a camera calibration step. Research work in
Preprocessing Detection Recognition Transmission
this area is required.

STOP STOP STOP


DETECTION TECHNIQUES
The main task of this stage is the generation of
Figure 1. Traffic sign detection, recognition, and transmission system stages. candidates. This stage extracts ROIs from the
image and sends them to the recognition stage,
avoiding as many background regions as possi-
TSDR STAGES ble. The key to this stage is to avoid missing
signs as much as possible. Otherwise, the recog-
A well designed TSDR system involves four nition stage will not be able to correct errors. To
stages. These stages are pre-processing, detec- design a robust detection system, many
tion, recognition and tracking. In the following approaches exist. Most of these approaches
sections, each stage of the STDR is properly achieve detection through color, shape, and gra-
addressed, and the most useful techniques are dient, or by also basing detection systems on
presented, as demonstrated in Fig. 1 and Table 1. hybrid algorithms, as demonstrated in Table 1.

PREPROCESSING TECHNIQUES DETECTION THROUGH COLOR


Color-based detection methods aim to segment
To design a robust TSDR system, a preprocess- the colors of signs in order to provide ROIs for
ing stage is required. The main advantage of this the recognition stage. Color segmentation con-
stage is that it decreases the false detection ratio sists of partitioning an image into subsets of con-
and speeds up the detection process. nected pixels that share similar color properties.
The fundamental function of the preprocess- Its aim is to intensify certain colors or edges,
ing stage is to enhance the input image and and to suppress inappropriate areas. Different
avoid dense scanning of it. This increases the color spaces are used to separate signs from the
chance of a fast and successful delivery of ROIs background. The selection of the appropriate
to the subsequent stages, which in turn mini- color space is of high importance, since it affects
mizes the number of false positive detections. the detection result. Different color spaces are
We believe that the preprocessing stage consists used in the literature, such as RGB, normalized-
mainly of operations related to the vision system RGB, HSI/HSV, and YCbCr.
(camera) and the scene taken by the camera In a more recent study [4], the authors not
(i.e., the output image/video). In fact, the camera only made a comparison between color spaces
should be calibrated and some low-level tuning for the purpose of selecting those that are supe-
such as dynamic range adjustments have to be rior, but they also presented a quantitative com-
performed. parison of several segmentation methods that
Camera calibration refers to the process of serve in sign detection. The color spaces identi-
finding extrinsic and intrinsic parameters that fied as superior in [4] are normalized-RGB, fol-
produce a given image or video [1]. Only a few lowed by HSI. Unfortunately, this study does not
works have been devoted to the calibration of consider other color spaces such as CIECAM97,
cameras. Most of them begin by computing the CIELab, YCbCr, and Luv.
intrinsic parameters, which are assumed to be After determining a suitable color space, a
constant, while the extrinsic parameters are con- search for signs within the input image was start-
tinuously updated [1]. ed using segmentation. Basically, two types of
Dynamic range adjustment (DRA) refers to the color segmentation were reported, segmentation
ratio between the largest and the smallest measur- based on color thresholding and segmentation
able light intensities. Images usually necessitate based on color learning.
some adaptive dynamic ranges to increase their
contrast. Recently, high dynamic range (HDR) Segmentation Based on Color Thresholding — Color
image-based systems have begun to expand into thresholding is the most intuitive segmentation
the field of driver assistance technologies since technique available when dealing with colors. In
they have commenced to provide high contrast fact, every pixel with a chrominance value above
images even at night. An HDR camera should be a certain threshold is marked with an appropri-
tested for sign detection application due to the ate label. After that, an ROI containing signs is
good quality of image it delivers. identified. Various authors have experimented
Besides camera calibration, reduction of the with color thresholding, especially in the 1990s.
search area inside the image helps in the process The authors in [5] present some segmentation
of building a fast detection system. This is due to algorithms based on color thresholding. Unfortu-
the fact that dense scanning of the whole image nately, when segmentation based on color
wastes time. To avoid dense scanning of the thresholding (SCT) methods were used, a num-
image, a localization technique can be used to ber of thresholds had to be adjusted, and their

38 IEEE Wireless Communications • December 2013


MAMMERI_LAYOUT_Layout 12/20/13 12:21 PM Page 39

adjustment depends only on the image undergo- are briefly explained in the following paragraphs. After image preprocess-
ing testing. Moreover, SCT techniques are sensi- Earlier detection methods rely on edge segmen-
tive to lighting conditions, especially those tation followed by linking to obtain a contour ing, shape detection is
occurring on sunny days. representation, from which various parameters
such as perimeter and center are extracted. For
performed. This is begun
Segmentation Based on Color Learning (SCL) — The instance, the authors in [7] apply the Laplacian by extracting feature
disadvantages of SCT techniques can be alleviat- filter to the input image to extract outlines and
ed by using a well-known machine learning tech- edges. Although the application of these tech- representations of
niques such as the support vector machine niques is simple, they produce several ROIs, images for the genera-
(SVM); this provides some improvements over which burden the subsequent stages.
SCT methods [4]. An algorithm based on SVMs The purpose of HT, the second technique, is tion of maps with fea-
was presented in [4] to classify pixels using color to isolate features of a particular shape by a vot-
information. Samples of the targeted color ing procedure. The classic HT technique is con-
tures, after which ROIs
required for detection and some other colors cerned with the identification of regular curves that possibly contain
from training images were labeled and used to such as ellipses. The HT is later extended into a
train the SVM. A lookup table (LUT), in which generalized HT technique to deal with more signs are processed in
the less significant bits were reduced, was used complex shapes. The HT technique is commonly the next stages.
to improve speed; this nevertheless also main- used to detect the shape of signs. For instance,
tained quality. The authors show that segmenta- the authors in [8] combine HT with an iterative
tion based on SVM yields comparatively better process of median filtering and dilation to refine
results than some SCT segmentation methods. the sign candidate set. However, the HT tech-
Neural networks (NNs) are also used to seg- nique remains computationally expensive, espe-
ment images based on their colors. In fact, in [6] cially for large images.
NNs are used in color training to recognize The third technique, FRS, is a variant of the
signs. The basic idea is to segment the input circular HT technique. It was extended to traffic
image by using an NN in eight colors. The ROIs sign application for the yielding of a regular
are then analyzed by a second NN to determine polygon detector in [9]. This method is based on
if their shape corresponds to a TS. Unfortunate- the symmetric nature of sign shapes and the pat-
ly, NNs are slow, especially in the training phase. tern of edge orientations exhibited by equiangu-
Of the SCT and SCL methods used for detec- lar polygons. Compared to HT, this technique is
tion, SVM outperforms SCT methods. Com- fast enough to produce robust systems. As it
pared to NNs, which have multiple solutions detects shapes based on edges, the algorithm is
associated with the local minima, the advantage robust to illumination changes. However, this
of SVM is that it provides a unique solution method may fail when many edges are present in
since the optimality problem is convex [4]. How- images. For the non-frontal viewing of signs, the
ever, the main problem of this technique is its correspondent image is usually transformed
speed, which can be alleviated using the LUT through an accurate transformation (e.g., rota-
proposed in [4]. tion) before detection if the orientation of the
sign is a priori known; otherwise, the detection
DETECTION THROUGH SHAPE fails.
Besides color, shape can be effectively used to A comparative study between different seg-
detect signs, especially when color-based tech- mentation techniques is presented in [4]; in this
niques fail (e.g., changes in illumination). After work, it is pointed out that edge detection meth-
image preprocessing, shape detection is per- ods are not the most appropriate, except when
formed. This is begun by extracting feature rep- applied to achromatic decomposition (white
resentations of images for the generation of color).
maps with features, after which ROIs that possi-
bly contain signs are processed in the next stages. Shape Extraction through Gradient Features — Colors,
From the literature reviewed, image features are edges, or corners are perceivable features usual-
divided into two categories: local features such ly extracted via image filtering or using learning
as edges, and global features such as histograms techniques. Recently, gradient information
of oriented gradients (HOGs) [10]. The choice (directions and magnitudes) have been success-
of features is often coupled with the detection fully used to extract signs. This includes, among
method. Edges are found using some classical other features, HOG [10]. Unlike the HT tech-
edge detection techniques, such as the Canny fil- nique, this approach does not require prior
ter [4]; these techniques are usually followed by knowledge of the target.
edge linking techniques (e.g., Hough Transform, The HOG detector, proposed initially for
HT, matching) or distance transform matching pedestrian detection, starts by dividing the image
for the detection of TS. HOGs are found by cre- into a set of overlapping blocks. Each block is
ating histograms of gradients on images. then divided into non-overlapping cells, and for
Shape is usually extracted by using perceiv- each cell a histogram of the gradient orienta-
able features (e.g., edges), gradient features tions is performed. HOG has been successfully
(e.g., HOG), and template matching (TM); it applied to detect signs in several works (e.g.,
may also extracted using machine learning based [16]). Different sizes of blocks, cells, and strides
techniques. of the HOG descriptor are used in [15] to evalu-
ate the performance of the classification stage. It
Shape Extraction through Edge Features — Most of the is reported that cells and blocks of small sizes
shape detectors are based on two edge detec- achieve higher recall and precision rates. This is
tion/linking techniques: the HT and Fast Radial due to the finer spatial binning of the gradient
Symmetry (FRS) algorithms. These techniques orientations. HOG has proven its efficiency over

IEEE Wireless Communications • December 2013 39


MAMMERI_LAYOUT_Layout 12/20/13 12:21 PM Page 40

HOG detector, proposed other algorithms because of the following desir- After ROI detection, object localization is
able properties: invariance to scale, local con- performed. This is accomplished by using some
initially for pedestrian trast normalization, coarse spatial sampling, and methods like simple circle fitting operation on
fine weighted gradient orientations [15]. circular signs or the bounded area space tech-
detection, starts by nique. With the first technique, once the center
dividing the image into Shape Extraction through Template Matching — Shape of the ROI is localized, the sign is cropped and
can also be extracted by using TM techniques. cross-correlation template matching is applied to
a set of overlapping TM concerns the process of matching a template find the best match in the sign database. With
blocks. Each block is to an image in the database. TM algorithms are the bounded area space technique, small objects
used to search through training samples and (e.g., red traffic lights) or large objects (e.g., red
then divided into non- determine the nearest neighbor of the test sam- cars) are excluded. For instance, the authors in
ple. However, the application of the TM [5] use a limited area with a size that varies
overlapping cells, and approach is unsuitable for real-time systems; the between 200 and 5000 pixels.
for each cell a histogram reason for this is that the technique in question Given the large number of papers found in the
requires many cross-correlation computations literature on detection systems, it is difficult to
of the gradient orienta- between the template and the ROIs for sign select one method among many as the best
tions is performed. HOG detection. option. Nevertheless, gradient feature-based algo-
TM using distance transform (DT) is widely rithms are obviously gaining comparative popular-
has been successfully employed for shape detection. DT represents the ity over other approaches, due to some desirable
distance of each pixel to the nearest edge in the properties such as scale invariance, local contrast
applied to detect signs corresponding image edge. After edge detection, normalization, coarse spatial sampling, and fine
in several works. a DT image is built. To find the shape of inter- weighted gradient orientations [15]. Further
est, a template is matched (e.g., a regular circle) research is needed to improve the quality of
against the DT image. Color-based DT was detection by reducing the number of features and
recently introduced in [11], where a DT was sep- hence speeding up the whole process.
arately computed for every color channel. It is
shown that shape detection through TM offers a TRACKING TECHNIQUES
more suitable description of signs and entails
easy training. However, it is sensitive to occlu- The most evolved TSDR systems use a tracking
sions and rotations. The main advantage of using stage to monitor signs over time. Compared to
TM is that it can be easily modified to recognize individual frame-based systems, this stage has
other objects. This approach is applicable to several advantages, which include the following
both detection and recognition stages. attributes: the reduction of false detections over
time, increasing the robustness of the whole sys-
Shape Extraction through Machine Learning Techniques — tem, and the prediction of future sign positions
Machine learning techniques are used success- in the image; this enables the reduction of the
fully to extract shapes. NNs and SVMs are usu- search space and supplying of the detection
ally used for sign detection due to their efficiency stage with updated candidates. The most com-
in detecting shapes. For instance, two NNs are mon trackers used in TSDR systems are the
trained to extract the shape and color features of Kalman filter (KF), which is discussed, for
signs in [2]. This approach is accurate but com- instance, in [2], and the particle filter (PF) elab-
putationally very expensive. However, the addi- orated on in [3]. The KF is often used for linear
tion of signs to the NN implies a new training of systems, while the PF is more robust in its appli-
the network and the manual selection of the cation for multimodal and nonlinear systems.
training samples. The investigation of SVM for The KF is used to produce a statistically opti-
shape detection is illustrated in [12], where the mal estimate of the position and scale of the sign
system contains three phases: color segmenta- in consecutive frames of an input video. Conse-
tion, shape extraction using linear SVMs, and quently, sign trackers based on the KF usually
recognition based on Gaussian-kernel SVMs. assume a uniform and constant direction motion
The suggested method is invariant to rotation, for the vehicle with a priori known velocity and
scale, and translation, and to partial occlusions. traffic sign size. An example of the adoption of
the KF tracker in the TSDR system is found in
HYBRID METHODS [2]. The same assumptions are used by the afore-
Many detection approaches follow a two-step mentioned authors; that is, the vehicle moves in
sequential strategy. That is, color or shape a straight line and exposed to a constant slope
thresholding segmentation is first performed on while maintaining an approximately constant
the input image to yield some ROIs. This step is speed. Based on these assumptions, they esti-
immediately followed by the sign detection stage mated sign position and velocity in 3D space.
applied on the generated ROIs. Unfortunately, The nonlinearity of the resulting measurement
ROIs that have been inaccurately rejected dur- model was eliminated by adopting the extended
ing segmentation cannot be recovered for fur- KF [2]. The following three limitations of the
ther processing in the detection and recognition system in [2] were identified: unrealistic assump-
stages. In this context, a parallel (joint) process- tions related to the real size of signs; the con-
ing of color and shape has thus been introduced stant speed of vehicles, which have to be known
to overcome this drawback. For instance, the a priori; and the assumption that trajectories are
authors in [2] have built a feature map of the straight.
input image based on the color and model shape In general, the PF is also widely used in
of signs. It is observed that the false rejection object tracking. Very few research works using
ratio is significantly lowered in comparison to the PF to detect signs have been found. Recent-
non-hybrid methods. ly, the authors of [3] used the sequential impor-

40 IEEE Wireless Communications • December 2013


MAMMERI_LAYOUT_Layout 12/20/13 12:21 PM Page 41

tance sampling PF to track signs. As an observa-


tion model, the authors used shape features
extracted through a Sobel filter. The output
edge map was approximated using discrete points
at the inner and outer edges of the sign board.
The likelihood was then updated using the VS
Mahalanobis distance between the measured 50
and predicted positions of signs. The algorithms
have shown high detection rates based on the F-
measure evaluations.
Although tracking represents an important
stage in TSDR systems, this stage has not
received as much attention as other stages. Most
existing works rely mainly on KFs or PFs. The
KF is the more heavily utilized algorithm com-
pared to PF. Its application is unfortunately lim-
ited to linear systems. On the other hand, PF
proves to be more robust when applied to multi- Figure 2. Transmission scenario of traffic signs.
modal and nonlinear problems. In our opinion,
much work remains to be done on tracking signs.
NNs) are used in [5] to implement the recogni-
RECOGNITION TECHNIQUES tion stage. Each pattern is recognized through
the use of one network, since it makes the classi-
The recognition stage receives a list of ROIs fier expandable. The authors use six MLP-NNs
that possibly contains one or more signs. In this to identify ROIs as belonging to one of the six
stage, ROIs are first recognized as to whether categories of signs used in their experiments.
they are signs. They are then classified, with Two algorithms are used to train the classifiers:
regard to their content, into the appropriate cat- the Scaled-Conjugate-Gradient algorithm and
egories (stop signs, speed limit signs, etc.); this is Resilient-Back-Propagation. The authors show
done by using the knowledge obtained from a that Scaled-Conjugate-Gradient yields the best
set of training samples. As determined from our results compared to Back-Propagation and SVM.
assessment of the reviewed literature, we would Recently, a comparative study was introduced in
like to emphasize that not all research works on [15] to evaluate the performance of k-d trees,
TSDR use a separate detection/recognition step random forests, and SVMs for traffic-sign recog-
as described in this article. To design a robust nition. A random forest with 100 trees yielded
recognition stage, three steps are used: prepro- higher classification accuracy on subsets with
cessing, training, and testing. As the output of more than 500 features. The random forests
the detection sign is probably unaligned with an used in this work outperform SVMs and k-d
unknown size of candidates, some basic image trees due to their randomness and the robust-
preprocessing, such as rescaling or rotation, is ness to outliers and unbalanced data sets. Unfor-
required in the first step. The second step, which tunately, no comparison with NNs was made.
is training, determines the set of basic elements Tree classifiers are useful techniques used to
needed to represent signs. Each sign is repre- recognize signs. A random forest classifier, a
sented by a coefficient vector, which is generated kind of decision tree, is a very promising tech-
by applying a set of basic elements on the sign. nique used in sign recognition. A random forest
The third step is the testing process, in which the consists of constructing many decision trees and
best match for a sign is carried out. yields the class that is the mode of class output
Mainly, signs are recognized using template of individual trees [15]. As stated in [15], it out-
matching or machine learning techniques. The performs SVM and k-d trees in its identification
choice of the appropriate recognition algorithm of signs.
depends essentially on the training sample statis- Despite the enormous number of papers, the
tics (e.g., class distribution and size), the features recognition of signs is still not perfect. Among
used, and the output of the detection algorithm. classifiers found in the literature, the random
Template-matching-based algorithms look forest remains a powerful classifier [15]. Ulti-
for the most similar training samples that exist, mately, accurate recognition techniques are
which are stored in a database of a respective needed to increase the recognition rate.
and unknown ROI. Recall that TM is performed
in the extracted ROIs for the purpose of finding TRANSMISSION
signs. An interesting method for simultaneous
detection and recognition, based on TM, is the After the detection and recognition of a sign, the
one previously discussed: [11]. The input image latter is used only by one driver in his or her
was matched to DT images of differently shaped own vehicle. Sharing recognized sign information
sign templates, grouped in terms of similarity between vehicles is not yet supported by modern
within a tree structure. TSDR systems. To show the impact of the trans-
Machine-learning-based algorithms: As with mission stage, let us consider a particular safety
detection, learning-based methods can be effi- scenario: Fig. 2. Once a vehicle source Vs detects
ciently used for sign recognition. They are and recognizes signs through its TSDR system, it
trained to find an optimal separation between warns the driver to take appropriate actions. In
classes. These include SVM, NN, Adaboost [14], some situations, sharing signs between V s and
and others. Multi-layer perceptron NNs (MLP- neighbouring vehicles is required. Hence, the Vs

IEEE Wireless Communications • December 2013 41


MAMMERI_LAYOUT_Layout 12/20/13 12:21 PM Page 42

To the best of our driver notifies other drivers located within its [8] W.-J. Kuo and C.-C. Lin, “Two-Stage Road Sign Detec-
tion and Recognition,” 2007 IEEE Int’l. Conf. Multime-
transmission range. The communication system
knowledge, recognition of vehicles ensures that data packets are reliably
dia and Expo, 2007, pp. 1427–30.
[9] G. Loy, “Fast Shape-Based Road Sign Detection for a
sent to all vehicles, and hence allows drivers to
and transmission sys- make appropriate decisions.
Driver Assistance System,” IEEE/RSJ Int’l. Conf. Intelli-
gent Robots and Systems (IROS, 2004), pp. 70–75.
tems of signs do not Regarding the communication aspect of [10] N. Dalal and B. Triggs, “Histograms of Oriented Gradients
for Human Detection,” CVPR, vol. 1, 2005, pp. 886–89.
TSDR, inter-vehicle communication is particu-
exist. We expect that larly challenging due to the high rate of vehicle
[11] A. Ruta, Y. Li, and X. Liu, “Real-Time Traffic Sign
Recognition from Video by Class-Specific Discriminative
the new generation of mobility and signal-weakening obstructions, such Features,” Pattern Recognition, vol. 43, no. 1, 2010,
pp. 416–30.
as buildings. To the best of our knowledge,
vehicles will be recognition and transmission systems of signs do
[12] P. Gil-Jiménez, H. Gómez-Moreno, and F. López-Fer-
reras, “Road-Sign Detection and Recognition based on
not exist. We expect that the new generation of
equipped with on-board vehicles will be equipped with onboard units
Support Vector Machines,” S. Maldonado-Bascón, S.
Lafuente-Arroyo, IEEE Trans. Intelligent Transportation
units based on the stan- based on the standard IEEE 802.11p interfaces Systems, 2007.
[13] T. Mangel, O. Klemp, and H. Hartenstein, “5.9 GHz
and antennas that work on a frequency band
dard IEEE 802.11p ranging from 5.8 to 5.9 GHz [13].
Inter-Vehicle Communication at Intersections: A Vali-
dated Non-Line-of-Sight Path-Loss and Fading Model,”
interfaces and antennas EURASIP J. Wireless Comm. and Networking 2011,

that work on a frequen-


CONCLUSION 2011, p. 182.
[14] X. Baró et al., “Traffic Sign Recognition Using Evolu-
tionary Adaboost Detection and Forest-ECOC Classifica-
In this article, TSDR systems are studied. Chal-
cy band ranging from lenges facing TSDR systems are regrouped into
tion,” IEEE Trans. Intelligent Transport Systems, vol. 10,
issue 1, 2009, pp. 113–26.
5.8 to 5.9 GHz. five different categories: camera acquisition, sur- [15] F. Zaklouta and B. Stanciulescu, “Real-Time Traffic-Sign
Recognition Using Tree Classifiers,” IEEE Trans. Intelligent
rounding conditions, moving and vibrating car, Transportation Systems, vol. 13, no. 4, 2012, pp. 1507–14.
sign state, and road conditions. A brief descrip- [16] G. Overett et al., “Creating Robust High-Throughput
tion of the problems encountered at each stage Traffic Sign Detectors Using Centre-Surround HOG
Statistics,” Machine Vision and Applications, Special
is given. We divide the TSDR into four different Issue, pp. 1–14.
stages: preprocessing, detection, recognition, and
tracking. We then describe each of these stages, BIOGRAPHIES
showing some techniques that can be used effi- ABDELHAMID MAMMERI (amammeri@uottawa.ca) is a senior
ciently. For each stage, the suggested techniques research and postdoc fellow at Diva research strategic cen-
are further classified. For example, in the detec- ter, University of Ottawa, Canada. He obtained his Master’s
degree from Universite Catholique de Louvain, Belgium,
tion stage, techniques are classified following the and his Ph.D. in electrical and computer engineering from
shape, color, and gradient information. For each the University of Sherbrooke, Canada. He is the recipient of
stage, the best techniques used in the literature the prestigious FQRNT Quebec fellowship award at the
are highlighted, followed by some concluding postdoc level in 2012. His current research interests include
multimedia and image processing applied on smart vehi-
interpretations. The transmission stage is also cles, multimedia streaming over wireless ad hoc and sensor
examined in this study. This is due to its impor- networks, multimedia green applications, and multimedia
tance in inter-vehicular communication. streaming over VANET. He has extensively published in
highly ranked international conferences and journals in his
ACKNOWLEDGMENT area. He served as a Technical Program Chair for the 3rd
ACM Symposium on Development and Analysis of Intelli-
This work was partially supported by Canada gent Vehicular Networks and Applications (DIVANet 2013),
Research Chair Programs, DIVA Strategic a Program Co-Chair of the 8th ACM Workshop on Perfor-
Research Center, Natural Sciences and Engi- mance Monitoring and Measurement of Heterogeneous
Wireless and Wired Networks (PM²HW²N 2013) and Techni-
neering Research Council of Canada (NSERC), cal Program Co-Chair of the 9th IEEE International Work-
and Fonds de Recherche du Québec, Nature et shop on Performance and Management of Wireless and
Technologie (FRQNT). Mobile Networks (P2MNET 2013). He also served as a Tech-
nical Program Committee member for several conferences,
including MIC-CAI2012, IEEE VTC 2013, and MIC-C
REFERENCES
[1] S. Marsi et al., “Video Enhancement and Dynamic AZZEDINE BOUKERCHE held a faculty position with the Univer-
Range Control of HDR Sequences for Automotive Appli- sity of North Texas, Denton, and was a senior scientist with
cations,” EURASIP J. Advances in Signal Processing, vol. the Simulation Sciences Division, Metron Corporation, San
2007, 2007, p. 9. Diego, CA. He is a full professor and holds a Canada
[2] C. Y. Fang, S. W. Chen, and C. S. Fuh, “Road Sign Research Chair position at the University of Ottawa,
Detection and Tracking,” IEEE Trans. Vehic. Tech., vol. Ontario, Canada. He is a Fellow of the Canadian Academy
52, no. 5, 2003, pp. 1329–41. of Engineering and the founding director of the PARADISE
[3] M. Meuter, A. Kummert, and S. Muller-Schneiders, “3D Research Laboratory, School of Information Technology
Traffic Sign Tracking Using a Particle Filter,” Proc. 11th and Engineering, Ottawa. Prior to this, he held a faculty
Int’l. IEEE Conf. Intelligent Transportation Sys., 2008, position at the University of North Texas, and he was a
pp. 168–73. senior scientist at the Simulation Sciences Division, Metron
[4] H. Gómez-Moreno et al., “Goal Evaluation of Segmen- Corporation, San Diego, California. He was also employed
tation Algorithms for Traffic Sign Recognition,” IEEE as a faculty member in the School of Computer Science,
Trans. Intelligent Transportation Systems, vol. 11, no. 4, McGill University, Montreal, Quebec, Canada, and taught
2010, pp. 917–30. at the Polytechnic of Montreal. He spent a year at the Jet
[5] Y. Y. Nguwi and A. Z. Kouzani, “Detection and Classifi- Propulsion Laboratories (JPL)/National Aeronautics and
cation of Road Signs in Natural Environments,” Neural Space Administration (NASA)-California Institute of Tech-
Computing & Applications, 2008, pp. 265–89. nology, Pasadena, where he contributed to a project cen-
[6] D. L. Kellmeyer and H. T. Zwahlen, “Detection of Highway tered on the specification and verification of the software
Warning Signs in Natural Video Images Using Color Mage used to control interplanetary spacecraft operated by
Processing and Neural Networks,” Proc. IEEE Int’l. Conf. JPL/NASA Laboratory. His current research interests include
Neural Networks, vol. 7, 1994, pp. 4226–31. wireless ad hoc and sensor networks, wireless networks,
[7] Y. Aoyagi and T. Asakura, “A Study on Traffic Sign mobile and pervasive computing, wireless multimedia, QoS
Recognition in Scene Image Using Genetic Algorithms service provisioning, performance evaluation and modeling
and Neural Networks,” Proc. 22nd IEEE Int’l. Conf. Ind. of large-scale distributed systems, distributed computing,
Electron., Control Instrum., Taipei, Taiwan, Aug. 1996, large-scale distributed interactive simulation, and parallel
vol. 3, pp. 1838–43. discrete-event simulation. He has published several research

42 IEEE Wireless Communications • December 2013


MAMMERI_LAYOUT_Layout 12/20/13 12:21 PM Page 43

papers in these areas. He served as a Guest Editor for the Time Application (DS-RT), Program Chair for the ACM
Journal of Parallel and Distributed Computing Special Workshop on QoS and Security for Wireless and Mobile
Issues on Routing for Mobile Ad Hoc, Wireless Communica- Networks, ACM/IFIPS Europar 2002 Conference, IEEE/SCS
tion and Mobile Computing, and Mobile Ad Hoc Network- Annual Simulation Symposium (ANNS 2002), ACM WWW
ing and Computing, ACM/Kluwer Wireless Networks, 2002, IEEE MWCN 2002, IEEE/ACM MASCOTS 2002, IEEE
ACM/Kluwer Mobile Networks Applications, and Journal of Wireless Local Networks WLN 03-04, IEEE WMAN 04-05,
Wireless Communication and Mobile Computing. He serves and ACM MSWiM 98-99, and a TPC member of numerous
as an Associate Editor of IEEE Transactions on Parallel and IEEE and ACM-sponsored conferences. He served as Vice
Distributed Systems, IEEE Transactions on Vehicular Tech- General Chair for the Third IEEE Distributed Computing for
nology, Elsevier Ad Hoc Networks, Wiley International Jour- Sensor Networks Conference in 2007, as Program Cochair
nal of Wireless Communication and Mobile Computing, for IEEE GLOBECOM 2007–2008 Symposium on Wireless
Wiley’s Security and Communication Network Journal, Else- Ad Hoc and Sensor Networks, and for the 14th IEEE ISCC
vier Pervasive and Mobile Computing Journal, IEEE Wireless 2009 Symposium on Computer and Commmunication, and
Communications, Elsevier’s Journal of Parallel and Dis- as Finance Chair for ACM Multimedia 2008. He also serves
tributed Computing, and SCS Transactions on Simulation. as a Steering Committee Chair for the ACM Modeling,
He was the recipient of the Best Research Paper Award at Analysis, and Simulation for Wireless and Mobile Systems
IEEE/ACM PADS 1997, ACM MobiWac 2006, ICC 2008, ICC Conference, the ACM Symposium on Performance Evalua-
2009, and IWCMC 2009, and the recipient of the Third tion of Wireless Ad Hoc, Sensor, and Ubiquitous Networks,
National Award for Telecommunication Software in 1999 and IEEE/ACM DS-RT.
for his work on a distributed security systems on mobile
phone operations. He has been nominated for the Best MOHAMMED ALMULLA received his B.Sc. and Ph.D. degrees in
Paper Award at the IEEE/ACM PADS 1999 and ACM MSWiM computer science from McGill University, Montreal, Cana-
2001. He is a recipient of an Ontario Early Research Excel- da, in 1986 and 1995, respectively. He is an associate pro-
lence Award (previously known as Premier of Ontario fessor in the Computer Science Department, College of
Research Excellence Award), Ontario Distinguished Computer Sciences and Engineering, Kuwait University.
Researcher Award, and Glinski Research Excellence Award. Currently, he is the director of the Graduate Program at
He is a co-founder of the QShine International Conference the Computer Science Department at Kuwait University and
on Quality of Service for Wireless/Wired Heterogeneous a consultant at Kuwait Central Agency for Information
Networks (QShine 2004). He served as General Chair of the Technology. His research interest is in the areas of dis-
Eighth ACM/IEEE Symposium on Modeling, Analysis, and tributed simulation, distributed computing, modeling and
Simulation of Wireless and Mobile Systems, and the Ninth simulation of parallel methodologies, and artificial intelli-
ACM/IEEE Symposium on Distributed Simulation and Real- gence.

IEEE Wireless Communications • December 2013 43

Das könnte Ihnen auch gefallen