Beruflich Dokumente
Kultur Dokumente
I. I NTRODUCTION
Many applications benefit from the availability of information about fish abundance and distribution. Examples
include ecological and environmental studies of fish communities [1], design and placement of fish ladders in dams
for hydroelectric power generation [2], feeding strategies
used by fish farmers [3] and stock assessment for fishery
management [4]. Accurate stock assessment is particularly
important in a scenario where human population and
the demand for fish are increasing. Attempting to match
natural stock fluctuation with fishing effort may help to
avoid any further long term damage of exploited species.
The consequences of over-fishing can be catastrophic,
since fish provide vital contributions to food supplies and
influence employment in coastal areas (see Figure 1).
Various methods have been applied to calculate estimates of stock sizes. It is apparent that stock assessment
techniques are highly dependent on available data and
whether long or short-term predictions are the aim. For
correct predictions many techniques require large inputs
of unbiased data and, therefore, the strength of any stock
biomass prediction will be influenced by the weakness of
the available inputs, validating final modal estimates of a
fishery [4], [5].
A critical task in all of the afore-mentioned applications
is fish species classification, that is, the task of grouping
and categorizing fish species according to shared physical
characteristics. Unfortunately, even though fish species
classification may be performed by human observers,
Immune Network (aiNet) [9] and Adaptive Radius Immune Algorithm (ARIA) [10] for clustering individuals
of the same species, and (iii) a simple nearest neighbor
classification strategy. A block diagram with the main
modules of the proposed framework is depicted in Figure
2. As it will be later demonstrated by our experimental
results, even though a very simple classifier has been
used, we have obtained very good performance levels,
what confirms the robustness of the feature extraction
techniques used.
In spite of the fact that vision-based systems for automatic classification of fish species have several applications, so far only a handful of works have proposed
effective solutions for that problem [6], [11][15]. In
[11], the authors use artificial neural networks for the
classification of digital echo recordings of fish schools.
Energetic, morphometric and bathymetric school descriptors were extracted. Several fish species were considered,
including anchovy, rough scad and blue whiting. Correct
classification rates up to 96% were obtained. That method
is best suited to coarser detections, such as finding fish
schools, and can not be applied in multispecies environments characterized by continuous changes in the species
composition of the schools.
The methodology used in [12] is the most closely
related to our approach. The authors present a system
for classifying four fish species, namely: Pacu, Carpa,
Surubim and Cascudo. That system is based on the use
of PCA and aiNet algorithms, achieving overall accuracy
higher than 80%. Unlike that work, we present a flexible
framework that allows users to apply not only PCA but
also the SIFT algorithm, as well as one of two possible
artificial immune systems: ARIA and aiNet. Differently
from PCA features, the SIFT features are based on the
appearance of the object at interest points and are invariant
to image scale and rotation [7]. Additionally, the ARIA
algorithm has a better performance than aiNet in several
data clustering applications [10], since it is able to preserve
the density information of data. Finally, our approach was
tested with a larger dataset, composed by nine fish species
of significant economic impact.
In [13], the authors present a deformable template
object recognition method for classifying two fish species
in underwater video. Specifically, a deformable template
matching is used, which employs shape contexts and largescale spatial structure preservation. A SVM texture classification is performed, achieving a classification accuracy
of 90%. Unlike our method, that approach was validated
with a very limited fish species group (only two species)
and its computational cost may prevent its application in
a real-time scenario composed by more fish species.
In [6], a feature selection methodology is proposed for
fish species classification. Given a set of available descriptors, the method builds the feature vector by analyzing
each individual characteristic contribution to the overall
classification performance. A couple of statistical measures, namely discrimination and independence, are used
to aid in the feature selection process. The authors report
a classification accuracy of about 85%. Similarly to our
approach, instead of studying techniques for improving
the classifiers structure itself, the authors consider it as
a black box and focus on the study of robust feature
determination methods.
In [14], a shape analysis algorithm was developed for
removing edge noise and redundant data points. A curvature function analysis was used to locate critical landmark
points. The fish contour segments of interest were then
extracted based on these landmark points for species
classification. Similarly to our approach, that method was
tested with a large group of fish species (nine species).
However, the authors performed experiments with a much
smaller dataset, composed by only 22 sample images.
Finally, in [15], an infrared silhouette sensor is used to
acquire contours of fish in constrained flow. Classification
is based on the combined results of three different classifiers which use invariant moments and Fourier boundary
descriptors for fish silhouette recognition. Those features,
however, do not perform well for noisy images. The
authors report a classification accuracy of about 78%.
The remainder of this paper is organized as follows:
Sections 2 covers our automatic framework for fish species
classification. Experimental results are presented in Section 3, followed by the conclusions and discussion in
Section 4.
II. C LASSIFICATION F RAMEWORK
The operation of the framework for automatic classification of fish species is divided in two main steps (see Figure
2). The first step is executed offline and is responsible for
estimating a knowledge base by using feature extraction
and clustering techniques. These techniques are applied
to a training set containing image samples of the nine
fish species considered in this work (see Figure 3). The
knowledge base is further used by the framework in a
second step, which is executed online and is responsible
for performing the classification of new detected fish.
A. Feature Extraction
Stable local feature extraction and representation is a
fundamental component of systems for automatic classification of fish species [13]. To perform a robust feature extraction, the proposed framework provides two
R = .
(6)
As R is symmetric, we have = , that is, R = .
After some additional mathematical manipulations, we
get:
= R,
(7)
(4)
Assuming that ui has expected value zero, that is, E[ui ] =
as follows:
0, i, we compute the covariance matrix of
2 = T RT,
(5)
where R = E p1 .
The coordinate system T that results in the highest
possible value for covariance is computed by finding a
singular value decomposition for R, as follows:
(8)
In Figures 4(a) and 4(b), we illustrate a PCA feature space
obtained by using the two components with highest values
(k = 2), which were computed by applying PCA to matrix
. In this case, encodes the color information of our
image samples of fish conserved in formaldehyde solution.
Alternatively, our framework may use the SIFT algorithm to detect local features in fish images. It consists of four steps [7]: (i) scale-space extrema detection;
(ii) keypoint localization; (iii) orientation assignment and
(iv) keypoint descriptor. In the first step, some interest
points (keypoints) are detected. For this, the image is
convolved with Gaussian filters at different scales, and
then the difference of successive Gaussian-blurred images
are taken. Keypoints are searched as maxima/minima of
the Difference of Gaussians that occur at multiple scales.
In the second step, candidate keypoints are localized and
eliminated if found to be unstable. In the third step,
each keypoint is assigned one or more orientations based
on local image gradient directions. The assigned orientation(s), scale and location for each keypoint enables
SIFT to construct a canonical view for the keypoint that
is invariant to similarity transforms [16]. The final step
computes descriptor vectors for these keypoints.
Specifically, a keypoint descriptor used by SIFT is
created by sampling the magnitudes and orientations of
the image gradient in the patch around the keypoint and
building orientation histograms to capture the relevant
aspects of the patch. Histograms contain 8 bins each, and
each descriptor contains a 4 4 array of 16 histograms
around the keypoint. This leads to a SIFT feature vector
with 4 4 8 = 128 elements. This 128-element vector
is then normalized to unit length to enhance invariance to
changes in illumination.
The SIFT keypoint descriptor representation is designed
to avoid problems due to boundary effects [7]. Therefore,
smooth changes in location, orientation and scale do not
cause radical changes in the feature vector. Moreover, it
(a)
(b)
Figure 4. PCA feature space for the two components with highest values, computed by applying PCA to matrix . In this case, encodes color
information of our image samples of fish conserved in formaldehyde solution. (a) Antibodies obtained by aiNet and (b) Antibodies obtained by ARIA.
Algorithm 1
Algorithm 2
1. Initialize variables
2. For iteration 1 to G do:
2.1 For each antigen xi , i = 1, . . . , p, do:
2.1.1 Select the best matching antibody Ab ;
2.1.2 Mutate Ab with rate ;
end;
2.2 Kill those antibodies that are not stimulated;
2.3 Clone those antibodies that recognize antigens located at
a distance larger than its radius R;
2.4 Calculate the local density for each Ab ;
2.5 Calculate the suppression threshold of each Ab making
RAb = r(Demax /De )(1/Di ) ;
2.6 Suppress antibodies giving survival priority for those with
smaller R;
2.7 Make E = mean(R);
2.8 If current generation is greater than G/2:
2.8.1 Reduce ( = *);
end;
end;
Table I
PARAMETERS VALUES OF AI N ET.
Parameters
G
d
s
n
Meaning
Number of generations
Natural death threshold
Suppression threshold
No of cloned antibodies
Percentage of the mature antibodies to be selected
Value
10
1
0.01
4
0.2
Meaning
Number of generations
Di
Value
10
Initially drawn in
[0.01, 0.09]
0.01
Initially set to 1
1
10 or 128
(a)
(b)
Figure 7. (a) Overall accuracy of the framework as a function of the number of principal components. (b) Frameworks accuracy per each fish
species, when SIFT is applied to the images.
(a)
(b)
Figure 8. (a) Bar graph containing the numbers of matches between each image sample of a validation individual of species Canivete and the images
of reference individuals of all species. (b) Numbers of matches between the image of a validation individual of species Canivete, rotated at 40 ,
and all the images of the reference individual of the same species.