Beruflich Dokumente
Kultur Dokumente
I
1
I
I S1
I
S-
I
S
1 1 1 1 1 1 1
I I
. . . . . . . . . . . . . . .
I I
}
= ^
m
)
G
L 2)
z
r/
)
/
)
/ /
JI
t
9
f d
f
"
'
>
L
L
"
0.
.
J
J /
I
Fiure 2-sample set of hand-printed Farsi chracters
In the Irst stp, one simple Bayesin classiier is
trained with all 81 features have been exracted nd
observed hat in test stage approximately 77 percent
of samples have been classiied correctly.
In the second step, we performed feature selection
through accidenal genetic algorihm As poined in
section 4. 1 . in this state, genetic algorithm by omitted
residue features and selected effective nd suitable
features re performed in addition to increase of
recognition percntage of chracters until 80 percent,
the nmber of features used for raining classiIer too
is reduced rom 81 features to 55 .
In the third step, in order to resolve the problem
of genetic algorithm, SA was added to the mehod
nd in fact operation related to feature selection
through combined genetic algorithm nd SA was
performed . In this stae, algorihm wih omission of
residue featres and selection of suitable feature sets,
recognition percentage of characters is increased K
about 82 percent. In his case numbers of selected
features were reduced rom 81 to 60 features. Table I
shows the results.
.......
LA
~~~~LA+5A
~LA+bAwI!h z
|hICshOIO
ss
z
. . . ... . _-
"=
f
-
-
=
81
s0
U
JB
Jb
J'
J4
o
- 2
12 11 1 J o b 4 o I
Figure 3. Comparative igures of proress in percentage
classiication in diferent genertions, in GA , GA+SA and
GA +SA algorithm with two hih and low thresholds
Figure 3 gives a clear insight in relation with the
progress of mentioned algorithm in diferent
generations. As you see, those igures related to
genetic algorithm nd combined genetic algorithm
nd SA with two high and low thresholds are
ascending. But igure related to combined genetic
algorithm And simple SA, re sometimes descending
nd sometimes ascending . This vriation is the efect
of probability factor which previously was mentioned
in section 4.2. In addition his comprative igure
explains superiority of combined genetic algorithm
nd SA, with tow thresholds in selection suitable
features for classiication.
VI. SUMMARY
Recognition system of letters must have high
scrutiny, high rapidity nd easy tools. Mny methods
could be explained hat must be diferent in feature
exraction nd classiication nd produce different
results. But extract all features is not always useul.
In this article for reduction of problem dimension,
two genetic and combined genetic nd SA algorithm
were used, nd it ultimaely is proved hat usage of
all exracted featres rom one image, for
classiication not only complexity of calculations are
increased but also as always the highest recognition
Jb1
percenage is not created. Therefore reduction of
problem dimension seems necessry through
diferent algorithm.
Table l- Results and Comparison
ClassiIcr ClassiIcation lcaturc's
ratc uumcr
^oIcaturcsc!ccton TT S1
Icaturcsc!cction byCA SO
Icaturc sc!ccton by S2 O
combnaton o CA an
SA usi4g two ig an
!owtrcso!Js
REFERENCES
[1] Ho-Duck Kim, Chang-H
y
un Park, Hun-Chng Yng,
Kwee-Bo "Genetic Algorithm Basd Feature Selection
Method Development for Patten Reconition",
ppears in SICE-ICASE, Intenational Joint Conference, pp
1020:1025, 2006
[2] D.Zongker A.Jain "Algorithms for feature Selection
.P Evaluation", apears in: Patten Recognition,
Proceedings of the 13th Intrnational Conerence, volume2,
pp 18.22,1996
[3] Jalili saeid, bitarafan mahi, "increment text
classiication performance based improve feature
selection methods",volume40, p 313:328, 2006. (in
Farsi)
[4] Janez Brank, Marko Grobelnik, NataSa Milic-Frayling, Dunja
Mldenic " Interaction of Feature Selection Methods nd
Linear Classiication Models", Proceedings of the ICL-02
Workshop on Text Lening, 2002
[5] Anirban Dasgupta, Petros Drineas,Boulos Harb "Featre
Selection Methods for Text Classiication", Intenational
Conference on Knowledge Discovery and Data Mining,
Proceedings of the 13th ACM SIGKDD intenational
conerence on Knowledge discovery and data mining, pp
230:239,2007
[6] Huiqing Liu, Jinyan Li, Limsoon Wong "A Comparative
Study on Featre Selection and Classiication Methods Using
Gene Expression Proiles and Proteomic Pattens", Genome
Informatics 13, pp 51-60, 2002
[7] Chng-Lin Liu, Kazuki Nakashima, Hiroshi Sko nd
Hiromichi Fujisawa,"Hnwritten digit reconition:
investigation of nomalization nd featue extraction
techniques", Patten Reconition Societ
y
. Published
b
y
Elsevier Science B.V., Volume 37, pp 265:279,
2004
[8] Kheirkhah ahmad Reza, rhmnin esmaeil,
"optimization of recognition of Farsi hnwriting
chracter based efective feature selection b
y
GA". 8
h
intellignce s
y
stem conference in Ferdosi universit
y
,
2007(in Frsi)
[9] L.Cordella, C.De Stemo,F.Fontanella and C.Mrrocco "A
Feature Selection Algorithm for Handwritten Character
Recognition" , appears in: Patten Reconition, ICPR 2008.
19th International Conference, pp 1 : 4,2008