Beruflich Dokumente
Kultur Dokumente
(Symbolic methods)
Introduction
What is learning?
) unknownknown
) learning enables to perform better next time
) learning enables to change our knowledge structure
) learning is related to adaptation (change for better)
) learning as search (learning problem solving search)
ID3_type Learning (decision
tree learning)
Motivation
Goalto search for order
to search for structures (tree)
The basic idea of ID3..
Introduction to information
(Theory Coding)
Channel
Input Output
e.g.P1,P2,,Pn is n symbols(Input)
Measure of information (Uncertainty)
( ) 0 p w =>
)surprise contains more information
A highly predictable sequence contains little actual information
Example: 11011011011011011011011011 (whats next?)
Example: I didnt win the lottery this week
A completely unpredictable sequence of n bits contains n bits of information
Example: 01000001110110011010010000 (whats next?)
Example: I just won $10 million in the lottery!!!!
)get more information from an observation of a rare event
S={s
1
,s
2
,,s
n
}
p(s
1
),p(s
2
)p(s
n
)
Question: If s
i
happenshow much
information do we have?
2
( ) log ( )
i i
I S P S =
2
2
2 2
( ) 1 log 1 0
( ) 0 log 0
( ) ( ) log log
i
i
i i i j
p S
p S
p S p S S S
= =
= =
> s
How to measure the information for the distribution
How to measure the information
for the distribution
exp
( ) ( ) log ( )
i
i i
S S
average ectation
I p p S p S
e
Class
HeightHairEyes
attribute
objects
-
NP-
35[3+,5-]
E(S)=E([3+,5-])=-(3/8)log2(3/8)-
(5/8)log2(5/8)=0.96
ID3
Root
Exampleslabel=+Root
Exampleslable=-Root
AttributesRootlable=Examples
AAttributes
RootA
RootA=vi
Example-viExamplesAvi
Examples-vi
lable=Examples
ID3(example-vi,target-
attribute,attributes-|A|
Root
ID3
ID3
ID3
ID3
(1)
overfitting
forward pruning
backward pruning
MDL
2
gain ratioGini-indexdistance
measure
3
Thank You!