Sie sind auf Seite 1von 3

Name Hair Height Weight Lotion Result

Digital Assignment
Course Name & Code: CSE4020 – Machine Learning
Slot: G1
Faculty Name: Dr.V.Manjula Maximum Marks: 10

1. Consider the instance space consisting of integer points in the x, y plane and the set of
hypotheses H consisting of rectangles. More precisely, hypotheses are of the form
, where a, b, c, and d can be any integers.

(a) Consider the version space with respect to the set of positive (+) and negative (-) training
examples shown below. What is the S boundary of the version space in this case? Write out
the hypotheses and draw them in on the diagram.

(b) What is the G boundary of this version space? Write out the hypotheses and draw them in.

(c) Suppose the learner may now suggest a new x, y instance and ask the trainer for its
classification. Suggest a query guaranteed to reduce the size of the version space, regardless
of how the trainer classifies it. Suggest one that will not.

2. Build a decision tree model to learn about the person who gets sunburn based on features
of past experience (training samples) given below by describing ID3 algorithm. Write the
rules from the decision tree.
Sarah blonde average Light No sunburned
(positive)
Dana blonde Tall average Yes none
(negative)
Alex brown short average Yes none
Annie blonde short average No sunburned
Emily red average heavy No sunburned
Pete brown Tall heavy No none
John brown average heavy No none
Katie blonde short Light Yes none

Also predict the given instances {Sam, brown, average, average, no} and {Raj , blonde,
average, average, yes}

b. Consider the above Training example. Find the Hypothesis space and Minimum number of
samples required for PAC learner that guarantees at most 5% and 98% of the time.

3. Study the relationship between the monthly rain fall and the umbrella sold. You have the
survey of the last year in the table given below. Draw a scatter plot and find the equation of
the regression line that fits the data best. Show that Sum of Squares of Error is less for the
regression line. Also forecast the umbrella sold when 175 mm and 190 mm rain fall.

Month Rainfall mm umberlla sold


Jan 82 15
Feb 92.5 25
March 83.2 17
Apr 97.7 28
may 131.9 41
jun 141.3 47
July 165.4 50
Aug 140 46
Sep 126.7 37
4. a) Compute the version space containing all hypotheses that are consistent with an
observed sequence of training examples given below, using Candidate elimination algorithm .
“Buy column is output variable

Company Country OS Weight Type Buy


Nokia Finland Android Moderate Smartphone yes
Oppo China Android Light Smartphone no
Nokia Finland Android Moderate Text pad yes
Vivo China Android Light Smartphone No
Samsung S.korea Blackberry Light Smartphone yes
Xiaomi China Android Light Smartphone No
Apple USA IoS Light Smartphone yes
b) Consider the above training example where all the variables were binary. Find the
Minimum number of sample required for PAC learner that guarantees at most 5% and 98% of
the time.

5. Apply the candidate elimination (CE) algorithm to the sequence of training examples
specified in Table given below and name the contents of the sets S and G after each step.
What happens, if the order of presentation of the training examples is altered?

Training running nose coughing reddened skin fever Classification


d1 + + + - positive(ill)
d2 + + – - positive(ill)
d3 - – + + positive(ill)
d4 – + + - negative(healthy)
d5 – – + - negative(healthy)
d6 – – – - negative(healthy)

Table : List of training instances for the medical diagnosis task.

b) Consider the above Training example where all the variables were binary. Find the
Minimum number of sample required for PAC learner that guarantees at most 10% and 99%
of the time.

Das könnte Ihnen auch gefallen