Willkommen bei Scribd!

Assignment 2

Hochgeladen von

0% fanden dieses Dokument nützlich (0 Abstimmungen)

25 Ansichten3 Seiten

Assignment 2 (due on 10 / 13 / 2008) (Datawarehousing, Classification) by Huzefa Rangwala This is an individual assignment. Please ensure that assignment is submitted in class in hard copy before start class. No late submissions allowed.

Originalbeschreibung:

Copyright

Verfügbare Formate

PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

Copyright:

Attribution Non-Commercial (BY-NC)

Verfügbare Formate

Als PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

25 Ansichten3 Seiten

Assignment 2

Hochgeladen von

Mazhar Bari

Copyright:

Attribution Non-Commercial (BY-NC)

Verfügbare Formate

Als PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 3

Im Dokument suchen

INFS 755 (Fall 2008)

Assignment 2 (Due on 10/13/2008)

(Datawarehousing, Classification)
by Huzefa Rangwala
This is an individual assignment. Please ensure that assignment is submitted in class in
hard copy before start class. No late submissions allowed. The first part of the assignment
introduces you to reading a research article. It is on data warehousing and serves as a
complementary reading to the book. The second part of the assignment are a few
questions on classification algorithms (small exercise in weka) from Chapter 4.

Part 1 (50 points)

Read the research article in the business intelligence journal

(http://www.tdwi.org/research/display.aspx?ID=6810). Based on your reading of this
article do the following.
1. What are the differences between the data warehouse and operational databases
from a user perspective. (5 points)
2. What does Kimball think about snowflake schema ? Do you agree with his
point of view ? (10 points)
3. Understand the steps involved in the converting the ER model for Retail Sales
System to a dimensional model. Now design a data-warehouse for George
Mason University consisting of the following dimensions: student, course,
semester, and instructor. One set of measure I had like to see is student GPA.
Design your dimensions in such a way that you could answer all the sub-
questions below. Please read all questions before beginning your design.
a. Draw a star-schema model
b. Discuss what are the benefits of your designed data warehouse. What
kind of information can I extend quickly and efficiently ?
c. Extend the star schema to snow flake schema. Ensure that you came up
with attributes of dimension tables in part (i) that allowed you to do this.
d. What OLAP operation will allow you to extract the following
information from your warehouse (ex: drilling, slicing ?)
i. Toughmost grader among the students in the Volneague School of
IT &E.
ii. The average GPA for the girl students in the Computer Science
Department for the year 2008. (30 points)
4. Based on the reading how are ER modeling and dimensional modeling related ?
(5 points)
Part 2 (50 points)
1. Given the following training examples for the target concept EnjoySport . (20 points)

Example Sky AirTemp Humidity Wind Water Forecast EnjoySport

1 Sunny Warm Normal Strong Warm Same Yes
2 Sunny Warm High Strong Warm Same Yes
3 Rainy Cold High Strong Cool Change No
4 Sunny Warm High Strong Cool Change Yes
5 Sunny Warm Normal Weak Warm Same No

Draw the decision tree that will be learnt by ID3 based on the training examples.
Show the value for information gain for each of the candidate attributes for each
step in the growing tree.

Now load the dataset in weka and learn a decision-tree using the id3, SimpleCART,
and J48 classiers. Remember to click on “Use training set”. Compare the three trees
you generated. Which one of them you think has the best performance on an unseen
test set and why ?

2. Consider the following data set for a binary class problem (below) (10 points)
a. Calculate the information gain when splitting on A and B. Which attribute
would the decision tree induction algorithm choose?
b. Calculate the gain in the Gini index when splitting on A and B. Which attribute
would the decision tree induction algorithm choose?

A B Class Label
T F +
T T +
T T +
T F -
T T +
F F -
F F -
F F -
T T =
T F -

3. If you had to choose between the na¨ıve Bayes and k-nearest neighbor classifiers,
which would you prefer for a classification problem where there are numerous missing
values in the training and test data sets? Indicate your choice of classifier and briefly
explain why the other one may not work so well? (10 points)

4. Explain why accuracy can be misleading in measuring the performance of

classification models. Give an example. (5 points)
5. How do you handle missing valued data while learning decision trees ? (5 points).

Das könnte Ihnen auch gefallen

Fear: Trump in the White House
Von Everand
Fear: Trump in the White House
Bob Woodward
Bewertung: 3.5 von 5 Sternen
3.5/5 (738)
A Man Called Ove: A Novel
Von Everand
A Man Called Ove: A Novel
Fredrik Backman
Bewertung: 4.5 von 5 Sternen
4.5/5 (4610)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Von Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Bewertung: 3.5 von 5 Sternen
3.5/5 (231)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Von Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Bewertung: 4.5 von 5 Sternen
4.5/5 (121)
Grit: The Power of Passion and Perseverance
Von Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Bewertung: 4 von 5 Sternen
4/5 (588)
Never Split the Difference: Negotiating As If Your Life Depended On It
Von Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Bewertung: 4.5 von 5 Sternen
4.5/5 (838)
The Little Book of Hygge: Danish Secrets to Happy Living
Von Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Bewertung: 3.5 von 5 Sternen
3.5/5 (400)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Von Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Bewertung: 4.5 von 5 Sternen
4.5/5 (266)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Von Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Bewertung: 4 von 5 Sternen
4/5 (5794)
Rise of ISIS: A Threat We Can't Ignore
Von Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Bewertung: 3.5 von 5 Sternen
3.5/5 (137)
Her Body and Other Parties: Stories
Von Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Bewertung: 4 von 5 Sternen
4/5 (821)
A Tree Grows in Brooklyn
Von Everand
A Tree Grows in Brooklyn
Betty Smith
Bewertung: 4.5 von 5 Sternen
4.5/5 (1929)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Von Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Bewertung: 4 von 5 Sternen
4/5 (1090)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Von Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Bewertung: 3.5 von 5 Sternen
3.5/5 (2259)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Von Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Bewertung: 4.5 von 5 Sternen
4.5/5 (345)
Shoe Dog: A Memoir by the Creator of Nike
Von Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Bewertung: 4.5 von 5 Sternen
4.5/5 (537)
The Emperor of All Maladies: A Biography of Cancer
Von Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Bewertung: 4.5 von 5 Sternen
4.5/5 (271)
The Glass Castle: A Memoir
Von Everand
The Glass Castle: A Memoir
Jeannette Walls
Bewertung: 4.5 von 5 Sternen
4.5/5 (1713)
Team of Rivals: The Political Genius of Abraham Lincoln
Von Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Bewertung: 4.5 von 5 Sternen
4.5/5 (234)
John Adams
Von Everand
John Adams
David McCullough
Bewertung: 4.5 von 5 Sternen
4.5/5 (2409)
Principles: Life and Work
Von Everand
Principles: Life and Work
Ray Dalio
Bewertung: 4 von 5 Sternen
4/5 (599)
Yes Please
Von Everand
Yes Please
Amy Poehler
Bewertung: 4 von 5 Sternen
4/5 (1891)
The Light Between Oceans: A Novel
Von Everand
The Light Between Oceans: A Novel
M.L. Stedman
Bewertung: 4.5 von 5 Sternen
4.5/5 (789)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Von Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Bewertung: 4 von 5 Sternen
4/5 (895)
Wolf Hall: A Novel
Von Everand
Wolf Hall: A Novel
Hilary Mantel
Bewertung: 4 von 5 Sternen
4/5 (3811)
The Perks of Being a Wallflower
Von Everand
The Perks of Being a Wallflower
Stephen Chbosky
Bewertung: 4.5 von 5 Sternen
4.5/5 (2104)
The Woman in Cabin 10
Von Everand
The Woman in Cabin 10
Ruth Ware
Bewertung: 3.5 von 5 Sternen
3.5/5 (2322)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Von Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Bewertung: 4.5 von 5 Sternen
4.5/5 (474)
Sing, Unburied, Sing: A Novel
Von Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Bewertung: 4 von 5 Sternen
4/5 (1103)
The Art of Racing in the Rain: A Novel
Von Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Bewertung: 4 von 5 Sternen
4/5 (4200)
Angela's Ashes: A Memoir
Von Everand
Angela's Ashes: A Memoir
Frank McCourt
Bewertung: 4.5 von 5 Sternen
4.5/5 (440)
The Constant Gardener: A Novel
Von Everand
The Constant Gardener: A Novel
John le Carré
Bewertung: 3.5 von 5 Sternen
3.5/5 (104)
The Outsider: A Novel
Von Everand
The Outsider: A Novel
Stephen King
Bewertung: 4 von 5 Sternen
4/5 (1839)
On Fire: The (Burning) Case for a Green New Deal
Von Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Bewertung: 4 von 5 Sternen
4/5 (74)
Brooklyn: A Novel
Von Everand
Brooklyn: A Novel
Colm Tóibín
Bewertung: 3.5 von 5 Sternen
3.5/5 (1937)
The Yellow House: A Memoir (2019 National Book Award Winner)
Von Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Bewertung: 4 von 5 Sternen
4/5 (98)
The Unwinding: An Inner History of the New America
Von Everand
The Unwinding: An Inner History of the New America
George Packer
Bewertung: 4 von 5 Sternen
4/5 (45)
Little Women
Von Everand
Little Women
Louisa May Alcott
Bewertung: 4 von 5 Sternen
4/5 (104)
SFDC PD1
Dokument3 Seiten
SFDC PD1
john
100% (1)
AadhaarTechnologyArchitecture March2014 PDF
Dokument170 Seiten
AadhaarTechnologyArchitecture March2014 PDF
anivar_aravind
Noch keine Bewertungen
Bad Feminist: Essays
Von Everand
Bad Feminist: Essays
Roxane Gay
Bewertung: 4 von 5 Sternen
4/5 (1016)
Database
Dokument4 Seiten
Database
abhinash biswal
Noch keine Bewertungen
Steve Jobs
Von Everand
Steve Jobs
Walter Isaacson
Bewertung: 4.5 von 5 Sternen
4.5/5 (806)
Manhattan Beach: A Novel
Von Everand
Manhattan Beach: A Novel
Jennifer Egan
Bewertung: 3.5 von 5 Sternen
3.5/5 (792)
Security Assignment
Dokument114 Seiten
Security Assignment
Lenovo Legion
Noch keine Bewertungen
SAP Unit Testing
Dokument14 Seiten
SAP Unit Testing
Prudhvikrishna Gurram
100% (1)
Project Report For MCA Final Year
Dokument73 Seiten
Project Report For MCA Final Year
Pankaj Manali
Noch keine Bewertungen
Chapter 22 Telecom Processes
Dokument8 Seiten
Chapter 22 Telecom Processes
maheshg
Noch keine Bewertungen
Introduction To Security
Dokument12 Seiten
Introduction To Security
Sharmistha Roy
Noch keine Bewertungen
Vijayraj 1
Dokument27 Seiten
Vijayraj 1
Real Chimkandi
Noch keine Bewertungen
VP Director Software Engineering in Boston MA Resume Abha Dogra
Dokument2 Seiten
VP Director Software Engineering in Boston MA Resume Abha Dogra
AbhaDogra
100% (1)
Module 3 - Implementing Virtual Machines
Dokument62 Seiten
Module 3 - Implementing Virtual Machines
Job Llanos Montaldo
Noch keine Bewertungen
Monroe: Measuring Mobile Broadband Networks in Europe
Dokument67 Seiten
Monroe: Measuring Mobile Broadband Networks in Europe
not jappening
Noch keine Bewertungen
Questions of SQL Assignment
Dokument3 Seiten
Questions of SQL Assignment
Karan malhi
Noch keine Bewertungen
DATASTAGE Unable To Stop Job in DataStage Director - by Aysenur Karatay - Medium
Dokument8 Seiten
DATASTAGE Unable To Stop Job in DataStage Director - by Aysenur Karatay - Medium
Luis Yoshihira
Noch keine Bewertungen
Log
Dokument21 Seiten
Log
RIZKY•YT
Noch keine Bewertungen
Java Naming Conventions
Dokument3 Seiten
Java Naming Conventions
Tine Valen
Noch keine Bewertungen
Console 1
Dokument314 Seiten
Console 1
Eze Alexander Ik
Noch keine Bewertungen
Dapp University Github
Dokument2 Seiten
Dapp University Github
vsrajkumar
Noch keine Bewertungen
Guidelines 9 - 2022 - GDPR Personal Data Breach Notification
Dokument32 Seiten
Guidelines 9 - 2022 - GDPR Personal Data Breach Notification
raulx
Noch keine Bewertungen
How Do I Setup LDAP SSL and Certificates in ADAM
Dokument3 Seiten
How Do I Setup LDAP SSL and Certificates in ADAM
attiliasp
Noch keine Bewertungen
Liveaction: Gui-Based Management and Visualization For Cisco Intelligent Wan
Dokument12 Seiten
Liveaction: Gui-Based Management and Visualization For Cisco Intelligent Wan
Jitendramatrix
Noch keine Bewertungen
WSO2 Storage Server: Documentation
Dokument336 Seiten
WSO2 Storage Server: Documentation
Ala
Noch keine Bewertungen
BDS Course Handout - Intuit PDF
Dokument6 Seiten
BDS Course Handout - Intuit PDF
Prasanth Tarikoppad
Noch keine Bewertungen
Veritas Backup Exec Migration Assistant
Dokument21 Seiten
Veritas Backup Exec Migration Assistant
ArunMurali
Noch keine Bewertungen
User Wip Guide
Dokument4 Seiten
User Wip Guide
xieing.et.wltfifi
Noch keine Bewertungen
PG NiceLabel WebSDK Eng
Dokument51 Seiten
PG NiceLabel WebSDK Eng
Teddy Dipataruna
Noch keine Bewertungen
NancyFX Succinctly
Dokument106 Seiten
NancyFX Succinctly
Алексей Гоголев
Noch keine Bewertungen
CETPA INFOTECH PVT. LTD-Dehradun, Dehradun Uttaranchal: Summer Training Report On Core Java Undertaken
Dokument46 Seiten
CETPA INFOTECH PVT. LTD-Dehradun, Dehradun Uttaranchal: Summer Training Report On Core Java Undertaken
Aman kumar
Noch keine Bewertungen
Penetration Testing
Dokument21 Seiten
Penetration Testing
Bryam Molina Batista
Noch keine Bewertungen
Practice 1-1 For Evaluting Data
Dokument2 Seiten
Practice 1-1 For Evaluting Data
Ashley
Noch keine Bewertungen