Sie sind auf Seite 1von 5

SRM UNIVERSITY

FACULTY OF ENGINEERING AND TECHNOLOGY


DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
COURSE PLAN

CourseCode : CS1141
CourseTitle : Data Mining
Semester : VI
CourseTime : JAN 2016-MAY2016
Session Details
SECTION DAY ORDER PERIOD TIMINGS
3 3 10.40 to 11.30
Group - I 4 5 1.25 to 2.15
5 5 1.25 to 2.15
3 7 3.15 to 4.05
Group - II 4 5 1.25 to 2.15
5 5 1.25 to 2.15

Location:S.R.M.E.C-TechPark
Faculty Details

Group Name OFFICE OFFICE HOURS Mail id


Tech Park Monday to Saturday
Mrs.B.Sowmya sowmiya.b@ktr.srmuniv.ac.in
603A 8.50 AM to 4.55 PM
I
Tech Park Monday to Saturday ushasukhanya.s@ktr.srmuniv.ac.in
Mrs. UshaSukhanya
003A 8.50 AM to 4.55 PM
Tech Park Monday to Saturday nithyakalyani.a@ktr.srmuniv.ac.in
Ms.Nithyakalyani
003A 8.50 AM to 4.55 PM
II
Tech Park Monday to Saturday girija.s@ktr.srmuniv.ac.in
Ms. S.Girija
003A 8.50 AM to 4.55 PM

Text Book
1. Shawkat Ali A B M, Saleh A.Wasimi, “Data Mining: Methods and Techniques”, Third Indian Reprint,
Cengage Learning, 2010.

Reference
1. Soman K. P., ShyamDiwakar, Ajay V, “Insight into Data Mining Theory and Practice”, Fifth Printing, PHI
Learning, 2011.

OnlineReferences
1. www.autonlab.org/tutorials : Statistical Data mining Tutorials
2. www-db.standford.edu/`ullman/mining/mining.html : Data mining lecture notes
3. ocw.mit.edu/ocwweb/slon-School-of-management/15-062Data-MiningSpring2003/course home/index.htm : MIT Data mining
open courseware
4. www.kdnuggets.com: Data mining resources
Web Links of Similar Courses Offered at Other Universities
1. PurdueUniversity :Introduction to Data mining:
www.cs.purdue.edu/homes/clifton/cs490d/
2. University of New South Wales :Data warehousing and Data mining
www.cse.unsw.edu.au/~cs9318/
3. YorkUniversity: Data mining
www.cs.yorku.ca/course-archieve/2005-06/w/4412/
4. IIT- Madras :Data Mining
www.iitm.ernet.in/~cs672/
5. New yorkUniversity: Data warehousing/mining
www.cs.nyu.edu/courses/spring03/G22.3033-015

Prerequisite
¾ An introductory course on database systems.
¾ Basic concepts in probability and statistics.

Objectives
¾ To learn the concepts of data processing
¾ To understand the different data mining techniques
¾ To perform data mining tasks with relevant tools

AssessmentDetails
Cycle Test – I : 10 Marks
Surprise Test – I : 5 Marks
Cycle Test – II : 10 Marks
Model Exam : 20 Marks
Attendance : 5 Marks

TestSchedule

S.No DATE TEST TOPICS DURATION


1 CycleTest-I UnitI &II 50 minutes
2 CycleTest-II UnitIII &IV 50 minutes
3 Model Exam All5 units 3 hours

Learning Outcome

By the end of this course the student should be able to describe and utilize a range of techniques for designing
data mining and data warehousing systems.

¾ Understand the functionality of the various data mining and data warehousing components
¾ Appreciate the strengths and limitations of various data mining and data warehousing models
¾ Compare the various approaches to data mining and data warehousing implementations

Detailed Session Plan

UNIT 1 - FUNDAMENTALS
DATA MINING, DATA PROCESSING AND DATA WAREHOUSES
Data Mining – History – Strategies – Techniques – Applications – Challenges – Future – Types of Data – Data
Warehouses – Data Processing –Quality Measure – OLAP – Sampling.
DATA TYPES, INPUT AND OUTPUT OF DATA MINING ALGORITHMS
Different Types of features – Concept Learning – Output of Data Mining Algorithms.
PREPROCESSING IN DATA MINING
Steps – Discretization – Feature Extraction, Selection and construction – Missing Data and Techniques for dealing it.
Sessi
Time Teaching
on Topics tobe covered Ref TestingMethod
(min) Method
No.
Data Mining – History - Strategies
1 50 T1 BB Group discussion
2 Techniques - Applications 50 T1 BB Group discussion
Challenges - Future T1 Assignment
3 50 BB
4 Types of data – Data Warehouses 50 T1 BB Group discussion
Data Processing – Quality Measure T1 Group discussion
5 50 BB
OLAP - Sampling T1 Assignment
6 50 BB
DATA TYPES, INPUT AND OUTPUT OF R1 Group discussion
7 DATA MINING ALGORITHMS 50 BB
Different Types of features – Concept Learning
Output of Data Mining R1 Group discussion
AlgorithmsPREPROCESSING IN DATA
50 BB
8
MININGSteps – Discretization
Feature Extraction, Selection and construction T1 Discussion
9 – Missing Data and Techniques for dealing it 50 BB

UNIT 2 - WEKA TOOL


Introduction – Installation – Visualisation – Filtering – Selecting Attributes – Other popular packages.
CLASSIFICATION TASK
Introduction – Decision trees – Naïve Bayes classification – Artificial Neural Networks and Support Vector Machines.

10 Introduction – Installation 50 Demo Discussion


T1
Visualization T1 Demo Illustration byexamples
11 50
Filtering T1 Demo Illustration byexamples
12 50
Selecting Attributes T1 Demo Illustration byexamples
13 50
14 Other popular packages 50 T1 Demo Illustration byexamples
CLASSIFICATION TASK T1 Problem solving
15 50 BB
Decision trees surprisetest
Naïve Bayes classification Problem solving,
16 50 T1 BB
Surprisetest
Artificial Neural Networks T1 Problem solving,
17 50 BB
Assignment
Support Vector Machines T1 Problem solving,
18 50 BB Assignment

UNIT 3 – MODEL EVALUATION TECHNIQUES


Accuracy Estimation – ROC – Lift Charts – Cost – Bagging and Boosting – Model Ranking Approach.
ASSOCIATION RULE MINING:
Concepts, Relevance, Functions of Association rule Mining – Apriori Algorithm – Strengths and Weaknesses of ARM –
Applications.
19 Accuracy Estimation – ROC – Lift Charts 50 BB Discussion
T1
20 Cost – Bagging and Boosting T1 BB Illustration byexamples
50
21 Model Ranking Approach T1 BB Illustration byexamples
50
22 ASSOCIATION RULE MINING: T1 BB Illustration byexamples
50
Concepts, Relevance
23 Functions of Association rule Mining 50 T1 BB Illustration byexamples
24 Apriori Algorithm T1 Problem solving
50 BB
Assignment
25 Apriori Algorithm problems Problem solving,
50 T1 BB
Surprisetest
26 Strengths and Weaknesses of ARM T1 Problem solving,
50 BB
Surprisetest
Applications T1 Problem solving,
27 50 BB Surprisetest

UNIT 4 – CLUSTERING AND ESTIMATION


CLUSTERING TASK: Introduction – Distance Measure – Types – KNN for clustering – Validation – Strengths and
Weaknesses of Algorithms – Applications.
ESTIMATION TASK: Scatter Plots and Correlation – Linear regression models – Logic regression – Regression
Analysis – Strength and Weaknesses of Estimation – Applications.
28 CLUSTERING TASK: Introduction – Distance 50 BB Discussion
Measure - Types
T1
29 KNN for clustering – Validation T1 BB Group Discussion
50
30 Strengths and Weaknesses of Algorithms - T1 BB Group Discussion
50
Applications.
31 ESTIMATION TASK: Scatter Plots and T1 BB Illustration byexamples
Correlation 50
32 Linear regression models – Logic regression 50 T1 BB Illustration byexamples
33 Regression Analysis T1 Problem solving
50 BB
Assignment
34 Strength and Weaknesses of Estimation Group Discussion
50 T1 BB
35 Applications T1 Group Discussion
50 BB

UNIT 5 – MINING OF TIME SERIES


Fundamentals – Time series models – Regression, Periodic Models – Strengths and Weaknesses of Time series
Analysis – Applications.
Text and Web Mining – Privacy, Security and Ethical Issues in Data Mining.

37 Fundamentals of Time series analysis 50 BB Discussion


T1
38 Time series models, Auto regressive models, Problem solving
Moving average models
39 Regression, Periodic Models T1 BB Problem
50 solvingIllustration
40 Strengths and Weaknesses of Time series T1 BB Group discussion
Analysis – Applications.
50
41 Text Mining – Measures of text retrieval T1 BB Group discussion
– Text frequency matrix – Tools - 50
Illustration byexamples
Applications
42 Web Mining 50 T1 BB Group discussion
43 Privacy, Security and Ethical Issues in Data T1 Group discussion
50 BB
Mining.

BB– Black Board


T1 – Text Book 1
R1 – Reference 1

Prepared By
Staff Name: A. NithyaKalyani, AP /CSE
Signature: HOD/CSE

Das könnte Ihnen auch gefallen