Beruflich Dokumente
Kultur Dokumente
Data Mining
Course No(s)
IS ZC415
Credit Units
Credit Model
Content Authors
Course Objectives
No
CO1
CO2
CO3
Text Book(s)
T1
Tan P. N., Steinbach M & Kumar V. Introduction to Data Mining Pearson Education, 2006
T2
Data Mining: Concepts and Techniques, Third Edition by Jiawei Han, Micheline Kamber and
Jian Pei Morgan Kaufmann Publishers
Reference Book(s) & other resources
R1
Predictive Analytics and Data Mining: Concepts and Practice with RapidMiner by Vijay Kotu
and Bala Deshpande Morgan Kaufmann Publishers 2015
Content Structure
Modules
No.
M1
M2 Data Preprocessing:
To understand the need for data preprocessing and various techniques used in the context of
Data Mining
M3 Data Exploration:
A preliminary exploration of the data to better understand its characteristics
M4 Classification and prediction:
To learn different techniques and algorithms for classification, a major predictive and
supervised Data Mining task
M5 Association Analysis:
To understand the descriptive relation between the entities by identifying associations among
them and to learn various algorithms to find them
M6 Clustering:
To learn different techniques and algorithms for clustering, a major descriptive and
unsupervised Data Mining task
M7 Anomaly Detection:
Detecting outliers and noise in data sets is an important Data Mining task. This module focuses
on techniques needed for anomaly detection
M8 Data Mining on unstructured(Big) data:
Graph Mining, Social Network Analysis, Multimedia Data Mining, Text Mining, Mining the
World Wide Web
M9 Data Mining Applications:
Recommendation Systems
Fraud Detection
Sentiment Analysis
Glossary of Terms:
1. Contact Hour (CH) stands for a hour long live session with students conducted either in a physical
classroom or enabled through technology. In this model of instruction, instructor led sessions will be
for 20 CH.
a. Pre CH = Self Learning done prior to a given contact hour
b. During CH = Content to be discussed during the contact hour by the course instructor
c. Post CH = Self Learning done post the contact hour
2. RL stands for Recorded Lecture or Recorded Lesson. It is presented to the student through an online
portal. A given RL unfolds as a sequences of video segments interleaved with exercises
3. SS stands for Self-Study to be done as a study of relevant sections from textbooks and reference
books. It could also include study of external resources.
4. LE stands for Lab Exercises
5. HW stands for Home Work will consist of discussed/new problems; could be a selection of problems
RL1.2
RL1.3
CS1.1
CS1.1.1 = Review of Data Mining basics Examples of patterns that can be mined
CS1.1.2 = Examples of technologies used in DM Approaches to overcome challenges.
Discuss one example Case Study for data mining
LE1.1
SS1.1
HW1.1
QZ1.1
M2: Data Preprocessing
Type
Description/Plan/Reference
RL2.1
RL2.2
CS2.1
LE2.1
SS2.1
HW2.1
QZ2.1
RL3.2
CS3.1
LE3.1
SS3.1
HW3.1
QZ3.1
M4: Classification and Prediction
Type
Description/Plan/Reference
RL4.1
RL4.2
CS4.1
CS4.1.1 = Review of concepts of recorded lectures, Algorithm for Decision trees induction,
Classification by back propagation, Comparison of methods of classification
CS4.1.2 = Prediction: Other Regression-Based Methods.
LE4.1
SS4.1
HW4.1
QZ4.1
M5: Association Analysis
Type
Description/Plan/Reference
RL5.1
RL5.2
CS5.1
LE5.1
SS5.1
HW5.1
QZ5.1
M6: Clustering
Type
Description/Plan/Reference
RL6.1
RL6.2
CS6.1
LE6.1
SS6.1
HW6.1
QZ6.1
M7: Anomaly Detection
Type
Description/Plan/Reference
RL7.1
RL7.1.1 = Preliminaries
RL7.1.2 = Statistical approach
RL7.2
CS7.1
LE7.1
SS7.1
HW7.1
QZ7.1
M8: Data mining on unstructured (Big) data
Type
Description/Plan/Reference
RL8.1
RL8.1.1 = Graph Mining methods and applications- Graph Indexing, Similarity Search,
Classification, and Clustering
RL8.1.2 = Multimedia Data Mining- Classification and Prediction Analysis of Multimedia
Data, Mining Associations in Multimedia Data, Audio and Video Data Mining
RL8.2
CS8.1
LE8.1
SS8.1
HW8.1
QZ8.1
M9: Data Mining Applications
Type
Description/Plan/Reference
RL9.1
RL9.2
CS9.1
LE9.1
SS9.1
HW9.1
QZ9.1
Academic Term
Course Title
Data Mining
Course No
IS ZC415
Content Developer
Contact hour
RL 1.1, RL 1.2
CS 1.1
RL 1.3
CS1.2
RL2.1
CS2.1
RL2.2
CS2.2
RL3.1
CS3.1
RL3.2, RL3.3
CS3.2
CS4.1
RL4.4
CS4.2
RL5.1, RL5.2
CS5.1
10
Review
11
Review
12
RL5.3, RL5.4
CS5.2
13
RL6.1
CS6.1
14
CS6.2
15
RL7.1
CS7.1
16
RL7.2, RL7.3
CS7.2
17
CS8.1, CS8.2
18
CS9.1
19
20
21
Review
22
Review
Notes:
Post-contact hour
LE1.1, HW1.1 ,SS1.1
LE2.1, SS2.1, HW2.1
LE3.1, SS3.1, HW3.1
LE4.1, SS4.1, HW4.1
Evaluation Scheme:
Legend: EC = Evaluation Component; AN = After Noon Session; FN = Fore Noon Session
No
Name
Type
Duration Weight Day, Date, Session, Time
EC-1
Quiz-I/ Assignment-I Online
5%
September 1-10, 2016
Quiz-II
Online
5%
October 1-10, 2016
Lab
Online
10%
To be announced
EC-2
Mid-Semester Test
Closed Book 2 hours
30%
24/09/2016 (AN) 2 PM TO 4 PM
EC-3
Comprehensive Exam Open Book 3 hours
50%
05/11/2016 (AN) 2 PM TO 5 PM
Syllabus for Mid-Semester Test (Closed Book): Topics in Session Nos. 1 to 11
Syllabus for Comprehensive Exam (Open Book): All topics (Session Nos. 1 to 22)
Important links and information:
Elearn portal: https://elearn.bits-pilani.ac.in
Students are expected to visit the Elearn portal on a regular basis and stay up to date with the latest
announcements and deadlines.
Contact sessions: Students should attend the online lectures as per the schedule provided on the Elearn
portal.
Evaluation Guidelines:
1. EC-1 consists of either two Assignments or three Quizzes. Students will attempt them through the
course pages on the Elearn portal. Announcements will be made on the portal, in a timely manner.
2. For Closed Book tests: No books or reference material of any kind will be permitted.
3. For Open Book exams: Use of books and any printed / written reference material (filed or bound) is
permitted. However, loose sheets of paper will not be allowed. Use of calculators is permitted in all
exams. Laptops/Mobiles of any kind are not allowed. Exchange of any material is not allowed.
4. If a student is unable to appear for the Regular Test/Exam due to genuine exigencies, the student
should follow the procedure to apply for the Make-Up Test/Exam which will be made available on
the Elearn portal. The Make-Up Test/Exam will be conducted only at selected exam centres on the
dates to be announced later.
It shall be the responsibility of the individual student to be regular in maintaining the self study schedule as
given in the course handout, attend the online lectures, and take all the prescribed evaluation components
such as Assignment/Quiz, Mid-Semester Test and Comprehensive Exam according to the evaluation scheme
provided in the handout.