Beruflich Dokumente
Kultur Dokumente
(DM)
SPRING 2017
Lecture 1:
Introduction
12
What is Data Mining?
Definition
Data mining (knowledge discovery from data)
Extraction of interesting (non-trivial, implicit,
previously unknown and potentially useful) patterns or
knowledge from huge amount of data.
Process of semiautomatically automatically
analyzing large databases to find patterns that are:
valid: hold on new data with some certainty
novel: nonobvious to the system
useful : should be possible to act on the item
understandable: humans should be able to interpret the
pattern
What Is Data Mining?
Alternative names
Knowledge discovery (mining) in databases (KDD),
knowledge extraction, data/pattern analysis, data
archeology, data dredging, information harvesting,
business intelligence, etc.
Watch out: Is everything data mining?
Simple search and query processing
(Deductive) expert systems
15
Knowledge Discovery (KDD) Process
This is a view from typical
Knowledge
database systems and data
warehousing communities
Pattern Evaluation
Data mining plays an essential
role in the knowledge discovery
process
Data Mining
Task-relevant Data
Data Cleaning
Data Integration
Databases
16
Data Mining in Business Intelligence
End User
Increasing potential Decisio
to support
n
business decisions
Making
Data Presentation Business
Analyst
Visualization Techniques
Data Mining Data
Information Discovery Analyst
Data Exploration
Statistical Summary, Querying, and Reporting
Patte
Inform rn
a
Input Data Data Pre- Data Post- Know tion
Processin ledge
Processing Mining
g
19