Sie sind auf Seite 1von 10

ASET

Amity School of Engineering & Technology


B. Tech. (CS & IT),VI Semester E-commerce & ERP Sunil Vyas

Data Mining

ASET

New buzzword, old idea. Inferring new information from already collected data. Traditionally job of Data Analysts Computers have changed this. Far more efficient to comb through data using a machine than eyeballing statistical data.

Data Mining Two Main Components

ASET

Wikipedia definition: Data mining is the entire process of applying computer-based methodology, including new techniques for knowledge discovery, from data. Knowledge Discovery Concrete information gleaned from known data. Data you may not have known, but which is supported by recorded facts.

Data Mining vs. Data Analysis


In terms of software and the marketing thereof Data Mining != Data Analysis

ASET

Data Mining implies software uses some intelligence over simple grouping and partitioning of data to infer new information.

Data Analysis is more in line with standard statistical software (ie: web stats). These usually present information about subsets and relations within the recorded data set (ie: browser/search engine usage, average visit time, etc. )

Data Mining Subtypes


ASET

Data Dredging The process of scanning a data set for relations and then coming up with a hypothesis for existence of those relations. MetaData Data that describes other data. Can describe an individual element, or a collection of elements. example: In a library, where the data is the content of the titles stocked, metadata about a title would typically include a description of the content, the author, the publication date and the physical location

Key Component of Data Mining


ASET

Whether Knowledge Discovery or Knowledge Prediction, data mining takes information that was once quite difficult to detect and presents it in an easily understandable format (ie: graphical or statistical) Data mining Techniques involve sophisticated algorithms, including Decision Tree Classifications, Association detection, and Clustering. Since Data mining is not on test, I will keep things superficial.

Uses of Data Mining


ASET

AI/Machine Learning Combinatorial/Game Data Mining Good for analyzing winning strategies to games, and thus developing intelligent AI opponents. (ie: Chess) Business Strategies Market Basket Analysis Identify customer demographics, preferences, and purchasing patterns. Risk Analysis Product Defect Analysis Analyze product defect rates for given plants and predict possible complications (read: lawsuits) down the line.

Contd..
User Behavior Validation Fraud Detection In the realm of cell phones Comparing phone activity to calling records. Can help detect calls made on cloned phones. Similarly, with credit cards, comparing purchases with historical purchases. Can detect activity with stolen cards.

ASET

Sources of Data for Mining


Databases (most obvious) Text Documents Computer Simulations Social Networks

ASET

Bottom Line
Data obtained through Data Mining is incredibly valuable

ASET

Companies are understandably reluctant to give up data they have obtained.

Das könnte Ihnen auch gefallen