Sie sind auf Seite 1von 5

27/9/2019 Data Mining For Dummies Cheat Sheet

  Programming  Big Data  Data Science  Data Mining For Dummies Cheat Sheet

Cheat Sheet

Data Mining For Dummies Cheat Sheet


From Data Mining For Dummies

By Meta S. Brown

Data mining is the way that ordinary businesspeople use a range of data analysis techniques to uncover
useful information from data and put that information into practical use. Data miners don’t fuss over
theory and assumptions. They validate their discoveries by testing. And they understand that things
change, so when the discovery that worked like a charm yesterday doesn’t hold up today, they adapt.

The 9 Laws of Data Mining: A Reference Guide


Pioneering data miner Thomas Khabaza developed his “Nine Laws of Data Mining” to guide new data
miners as they get down to work. This reference guide shows you what each of these laws means to
your everyday work.

1st Law of Data Mining, or “Business Goals Law”: Business objectives are the origin of every data
mining solution.

A data miner is someone who discovers useful information from data to support speci c business
goals. Data mining isn’t de ned by the tool you use.

2nd Law of Data Mining, or “Business Knowledge Law”: Business Knowledge is central to every
step of the data mining process.

https://www.dummies.com/programming/big-data/data-science/data-mining-for-dummies-cheat-sheet/ 1/5
27/9/2019 Data Mining For Dummies Cheat Sheet

You don’t have to be a fancy statistician to do data mining, but you do have to know something
about what the data signi es and how the business works.

3rd Law of Data Mining or “Data Preparation Law”: Data preparation is more than half of every
data mining process.

Pretty much every data miner will spend more time on data preparation than on analysis.

4th Law of Data Mining, or “No Free Lunch for the Data Miner”: The right model for a given
application can only be discovered by experiment.

In data mining, models are selected through trial and error.

5th Law of Data Mining: There are always patterns in the data.

Enov8 Data Compliance Suite - (DevOps Edition)


OPEN
Automatically pro le your data, identify risks, remediate/mask and validate compliance.
enov8.com

As a data miner, you explore data in search of useful patterns. Understanding patterns in the data
enables you to in uence what happens in the future.

6th Law of Data Mining, or “Insight Law”: Data mining ampli es perception in the business
domain.

Data mining methods enable you to understand your business better than you could have done
without them.

7th Law of Data Mining or “Prediction Law”: Prediction increases information locally by
generalization.

Data mining helps us use what we know to make better predictions (or estimates) of things we
don’t know.

Data obfuscation tool - Demonstrate software


safely OPEN

Make test data which looks like real data rststrike.com.au

https://www.dummies.com/programming/big-data/data-science/data-mining-for-dummies-cheat-sheet/ 2/5
27/9/2019 Data Mining For Dummies Cheat Sheet

8th Law of Data Mining, or “Value Law”: The value of data mining results is not determined by the
accuracy or stability of predictive models.

Your model must produce good predictions, consistently. That’s it.

9th Law of Data Mining, or “Law of Change”: All patterns are subject to change.

Any model that gives you great predictions today may be useless tomorrow.

Phases of the Data Mining Process


The Cross-Industry Standard Process for Data Mining (CRISP-DM) is the dominant data-mining process
framework. It’s an open standard; anyone may use it. The following list describes the various phases of
the process.

Data obfuscation tool - Demonstrate software


safely OPEN

Make test data which looks like real data rststrike.com.au

Business understanding: Get a clear understanding of the problem you’re out to solve, how it
impacts your organization, and your goals for addressing it. Tasks in this phase include:

Identifying your business goals

Assessing your situation

De ning your data mining goals

Producing your project plan

Data understanding: Review the data that you have, document it, identify data management and
data quality issues. Tasks for this phase include:

Gathering data

Describing

Exploring

https://www.dummies.com/programming/big-data/data-science/data-mining-for-dummies-cheat-sheet/ 3/5
27/9/2019 Data Mining For Dummies Cheat Sheet

Verifying quality

Data preparation: Get your data ready to use for modeling. Tasks for this phase include:

Selecting data

Cleaning data

Constructing

Integrating

Formatting

Modeling: Use mathematical techniques to identify patterns within your data. Tasks for this phase
include:

Selecting techniques

Designing tests

Building models

Assessing models

Evaluation: Review the patterns you have discovered and assess their potential for business use.
Tasks for this phase include:

Evaluating results

Reviewing the process

Determining the next steps

Deployment: Put your discoveries to work in everyday business. Tasks for this phase include:

Planning deployment (your methods for integrating data mining discoveries into use)

Reporting nal results

Reviewing nal results

https://www.dummies.com/programming/big-data/data-science/data-mining-for-dummies-cheat-sheet/ 4/5
27/9/2019 Data Mining For Dummies Cheat Sheet

https://www.dummies.com/programming/big-data/data-science/data-mining-for-dummies-cheat-sheet/ 5/5

Das könnte Ihnen auch gefallen