Beruflich Dokumente
Kultur Dokumente
Programming Big Data Data Science Data Mining For Dummies Cheat Sheet
Cheat Sheet
By Meta S. Brown
Data mining is the way that ordinary businesspeople use a range of data analysis techniques to uncover
useful information from data and put that information into practical use. Data miners don’t fuss over
theory and assumptions. They validate their discoveries by testing. And they understand that things
change, so when the discovery that worked like a charm yesterday doesn’t hold up today, they adapt.
1st Law of Data Mining, or “Business Goals Law”: Business objectives are the origin of every data
mining solution.
A data miner is someone who discovers useful information from data to support speci c business
goals. Data mining isn’t de ned by the tool you use.
2nd Law of Data Mining, or “Business Knowledge Law”: Business Knowledge is central to every
step of the data mining process.
https://www.dummies.com/programming/big-data/data-science/data-mining-for-dummies-cheat-sheet/ 1/5
27/9/2019 Data Mining For Dummies Cheat Sheet
You don’t have to be a fancy statistician to do data mining, but you do have to know something
about what the data signi es and how the business works.
3rd Law of Data Mining or “Data Preparation Law”: Data preparation is more than half of every
data mining process.
Pretty much every data miner will spend more time on data preparation than on analysis.
4th Law of Data Mining, or “No Free Lunch for the Data Miner”: The right model for a given
application can only be discovered by experiment.
5th Law of Data Mining: There are always patterns in the data.
As a data miner, you explore data in search of useful patterns. Understanding patterns in the data
enables you to in uence what happens in the future.
6th Law of Data Mining, or “Insight Law”: Data mining ampli es perception in the business
domain.
Data mining methods enable you to understand your business better than you could have done
without them.
7th Law of Data Mining or “Prediction Law”: Prediction increases information locally by
generalization.
Data mining helps us use what we know to make better predictions (or estimates) of things we
don’t know.
https://www.dummies.com/programming/big-data/data-science/data-mining-for-dummies-cheat-sheet/ 2/5
27/9/2019 Data Mining For Dummies Cheat Sheet
8th Law of Data Mining, or “Value Law”: The value of data mining results is not determined by the
accuracy or stability of predictive models.
9th Law of Data Mining, or “Law of Change”: All patterns are subject to change.
Any model that gives you great predictions today may be useless tomorrow.
Business understanding: Get a clear understanding of the problem you’re out to solve, how it
impacts your organization, and your goals for addressing it. Tasks in this phase include:
Data understanding: Review the data that you have, document it, identify data management and
data quality issues. Tasks for this phase include:
Gathering data
Describing
Exploring
https://www.dummies.com/programming/big-data/data-science/data-mining-for-dummies-cheat-sheet/ 3/5
27/9/2019 Data Mining For Dummies Cheat Sheet
Verifying quality
Data preparation: Get your data ready to use for modeling. Tasks for this phase include:
Selecting data
Cleaning data
Constructing
Integrating
Formatting
Modeling: Use mathematical techniques to identify patterns within your data. Tasks for this phase
include:
Selecting techniques
Designing tests
Building models
Assessing models
Evaluation: Review the patterns you have discovered and assess their potential for business use.
Tasks for this phase include:
Evaluating results
Deployment: Put your discoveries to work in everyday business. Tasks for this phase include:
Planning deployment (your methods for integrating data mining discoveries into use)
https://www.dummies.com/programming/big-data/data-science/data-mining-for-dummies-cheat-sheet/ 4/5
27/9/2019 Data Mining For Dummies Cheat Sheet
https://www.dummies.com/programming/big-data/data-science/data-mining-for-dummies-cheat-sheet/ 5/5