Sie sind auf Seite 1von 5

@ VESPA we culture Data Scientists who are statistically significant

VESPA Course Framework: Total Hours #30+10

Objective
This course provides understanding life cycle of data, techniques for data cleaning, visualizing the
data, basic concepts of statistics, modelling techniques, using the Industry Famous Analytics using
R language, Python, Knime and Excel.

Course Duration & Features

• VESPA Course Framework recommended hours are 40 hours (30 Hours Theory, Tools
Introduction, Business Problem Examples + 10 Hours on Projects & Exercises
• Hands-on training
• Contains real word business applications and examples.
• Rich material for reference

After Completion of this Training

• Gain exposure to key disciplines and skills needed to fulfil the role of a Statistician/Data
analyst. On the way to Data scientist
• Build predictive models using linear, logistic regression and decision trees.
Tools

• Open source tools such as R , R-Studio, Gretl, Python, Hive, Hadoop, PSPP

http://www.vespaanalytics.com
@ VESPA we culture Data Scientists who are statistically significant

Evaluation

Assignment and Evaluation

Data Set for Assignment on Basic Explanation of the Data set and its contents
Stats, Logistic Regression and
Cluster Analysis
Questions

Evaluation of the Assignments


Basic Stats, Logistic Regression and Evaluation based on the way candidates solved the assignment
Cluster Analysis

Pre-Requisites

Expect people to know these things

Basic Statistics Introduction to Probability Distributions


Measures of central tendency and dispersion (Mean, Median , Mode, SD and CV)
Normal /Binomial/Poison Distribution and Properties
Hypothesis Null/Alternative Hypothesis formulation
Testing
One Sample, two sample (Paired and Independent) T/Z Test
P Value Interpretation

http://www.vespaanalytics.com
@ VESPA we culture Data Scientists who are statistically significant

SESSION WISE TOPICS

Topic No-1 2.0 hours


Session Title Introduction to Big Data Sciences
Pedagogy What big data is, and why it is so important
How different institutes are leveraging the
advantage of big data analytics
Basic tools and concepts of big data

Topic No-2 2.5 hours


Session Title VESPA Frame Work

Introduction tools Gretl/R/Tableau/PSPP/ Business Data systems


–SQL/Hadoop/Hive
Content R programming as a tool for analytics

Pedagogy Some common analytical tasks

Topic No-3 1.5 hours


Session Title Visualization Framework
Content Concepts of Visualization & Story Boarding
Intro tools Descriptive/Explorative analytics with either
of the Qlik view/Tableau/power BI/ intro to
tools
Topic No-4 1.5 hour
Session Title Visualization
Tool Tableau
Case Title Food Supply Retail Chain
Pedagogy Visualization importance and its impact on
the business
Topic No-5 1.5 hour
Session Title Introduction To Econometrics
Contents Cross Sectional /Time Series/Panel Data

http://www.vespaanalytics.com
@ VESPA we culture Data Scientists who are statistically significant

Topic-6 1.5 hours


Session Title Cross-Sectional Data
Content Choosing to Join the Course-Education
Campaign Education Analysis

Pedagogy Applying the Logistic Regression Model


Topic No-7 4.5 hours
Session Title Time Series Analysis
Content Forecasting with time series
Case Title Stock Market Prediction
Pedagogy Decomposition of Data and
Creating the models
Validating the models -ARIMA
Topic No-8 2.0 hours
Session Title Time Series Analysis
Case Title Demand prediction for customer service
orders – call center tickets and staffing
Pedagogy ideating – structural/Arimax modeling
Topic No-9 2.5 hours
Session Title Predictive modeling Essentials
Reading Material Slides
Tool R/Knime
Case Title Credit card Banking Analysis
Topic No-10 3.5 hours
Session Title Predictive modeling –Clustering &
decomposition analysis
Reading Material Slides
Tool R/Knime/Python
Case Title Supply chain Procurement -analysis

http://www.vespaanalytics.com
@ VESPA we culture Data Scientists who are statistically significant

Topic No-11 3.5 hours


Session Title Data mining Predictive modeling Methods
Tool Python /Rattle
Case Title Decision trees VS Random forests
Topic No-12 3.5 hours
Session Title Text mining & Social media analytics
Tool R/Knime
Case Title Call center data analytics

http://www.vespaanalytics.com

Das könnte Ihnen auch gefallen