Beruflich Dokumente
Kultur Dokumente
Length of Program*: Term 1: 3 months (130 hrs), Term 2: 4 months (170 Hours)
Frequency of Classes: The program is self-paced within two terms. Term 1 is 3 months long and Term 2 is 4
months long. All projects must be completed by the end of the term.
Textbooks required: None
Textbooks optional: Elements of Statistical Learning, Machine Learning: A Probabilistic Perspective, Python
Machine Learning
Instructional Tools Available: Video lectures, mentors, Slack channel, project reviews
*The length of this program is an estimation of total hours the average student may take to complete all
required coursework, including lecture and project time. If you spend about 10 hours per week working
through the program, you should finish within the time provided. Actual hours may vary.
NAIVE BAYES ➔ Learn the Bayes rule, and how to apply it to predicting data
using the Naive Bayes algorithm
➔ Train models using Bayesian Learning
➔ Use Bayesian Inference to create Bayesian Networks of several
variables
➔ Bayes NLP Mini-Project
EVALUATION METRICS ➔ Learn about metrics such as accuracy, precision, and recall, used
to measure the performance of your models
TRAINING AND TUNING ➔ Choose the best model using cross-validation and grid search.
INTRODUCTION TO ➔ Acquire a solid foundation in deep learning and neural networks.
NEURAL NETWORKS Implement gradient descent and backpropagation in Python
TRAINING NEURAL ➔ Learn about techniques for how to improve training of a neural network,
NETWORKS such as: early stopping, regularization, and dropout
KERAS ➔ Learn how to use Keras for building deep learning models
DEEP LEARNING WITH ➔ Learn how to use PyTorch for building deep learning models
PYTORCH
RANDOM PROJECTIONS & ➔ Explore noise signal data from a cello, a television, and a piano
INDEPENDENT using additional methods of dimensionality reduction
COMPONENT ANALYSIS
DATA VISUALIZATION ➔ Apply data visualization for exploratory and explanatory purposes
WITH PYTHON ➔ Understand how design principles are vital to effective data visualizations
➔ Build univariate, bivariate, and multivariate visualizations in Python
COMMUNICATING WITH ➔ Implement best practices in sharing your code and written summaries
STAKEHOLDERS ➔ Learn what makes a great data science blog
➔ Learn how to create your ideas with the data science community
MACHINE LEARNING ➔ Understand the advantages of using machine learning pipelines to
PIPELINES streamline the data preparation and modeling process
➔ Chain data transformations and an estimator with scikit-learn’s Pipeline
➔ Use feature unions to perform steps in parallel and create more complex
workflows
➔ Grid search over pipeline to optimize parameters for entire workflow
➔ Complete a case study to build a full machine learning pipeline that
prepares data and creates a model for a dataset
NATURAL LANGUAGE ➔ Prepare text data for analysis with tokenization, lemmatization, and
PROCESSING removing stop words
➔ Use scikit-learn to transform and vectorize text data
➔ Build features with bag of words and tf-idf
➔ Extract features with tools such as named entity recognition and part of
speech tagging
➔ Build an NLP model to perform sentiment analysis
MATRIX FACTORIZATION ➔ Learn about the role of matrix factorization in machine learning
FOR RECOMMENDATIONS ➔ Implement matrix factorization techniques using real data in Python
➔ Interpret the results of matrix factorization to better understand latent
features of customer data
DEPLOYING YOUR ➔ Integrate your machine learning algorithm into an online application using
RECOMMENDATION flask and bootstrap
ENGINE ➔ Put it all together—use machine learning and software engineering to
recommend new movies to users
ELECTIVE 1: DOG BREED ➔ Use convolutional neural networks to classify different dogs according to
CLASSIFICATION their breeds
➔ Deploy your model to allow others to upload images of their dogs and
send them back the corresponding breeds
ELECTIVE 2: SPARK FOR BIG ➔ Take a Spark course and complete a project using a massive dataset of
DATA Spotify data to predict customer churn
➔ Use Spark to wrangle streaming data and make real time predictions
ELECTIVE 3: ARVATO ➔ Work through a real-world dataset and challenge provided by Arvato
FINANCIAL SERVICES Financial Services, a Bertelsmann company
➔ Top performers have a chance at an interview with Arvato or another
Bertelsmann company!
ELECTIVE 4: YOUR CHOICE ➔ Use your skills to tackle any other project of your choice