Beruflich Dokumente
Kultur Dokumente
Summary
- Diverse background in statistics and engineering, strong analytical and programming skills - Hands-on
experience working with big data, machine learning algorithms and predictive models - Work well both
individually and in a team environment, good written and verbal presentation skills
Experience
Data Scientist at MasterCard
July 2013 - Present (1 year 9 months)
# Consumer Analytics: Enhanced consumer insights by creating profiles for millions of users. Performed
user segmentation (R) and generated personalized labels for consumers based on their shopping history.
Improved the capability of personalized campaign targeting and offer distribution. # Predictive Modeling:
Built propensity model to predict users probability of shopping at a merchant in the next 14 days. Generated
innovative features from shopping history (MapReduce), and built logistic regression and random forest
models to project future transaction (R and Python Scikit-Learn). # Merchant Analytics: Analyzed
spend behavior of millions of users on thousands of merchants. Calculated the correlation between pairs
of merchants, i.e. normalized probability of shopping at A given shopped at B. # A/B Testing: Created
framework to perform A/B testing in order to compare different modeling algorithms. Determined control
group sizes required for statistical significance. Design and carried out the tests.
Research Assistant at University of Illinois at Urbana-Champaign
January 2009 - May 2011 (2 years 5 months)
Modified ultra-high-vacuum scanning tunneling microscope to achieve selective silicon growth in nanometer
scale
Languages
English
Chinese
Page1
Python
Hadoop
Big Data
Statistics
SAS
SQL
C++
Microsoft Office
C
Education
University of Illinois at Urbana-Champaign
Master of Science (MS), Statistics, 2012 - 2013
Grade: 4.0
University of Illinois at Urbana-Champaign
Master of Science (MS), Electrical and Computer Engineering, 2009 - 2011
Grade: 3.95
National University of Singapore
Bachelor of Engineering (BEng), Electrical and Computer Engineering, 2004 - 2008
Grade: 3.74
Certifications
SAS Certified Base Programmer for SAS 9 Credential
SAS
February 2013
Machine Learning
Coursera
July 2013
Courses
Master of Science (MS), Statistics
University of Illinois at Urbana-Champaign
Statistical Learning
STAT 542
Page2
STAT 410
STAT 426
STAT 429
STAT 440
STAT 448
STAT 425
STAT 510
STAT 510
ECE 534
Projects
Statistical Learning (R)
Members:Fan Z.
Analyzed and performed binary classification on large data set by applying various supervised learning
methods, including linear / logistic regression, k-nearest neighbor, linear and quadratic discriminant analysis,
support vector machine, tree model and random forest. Explored variable selection and dimension reduction
techniques.
Time Series Analysis (R)
Members:Fan Z.
Built seasonal ARIMA model based on past data, and used the model to forecast upcoming events. Applied
model selection techniques and performed model diagnostics.
Linear Regression (R)
Members:Fan Z.
Analyzed effect of analgesic ketorolac on post-operation morphine use and hospital stay based on clinical
data. Built a multivariate linear regression model, applied variable selection techniques and performed model
diagnostics.
Data Management (SAS)
Members:Fan Z.
Cleaned large raw data files and inputted into SAS data set. Applied techniques such as merging / subsetting
data sets, creating format and variables. Prepared summary reports on the data
Page3
Fan Zhang
Data Scientist at MasterCard
Page4