Sie sind auf Seite 1von 29

Course Title : Introduction to R in

Business Applications
Ram Mohan Dhara|
IMTG/ PGDM/ Term-V / 2019-2021
Session 1 : Introduction to R and R-installation
Split your screen – one for Your Profs are rationally
log-in and other for hands- bounded! Q&A session only
on practice in last 15 mins…

Stay alert! You have a quiz


every 15 minutes…and Please share feedback on today’s
that’s evaluative! session. Help your Prof to make the
sessions better

Every session has an assignment You can’t present; your video and
to be submitted same day by audio are in mute mode. Your are
23.59 PM…and that’s evaluative! not supposed to mute/ remove
your prof in session.
R – installation
R Installation

R basic http://cran.r-project.org
 
R Studio https://www.rstudio.com/products/rstudio/download/
R installation Find the .pdf file
guide
Click this folder and install R
Click this folder and install R Studio
After completing this session, you will be able to –

Session • Understand ‘what is R’ and its usefulness


objectives • Install R package
• Discuss the R workplace and packages
What is R ?
• R is a programming language developed at AT&T Bell Laboratories by
Robert Gentleman and Ross Ihaka
• It is a free, open source language, with highly active community
members
• It is available across all platforms (Linux, Mac, Windows)
• It is extremely useful for statistical computation and graphic
visualization.
What is R/ CRAN ?
What are R and CRAN (Comprehensive R Archive Network)?

• R is a freely available language and environment for statistical computing and graphics
which provides a wide variety of statistical and graphical techniques: linear and
nonlinear modelling, statistical tests, time series analysis, classification, clustering, etc.

• CRAN is a network of FTP and web servers around the world that store codes and
documentation for R.

• To “submit” a package to CRAN, check that your submission meets the CRAN
Repository Policy and then use the web form.
What is R/ CRAN/ R Studio ?
• R has a worldwide repository system—
Comprehensive R Archive Network (CRAN). It can
be accessed at http://cran.r-project.org
• As of now, there are than 10,000 such packages
hosted on CRAN and many more on other
websites.
• R is used by various reputed companies, such as:
Microsoft, Google , FaceBook etc…
• R Studio is an integrated development
environment (IDE) for R. It includes a console,
syntax-highlighting editor that supports direct
code execution, as well as tools for plotting,
history, debugging and workspace management. 
Syntax editor History

Console Help / Viewer


Some basics about R coding
• The assignment operator in R is "<-" (Although "=" also works)
• R statements or commands can be separated by a semicolon (;) or a new
line.
• All characters after # are treated as comments.
• There are no multiline or block-level comments.
• The $ (dollar) operator in R is analogous to a “.” (dot) operator in other
languages.
• Single inverted comma ‘ ’ and double inverted comma “ ” work similarly
• First bracket ( ) is used quite frequently; but second bracket { } and third
bracket [ ] hardly are in use.
Functions and Help in R
• There are over 1,000 functions at the core of R, and new R functions
are created all the time.
• Each R function comes with its own help page. To access a function’s
help page, type a question mark followed by the function’s name in
the console.
R Packages
• Many data scientist programmers and statisticians use R to design
tools for analysing data and to contribute their codes as pre-
assembled collections of functions and objects called packages. Each
R package is hosted at http://cran.r-project.org.
Reference books used in this course
1. R Cookbook by Paul Teetor (O’Reilly)
2. R for Data Science, Hadley Wickham (O’Reilly)
3. R Graphics Cookbook, Winston Chang (O’Reilly)
4. Data Mining and Predictive Analytics, Larose and
Larose, Wiley
5. Discovering Statistics using R, Andy Field, Sage
6. Statistical Analyses Using R, Brian et.al., CRC
publication
7. Multivariate Statistics using R, Randall Schumacker
8. Statistics for Management, Levin and Rubin
9. Marketing Research, Malhotra and Dash
10. Data Analytics with R, Bharti Motwani, Wiley
Reference materials / other R resources

1. R-blogs : https://www.r-bloggers.com
2. R-blogs: https://rpubs.com/
3. R tutorials :
https://www.programiz.com/r-programming
/
4. R Video book :
https://www.r-bloggers.com/in-depth-introd
uction-to-machine-learning-in-15-hours-of-e
xpert-videos/
Reference materials / other analytics
resources
1. www.analyticsmag.com
2. www.kdnuggets.com
3. www.analyticsbridge.com
4. www.datapine.com
5. www.datasciencecentral.com
Reference materials / online data sources

1. Kaggle.com
2. UCI machine Learning
3. www.datasciencecentral.com
4. https://data.gov.in/ (A digital
India Initiative)
5. https://www.data.gov/ (US
Govt. open data source)
Summary : what we have learnt
• R is a programming language developed as an
alternative to the S language.
• R is available across all platforms—Windows, Mac,
and Linux.
• R is most useful for statistical computation and
visualization.
• Each R function comes with its own help page.
• CRAN is a network of FTP and web servers around
the world that store codes and documentation for R.
• R Studio is an integrated development environment
(IDE) for R. R workspace is divided into four parts.
This concludes the session :
Introduction to R

Next session : Basic R-


Programming

Das könnte Ihnen auch gefallen