15
INTRODUCTION TO SETIA PRAMANA [email protected]

Introduction to R

Embed Size (px)

Citation preview

Page 1: Introduction to R

INTRODUCTION TO SETIA PRAMANA

[email protected]

Page 2: Introduction to R

WHAT IS R?

• A language and environment for statistical computing and graphics.

• An integrated suite of software facilities for data manipulation, calculation and graphical display.

• First appeared in 1996 by Prof. Ross Ihaka and Robert Gentleman of the University of Auckland, NZ.

• GNU software -> Free. Similar like S language.

• open source, maintained and developed by a community of developers.

• Works in Windows, Unix, MacOs

Page 3: Introduction to R

R INCLUDES

• an effective data handling and storage facility,

• a suite of operators for calculations on arrays, in particular matrices,

• a large, coherent, integrated collection of intermediate tools for data analysis,

• graphical facilities for data analysis and display either on-screen or on hardcopy, and

• a well-developed, simple and effective programming language which includes conditionals, loops, user-defined recursive functions and input and output facilities.

http://www.r-project.org/

Page 4: Introduction to R

WHY R?

• R is HOT !

Page 5: Introduction to R

WHY R?

• R is HOT !

• It's free, open source, powerful and highly extensible

• It is not only statistical software but also a language

• > 5000 add-on packages lots of pre-prepared packages (http://cran.r-project.org/web/packages/)

• With many applications http://cran.r-project.org/web/views/

• Access to powerful, cutting-edge analytics

Page 6: Introduction to R

WHY R?

• Flexible (complex or standard statistical practices, bayesian modelling, GIS map building, building interactive web applications, building interactive tests, etc.   )

• We can make our own package and publish it

• Great Graphics and data visualization

• Can be used for High Performance Computer Clusters

• Well Supported by R Community (http://www.inside-r.org/r-resources-web)

• And many more…..

Page 7: Introduction to R

BUT…..

• R has no warranty

• Command Line Interface : difficult for some users.

• Users must learn a new way of thinking about data and data analysis sequence

• ….. I guess

Page 8: Introduction to R

COMPANIES USING R IN 2013

• The New York Times routinely uses R for interactive and print data visualization.

• Google has more than 500 R users.

• The FDA supports the use of R for clinical trials of new drugs.

• The National Weather Service uses R to predict the extent of flooding events.

• Zillow uses R to model housing prices.

• The Consumer Financial Protection Bureau uses R and other open source tools.

• Twitter uses R for data science applications on the Twitter database.

• FourSquare uses R to develop its recommendation engine.

• Facebook uses R to model all sorts of user behaviour. Source: Revolutionanalytics

Page 9: Introduction to R

B P S ?

Page 10: Introduction to R

R• Download from CRAN.

• Command Line Interface (for some is nightmare, for others advantages)

• Also provide Graphical User Interface: RCommander, IsoGeneGUI, Shiny, etc.

Page 11: Introduction to R

R STUDIO

• Download from Rstudio.com

• Powerfull IDE (Integrated Development Environment) for R.

Page 12: Introduction to R

R COMMANDER

• Rcommander.com

• Helpful for R beginner

• Install inside R

Page 13: Introduction to R

R LIBRARY/PACKAGES

R Base Packages

lme4IsoGene

foreign

survivalzoo

Page 14: Introduction to R

I CONTRIBUTES TO THESE R PACKAGES

• IsoGene

• IsoGeneGUI

• nea

• neaGUI

• biclustGUI

• OCRME

• More detail: http://setiopramono.wordpress.com/r-programming/

Page 15: Introduction to R

PRACTICE TIME