RStudio provides the premiere open source and enterprise-ready professional software for R, including RStudio Desktop, RStudio Server, RStudio Connect, Shiny Server, and shinyapps.io. The tidyverse, shiny, ggplot, ggvis, dplyr, knitr, R Markdown, and packrat are R packages from RStudio that every data scientist will want to enhance the value, reproducibility, and appearance of their work.

Date: June 7th
Time: 11:00 a.m. EDT

Description:

R has an excellent framework for specifying models using formulas. While elegant and useful, it was designed in a time when models had small numbers of terms and complex preprocessing of data was not commonplace. As such, it has some limitations. In this talk, a new package called recipes is shown where the specification of model terms and preprocessing steps can be enumerated sequentially. The recipe can be estimated and applied to any dataset. Current options include simple transformations (log, Box-Cox, interactions, dummy variables, ...), signal extraction (PCA, ICA, MDS), basis functions (splines, polynomials), imputation methods, and others.



Logistics:

Only 1,000 live attendees are allowed in the Webinar on a first come first serve basis. It is typical for many people who register to not attend (which is why registration does not guarantee access.) If for any reason you cannot make the webinar or cannot get in we will provide links to the recording as well as all materials within 48 hours.

Creating and Preprocessing a Design Matrix with Recipes Webinar Registration:


Presenter:

JeffAllenHS.jpg Max Kuhn, Software Engineer -  Max is currently working on improving R's modeling capabilities. He has a Ph.D. in Biostatistics.

Previously, Max was a Director of Nonclinical Statistics at Pfizer Global R&D in Connecticut. He was applying models in the pharmaceutical and diagnostic industries for over 18 years.

Max is the author of eight R packages for techniques in machine learning and reproducible research and is an Associate Editor for the Journal of Statistical Software. He, and Kjell Johnson, wrote the book Applied Predictive Modeling, which won the Ziegel award from the American Statistical Association, which recognizes the best book reviewed in Technometrics in 2015.

He has taught courses on modeling, including many classes for Predictive Analytics World, the useR! conference, the Open Data Science Conference, the India Ministry of Information Technology, and others.


Webinar Recordings:

We try to record every webinar we host and post all materials on our website.
http://www.rstudio.com/resources/webinars/

Slides & Code:

We've started a Github repository with all webinar materials. Speakers for this webinar and all future webinars will add their materials to the repository.
https://github.com/rstudio/webinars


Live on June 7th at 11am EDT
Approximately 45 minutes of presentation followed by 15 Minutes of Q&A.