This course provides a brief introduction to the fundamentals of the R language and focuses on its use for data analysis--including exploratory data analysis, linear and logistic regression, variable selection, model diagnostics, and prediction.
Please follow the Pre-Workshop Instructions prior to coming to the workshop.
- Fundamentals of R & RStudio: the basics--including objects, subsetting, indexing, data I/O, and control structures.
- Exploratory Data Analysis: all the necessary tools to investigate your data before performing any formal modeling--from summary statistics to visualization including plotting histograms, boxplots, and scatterplots
- Linear Regression: everything you need to know to begin fitting linear models--from simple t-tests to estimation of regression coefficients, variable selection, model diagnostics, and prediction
- Logistic Regression: the basics of generalized linear models (GLMs) with an emphasis on binary response data--we extend the theory and modeling strategies of linear regression
Time | ||
---|---|---|
8:30-9:00 | Registration & Breakfast | |
9:00-10:15 | Session 1: Fundamentals of R | Shell Code, Complete Code |
10:15-10:45 | Exercise 1 | |
10:45-11:00 | Break | |
11:00-11:15 | Discuss Exercise 1 Solutions | Chris' Solution Code |
11:15-12:15 | Session 2: Exploratory Data Analysis | Shell Code, Complete Code |
12:15-12:30 | Exercise 2 | |
12:30-1:00 | Lunch | |
1:00-1:15 | Discuss Exercise 2 Solutions | Chris' Solution Code |
1:15-2:30 | Session 3: Linear Regression | Shell Code, Complete Code |
2:30-3:15 | Exercise 3 | |
3:15-3:30 | Break | |
3:30-4:00 | Discuss Exercise 3 Solutions | Chris' Solution Code |
4:00-4:50 | Session 4: Logistic Regression | Shell Code, Complete Code |
4:50-5:00 | Course Evaluation Survey |
First, visit The R Project for Statistical Computing. Click on CRAN
under the Download section on the left-hand side of the page. Then, click on any of the nearby websites under the USA section near the bottom of the page. For example, the link from the University of California, Berkley, CA or University of California, Los Angeles, CA are both fine. Download R for your platform (Linux, Mac, or Windows), open the downloaded file and follow the instructions.
RStudio is a set of integrated tools designed to help you be more productive with R. Also, it is far more user-friendly than base R. You will be doing essentially all of your programming in RStudio. To download RStudio, visit the download page, scroll down to "Installers for Supported Platforms," and click on the appropriate installer for your platform. Finally, open the downloaded file and follow the instructions.
In R, packages are used to share code. A package bundles together code, data, documentation, and tests. As of February 2018, there were over 12,000 packages available on the Comprehensive R Archive Network, or CRAN. This huge variety of packages is one of the reasons that R is so successful: the chances are that someone has already solved a problem that you’re working on, and you can benefit from their work by downloading their package and using their code.
In this workshop, we will be using three packages-- car
, leaps
, and here
. If everyone attempts to download them at the same time, the wireless internet will become very upset with us and work slowly. Therefore, I suggest installing them before coming to class. There are a few ways to do this (which we will discuss more during the workshop). For now, please download install_packages.R and open it in RStudio (Right click > Open with > RStudio
). Once in RStudio, you should see the following three lines of code:
install.packages("car", dependencies = TRUE)
install.packages("leaps", dependencies = TRUE)
install.packages("here", dependencies = TRUE)
Highlight all of this code (using your cursor, Ctrl + A
on Windows, or Cmd + A
on Mac) and click the Run icon (or hit Ctrl + Enter
on Windows or Cmd + Enter
on Mac). Additional "dependencies," or other packages necessary to run the three above, will also be installed. This make take a few minutes.
We have a chatroom set up at ida-with-r.slack.com for you to have discussions and ask/answer questions that may arise throughout the day.
Chris Galbraith ([email protected])
Emily Smith ([email protected])