The dplyr package is a popular R package that people often use to manipulate and join datasets. You will need to have either some basic knowledge about using R or have previously attended our Introduction to R for Absolute Beginners workshop in order to take this one. Join JHU Data Services for this workshop and you will learn to use several functions, including mutate()
, filter()
, select()
, summarize()
and group_by()
, in dplyr to manipulate data for the first half of the workshop. For the second part of this workshop, you will learn the join functions (e.g. left_join()
, right_join()
, inner_join()
, semi_join()
, anti_join()
, full_join()
, bind_rows()
and bind_cols()
) and set operations (e.g. union()
, intersect()
and setdiff()
) in dplyr to combine two datasets. You will have plenty of opportunities to do hands-on activities on your laptop and work on datasets provided by instructors.
Website: dataservices.library.jhu.edu/
Contact us: [email protected]
JHU Data Services, part of the Johns Hopkins University Sheridan Libraries, helps the JHU community find, use, visualize, manage, and share data. We offer live webinars and self-paced online trainings on computational research and coding, GIS, data management, data visualization, and more. See all of our training topics on our website.
This repository contains materials for one of our live webinars open to JHU students, faculty, and staff. Please contact us with any questions.
As of March 2020, Data Services workshops are being held virtually on Zoom. See our calendar to register for upcoming workshops.
Before the class, make sure you have a recent version of R and RStudio installed: https://posit.co/download/rstudio-desktop/
Please also install the tidyverse
package with the R command install.packages("tidyverse")
or using the package manager.
- In-ClassScripts: This folder contains code files you will need for the workshop:
- class_script.R: this file contains the script we wrote together during the workshop
- PresentationMaterials: This folder contains PowerPoint slides and other presentation materials used in the workshop
- Manipulating and Joining Data in R with dplyr.html: These are the slides, which can be viewed in the browser (recommended)
- Manipulating and Joining Data in R with dplyr.pdf: If you can't view the browser version. this is a PDF copy of the slides.
- Quarto: this folder contains the Quarto documents needed to edit and generate the presentation
- Resources: This folder contains cheatsheets to assist you during the workshop
If you have taken the live webinar for this class, please take this survey: Manipulating and Joining Data in R
The presentation materials are licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0), attributable to Data Services, Johns Hopkins University.
See LICENSE file for additional code licensing and re-use information.
The images, external resources, and cheatsheets linked in this repository may have other licenses and terms of use.
Please cite this material as:
Johns Hopkins University Data Services. October 9 2024. Manipulating and Joining Data in R with dplyr. https://github.com/jhu-data-services/.reshaping-joining-R-dplyr