Skip to content

jhu-data-services/reshaping-joining-R-dplyr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Manipulating and Joining Data in R with dplyr

The dplyr package is a popular R package that people often use to manipulate and join datasets. You will need to have either some basic knowledge about using R or have previously attended our Introduction to R for Absolute Beginners workshop in order to take this one. Join JHU Data Services for this workshop and you will learn to use several functions, including mutate(), filter(), select(), summarize() and group_by(), in dplyr to manipulate data for the first half of the workshop. For the second part of this workshop, you will learn the join functions (e.g. left_join(), right_join(), inner_join(), semi_join(), anti_join(), full_join(), bind_rows() and bind_cols()) and set operations (e.g. union(), intersect() and setdiff()) in dplyr to combine two datasets. You will have plenty of opportunities to do hands-on activities on your laptop and work on datasets provided by instructors.

JHU Data Services

Website: dataservices.library.jhu.edu/
Contact us: [email protected]
JHU Data Services, part of the Johns Hopkins University Sheridan Libraries, helps the JHU community find, use, visualize, manage, and share data. We offer live webinars and self-paced online trainings on computational research and coding, GIS, data management, data visualization, and more. See all of our training topics on our website.

This repository contains materials for one of our live webinars open to JHU students, faculty, and staff. Please contact us with any questions.

As of March 2020, Data Services workshops are being held virtually on Zoom. See our calendar to register for upcoming workshops.

Pre-Class Instructions

Before the class, make sure you have a recent version of R and RStudio installed: https://posit.co/download/rstudio-desktop/

Please also install the tidyverse package with the R command install.packages("tidyverse") or using the package manager.

Description of Files

  • In-ClassScripts: This folder contains code files you will need for the workshop:
    • class_script.R: this file contains the script we wrote together during the workshop
  • PresentationMaterials: This folder contains PowerPoint slides and other presentation materials used in the workshop
    • Manipulating and Joining Data in R with dplyr.html: These are the slides, which can be viewed in the browser (recommended)
    • Manipulating and Joining Data in R with dplyr.pdf: If you can't view the browser version. this is a PDF copy of the slides.
    • Quarto: this folder contains the Quarto documents needed to edit and generate the presentation
  • Resources: This folder contains cheatsheets to assist you during the workshop

Post-Class Survey

If you have taken the live webinar for this class, please take this survey: Manipulating and Joining Data in R

License and Terms of Use

The presentation materials are licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0), attributable to Data Services, Johns Hopkins University.

See LICENSE file for additional code licensing and re-use information.

The images, external resources, and cheatsheets linked in this repository may have other licenses and terms of use.

Citation

Please cite this material as:
Johns Hopkins University Data Services. October 9 2024. Manipulating and Joining Data in R with dplyr. https://github.com/jhu-data-services/.reshaping-joining-R-dplyr

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published