Data Wrangling in R Parts I & II
Online workshop series: October 10 & 17, 2025. 1:00 - 3:00 pm
Overview
"Data wrangling" refers to all the manipulations commonly needed to prepare tabular data for visualization or analysis. This could include subsetting the data, manipulating rows and columns, changing data types, going from long-to-wide formats or vice-versa, parsing text, dealing with missing values, joining, stacking, or splitting tables, and aggregating rows for higher level units of analysis.
In the real-world, data wrangling is frequently a pain point for researchers. It is often said that up to 80% of the time spent on data analysis is spent preparing the data, the so-called 80/20 rule.
R is an outstanding platform for wrangling data. This two-part workshop series will cover data wrangling methods using the tidyverse family of R packages, including dplyr, tidyr, lubridate, stringr, ggplot, and others. The topics in this workshop are part of the foundation for doing data analysis in R, including spatial data.
Agenda
Instructor
- Andy Lyons, Program Coordinator, IGIS
Audience
- UC Cooperative Extension
- UC students & faculty
- AIFS affiliates (including Davis, Berkeley, Cornell, U. Illinois, & USDA)
Cost
This is a free workshop
Requirements
Parts of the workshop will require you to follow along as the instructor demonstrates code to solve a problem. It is therefore highly recommended that you have a computer with two monitors, so you can view the instructor's screen on one monitor, and work in RStudio in the other. Participants who only have a small laptop with no external monitor may find it challenging to complete the hands-on exercises, and may just want to watch.
Registration
Registration is closed. The recordings are available below and on the IGIS YouTube Channel.
Recordings
This course content is the result of a collaborative effort between UC ANR IGIS Statewide Program, and the USDA-NIFA/NSF AI Institute for Next Generation Food Systems (AIFS).