CS506: Data Wrangling and Management

Fall 2025

Author
Affiliation

Marc Tollis, PhD

Northern Arizona University

Preface

Welcome to CS506: Data Wrangling and Management. This course introduces graduate students to data wrangling and management using R and the Tidyverse ecosystem. Students will learn to import, manipulate, clean, and visualize data with a strong emphasis on practical applications and reproducible workflows.

Please access the course syllabus.

The course will utilize the free textbook R for Data Science by Hadley Wickham and Garrett Grolemund.

Course Objectives: Upon successful completion of the course, students will be able to:

  • Develop an understanding of R and the Tidyverse ecosystem

  • Import structured and unstructured data into R

  • Clean and transform data using dplyr, tidyr, and other core Tidyverse packages

  • Visualize data effectively using ggplot2

  • Conduct exploratory data analysis (EDA)

  • Apply data wrangling techniques to real-world datasets

Textbook: R for Data Science by Hadley Wickham & Garrett Grolemund (Available for free: https://r4ds.hadley.nz/)

Software Requirements:

Footnotes