Data Wrangling with Tidyverse (part 2)

Опубликовано: 02 Май 2024
на канале: Sharcnet HPC
144
0

Tidyverse is an cohesive set of packages for doing data science in R. In an earlier talk, we began reviewing the data munging portions of tidyvese (dplyr, forcats, tibble, readr, stringr, tidyr, and purr) by using it to reconstruct the data hierarchy in a 500 pages reference PDF given only the words on each page and their bounding boxes.

Part 1:    • Data Wrangling with Tidyverse (part 1)  
Part 2 (this video):    • Data Wrangling with Tidyverse (part 2)  
Part 3:    • Data Wrangling with Tidyverse (part 3)  
_______________________________________­________

This webinar was presented by Tyson Whitehead (SHARCNET) on April 24th, 2024, as a part of a series of weekly Compute Ontario Colloquia. The webinar was hosted by SHARCNET. The colloquia cover different advanced research computing (ARC) and high performance computing (HPC) topics, are approximately 45 minutes in length, and are delivered by experts in the relevant fields. Further details can be found on this web page: https://www.computeontario.ca/trainin... . Recordings, slides, and other materials can be found here: https://helpwiki.sharcnet.ca/wiki/Onl...

SHARCNET is a consortium of 19 Canadian academic institutions who share a network of high performance computers (http://www.sharcnet.ca). SHARCNET is a part of Compute Ontario (http://computeontario.ca/) and Digital Research Alliance of Canada (https://alliancecan.ca).