Including data in a package can be useful way to help demonstrate the utility of your R package as well as a convenient way to give your users what they need to run your functions. In this Code Club, Pat shows how to use the {devtools} tools to create two datasets that will potentially ship with his {phylotypr} R package. Along the way, he'll discuss file compression, documentation, and file sizes. This episode is part of an ongoing effort to develop an R package that implements the naive Bayesian classifier.
If you want to get a physical copy of R Packages: https://amzn.to/43pMR8L
If you want a free, online version of R packages: https://r-pkgs.org/
You can find my blog post for this episode at https://www.riffomonas.org/code_club/....
Check out the GitHub repository at the:
Beginning of the episode: https://github.com/riffomonas/phyloty...
End of the episode: https://github.com/riffomonas/phyloty...
#rstats #readr #vroom #data.table #read.delim #rdp #16S #classification #classifier #microbialecology #microbiome
Support Riffomonas by becoming a Patreon member!
/ riffomonas
Want more practice on the concepts covered in Code Club? You can sign up for my weekly newsletter at https://shop.riffomonas.org/youtube to get practice problems, tips, and insights.
If you're interested in purchasing a video workshop be sure to check out https://riffomonas.org/workshops/
You can also find complete tutorials for learning R with the tidyverse using...
Microbial ecology data: https://www.riffomonas.org/minimalR/
General data: https://www.riffomonas.org/generalR/
0:00 Introduction
3:59 Creating data to be exported
10:51 Compressing data
14:07 Writing documentation for dataset
21:46 Passing "check"
23:11 Creating another dataset
28:30 Updating vignette to use trainset19