As mentioned last time, Github is a widely used tool in the data science world. While the primary purpose is software development, data science has made heavy use of the environment with all of the advantages (and potential headaches).
Was a minimal collaboration using Github. There were two ways to accomplish it. My example contains elements of both, first 2 then 1.
At least one group tried it both ways. Let’s talk this through.
There are tons. I tend to use GitKraken because it was the first thing that I tried and it works. YMMV.
usethis
Setting this up makes interacting with Git
far easier from RStudio.
#tidyTuesday
as a source of data and an awesome collection of neat visualizationsstackoverflow
and Posit CommunityTips: echo=FALSE
and results="hide"
, include=FALSE
The first three are crucial. Four and five depend on the analytical task. For throwaways, this is overkill. For repeated tasks, going at least through four is ideal. For oft-repeated tasks, all of them make sense.
Emily’s Talk at RStudio::conf 2020 is definitely worth checking out.
Preliminary questions:
I can get a something up in only a few minutes. Let’s walk through that.
Let’s build at least a barebones portfolio. I don’t care which method you choose though I have used blogdown
for years and am somewhat new to quarto
.
If you want to use blogdown, I would strongly encourage you to basically follow along here. It is a nice walkthrough.
Partly for next week’s assignment, browse the tidyTuesday archives, find a visualization, and try a modification of it in a post. Or some other post of interest. So that we know how to extend it. We are going to add to it from here.
Communicating With Data: Week 2 (23 Jan 2023)