Meeting information

M/W/F from 10:30-11:20 via Zoom


Please note that the schedule and topics may change as the course progresses.

Date Topics Notes (html) Source (Rmd) Zoom recording Background
4 Jan Course overview
What is data science?
Types of questions
Storytelling
Leek & Peng (2015)
D’Agostino McGowan (2020)
Peng (2018)
6 Jan Steps to an analysis
Informal vs formal
Rubrics
Required software
Cooper & Hsing (2017)
8 Jan Organization & styles
Research compendia
Coding styles
Marwick et al (2018)
Tidyverse Style Guide
11 Jan Introduction to GitHub
Repositories
Markdown
Issues
Projects
Intro to GitHub
13 Jan Version control with Git
Adding files
Committing file
Exploring history
Reverting files
Blischak et al (2016)
Chap 20 of HGG
15 Jan Remote repositories
Remotes in GitHub
RStudio projects
Pushing/pulling
RStudio cheatsheet
Chaps 9 & 10 of HGG
18 Jan Martin Luther King, Jr. Day
No class

20 Jan Understanding workflows
Centralized
Feature branch
Forking
(1)
(2)
(1)
(2)
Sec VI of HGG
Learn Git branching
22 Jan Working with data
Chain of custody
Tidy data
Manual entry
Ellis & Leek (2017)
Broman & Woo (2018)
25 Jan Getting data
Importing text files
Reading from Excel & Google
readr pkg
readxl pkg
googlesheets pkg
jsonlite pkg
27 Jan Getting help
Minimal
Complete
Reproducible
Maelle Salmon’s blog
Reprex do’s & don’ts
29 Jan Kara Woo
Data science for fun and nonprofit
Guest presentation
Kara’s website
1 Feb Jennifer Scheuerell
Intro to relational databases
SQL programming language
PostgreSQL & its community
About PostgreSQL
Installing PostgreSQL
Troubleshooting PostgreSQL
3 Feb Jennifer Scheuerell
Creating a database
New users & databases
Adding data

5 Feb Jennifer Scheuerell
Working with a database
Cleaning data
Joining tables

8 Feb Review session
Workflows
Reprexes
Databases

10 Feb Metadata
Types of metadata
Ecological Metadata Language
Michener (2005)
Fegraus et al (2005)
EML specification
EML package
12 Feb Data analysis & visualization
Data summaries
Design principles
Exploratory figures
Expository figures
R Graph Gallery
High Quality Graphics in R
15 Feb President’s Day
No class

17 Feb R Markdown
Introduction
Publishing papers
Intro to R Markdown
19 Feb R Markdown
Publishing papers
Reporting
Chap 6 of Markdown Cookbook
22 Feb Dr. Christine Stawitz
Data science skills for a career in fisheries management
Guest presentation

24 Feb R packages
Functions
Documentation
Wickham & Bryan (2020)
26 Feb R packages
Unit tests
Vignettes
Websites
Wickham & Bryan (2020)
1 Mar Dr. Margaret Siple
Shiny apps
Introduction
RStudio Shiny apps
Sievert (2019)
3 Mar Dr. Margaret Siple
Shiny apps
Development
Deployment
Shiny App Contest
5 Mar R Markdown
Designing presentations
Chap 4 of Xie et al. (2014)
8 Mar R Markdown
Building simple websites
GitHub Pages
Chap 17 of HGG
10 Mar Code reviews
Checklists
Do’s and Don’ts
rOpenSci Guide
12 Mar Ethics in data science
Data selection
Analysis & inference
FAIR data principles
CARE data principles
Lum (2018) Bias in data
Fox (2020) Faculty job market