All Lecture Materials are listed below in order. The first 7 lectures were designed for html, so the pdf versions are likely to have some iffy formatting.
- Lecture 1 – Admin, Syllabus, Stats 101, FDR Control – .Rmd file and PDF
- Lecture 2 – Predictions, FDR Control, and Loss – .Rmd and PDF
- Lipids example:
- Code.
- Data is on canvas under lecture 2. “jointGwasMc_LDL.txt”
- Diabetes Example:
- Simulations in lecture: Code
- Lecture 3 – Regression: Linear and Logit – Rmd and PDF
- Lecture 4 – Deviance, OOS, Bootstrap – Rmd and PDF
- Lecture 5 – Variable Selection – Rmd and PDF
- For those of you struggling with homeworks, there is a lot of valuable code in the old Lecture Rmd files
- Semiconductors: Data, code in lecture Rmd
- Comscore: Code in lecture Rmd, Data: domains, sites, total spend
- Lecture 6 – Cross Validation – Rmd and PDF
- The last few slides include a lot of useful code. If you can interpret and run the cross-validation code there, you should gain a solid grasp on cross-validation generally.
- Lecture 7 – Classification – Rmd, and PDF
- Lecture 8 – Classification 2 – Rmd
- Lecture 9 – ROC and Trees – Rmd
- Lecture 10 – Trees and Forests – and Rmd
- Lecture 11 – Forests and Boosting – Rmd
- Lecture 12 – Boosting, Ensembles, and Rolling Block CV – Rmd
- Lecture 13 – Data Cleaning – Rmd
- Lecture 14 – Data Cleaning and Bayes – Rmd
- Lecture 15 – Causal Inference 1: RCTs – Rmd
- Lecture 16 – Causal Inference 2: Targeting, Observational Methods – Rmd
- Lecture 17 – Neural Nets, SGD, Optimization – Rmd
- Lecture 18 – Review – Rmd