STATS 506, Fall 2024
Computational Methods and Tools in Statistics
Course Material
Syllabus
Assignments
Preliminary Assignment - Git/Github
Problem Set 1 due Sep 12
Problem Set 1 Solutions
Problem Set 2 due Sep 26
Problem Set 2 Solutions
Problem Set 3 due Oct 10
Problem Set 3 Solutions
Problem Set 4 due Oct 31
Problem Set 4 Solutions
Problem Set 5 due Nov 21
Problem Set 5 Solutions
Problem Set 6 due Dec 5
Problem Set 6 Solutions
Quiz 1 Solutions
Quiz 2 Solutions
Quiz 3 Solutions
Quiz 4 Solutions
Quiz 5 Solutions
Midterm & Final project
Midterm details
Midterm Solutions
Final project instructions
Lecture Notes
01 - Introduction to R
02 - Quarto and RMarkdown
03 - Version Control and Git
04 - Vectorization and Monte Carlo
05 - Debugging Function in R
06 - Fitting Models in R
07 - SQL
08 - Regular Expressions
09 - The Tidyverse
10 - R Visualization - Base R and ggplot2
11 - Other Statistical Software
12 - R’s OOP
13 - R Visualization - plotly
14 - Rcpp
15 - data.table
16 - Parallel Processing
17 - Futures
18 - High Performance Computing
19 - Unix Command Line
20 - Interactive Apps with shiny
21 - R Packages, RationalPackage repo
Scripts from Lecture
These are the scripts I work on during class. I make no promises that these are complete or useful.
Case Studies
Case Study 1: MS Daily Data (10/19/24)
Files: msdaily-tidyverse.R, msdaily-baseR.R
Case Study 2: InfinitySparseMatrix
implementation (11/1/24)
- InfinitySparseMatrix.R
- Matching slides - Some slides with more information on matching and optmatch for more context on InfinitySparseMatrices.
Case Study 3: Shiny Web Apps
- Github repository of demos
- False Discovery Rate calculator (on shinyapps.io) - please don’t use too much bandwidth of this version!
- False Discovery Rate calculator (converted to javascript)
Miscellanous
Using LLMs
Developing a Consistent R Style
Blank SQL joins worksheet
Completed SQL joins worksheet