Statistics 506, Fall 2016

Course Materials

Syllabus
Programming environments for data management and analysis
Linux shell skills
Basic computer architecture for data-oriented computing
Minimal introduction to Stata
Problem set 1
Data formats and data structures
Introduction to R
R tips and common errors
Vectorization in R
Problem set 2
The R language
Previous exams
R Dataframes and dplyr
Problem set 3
Midterm
Large data sets in R
Basic SAS
Problem set 4
SAS Case Study: flights data
Project
Concurrent programming in R
Distributed computing
Problem set 5
Final exam