Last updated: March 07, 2017
Note: This is a draft of the syllabus. Details are subject to change, though overall topics covered and flavor of the course will not.
Several homeworks/projects (a goal of 3-4). Open ended where possible, though there will be some normal practice questions. The assignments should be done in RMarkdown, where any code is documented and choices justified. (If you want to use something besides RMarkdown, talk to me ASAP to see if we can work it out.)
Programming is not a solo activity - I encourage you to seek out resources online or to discuss with classmates. However
Stats 506 (Computational methods and tools in statistics) or equivalent experience. The course will be taught using R. We will use basic statistical techniques.
This course will use real-world data to explore the issues surrounding the handling of raw data. Often in courses, the data are provided by the instructor and have undergone some cleaning. We will be using real world data from sources such as data.gov, as well as obtaining our own data by web scraping. Time allowing, we’ll also discuss approaches to dealing with big data.
Students are encouraged (not required) to bring laptops to class to aid with hands-on learning.
This list of topics is ambitious. We will most likely not have time to get to everything. If any of these topics are of special interest, please let me know as soon as possible and I will try and ensure we cover them.