Tag Archives: Teaching resources

PMean: Think positively, what has research done for us

Several years ago, I was part of panel presentation at the Joint Statistical Meetings. My talk was on how to teach Statistics from an evidence-based perspective. A question came up from the audience about the quality of medical research, and there’s a lot of cynicism in the Statistics community about this. Each comment from the audience seemed to get more negative and I stepped in to offer a counter argument. The research process has a lot of flaws, but we have made a ton of progress in how we provide medical care thanks to the careful and rigorously designed studies that have been done. I didn’t convince anyone, but it felt good to stand up for something I strongly believe in. Recently, I had to look for examples of research that has changed clinical practice for the better, and found several interesting articles. Continue reading

PMean: Grading rubric for computer assignments

I’ve been teaching a variety of classes that require students to run a statistical analysis in a package like SAS or R and report the results. There is a tremendous variety of formats that students use, and I thought it would be helpful to offer some guidance. It would save me time in grading, but more importantly it would emphasize that students need to think about what they produce rather than just tossing together whatever comes out of the computer. The five requirements for homework assignments are they be complete, concise, clear, error-free, and interpretable. Continue reading

Recommended: Textbook Examples Applied Survival Analysis

I’m teaching an online workshop for The Analysis Factor on survival analysis. It’s not announced yet, and I have a LOT of work to do before it is ready. One thing that will save me time is that I am taking many of my examples from the excellent textbook, Applied Survival Analysis Second Edition. One nice perk of this book is that the helpful folks at UCLA have taken every textbook example, and written up code (with comments!) to reproduce the book’s results. With the exception of a few advanced methods in later chapters, where only one or two software packages have the right capability, the code is written in parallel in R, SAS, SPSS, and Stata. They also have links to the raw data at the publishers website, and datasets stored in SAS format and SPSS format. How nice! Browse around and you’ll find software code for all the examples in other popular statistics textbooks as well.

Warning! The R examples look like they are from the first edition, not the second edition. A small nitpick for an otherwise very nice resource. Continue reading

Recommended: beanumber repository

This is the github repository of Ben Baumer. He is one of the co-authors of “Modern Data Science with R” and the data and code from that book is available here. He also provides code and data for OpenWAR, an open source method for calculating a baseball statistic, Wins Above Replacement. Finally, there is an R library for extracting, transforming, and loading “medium” sized datasets into SQL. Medium here means multi-gigabyte sized files. Related to this are a couple of “medium” sized data sets from the Internet Movie Database and from the NYC CitiBike dataset. Continue reading