Category Archives: Statistics

PMean: Can you recommend an introductory book on Bayesian Statistics

I got an email asking for a recommendation for an introductory book on Bayesian Statistics from someone who recently graduated from our program. It’s kind of a difficult request because the mathematical demands needed to understand Bayesian statistics are not trivial. Here’s what I recommended. Continue reading

PMean: Introduction to R, Monday, August 8 through Friday, August 12, 2016

I will be teaching a one semester hour class at UMKC, Introduction to R (MEDB 5505) on Monday, August 8, 2016 through Friday, August 12, 2016. It runs from 9am to noon on all five days. This is part of a series of classes that cover a basic introduction to statistical packages: data import, data management, simple graphs, and simple descriptive statistics. The other classes (MEDB 5506 and MEDB 5507) cover SPSS and SAS.

Here are some details about this class. Continue reading

PMean: Examining the storage format for sparse matrices in R

I’ve been working with sparse matrices a bit for my work with the Greater Plains Collaborative. They are a very useful way of storing matrices where most of the entries are zero. This occurs quite often in medical data. There are thousands of medical procedures that you can torture your patients with, so any matrix that has indicator variables for every medical procedure will be quite big. Fortunately, both for us and for the patients, the number of procedures that a particular patient has to endure is quite a bit smaller. So for each row of the matrix, the number of non-zero entries will be very small, probably in the single digits. A sparse matrix will be much smaller because it stores only the location of the non-zero entries. Here’s some R code that shows how this works. I have the code available at my new github site. Continue reading