Author Archives: pmean

PMean: Who is your authorized official?

This page has moved to a new website.

PMean: A biased sample of car speeds

Dear Professor Mean, I read a newspaper report about speed limits and how few people obeyed them. A reporter decided to collect some hard data and drove exactly at the speed limit (55 mph in this particular setting). The reporter noticed that nine cars passed his car for every car that he passed, and concluded that most people are breaking the speed limit. I’m wondering if this is really a valid way to collect data. Continue reading →

Recommended: MLPowSim software

This site provides description of a free software package, MLPowSim, that calculates power for complex random effects models. It was developed by the Centre for Multilevel Modelling, the same group that developed the LMwiN package for analysis of complex random effects models. Continue reading →

PMean: Sample size for a study of reproducibility

Dear Professor Mean: I am using a risk stratification tool for patients presenting to the ED with chest pain. This has been a well validated tool in the ED, but I want to show that the scores are reproducible irrespective of the grade of doctor or assessment nurse calculating the score. I’m going to collect a convenience sample of patients presenting to the ED, and after I get informed consent, I will have those patients assessed separately by a triage-trained nurse, an intern doctor, a registrar and a consultant. I will calculation agreement using the intraclass correlation coefficient (ICC). My question is: How do I calculate the sample size in this context? Continue reading →

Recommended: Comparisons within randomised groups can be very misleading

In studies with a baseline, examining the decline exclusively within the treated group, or examining the decline in the treated group and then separately examining the decline in the control group is a bad idea, notes two famous statisticians in the British Medical Journal. They explain why you need to look first at comparisons between the two groups, ideally with analysis of covariance. Continue reading →

PMean: History of SPSS

I’m helping to put together three separate classes, Basic data management and analysis with R [SAS / SPSS]. As part of these classes, I need to discuss the history of these programs, because understanding that history will help you better understand the strengths and weaknesses of each statistical package. Here’s a brief history of SPSS. Continue reading →

PMean: What should go into a data codebook

Before you start your data entry, you should create a data codebook. If you don’t have a data codebook when you hand your data over to someone else, take the time to create one for their benefit and yours. The data codebook contains a description of your data set. There’s no standard form for a data codebook, and what you describe may depend on a variety of factors, such as the complexity of your data set, the number of people involved in data collection and data entry, and the number of people that you are likely to share your data with. Here are some of the elements that you should think about putting in a data codebook. Continue reading →

PMean: History of SAS

Recommended: FDA: R OK for drug trials

This blog post reviews a presentation by Jae Brodowsky, a statistician with the U.S. Food and Drug Administration that put to bed the rumor that FDA will only accept submissions where the data analysis was done by SAS. The summary does mention that FDA has certain regulatory requirements for R (or any other statistical package, including SAS). Continue reading →

PMean

A blog about statistics, evidence-based medicine, and research ethics