Tag Archives: Datasets

Recommended: ENCODE: Encyclopedia of DNA Elements

The genetics research community should be lauded for the openness with which they share research data. You can find numerous data sources that are free and without ANY restrictions. One very good example is ENCODE, the Encyclopedia of DNA Elements. This repository, mostly of human data, but some mouse, fruit fly, and round worm data as well. It has data from many different assays including ChIP-seq, RNA-seq, and DNase-seq. It looks like a great teaching resource, though it does require a fairly hefty understanding of genetics to browse through the data. Continue reading

PMean: Simple longitudinal data sets to illustrate data management

I am working on a class that will teach basic data management and graphics using the R programming language with parallel classes in SPSS and SAS. On the third or fourth day of the class, we will look at managing longitudinal data sets, as these require special skills. I wanted to find a couple of reasonably simple longitudinal data sets that were available on the web and which had at least a few missing values in them. Here’s a couple of data sets that might work. Continue reading