I heard about this from the UMKC Bioinformatics twitter feed. Google has a blog entry highlighting a new search feature they’ve developed, Dataset Search. It lets you find interesting data sets using standard Google search criteria. The system only works if people on the web provide reasonable documentation of their data sets. I’ve not had a chance to work with this yet, but it looks interesting. Continue reading
This is a nice overview of the various dimensions of security that you need to consider when making research data available on the Internet. Continue reading
I’ve been looking for something like this for a while. It is a repository for data sets associated with peer-reveiwed publicattions. I have only glanced at it briefly, but it looks fairly easy to use with a fair number of interesting data sets/publications. Continue reading
I’ve always been supportive of efforts to share data. For me, it’s a bit selfish, because I want to find interesting real world examples to use in teaching and on my web pages. But the issue goes way beyond this, of course. Sharing data is an ethical imperative, especially for federally funded research or research that relies on volunteer subjects. It has led to many important discoveries beyond the realm of the original context in which the data was collected. In order for data sharing to be effective, you need to embrace four guiding principles: your data needs to findable, accessible, interoperable, and re-usable. This paper highlights those principles and offers some current examples of data sharing systems.
I don’t have time to follow the journals as closely as I should, but I was shocked to find two very nasty editorials in recent issues of the New England Journal of Medicine. They are sharply critical of open sharing of data and of quality improvement efforts. Continue reading