Category Archives: Recommended

Recommended: Make PowerPoint Presentations with R Markdown

This page is moving to a new website.

This is a 42 minute presentation that covers the basics of using R Markdown to produce PowerPoint files. It touches on another couple of RStudio products: R Studio Connect and Shiny. This covers a lot of customizations issues. Also see Rendering PowerPoint Presentations with RStudio. Continue reading

Recommended: Welcome to DASL – The Data And Story Library

This page is moving to a new website.

The Data and Story Library (DASL) is a collection of small and simple data sets useful for teaching basic statistical concepts. It was originally housed at the Carnegie-Mellon website, but (like many classic websites) it disappeared one day. The nice folks at Data Description, Inc. (makers of Data Desk software) have revived and updated this resource. Continue reading

Recommended: 1.1 Billion Taxi Rides with Spark 2.2 & 3 Raspberry Pi 3 Model Bs

This page is moving to a new website.

Mark Litwintschik has taken a large open source data set (1.1 billion taxi rides with data storage on the order of hundreds of gigabytes) and ran some benchmark queries on a variety of different systems. Perhaps the most humble of these systems is a cluster of three Raspberry Pi computers. This webpage talks about how he set up the software on this cluster. Continue reading

Recommended: Accessible R Markdown Documents

This page is moving to a new website.

A class covering on-line teaching has reminded me about accessibility issues. This includes accessibility for blind students who rely on screen readers. This webpage post covers some of the very simple things you can do that would make life a lot easier for students with impaired vision. Continue reading

Recommended: Making it easier to discover data sets

This page is moving to a new website.

I heard about this from the UMKC Bioinformatics twitter feed. Google has a blog entry highlighting a new search feature they’ve developed, Dataset Search. It lets you find interesting data sets using standard Google search criteria. The system only works if people on the web provide reasonable documentation of their data sets. I’ve not had a chance to work with this yet, but it looks interesting. Continue reading

Recommended: Use of Electronic Health Record Data in Clinical Investigations. Guidance for Industry

This page is moving to a new website.

The U.S. Food and Drug Administration (FDA) is encouraging great use of electronic health record data to supplement the traditional randomized clinical trials. But you need to use care. Here is some guidance on what the FDA is recommending to industry. Continue reading