Tag Archives: Cluster computing

Recommended: 1.1 Billion Taxi Rides with Spark 2.2 & 3 Raspberry Pi 3 Model Bs

Mark Litwintschik has taken a large open source data set (1.1 billion taxi rides with data storage on the order of hundreds of gigabytes) and ran some benchmark queries on a variety of different systems. Perhaps the most humble of these systems is a cluster of three Raspberry Pi computers. This webpage talks about how he set up the software on this cluster. Continue reading

PMean: Python, Raspberry Pi, and cluster computing

I’ve been experimenting with connecting a small number of Raspberry Pi in a cluster computer, and a good place to start is MPI (Message Passing Interface). Unfortunately, many of the books and websites that I have looked at use examples in C and FORTRAN. These are fine languages, but ones that I am unlikely to need in the future. I want to explore MPI from with a newer programming language, Python. Here are some resources I have leaned on in getting this started. Continue reading