Essential Libraries for Data Science on a Mac

I recently ran a fresh install on my Mac and thought I’d take the opportunity to document the libraries and programs I find incredibly useful. The Python libraries I’ll frequently pip3 install include:

Read More
Multiple plots in figures with R

The other day I was working with R in a Jupyter Notebook when I discovered that I needed to include multiple figures in the same plot. Surprisingly, R doesn’t include this capability out of the box, so I went searching and found this function that does the job. I’ve included the code below for my …

Read More
Never trust your factors

I recently helped a friend out with a dataset – she was struggling to merge the CSV files from two dataframes in R into one dataframe. I thought this would be quite simple and yet could not get it to work with merge or dplyr – it just kept giving me weird results. The problem was …

Read More
Interesting Readings

Work has kept me pretty busy lately but I’ve been meaning to try put together another post with some of the interesting readings I’ve come across. The first thing I’ll mention is that the IEEE (Institute of Electrical and Electronics Engineers) have released their rankings for programming language popularity. Python (ranked #4) and R (ranked #6) …

Read More
Parallel operations in R

I thought I’d start a list of some code examples I’ve found online which enable you to perform parallel operations in R and take advantage of multi-core processors. By-Group Aggregation in Parallel Row Search in Parallel I’ll try to add to this list from time-to-time when I come across new examples.

Read More
Where to find R tutorials

Here’s a list of a few R tutorials (in addition to the one I wrote), which I’ve found (or look) rather useful: Google Developers R Tutorials Playlist of R tutorials on YoTtube A slightly Different Introduction to R Part I Part II Part III Part IV Part V

Read More
Useful links for using ggplot2 in R

The R package ggplot2 is one of the best data visualisation tools I’ve come across, and while it simplifies generating impressive graphics, there’s still a bit you have to learn to use it. Here are a few of the posts I’ve found really handy when using this package: Quick Introduction to ggplot2 ggplot2: Cheatsheet for …

Read More
RStudio panes
R Tutorial

This tutorial is a beginners guide for getting started with R, once you complete it you should have R installed on your computer and be able to import data, perform basic statistical tests and create graphics. Index Getting started Basics Importing Data Tests Graphics Packages Getting Started The first things you will have to do …

Read More
Running an R script through Python

Before I made the switch to developing on a Linux machine, I noticed that the Python module for calling R (RPy2) seemed to be having some problems on Windows. This gave me an excuse to play around with writing my own Python script to create and run an R script. As you’ll see in the …

Read More