Multiple comparisons of group-level means is a tricky problem in statistical inference. A standard practice is to adjust the threshold for statistical significance according to the number of pairwise tests...continue reading.
In my previous post, I showed how to use cdata package along with ggplot2‘s faceting facility to compactly plot two related graphs from the same data. This got me thinking:...continue reading.
All that noise, and all that sound, all those places I have found (Speed of Sound, Coldplay) Some days ago, my friend Jorge showed me one of the coolest datasets...continue reading.
In this post, we’ll return to the Kaggle data containing information on Pitchfork music reviews. In a previous post, I used this dataset to cluster music genres. In the current...continue reading.