Multiple comparisons of group-level means is a tricky problem in statistical inference. A standard practice is to adjust the threshold for statistical significance according to the number of pairwise tests...continue reading.
One of the concepts we teach in both Practical Data Science with R and in our theory of data shaping is the importance of identifying the roles of columns in...continue reading.
R is an interpreted programming language with vectorized data structures. This means a single R command can ask for very many arithmetic operations to be performed. This also means R...continue reading.
In my previous post, I showed how to use cdata package along with ggplot2‘s faceting facility to compactly plot two related graphs from the same data. This got me thinking:...continue reading.
Authors: John Mount, and Nina Zumel 2018-10-25 As a followup to our previous post, this post goes a bit deeper into reasoning about data transforms using the cdata package. The...continue reading.