Monthly Archive: March 2015
My first CRAN package, ggExtra, contains several functions to enhance ggplot2, with the most important one being ggExtra::ggMarginal() – a function that finally allows easily adding marginal density plots or...continue reading.
I just published a new interactive visualization in my series of basic statistical concepts and techniques. This time I am trying to show how p-values are distributed. Check it out...continue reading.
Cubist is a tree-based model with a OLS regression attached to each terminal node and is somewhat similar to mob() function in the Party package (https://statcompute.wordpress.com/2014/10/26/model-segmentation-with-recursive-partitioning). Below is a demonstrate...continue reading.
We are pleased to announce our proposal ‘Subsetted and parallel computations in matrixStats’ for Google Summer of Code. The project is aimed for a student with experience in R and...continue reading.
In August 2014, I created a 40-minute video tutorial introducing the key functionality of the dplyr package in R. dplyr continues to be my “go-to” package for data exploration and...continue reading.
An universally used generative unsupervised clustering is Gaussains Mixture Model (GMM) which is also known as “EM Clustering”. The idea of GMM is very simple: for a given dataset, each...continue reading.