Monthly Archive: May 2017
Even though the concept of the first-order Markov chains is pretty simple, you can face other issues and challenges when implementing the approach in practice. We will review some of them This is...continue reading.
This article explains how to select important variables using boruta package in R. Variable Selection is an important step in a predictive modeling project. It is also called ‘Feature Selection’....continue reading.
Together with David Kellen I am currently working on an introductory chapter to mixed models for a book edited by Dan Spieler and Eric Schumacher (the current version can be...continue reading.
In my last post, I discussed modeling wine price using Lasso regression. In this post, I’ll return to this dataset and describe some analyses I did to predict wine type...continue reading.
It took us quite a while but we have finally released a new version of rtdists to CRAN which provides a few significant improvements. As a reminder, rtdists [p]rovides response...continue reading.
It’s been a while since my last post on some TB WHO data. A lot has happened since then, including the opportunity to attend the Open Data Science Conference (ODSC) East...continue reading.
In April I attended the 2017 New York R conference, hosted by Lander Analytics and Work-Bench. It was both the third time the conference was held and the third time...continue reading.
Many precious hours have been lost to Character encoding errors and EOF character errors in CSV files being read by the Pandas read_csv file. This is an incredibly frustrating start...continue reading.
We are ready for the third R-Lab, the monthly appointment where we co-work together on a real data science problem using R. This time the R-Lab is promoted by nothing...continue reading.