Thanks for Reading!
As I’ve been blogging more about statistics, R, and research in general, I’ve been trying to increase my online presence, sharing my blog posts in groups of like-minded people. Those...continue reading.
As I’ve been blogging more about statistics, R, and research in general, I’ve been trying to increase my online presence, sharing my blog posts in groups of like-minded people. Those...continue reading.
I haven’t had a lot of time to play with this but yesterday, I discovered the tuber R package, which allows you to interact with the YouTube API.To use the...continue reading.
Today I decided to create a new repository on GitHub where I am sharing code to do spreadsheet data manipulation in R.The first version of the repository and R script...continue reading.
OverviewIn this post, I would like to introduce my new R package GLMMadaptive for fitting mixed-effects models for non-Gaussian grouped/clustered outcomes using marginal maximum likelihood.Admittedly, there is a number of...continue reading.
How to Read in and Clean Your Facebook Data – I recently learned that you can download all of your Facebook data, so I decided to check it out and...continue reading.
I recently discovered the R Graph Gallery, where users can share the beautiful visualizations they’ve created using R and its various libraries (especially ggplot2). One of my favorite parts about...continue reading.
Cloudy with a Chance of Words Lots of fun projects in the works, so today’s post will be short – a demonstration on how to create wordclouds, both with and...continue reading.
Most people know KEGG pathway, but not everyone knows that it costs at least $2000 to subscribe its database. If you want to save the cost a bit, you can...continue reading.
In this post, we’ll return to the Kaggle data containing information on Pitchfork music reviews. In a previous post, I used this dataset to cluster music genres. In the current...continue reading.
With fewer than three weeks left in the June 7 provincial elections in Ontario, Canada’s most populous province with 14.2 million persons, the expected outcome is far from certain.The weekly...continue reading.
The Delaware River Basin Commission’s Delaware Estuary water quality monitoring program, which was initiated in 1967, is one of the longest running monitoring programs in the world. One advantage of...continue reading.
There are a few reasons why you might want to send tweets from R. You might want to write a Twitter bot or – as in my case – you...continue reading.
About two months ago, the German online magazine ‘Informatik Aktuell’ asked me to write an introductory article on R. And so I did.It’s now only a few days ago that...continue reading.
My package ‘quantification’ is now on GitHub: https://github.com/jsugarelli/quantification.’quantification’ is a package that provides functions for quantifying qualitative survey data. It supports the Carlson-Parkin method, the regression approach, th…continue reading.
Here is some updated R code from my previous post. It doesn’t throw any warnings when importing tracks with and without heart rate information. Also, it is easier to distinguish...continue reading.
In this post, we’ll return to analyzing rap lyrics using statistical and data analytic tools (the first posts of this blog dealt primarily with this topic). Specifically, in this post...continue reading.
Before to continue with the posts about how to do things with R, I have decided to describe how I lead the creation of an analytics team starting from zero.My...continue reading.
The relationship between paired t-tests and linear mixed models in a 2×2 repeated measures design Assume that we have four random variables \(X_1,\dots,X_4\), each has standard deviation \(\sigma\) and pairwise...continue reading.
A little-known fact: The paired t-test is equivalent to a linear mixed model with varying intercepts Given the assumptions above, the paired t-test is equivalent to fitting a varying intercept...continue reading.
[Thanks to Scott Glover for correcting me.] Someone recently said to me that the lower the p-value, the higher the likelihood ratio under the alternative vs the null. The arXiv...continue reading.