Alright, seems like this is developing into a blog where I am increasingly investigating my own music listening habits.Recently, I’ve come across the analyzelastfm package by Sebastian Wolf. I used...continue reading.
Linguistic Signals of Album Quality: A Predictive Analysis of Pitchfork Review Scores Using Quanteda
In this post we will return to the Pitchfork music review data, parts of which I’ve analyzed in previous posts. Our goal here will be to use text mining and...continue reading.
Displaying our “R – Quality Control Individual Range Chart Made Nice” inside a Java web App using AJAX – How To.
Prerequisites:What you should have installed:Java, it can be OpenJDK, you can get it from here: https://github.com/ojdkbuild/ojdkbuildTomcat, any version from 8 up.Eclipse EE: Eclipse IDE for Java EE.Spring Tools Suite For Eclipse: https://sp…continue reading.
This dance, it’s like a weapon: Radiohead’s and Beck’s danceability, valence, popularity, and more from the LastFM and Spotify APIs
Giddy up, giddy it upWanna move into a fool’s gold roomWith my pulse on the animal jewelsOf the rules that you choose to use to get looseWith the luminous movesBored...continue reading.
In this post we will return to the Pitchfork music data and use recurrent neural networks (a “deep learning” technique) to automatically generate band names.The DataFor this analysis, we will...continue reading.
In my last Statistics Sunday post, I briefly mentioned the concept of regular expressions, also known as regex (though note that in some contexts, these refer to different things -...continue reading.
First Statistics Sunday in far too long! It’s going to be a short one, but it describes a great trick I learned recently while completing a time study for our...continue reading.
In R we have the qcc package but charts are not very nice, specially if you want to put your chart in a HTML file.Here I describe the process of...continue reading.
Multiple comparisons of group-level means is a tricky problem in statistical inference. A standard practice is to adjust the threshold for statistical significance according to the number of pairwise tests...continue reading.
Two statistical indices crossed my inbox in the last week, both of which use fast food restaurants to measure a concept indirectly.First up, in the wake of recent hurricanes, is...continue reading.
Data manipulation is a breeze with amazing packages like plyr and dplyr. Recoding factors, which could prove to be a daunting task especially for variables that have many categories, can...continue reading.
Several people wanted to have the slides from Betancourt’s lectures at SMLP2018. It is possible to recreate most of the course from his writings:1. Intro to probability:https://betanalpha.github.io/assets/case_studies/probability_theory.html2. Workflow…continue reading.