Introduction:We will identify anomalous products on the production line by using measurements from testing stations and deep learning models. Anomalous products are not failures, these anomalies are products close to...continue reading.
As I conduct some analysis for a content validation study, I wanted to quickly blog about a fun plot I discovered today: ggpairs, which displays scatterplots and correlations in a...continue reading.
A Tale of Two (Small Belgian) Cities with Open Data: Official Crime Statistics and Self-Reported Feelings of Safety in Leuven and Vilvoorde
In this post, we will analyze government data from the Flemish region in Belgium on A) official crime statistics and B) self-reported feelings of safety among residents of Flanders. We...continue reading.
Stacked Bar Chart for Rank Data At work on Friday, I was trying to figure out the best way to display some rank data. What I had were rankings from...continue reading.
Alright, seems like this is developing into a blog where I am increasingly investigating my own music listening habits.Recently, I’ve come across the analyzelastfm package by Sebastian Wolf. I used...continue reading.
Linguistic Signals of Album Quality: A Predictive Analysis of Pitchfork Review Scores Using Quanteda
In this post we will return to the Pitchfork music review data, parts of which I’ve analyzed in previous posts. Our goal here will be to use text mining and...continue reading.
Displaying our “R – Quality Control Individual Range Chart Made Nice” inside a Java web App using AJAX – How To.
Prerequisites:What you should have installed:Java, it can be OpenJDK, you can get it from here: https://github.com/ojdkbuild/ojdkbuildTomcat, any version from 8 up.Eclipse EE: Eclipse IDE for Java EE.Spring Tools Suite For Eclipse: https://sp…continue reading.
This dance, it’s like a weapon: Radiohead’s and Beck’s danceability, valence, popularity, and more from the LastFM and Spotify APIs
Giddy up, giddy it upWanna move into a fool’s gold roomWith my pulse on the animal jewelsOf the rules that you choose to use to get looseWith the luminous movesBored...continue reading.
In this post we will return to the Pitchfork music data and use recurrent neural networks (a “deep learning” technique) to automatically generate band names.The DataFor this analysis, we will...continue reading.
In my last Statistics Sunday post, I briefly mentioned the concept of regular expressions, also known as regex (though note that in some contexts, these refer to different things -...continue reading.
First Statistics Sunday in far too long! It’s going to be a short one, but it describes a great trick I learned recently while completing a time study for our...continue reading.
In R we have the qcc package but charts are not very nice, specially if you want to put your chart in a HTML file.Here I describe the process of...continue reading.
Multiple comparisons of group-level means is a tricky problem in statistical inference. A standard practice is to adjust the threshold for statistical significance according to the number of pairwise tests...continue reading.
Two statistical indices crossed my inbox in the last week, both of which use fast food restaurants to measure a concept indirectly.First up, in the wake of recent hurricanes, is...continue reading.