Several people wanted to have the slides from Betancourt’s lectures at SMLP2018. It is possible to recreate most of the course from his writings:1. Intro to probability:https://betanalpha.github.io/assets/case_studies/probability_theory.html2. Workflow…continue reading.
In this post we will return to the data on Pitchfork music reviews, parts of which I’ve analyzed previously. The goal of this post will be to gain an understanding...continue reading.
Click here for the interactive visualizationIf you’re interested in the visualisation of networks or graphs, you might’ve heard of the great package “visNetwork”. I think it’s a really great package...continue reading.
Clustered/Grouped Count DataOften cluster/grouped count data exhibit extra zeros and over-dispersion. To account for these features, Poisson and negative binomial mixed effects models with an extra zero-inflation part are used....continue reading.
Using Text Analysis to Become a Better WriterWe all have words we love to use, and that we perhaps use too much. As an example: I have a tendency to...continue reading.
We wrote a short tutorial on contast coding, covering the common contrast coding scenarios, among them: treatment, helmert, anova, sum, and sliding (successive differences) contrasts. The target audience is psychologists...continue reading.
This is a short post following the previous one (PCA revisited).In this post I’m going to apply PCA to a toy problem: the classification of faces. Again I’ll be working...continue reading.
IRA Tweet Data You may have heard that two researchers at Clemson University analyzed almost 3 millions tweets from the Internet Research Agency (IRA) – a “Russian troll factory”. In...continue reading.
Highlighting Specific Cases in ggplot2Here’s my belated Statistics Sunday post, using a cool technique I just learned about: gghighlight. This R package works with ggplot2 to highlight a subset of...continue reading.
Using Text Mining on Open Ended ItemsGood survey design is both art and science. You have to think about how people will read and process your questions, and what sorts...continue reading.
Statistics Sunday: Term Frequency and Inverse Document Frequency As a mixed methods researcher, I love working with qualitative data, but I also love the idea of using quantitative methods to...continue reading.
I just got back from attending this amazing conference in Paris:http://www.go-isop.org/stan-for-pharmacometrics—paris-franceA few people were disturbed/surprised by the fact that I am linguist (“what are you doing at an pharmacometrics conference?”)….continue reading.
To do this I will create a prediction of the open values for Bitcoin in the next 3 days.The process I follow is based on CRISP-DM methodology: https://www.datasciencecentral.com/profiles/blogs/crisp-dm-a-standard-methodology-to-ensure-a-good-outcome1.- Planning the activities.To...continue reading.