I’m currently working on a paper (with my colleague Vincent Vergnat who is also a Phd candidate at BETA) where I want to estimate the causal impact of the birth...continue reading.
R News from another blog for R community
Below is a piece of Python code allowing to download option chains from NASDAQ website. It is basically a big function relying heavily on BeautifulSoup and wrapped into a class (‘c’est chic’)....continue reading.
A new release of stringdist has been accepted on CRAN. stringdist offers a number of popular distance functions between sequences of integers or characters that are independent of character encoding....continue reading.
I will be teaching a course on statistical regression models for repeated measurements data, and I thought of creating a shiny app to let students run the code used in...continue reading.
When modeling the frequency measure in the operational risk with regressions, most modelers often prefer Poisson or Negative Binomial regressions as best practices in the industry. However, as an alternative...continue reading.
Consider the following two spark dataframes:df1.show()+—-+——+——-+|id_a|time_a|value_a|+—-+——+——-+| 1| 1| CA|| 1| 2| CA|| 2| 1| TX|| 3| 5| NE|| 4| 6| WA|+—-+——+——-+df2.show(…continue reading.
We will study how to use LifeCycle Grids concept for measuring a health of the business via Delta Analysis technicThere are several posts connected with LifeCycle Grids on this blog. If you...continue reading.
My Statistical Power and Significance Testing Visualization. By Kristoffer Magnusson now lets you vary effect size, sample size, power and significance level. There’s also a new feature to rescale the...continue reading.
Google Summer of Code 2015 is coming to an end. During this summer, I have learned too many things to list here about statistical modeling, Ruby and software development in...continue reading.