Being data curious: the strange case of lamb consumption in NZ

Sheep meat consumption, kg/person. Data from OECD statistics.

There is a lot of talk about the skills needed for working in Statistics/Data Science, with the discussion often focusing on theoretical understanding, programming languages, exploratory data analysis, and visualization. There are many good blog posts dealing with how you get data, process it with your favorite language and then creating some good-looking plots. However,… Continue reading Being data curious: the strange case of lamb consumption in NZ

Functions with multiple results in tidyverse

Mobile phone antenna in church (Photo: Luis, click to enlarge).

I have continued playing with the tidyverse for different parts of a couple of projects. Often I need to apply a function by groups of observations; sometimes, that function returns more than a single number. It could be something like for each group fit a distribution and return the distribution parameters. Or, simpler for the… Continue reading Functions with multiple results in tidyverse

Turtles all the way down

Door in Valparaíso.

One of the main uses for R is for exploration and learning. Let's say that I wanted to learn simple linear regression (the bread and butter of statistics) and see how the formulas work. I could simulate a simple example and fit the regression with R:

The formulas for the intercept ($latex b_0$) and… Continue reading Turtles all the way down

Cute Gibbs sampling for rounded observations

I was attending a course of Bayesian Statistics where this problem showed up: There is a number of individuals, say 12, who take a pass/fail test 15 times. For each individual we have recorded the number of passes, which can go from 0 to 15. Because of confidentiality issues, we are presented with rounded-to-the-closest-multiple-of-3 data… Continue reading Cute Gibbs sampling for rounded observations