Month: May 2012

Harvesting Eucalyptus urophylla x E. grandis hybrid clones in Brazil (Photo: Luis).

Review: “Forest Analytics with R: an introduction”

2012-05-30 / Luis

Forestry is the province of variability. From a spatial point of view this variability ranges from within-tree variation (e.g. modeling wood properties) to billions of trees growing in millions of hectares (e.g. forest inventory). From a temporal point of view we can deal with daily variation in a physiological model to many decades in an empirical growth and yield model. Therefore, it is not surprising that there is a rich tradition of statistical applications to forestry problems.

At the same time, the scope of statistical problems is very diverse. As the saying goes forestry deals with “an ocean of knowledge, but only one centimeter deep”, which is perhaps an elegant way of saying a jack of all trades, master of none. Forest Analytics with R: an introduction by Andrew Robinson and Jeff Hamann (FAWR hereafter) attempts to provide a consistent overview of typical statistical techniques in forestry as they are implemented using the R statistical system.

Unfortunately named Fiat dealer in Southern Brazil. Ideal if you want to zipoisson your way around.

End of May flotsam

2012-05-29 / Luis

The end is near! At least the semester is coming to an end, so students have crazy expectations like getting marks back for assignments, and administrators want to see exam scripts. Sigh! What has been happening meanwhile in Quantum Forest?

Luis in Sydney Botanical Gardens (Photo: Orlando).

On point of view

2012-05-21 / Luis

Often times we experience mental paralysis: we can only see a problem, a situation or a person from a single point of view (mea culpa). Some times the single mindedness of our view point becomes so bad that we inexorably drift to complete silliness. This is the case when one keeps on insisting on a point that has been shown to be, how to put it, wrong.

Photography is a fascinating hobby. I think it was around 30 years ago, may be a bit earlier, that I started taking it more seriously. Learned to process film and to use an enlarger and to witness the magic of an image slowly appearing on paper submerged in developer, while a dim red light bathed the room. A few years later I stopped taking pictures, mostly due to economic problems: I was not able to even buy film, let alone to process it. Photography stayed dormant for many years, then resurfaced in the digital area, but it did not feel the same.

Gratuitous picture: Firescapes II, night illuminated by bonfire (Photo: Luis).

R’s increasing popularity. Should we care?

2012-05-17 / Luis

Some people will say ‘you have to learn R if you want to get a job doing statistics/data science’. I say bullshit, you have to learn statistics and learn to work in a variety of languages if you want to be any good, beyond getting a job today coding in R.

R4stats has a recent post discussing the increasing popularity of R against other statistical software, using citation counts in Google Scholar. It is a flawed methodology, at least as flawed as other methodologies used to measure language popularities. Nevertheless, I think is hard to argue against the general trend: R is becoming more popular. There is a deluge of books looking at R from every angle, thousands of packages and many jobs openings asking for R experience, which prompts the following question:

Gratuitous picture: Trees at 8 pm illuminated by bonfire and full moon (Photo: Luis).

Bivariate linear mixed models using ASReml-R with multiple cores

2012-05-07 / Luis

A while ago I wanted to run a quantitative genetic analysis where the performance of genotypes in each site was considered as a different trait. If you think about it, with 70 sites and thousands of genotypes one is trying to fit a 70×70 additive genetic covariance matrix, which requires 70*69/2 = 2,415 covariance components. Besides requiring huge amounts of memory and being subject to all sort of estimation problems there were all sort of connectedness issues that precluded the use of Factor Analytic models to model the covariance matrix. The best next thing was to run over 2,000 bivariate analyses to build a large genetic correlation matrix (which has all sort of issues, I know). This meant leaving the computer running for over a week.
Continue reading