[R] popular R packages

Duncan Murdoch murdoch at stats.uwo.ca
Sun Mar 8 18:27:38 CET 2009

On 08/03/2009 12:08 PM, Barry Rowlingson wrote:
>> I think the situation is worse than messy.  If a client comes in with data
>> that doesn't address the question they're interested in, I think they are
>> better served to be told that, than to be given an answer that is not
>> actually valid.  They should also be told how to design a study that
>> actually does address their question.
>> You (and others) have mentioned Google Analytics as a possible way to
>> address the quality of data; that's helpful.  But analyzing bad data will
>> just give bad conclusions.
>  As long as we say 'package Foo is the most downloaded package on
> CRAN', and not 'package Foo is the most used package for R', we can
> leave it to the user to decide if the latter conclusion follows from
> the former.

But we don't even have that data, since CRAN is distributed across lots 
of mirrors.

Duncan Murdoch

  In the absence of actual usage data I would think it a
> good approximation. Not that I would risk my life on it.
>  Pop music charts are now based on download counts, but I wouldn't
> believe they represent the songs that are listened to the most times.
> Nor would I go so far as to believe they represent the quality of the
> songs...
>  Should R have a 'Would you like to tell CRAN every time you do
> library(foo) so we can do usage counts (no personal data is
> transmitted blah blah) ?'? I don't think so....
> Barry

More information about the R-help mailing list