[R] Issue with dataset inclusion in CRAN packages
Frank Harrell
f.harrell at vanderbilt.edu
Sun Jun 26 23:12:12 CEST 2011
I was wrong about this. The dataset is small. Most of the space is taken up
by a nice tutorial on rpart.plot. Still I would favor linking to datasets
rather than duplicating part of them.
Thanks
Frank
Frank Harrell wrote:
>
> I was glad to see the new rpart.plot package by Stephen Milborrow. I was
> however a bit concerned that Stephen distributed a dataset I created, and
> renamed the dataset (from titanic3 to ptitanic) in the process [with some
> justification, as some variables were omitted]. Fortunately Stephen
> included the script he used to download the dataset from our web site, and
> gave full credit to us. What concerns me is that the rpart.plot package
> does not contain many functions but the package is as large as packages
> containing hundreds of functions. This is due to the inclusion of the
> dataset. I would prefer that authors provide the URL so that users can
> easily install the binary R binary dataframe directly from our web site
> (we even have an automated way to do this: require(Hmisc);
> getHdata(titanic3)). This will allow users to profit from possible future
> data corrections as well as making the package much more compact. Thanks
> for listening. I'm writing to r-help because this may applied to other R
> packages as well.
>
> Frank
>
-----
Frank Harrell
Department of Biostatistics, Vanderbilt University
--
View this message in context: http://r.789695.n4.nabble.com/Issue-with-dataset-inclusion-in-CRAN-packages-tp3626536p3626568.html
Sent from the R help mailing list archive at Nabble.com.
More information about the R-help
mailing list