[Rd] A couple of issues with colClasses/setAs

Prof Brian Ripley ripley at stats.ox.ac.uk
Wed Sep 8 18:49:52 CEST 2004


On 8 Sep 2004, Peter Dalgaard wrote:

> Peter Dalgaard <p.dalgaard at biostat.ku.dk> writes:
> 
> > To make it actually work, we should probably fixup the "what" that is
> > being passed to scan a bit further upstreams.
> > 
> > 
> > > Might be a good idea to teach colClasses about "factor".
> > 
> > That's what I thought. Other ideas would be to predefine some standard
> > date classes (it's a bit annoying that there's no way to give
> > auxiliary information like formats), and maybe to allow a second
> > header line containing class names.
> 
> Done already, I see (except the class-header idea, which probably
> requires more thought). Thanks.

I've done some.  If you allow columns to be dropped it gets increasing 
hard to work out what as.is for example refers to, so I am looking at 
allowing named vectors.

> While we're in there should we perhaps consider adding pseudo-classes
> 
> "Ymd"
> "dmY"
> "mdY"
> "Y/m/d"
> "d/m-Y"
> "m/d/Y"
> 
> etc?  Or perhaps it is better with a catchall "Date-fmt", i.e., e.g.,
> "Date-%Y%m%d"? That'd be fairly easy to code:
> 
> else if (length(grep("^Date-",colClasses[i])))  
>         as.Date(data[[i]], sub("^Date-(.*)$","\\1", colClasses[i]))
> 
> although perhaps not so easy to document concisely.

Is it really so hard to convert such columns from character afterwards?


-- 
Brian D. Ripley,                  ripley at stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595



More information about the R-devel mailing list