[R] subsetting and Dates
arun
smartpink111 at yahoo.com
Fri May 24 03:44:19 CEST 2013
You could convert those columns to "Date" class by:
Data[,c(4,6)]<-lapply(Data[,c(4,6)],as.Date,origin="1970-01-01")
#or
Data[,c(4,6)]<-lapply(Data[,c(4,6)],function(x) structure(x,class="Date"))
# dat1 dat2 Dat1a Dat1b Dat2a Dat2b
#1 41327 41327 2013-02-22 2013-02-22 2013-02-22 2013-02-22
#2 41334 41334 2013-03-01 2013-03-01 2013-03-01 2013-03-01
#3 41341 41341 2013-03-08 2013-03-08 2013-03-08 2013-03-08
#4 41348 41348 2013-03-15 2013-03-15 2013-03-15 2013-03-15
#5 41355 NA 2013-03-22 2013-03-22 <NA> <NA>
#6 41362 41362 2013-03-29 2013-03-29 2013-03-29 2013-03-29
#7 41369 41369 2013-04-05 2013-04-05 2013-04-05 2013-04-05
#8 41376 41376 2013-04-12 2013-04-12 2013-04-12 2013-04-12
#9 41383 NA 2013-04-19 2013-04-19 <NA> <NA>
#10 41390 41390 2013-04-26 2013-04-26 2013-04-26 2013-04-26
#11 41397 41397 2013-05-03 2013-05-03 2013-05-03 2013-05-03
A.K.
----- Original Message -----
From: Denis Chabot <chabot.denis at gmail.com>
To: R-help at r-project.org
Cc:
Sent: Thursday, May 23, 2013 5:35 PM
Subject: [R] subsetting and Dates
Hi,
I am trying to understand why creating Date variables does not work if I subset to avoid NAs.
I had problems creating these Date variables in my code and I thought that the presence of NAs was the cause. So I used a condition to avoid NAs.
It turns out that NAs are not a problem and I do not need to subset, but I'd like to understand why subsetting causes the problem.
The strange numbers I start with are what I get when I read an Excel sheet with the function read.xls() from package gdata.
dat1 = c(41327, 41334, 41341, 41348, 41355, 41362, 41369, 41376, 41383, 41390, 41397)
dat2 = dat1
dat2[c(5,9)]=NA
Data = data.frame(dat1,dat2)
keep1 = !is.na(Data$dat1)
keep2 = !is.na(Data$dat2)
Data$Dat1a = as.Date(Data[,"dat1"], origin="1899-12-30")
Data$Dat1b[keep1] = as.Date(Data[keep1,"dat1"], origin="1899-12-30")
Data$Dat2a = as.Date(Data[,"dat2"], origin="1899-12-30")
Data$Dat2b[keep2] = as.Date(Data[keep2,"dat2"], origin="1899-12-30")
Data
dat1 dat2 Dat1a Dat1b Dat2a Dat2b
1 41327 41327 2013-02-22 15758 2013-02-22 15758
2 41334 41334 2013-03-01 15765 2013-03-01 15765
3 41341 41341 2013-03-08 15772 2013-03-08 15772
4 41348 41348 2013-03-15 15779 2013-03-15 15779
5 41355 NA 2013-03-22 15786 <NA> NA
6 41362 41362 2013-03-29 15793 2013-03-29 15793
7 41369 41369 2013-04-05 15800 2013-04-05 15800
8 41376 41376 2013-04-12 15807 2013-04-12 15807
9 41383 NA 2013-04-19 15814 <NA> NA
10 41390 41390 2013-04-26 15821 2013-04-26 15821
11 41397 41397 2013-05-03 15828 2013-05-03 15828
So variables Dat1b and Dat2b are not converted to Date class.
sessionInfo()
R version 2.15.2 (2012-10-26)
Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)
locale:
[1] fr_CA.UTF-8/fr_CA.UTF-8/fr_CA.UTF-8/C/fr_CA.UTF-8/fr_CA.UTF-8
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] gdata_2.12.0
loaded via a namespace (and not attached):
[1] gtools_2.7.0
Thanks in advance,
Denis
______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list