[R] Imputing missing values
Manoj - Hachibushu Capital
Wanzare at HCJP.com
Wed Sep 1 11:01:36 CEST 2004
How about the following code below?
Price[is.na(price)] = mean(Price[-which(is.na(price))]);
HTH
Manoj
-----Original Message-----
From: r-help-bounces at stat.math.ethz.ch
[mailto:r-help-bounces at stat.math.ethz.ch] On Behalf Of Jan Smit
Sent: Wednesday, September 01, 2004 5:44 PM
To: R-help at stat.math.ethz.ch
Subject: [R] Imputing missing values
Dear all,
Apologies for this beginner's question. I have a
variable Price, which is associated with factors
Season and Crop, each of which have several levels.
The Price variable contains missing values (NA), which
I want to substitute by the mean of the remaining
(non-NA) Price values of the same Season-Crop
combination of levels.
Price Crop Season
10 Rice Summer
12 Rice Summer
NA Rice Summer
8 Rice Winter
9 Wheat Summer
Price[is.na(Price)] gives me the missing values, and
by(Price, list(Crop, Season), mean, na.rm = T) the
values I want to impute. What I've not been able to
figure out, by looking at by and the various
incarnations of apply, is how to do the actual
substitution.
Any help would be much appreciated.
Jan Smit
______________________________________________
R-help at stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide!
http://www.R-project.org/posting-guide.html
More information about the R-help
mailing list