[R] Finding values in a dataframe at a specified hour
Alexandra Catena
amc5981 at gmail.com
Fri Apr 10 22:07:32 CEST 2015
Hello,
I have a large dataframe (windHW) of wind speeds (ws) at each hour
from many days over a set of years. Some of these values are
obviously wrong (600 m/s) and I want to get rid of all the values that
are larger than 5*sigma for each hour. The 5*sigma (variable name
sigma5) values are located in different dataframes for each season,
with each dataframe titled as a season. For example, in the
dataframe, spring, the 5*sigma value is 79.6 m/s for hour 1.
So my question is as follows: how can I get it so that the code will
be able to find all the wind speed values in the dataframe, windHW, of
a specific hour be higher than the 5*sigma value at that hour?
For example, I would like to find if any of the wind speed values at
hour 1 are higher than 79.6 m/s, and if so, then replace that value
with NA.
I have something like this but I can't seem to figure out how to get
it for specific hours:
windHW$ws[windHW$ws>=spring$sigma5] <- NA
I imported the data using readLines and into the dataframe windHW. I
also have R version 3.1.1
Any help would be appreciated!
Thanks,
Alexandra
More information about the R-help
mailing list