[R] Conditional Statistics
s.blomberg1 at uq.edu.au
Tue Jan 8 05:30:30 CET 2013
You can use the tapply function to do this. You can't type a line into
the mean statement. (See ?mean for what you can type in there). The
general approach is to have a vector of data (stock prices) and a
categorical variable (day of week). Then break up the data vector
according to the levels in the categorical variable, and calculate the
Weekmeans <- tapply(data.vector, catvariable, mean)
This will give you the means for all days. If you really just want one
mean (just monday), you could do:
Monmean <- mean(data.vector[catvariable=="Monday"])
Similarly, if you want the standard deviation for each day of the week,
you would use:
WeekSD <- tapply(data.vector, catvariable, sd)
MonSD <- sd(data.vector[catvariable=="Monday"])
You will find that some things that are easy in SAS require a little
more thought in R, and vice versa. Certainly, the philosophical approach
to data analysis in R is different to that in SAS. There are a couple of
books for R for SAS users. They might help you.
On 08/01/13 11:17, Joseph Norman Thomson wrote:
> I am a new user of R. I am coming from SAS and do statistics on stock
> market data, economic data, and social data. My question is this: How
> can you get the mean, standard dev, etc. of a variable based on a
> conditional statement on either the same variable or a different
> variable in the same data set? So if I had the closing prices of the
> S&P from 01/01/1990-12/31/1990, how could I get the average price of
> the S&P from 02/01/1990-03/15/1990? Or the average price of the S&P on
> Mondays (assuming a dummy var is created for 1 = Monday, 0 = else). I
> understand that you can create subsets and new data sets based on the
> conditional statements; but is there an easier way to do this by
> typing a line into the mean() statement? That was extremely easy in
> SAS where you could say:
> proc means data=sp500;
> var price;
> where monday = 1;
> Thank you for your help.
> R-help at r-project.org mailing list
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
Simon Blomberg, BSc (Hons), PhD, MAppStat, AStat.
Lecturer and Consultant Statistician
School of Biological Sciences
The University of Queensland
St. Lucia Queensland 4072
T: +61 7 3365 2506
1. I will NOT analyse your data for you.
2. Your deadline is your problem.
Statistics is the grammar of science - Karl Pearson.
More information about the R-help