[R] anova analysis on factors...
David Winsemius
dwinsemius at comcast.net
Thu Dec 8 22:17:04 CET 2011
On Dec 8, 2011, at 3:28 PM, Michael wrote:
> Hi all,
>
> If we wanted to study the effect on the mean of the hourly data
> based on
> the hours within a day...
>
> and we wanted to do Anova analysis...
>
> We have two choices:
Who is "we" and how were these constraints imposed?
>
> Please see below:
>
> Why are these two approaches giving very different p-values?
They are markedly different statistical models.
> And which one
> shall I use?
>
Without knowing your situation better and the eventual purposes of
this analysis, it would be difficult to give sensible advice. I
suspect the answer is "neither".
--
David.
> Thanks a lot!
>
> 1. treating the hours as double/floating numbers:
>
>
> anova(lm(hourlydata~as.double(hours_factors)))
>
> Df Sum Sq Mean Sq F value Pr(>F)
>
> as.double(hours_factors) 1 0.0002 0.00019876 1.3425 0.2466
>
> Residuals 14868 2.2013 0.00014806
>
> 2. treating the hours as factors:
>
>
>
> anova(lm(hourlydata~hours_factors))
>
> Df Sum Sq Mean Sq F value Pr(>F)
>
> hours_factors 9 0.00077 8.5979e-05 0.5806 0.8142
>
> Residuals 14860 2.20072 1.4810e-04
>
> [[alternative HTML version deleted]]
David Winsemius, MD
West Hartford, CT
More information about the R-help
mailing list