[R] Unexplained behavior of level names when using ordered factors in lm?

Bert Gunter gunter.berton at gene.com
Fri Dec 2 16:06:30 CET 2011


?ordered
?C
?contr.poly

If you don't know what polynomial contrasts are, consult any good
linear models text. MASS has a good, though a bit terse, section on
this.

-- Bert

On Fri, Dec 2, 2011 at 6:51 AM, Tal Galili <tal.galili at gmail.com> wrote:
> Hello dear all,
>
> I am unable to understand why when I run the following three lines:
>
> set.seed(4254)
>> a <- data.frame(y = rnorm(40), x=ordered(sample(1:5, 40, T)))
>> summary(lm(y ~ x, a))
>
>
> The output I get includes factor levels which are not relevant to what I am
> actually using:
>
> Call:
>> lm(formula = y ~ x, data = a)
>> Residuals:
>>     Min      1Q  Median      3Q     Max
>> -1.4096 -0.6400 -0.1244  0.5886  2.1891
>> Coefficients:
>>             Estimate Std. Error t value Pr(>|t|)
>> (Intercept) -0.03276    0.15169  -0.216    0.830
>> x.L         -0.28968    0.33866  -0.855    0.398
>> x.Q         -0.38813    0.33851  -1.147    0.259
>> x.C         -0.27183    0.34027  -0.799    0.430
>> x^4          0.25993    0.33935   0.766    0.449
>> Residual standard error: 0.9564 on 35 degrees of freedom
>> Multiple R-squared: 0.08571, Adjusted R-squared: -0.01878
>> F-statistic: 0.8202 on 4 and 35 DF,  p-value: 0.5211
>
>
> I am guessing that this is having something to do with the contrast matrix
> that is used, but this is not clear to me.
> Can anyone suggest a good read, or an explanation?
>
> Thanks.
>
>
> ----------------Contact
> Details:-------------------------------------------------------
> Contact me: Tal.Galili at gmail.com |  972-52-7275845
> Read me: www.talgalili.com (Hebrew) | www.biostatistics.co.il (Hebrew) |
> www.r-statistics.com (English)
> ----------------------------------------------------------------------------------------------
>
>        [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



-- 

Bert Gunter
Genentech Nonclinical Biostatistics

Internal Contact Info:
Phone: 467-7374
Website:
http://pharmadevelopment.roche.com/index/pdb/pdb-functional-groups/pdb-biostatistics/pdb-ncb-home.htm



More information about the R-help mailing list