[R] Summary coefficients give NA values because of singularities

Gathurst ajdowd at hotmail.co.uk
Mon Dec 5 21:57:34 CET 2011


Hello,

I have a data set which I am using to find a model with the most significant
parameters included and most importantly, the p-values.  The full model is
of the form:
  sad[,1]~b_1 sad[,2]+b_2 sad[,3]+b_3 sad[,4]+b_4 sad[,5]+b_5 sad[,6]+b_6
sad[,7]+b_7 sad[,8]+b_8 sad[,9]+b_9 sad[,10],
where the 9 variables on the right hand side are all indicator variables. 
The thing I don't understand is the line ' sad[, 10]         NA         NA     
NA       NA ' as a result of 'Coefficients: (1 not defined because of
singularities)'.

I think the output is taking sad[,10] as the intercept, based on previous
attempts at figuring my issue out, which I find a bit wierd considering
sad[,10] is either 0 or 1.  How do I produce the correct output showing all
p-values?

My code and output is as follows:

sad<-matrix(1,ncol=11,nrow=486)
sad[,c(1:10)]<-d[,2][-357]
sad[,1]<-d[,29][-357]
sad[,2][sad[,2]!=1]<-0
sad[,3][sad[,3]!=2]<-0
sad[,4][sad[,4]!=3]<-0
sad[,5][sad[,5]!=4]<-0
sad[,6][sad[,6]!=5]<-0
sad[,7][sad[,7]!=6]<-0
sad[,8][sad[,8]!=7]<-0
sad[,9][sad[,9]!=8]<-0
sad[,10][sad[,10]!=9]<-0
sad[,2][sad[,2]==1]<-1
sad[,3][sad[,3]==2]<-1
sad[,4][sad[,4]==3]<-1
sad[,5][sad[,5]==4]<-1
sad[,6][sad[,6]==5]<-1
sad[,7][sad[,7]==6]<-1
sad[,8][sad[,8]==7]<-1
sad[,9][sad[,9]==8]<-1
sad[,10][sad[,10]==9]<-1
sad

summary(lm(sad[,1]~sad[,2]+sad[,3]
+sad[,4]+sad[,5]+sad[,6]
+sad[,7]+sad[,8]+sad[,9]+sad[,10]))

Call:
lm(formula = sad[, 1] ~ sad[, 2] + sad[, 3] + sad[, 4] + sad[, 
    5] + sad[, 6] + sad[, 7] + sad[, 8] + sad[, 9] + sad[, 10])

Residuals:
    Min      1Q  Median      3Q     Max 
-3.3191 -0.3893  0.0519  0.7436  1.0519 

Coefficients: (1 not defined because of singularities)
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)  4.34091    0.14495  29.947   <2e-16 ***
sad[, 2]    -0.16142    0.18128  -0.890   0.3737    
sad[, 3]    -0.23221    0.20275  -1.145   0.2527    
sad[, 4]     0.17832    0.19695   0.905   0.3657    
sad[, 5]     0.06450    0.21447   0.301   0.7638    
sad[, 6]    -0.15909    0.18713  -0.850   0.3957    
sad[, 7]    -0.39286    0.18171  -2.162   0.0311 *  
sad[, 8]    -0.08450    0.21146  -0.400   0.6896    
sad[, 9]    -0.02176    0.20170  -0.108   0.9141    
sad[, 10]         NA         NA      NA       NA    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 

Residual standard error: 0.9615 on 477 degrees of freedom
Multiple R-squared: 0.02984,    Adjusted R-squared: 0.01357 
F-statistic: 1.834 on 8 and 477 DF,  p-value: 0.06869 

Thanks in advance.

--
View this message in context: http://r.789695.n4.nabble.com/Summary-coefficients-give-NA-values-because-of-singularities-tp4162113p4162113.html
Sent from the R help mailing list archive at Nabble.com.



More information about the R-help mailing list