[R] Regsubsets model selection

Maximilian Lklweryc maxlklweryc at gmail.com
Tue Sep 25 14:22:21 CEST 2012


Hi,
I have 12 independent variables and one dependent variable. Now I want to
select the best adj. R squared model by using the regsubsets command, so I
code:

> plot(regsubsets(Gesamt ~ CommunistSocialist + CountrySize + GNI + Lifeexp
+ Schoolyears + ExpMilitary + Mortality +
+   PopPoverty + PopTotal + ExpEdu + ExpHealth, data=olympiadaten, nbest=1,
nvmax=12), scale='adjr2')

Then I get the picture I attached. The problem is, that the best model has
an adjusted R squard of 0.49. But if I regress e.g. my y on only the
variable PopTotal, then I already get an adjusted R squared of 0.779! So
this simple model is way better but it is not recognized by the regsubsets
command. I don't know why R does this and how can I change this?

And a general question: If I take the best model by AIC, does this model
also has the highest (best) adj. R squared? Should I select my models by
information criterions or by R squared? And what is exactly the difference,
I mean, both take into account the fitting and the nunber of variables
right? Thanks a lot!


Thanks a lot for your help!
-------------- n?chster Teil --------------
Ein Dateianhang mit Bin?rdaten wurde abgetrennt...
Dateiname   : subsets.png
Dateityp    : image/png
Dateigr??e  : 8196 bytes
Beschreibung: nicht verf?gbar
URL         : <https://stat.ethz.ch/pipermail/r-help/attachments/20120925/9dd3e1ba/attachment.png>


More information about the R-help mailing list