[R-sig-Geo] To validate logistic regression

Dylan Beaudette debeaudette at ucdavis.edu
Tue Apr 26 15:58:56 CEST 2011


Another approach:

See ?lrm, ?validate, and ?calibrate from the rms package. 

Dylan

On Tuesday, April 26, 2011, Bram Van Moorter wrote:
> Dear Komine,
> Not sure whether this is the easiest way, but it has worked for me:
> 
> set.seed(0)
> head(tab <- data.frame(Y=as.numeric(runif(100)>0.5), X=rnorm(100)))
> subs <- sample(c(1:nrow(tab)), round(nrow(tab)*0.66), replace=F)  #the
> 66% of data you want in one sample
> tab1 <- tab[subs, ] #the one sample
> tab2 <- tab[!c(1:nrow(tab)) %in% subs, ] #the other sample, which are
> the data that do not fall in the first sample
> 
> rlog1 <- glm(Y~X,family=binomial,data=tab1)
> summary(rlog1)
> tab2$pred <-predict(rlog1, newdata=tab2, type="response")
> hist(tab2$pred)
> 
> library(ROCR) #allows you to make easily ROC's which allows the
> assessment of your prediction
> pred <- prediction(tab2$pred, tab2$Y)
> perf <- performance(pred,"tpr","fpr")
> plot(perf); abline(0, 1, col="red")  #the proportional line shows that
> the prediction is as good as random, which you would expect in this
> example
> 
> Best,
> Bram
> 
> 
> > Hi,
> > I would like your help to validate my logistic regression. I know how to 
do
> > logistic regression.
> >
> > rlog<-glm(Y~X,family=binomial,data=tab)
> > summary(rlog)
> > HLgof.test(fit = fitted(rlog), obs=Y)
> >
> > However, I would like to validate my model. For example to divise my data 
in
> > a sample for training (66%) and a sample for validation (34%).
> > e.g for ma table
> > Area   Y     X
> > 1       1     135
> > 1       0     200
> > 1       1      97
> > 1       1     160
> > 1       0     201
> > 1       1     144
> > 1       0     100
> >
> > But I don't know how to validate it.
> > 1- My first problem: How to create my 2 samples from my variables Y and X
> > using pourcentage 66 ang 34 %?
> >
> > - How to have the pourcentage of good prediction and bad prediction?
> >
> > Thanks for your Help
> > Komine
> >
> 
> 
> -- 
> Bram Van Moorter
> Centre for Conservation Biology (NTNU),
> Norwegian Institute for Nature Research (NINA)
> Trondheim (Norway)
> email:  Bram.Van.Moorter at gmail.com
> website: http://ase-research.org/moorter
> phone: +47 73596060
> 
> _______________________________________________
> R-sig-Geo mailing list
> R-sig-Geo at r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-sig-geo
> 


-- 
Dylan E. Beaudette
USDA-NRCS Soil Scientist
California Soil Resource Lab
http://casoilresource.lawr.ucdavis.edu/



More information about the R-sig-Geo mailing list