[R] Odp: nls, how to determine function?
Petr PIKAL
petr.pikal at precheza.cz
Tue Aug 9 10:38:35 CEST 2011
Hi
> Hi R help,
>
> I am trying to determine how nls() generates a function based on the
> self-starting SSlogis and what the formula for the function would be.
> I've scoured the help site, and other literature to try and figure
> this out but I still am unsure if I am correct in what I am coming up
> with.
Thanks for providing data and your code
>
>
>
**************************************************************************
> dat <- c(75.44855206,NA,NA,NA,82.70745342,82.5335019,88.56617647,80.
> 00128866,94.15418227,86.63987539,93.91052952,74.10612245,86.62289562,90.
> 47961047,NA,NA,82.45320197,72.14371257,NA,71.44104803,72.59742896,68.
> 36363636,NA,NA,61,NA,NA,71.26502909,NA,85.93333333,84.34248284,79.
> 00522193,79.64223058,97.2074017,88.43700548,96.40413877,95.13511869,92.
> 57379057,93.97498475,NA,97.55995131,89.53321146,97.21728545,93.21980198,
> 77.54054054,95.85392575,86.25684723,97.55325624,80.03950617,NA,91.
> 34023128,92.42906574,88.59433962,65.77272727,89.63772455,NA,NA,NA,NA,74.
> 86344239,83.57594937,70.22516556,65.30543319,NA,NA,67.84852294,60.
> 90909091,54.79303797,NA,52.18735363,33.47003155,NA,41.34693878,24.
> 5047043,NA,NA,NA,NA,9.944444444,13.6875,NA,11.90267176,84.14285714,3.
> 781456954,NA,1.432926829,4.26557377,1.823529412,0.444620253,4.
>
711155378,NA,6.320284698,0.581632653,0.144578313,3.666666667,0,0,0,0,0,NA,
> 0.032947462,0,0,10.54545455,0,NA,0.561007958,0.75,NA,0.048780488,0.
> 74137931,NA,2.023339318,0,0,0,NA,NA,0.156950673,NA,0.283769634,32.
>
81818182,NA,NA,0,NA,0,0,0,NA,0.212454212,3.120181406,NA,0.011811024,NA,0,
>
0.120430108,5.928571429,1.75,0.679292929,0.97,NA,0,NA,NA,1,0.38547486,NA,
> 1.460732984,0.007795889,0.05465288,0.004341534)
> dat.df.1 <- data.frame(dat)
unnecessary
> dat.df.2 <- data.frame(x=x.seq, dat.df=dat.df.1)
some correction
dat.df.2 <- data.frame(x=seq_along(dat), dat=dat)
> fit.dat <-nls(dat ~ SSlogis(x, Asym, xmid,scal), data = dat.df.2,
> start =list(Asym=90, xmid = 75, scal = -6))
> plot(dat.df.2, axes=FALSE, ann=FALSE, ylim=c(0,100))
> lines(dat.df.2$x[complete.cases(dat.df.2)], predict(fit.dat),
ylim=c(0,100))
>
> summary(fit.dat)
>
>
**************************************************************************
> Formula: dat ~ SSlogis(x, Asym, xmid, scal)
>
> Parameters:
> Estimate Std. Error t value Pr(>|t|)
> Asym 85.651 1.716 49.900 < 2e-16 ***
> xmid 72.214 1.036 69.697 < 2e-16 ***
> scal -6.150 0.850 -7.236 7.9e-11 ***
> ---
> Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
>
> Residual standard error: 10.33 on 105 degrees of freedom
>
> Number of iterations to convergence: 10
> Achieved convergence tolerance: 4.405e-06
> (45 observations deleted due to missingness)
>
**************************************************************************
>
> >From r-help, SSlogis parameters asym, xmid and scal are defined as:
>
> Asym: a numeric parameter representing the asymptote.
>
> xmid: a numeric parameter representing the x value at the inflection
> point of the curve. The value of SSlogis will be Asym/2 at xmid.
>
> scal: a numeric scale parameter on the input axis.
>
> and it states that the value of SSlogis "is a numeric vector of the
> same length as input. It is the value of the expression
> sym/(1+exp((xmid-input)/scal)). If all of the arguments Asym, xmid,
> and scal are names of objects the gradient matrix with respect to
> these names is attached as an attribute named gradient."
>
> However, how do I get the actual function for the curve that is
> generated? I don't think it can just be: y=
> asym/((1+e^((xmid-x)/scal)))?
Yes. I think that the best source of information about nonlinear
regression is book by Bates, Pinheiro - Mixed effect models with S and S+.
There you can find how to determine starting parameters, how to construct
and use your own function together with selfstart feature.
>
> Also, how do you determine the starting parameters to input in for
> asym, xmin, and scal?
>
> Perhaps I need to start at the beginning and define my own function,
> and not rely on SSlogis to provide it?
>
> What I want to be able to do is determine a local maximum for my curve
> (the x value at which this curve inflects (the upper inflection)), and
> the x value for the local minimum (the lower inflection curve), and
> the x value counts in between these values. I think in order to do
> this I need to differentiate the function.
Maybe I do not understand well but looking at the picture it seems to me
that logistic model is fitting your data quite well. You can use also four
parameter logistic model.
> fit.dat.2 <-nls(dat ~ SSfpl(x, A, B, xmid,scal), data = dat.df.2, start
=list(A=85.65, B=0, xmid = 72, scal = -6))
> summary(fit.dat.2)
Formula: dat ~ SSfpl(x, A, B, xmid, scal)
Parameters:
Estimate Std. Error t value Pr(>|t|)
A 1.6729 1.5927 1.050 0.296
B 85.5555 1.7065 50.134 < 2e-16 ***
xmid 71.7628 1.0762 66.679 < 2e-16 ***
scal -5.8051 0.9162 -6.336 6.13e-09 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residual standard error: 10.32 on 104 degrees of freedom
Number of iterations to convergence: 9
Achieved convergence tolerance: 7.629e-06
(45 observations deleted due to missingness)
As you can see parameter A is insignificant so simple logistic can be used
too. In that case upper asymptote is 85.6, lower asymptote is zero,
inflection point is 72 (x where y is in the middle between both
asymptotes) and scal is rate at which the curve is falling (growing).
There is however some wave in the beginning of your data
fit <-loess(dat ~ x, data = dat.df.2, span=0.3)
lines(dat.df.2$x[complete.cases(dat.df.2)], predict(fit), col=3)
So it is up to you to decide if you are satisfied with getting asymptotic
values from logistic model or you want to set something more elaborated.
Regards
Petr
>
> Any insight on this would be greatly appreciated.
>
> Sincerely,
>
> Katrina
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list