[R] Generalized linear models (GLM)
David Winsemius
dwinsemius at comcast.net
Tue Apr 28 21:31:01 CEST 2009
Although both are known now, there is a time element involved in
which one, max.loss was fixed at the time of underwriting and loss was
unknown at that time. This *is* an insurance question is it not?
Wouldn't the question be: Can one use the group variable to estimate
the proportion of max.loss actually lost following underwriting?
I would vote "no" on all of those offered solutions. If you believe
otherwise, you should write an explanation to a hypothetical professor
what the output of any of the models actually means.
--
David Winsemius
On Apr 28, 2009, at 2:38 PM, mathallan wrote:
>
> Actually both max.loss and loss are known values (in dollars). I'm
> very much
> doubt, what to choose.
>
>
> glm(max.loss~loss,family=gaussian(link="identity")
>
> or
>
> glm(formula = sum ~ claims * as.factor(grp), family = gaussian(link =
> "identity"))
There are no variables named claims or sum. "sum" would be a bad
choice in variable names because of the potential confusion with the
function of the same name.
>
>
> or
> glm(loss~max.loss,family=gaussian(link="identity")
>
> we have to look at gaussian and gamma, with link identity and log.
>
> But my problem is what is going to be between the ~
>
>
>
> David Winsemius wrote:
>>
>> I think you are off-track because max.loss does not sound like a
>> proper Y variable. Because max.loss is an amount that is known, in
>> the
>> insurance applications I have seen it would have been modeled within
>> an offset term. Many of the examples have used number of ships or
>> buildings or the person years of exposure but I do not see that the
>> general strategy is limited to only such considerations.
>>
>> I would also suggest that you consider links other than Gaussian,
>> perhaps negative binomial.
>>
>> The task for the analyst is then to translate output from the chosen
>> model into interpretable meaning on the scale of interest, but I
>> assume your course instructor will help with that.
>>
>> --
>> David Winsemius
>> On Apr 28, 2009, at 11:34 AM, mathallan wrote:
>>
>>>
>>> Hi
>>>
>>> I got a dataset
>>>
>>> loss max.loss grp
>>> 1 10 50 2
>>> 2 15 33 1
>>> 3 18 49 2
>>> 4 33 38 1
>>> 5 8 50 3
>>> 6 19 29 1
>>> 7 22 51 4
>>> 8 50 50 2
>>> 9 16 38 1
>>> 10 24 30 3
>>>
>>> were loss and max.loss are monetary values (in dollar). Grp is group
>>> number.
>>>
>>> By use of GLM, I have to determine the effect of max.loss and grp
>>> (and
>>> interactions between them) on loss. My question is how to do this.
>>>
>>> Is it something like
>>>
>>> glm(max.loss~loss,family=gaussian(link="identity")
>>>
>>> were ofcourse I can change gaussian with Gamma,... and link with
>>> log,...
>>>
>>> But am I on right track, or what should I change?
>>>
>>>
>>> Thanks
>>> --
>>> View this message in context:
>>> http://www.nabble.com/Generalized-linear-models-%28GLM%29-tp23279588p23279588.html
>>> Sent from the R help mailing list archive at Nabble.com.
>>>
>>> ______________________________________________
>>> R-help at r-project.org mailing list
>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>> PLEASE do read the posting guide
>>> http://www.R-project.org/posting-guide.html
>>> and provide commented, minimal, self-contained, reproducible code.
>>
>> David Winsemius, MD
>> Heritage Laboratories
>> West Hartford, CT
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
>>
>
> --
> View this message in context: http://www.nabble.com/Generalized-linear-models-%28GLM%29-tp23279588p23283633.html
> Sent from the R help mailing list archive at Nabble.com.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
David Winsemius, MD
Heritage Laboratories
West Hartford, CT
More information about the R-help
mailing list