[R] Fw: Logistic regresion - Interpreting (SENS) and (SPEC)
Frank E Harrell Jr
f.harrell at vanderbilt.edu
Thu Oct 16 13:49:55 CEST 2008
Gad Abraham wrote:
> Frank E Harrell Jr wrote:
>> Gad Abraham wrote:
>>>> This approach leaves much to be desired. I hope that its
>>>> practitioners start gauging it by the mean squared error of
>>>> predicted probabilities.
>>>
>>> Is the logic here is that low MSE of predicted probabilities equals a
>>> better calibrated model? What about discrimination? Perfect calibration
>>
>> Almost. I was addressed more the wish for the use of strategies that
>> maximize precision while keeping bias to a minimim.
>>
>>> implies perfect discrimination, but I often find that you can have two
>>
>> That doesn't follow. You can have perfect calibration in the large
>> with no discrimination.
>
> I'm not sure I understand: if you have perfect calibration, so that you
> correctly assign the probability Pr(y=1|x) to each x, doesn't it follow
> that the x will also be ranked in correct order of probability, which is
> what the AUC is measuring?
You can have a prediction model that assigns Pr(y=1|x) to a range of
0.45 to 0.55 such that the probabilities are perfectly accurate, but the
ROC area is 0.6.
>
>>
>>> competing models, the first with higher discrimination (AUC) and
>>> worse calibration, and the the second the other way round. Which one
>>> is the better model?
>>
>> I judge models on the basis of both discrimination (best measured with
>> log likelihood measures, 2nd best AUC) and calibration. It's a
>> two-dimensional issue and we don't always know how to weigh the two.
>> For many purposes calibration is a must. In those we don't look at
>> discrimination until calibration-in-the-small is verified at high
>> resolution.
>
> By "log likelihood measures" do you mean likelihood-ratio tests?
I mean generalized R^2, log-likelihood, or the adequacy index in my book.
Frank
>
--
Frank E Harrell Jr Professor and Chair School of Medicine
Department of Biostatistics Vanderbilt University
More information about the R-help
mailing list