[R] normality testing with nortest
Duncan Murdoch
murdoch at stats.uwo.ca
Mon May 22 15:15:21 CEST 2006
On 5/22/2006 5:25 AM, Peter Dalgaard wrote:
> Uwe Ligges <ligges at statistik.uni-dortmund.de> writes:
>
>
>>> A = 0.1846, p-value = 0.9059
>>>
>>>> ad.test(rnorm(100,mean=5,sd=3))
>>> ...
>>> A = 0.5138, p-value = 0.1887
>>>
>>> I mistakenly had thought the p-values would be more stable since I
>>> am artificially creating a random normal distribution. Is this expected
>>> for a normality test or is this an issue with how rnorm is producing
>>> random numbers? I guess if I run it many times, I would find that I
>>> would get many large values for the p-value?
>> Well, as many large values as small values, 5% significant differences
>> for the 5% level....
>>
>> The following looks alright:
>>
>> hist(replicate(1000, ad.test(rnorm(100,mean=5,sd=3))$p.value))
>
> We see this misunderstanding worryingly often. Worrying because it
> reveals that a fundamental aspect of statistical inference has not
> been grasped: that p-values are designed to be (approximately)
> uniformly distributed and fall below any given level with the stated
> probability, when the null hypothesis is true.
I think it's the fallacious belief that the p-value measures the
probability that the null hypothesis is true. This is currently
misunderstanding #1 in the Wikipedia entry for P-values.
(Google had me worried: I searched for "probability that the null
hypothesis is true" and found
> P-value - Wikipedia, the free encyclopedia
> The p-value is the probability that the null hypothesis is true, justifying the "rule" of considering as significant p-values closer to 0 (zero). ...
> en.wikipedia.org/wiki/P-value - 17k - Cached - Similar pages
This quote is preceded by: "All of the following [...] statements are
false:" Context is important! :-)
The vast majority of hits to that search also pointed out that this
interpretation was incorrect. A couple of counterexamples were a
"research methods" page at a department of psychology, and
another at a medical school. I'll send a copy of this note to people there.
Duncan Murdoch
>
> There is no mechanism to give you "fewer significant" or "more stable"
> p-values, and a p-value close to one is no better an indication of a
> true null hypothesis than one of 0.5 or 0.25.
>
More information about the R-help
mailing list