[Rd] ks.test (PR#1004)

Peter Dalgaard BSA p.dalgaard@biostat.ku.dk
27 Oct 2001 00:43:43 +0200


charlie@muskrat.stat.umn.edu writes:

> For example, Example 5.4 in Hollander and Wolfe (Nonparametric Statistical,
> Methods, 2nd ed., Wiley, 1999, pp. 180-181)
> 
> R Version 1.3.1 (SuSE Linux 7.1)
> 
> > X <- read.table(url("http://www.stat.umn.edu/geyer/5601/hwdata/t5-7.txt"),
> +     header = TRUE)
> > names(X)
> [1] "x" "y"
> > attach(X)
> > ks.test(x, y)
>  
>         Two-sample Kolmogorov-Smirnov test
>  
> data:  x and y
> D = 0.6, p-value = 0.01234
> alternative hypothesis: two.sided
> 
> Not hardly.  Hollander and Wolfe say the exact P-value is 0.0524.  Note

Here's part of the problem:

Browse[1]> .C("psmirnov2x", p = as.double(STATISTIC), as.integer(n.x),
as.integer(n.y), PACKAGE = "ctest")$p
[1] 0.9876594
Browse[1]> .C("psmirnov2x", p = as.double(0.6), as.integer(n.x),
as.integer(n.y), PACKAGE = "ctest")$p
[1] 0.9475524
Browse[1]> STATISTIC
[1] 0.6

Perplexed? The reason is of course that

Browse[1]> STATISTIC-0.6
[1] 1.110223e-16

and the KS distribution is discontinuous, so the point probability at
0.6 didn't get counted. Looks like we need a fudge factor.

-- 
   O__  ---- Peter Dalgaard             Blegdamsvej 3  
  c/ /'_ --- Dept. of Biostatistics     2200 Cph. N   
 (*) \(*) -- University of Copenhagen   Denmark      Ph: (+45) 35327918
~~~~~~~~~~ - (p.dalgaard@biostat.ku.dk)             FAX: (+45) 35327907
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-devel mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-devel-request@stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._