[R] Difference Between R: wilcox.test and STATA: signrank
peter dalgaard
pdalgd at gmail.com
Mon Aug 9 15:52:39 CEST 2010
On Aug 9, 2010, at 3:03 PM, Alain Guillet wrote:
> Hi,
>
> Look at the output of the test made in R and you can see it is a Wilcoxon rank sum test and not a Wilcoxon signed rank test.
It might be helpful to add that paired=TRUE is needed in the call to get the signed-rank test.
> If there are ties, I know I prefer wilcox.exact from the exactRankTests.
>
(Not that much of an issue in larger sample sizes, I'd say. Even with binary data, the normal approximation works reasonably well under the usual assumptions of expected counts > 5, since the tie-adjustment for the variance is exact for the distribution of the ranks. The continuity correction doesn't quite work though. Anyways, wilcox.exact is of course a nice thing to have.)
> Alain
>
> On 09-Aug-10 12:43, Capasia wrote:
>> This is my first post to the mailing list and I guess it's a pretty stupid
>> question but I can't figure it out. I hope this is the right forum for these
>> kind of questions.
>>
>> Before I started using R I was using STATA to run a Wilcoxon signed-rank
>> test on two variables. See data below:
>>
>> https://spreadsheets.google.com/pub?key=0ApodAA2GAEP_dDZkdzZHSFBqX1JHOWJBX1dMQUZCVkE&hl=en&output=html<%20%20https://spreadsheets.google.com/pub?key=0ApodAA2GAEP_dDZkdzZHSFBqX1JHOWJBX1dMQUZCVkE&hl=en&output=html>
>>
>> STATA Output:
>> . signrank x=y
>>
>> Wilcoxon signed-rank test
>>
>> sign | obs sum ranks expected
>> -------------+---------------------------------
>> positive | 41 3101 2330.5
>> negative | 18 1560 2330.5
>> zero | 49 1225 1225
>> -------------+---------------------------------
>> all | 108 5886 5886
>>
>> unadjusted variance 106438.50
>> adjustment for ties -282.38
>> adjustment for zeros -10106.25
>> ----------
>> adjusted variance 96049.88
>>
>> Ho: transfer_2_a = transfer_2_b
>> z = 2.486
>> Prob> |z| = *0.0129*
>>
>> When running a Wilcoxon signed-rank test
>>
>>
>>> wilcox.test(datablatt$x, datablatt$y)
>> Wilcoxon rank sum test with continuity correction
>>
>> data: datablatt$x and datablatt$y
>> W = 7059.5, p-value = *0.09197*
>> alternative hypothesis: true location shift is not equal to 0
>>
>> As you can see the p Values are different (one with H0 rejection and the
>> other one not). I tested whether it could be that the STATA one isn't paired
>> but this doesn't seem to be the problem.
>>
>> I'm dumbfound what could lead to such a difference. I couldn't find any
>> seetings I have missed but I somehow I guess I'm using the function in the
>> wrong way...
>> Any ideas?
>> Thanks a lot in advance!
>>
>> [[alternative HTML version deleted]]
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
>
> --
> Alain Guillet
> Statistician and Computer Scientist
>
> SMCS - IMMAQ - Université catholique de Louvain
> Bureau c.316
> Voie du Roman Pays, 20
> B-1348 Louvain-la-Neuve
> Belgium
>
> tel: +32 10 47 30 50
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
--
Peter Dalgaard
Center for Statistics, Copenhagen Business School
Solbjerg Plads 3, 2000 Frederiksberg, Denmark
Phone: (+45)38153501
Email: pd.mes at cbs.dk Priv: PDalgd at gmail.com
More information about the R-help
mailing list