[R] Homework help: t test hypothesis testing with summarized data?
Zembower, Kevin
kzembowe at jhuccp.org
Thu Nov 8 16:47:52 CET 2007
Peter and Moshe, thank you both for your suggestions and hints. I'm proud to say that it took me less than an hour to find my mistake:
> s_pooled <- (((n-1)*(s_x^2)) + ((m-1)*(s_y^2))) / (n+m-2)
> s_pooled
[1] 1.939521
> t_obs <- (xbar - ybar) / (sqrt(s_pooled) * (sqrt(1/n + 1/m)))
> t_obs
[1] 2.15578
> qt(c(.025, .975), n+m-2)
[1] -1.987934 1.987934
> # Therefore, reject H0 at the 0.05 level of significance.
Just to be clear about the 'homework' aspect of my questions: my homework is to work the problems out 'longhand' with just a calculator and printed tables. (In fact, 10 weeks into a 14 week course, we haven't been asked yet to use a computer.) I do this before I ask any questions regarding homework on this forum. On my own, I'm trying to answer some of the questions and examples in my textbook using R. My 'Homework help:' subject may have been misleading. I may change it to 'Extra-credit help:' to acknowledge the academic aspect of my question but distinguish it from my homework. I used 'Homework help:' because I didn't want anyone to suspect from the nature of the questions that I was trying to sneak in a homework question without acknowledging it.
Thanks, again, for all your help for this statistics student.
-Kevin
-----Original Message-----
From: Peter Dalgaard [mailto:p.dalgaard at biostat.ku.dk]
Sent: Wednesday, November 07, 2007 6:50 PM
To: Zembower, Kevin
Cc: r-help at stat.math.ethz.ch
Subject: Re: [R] Homework help: t test hypothesis testing with summarized data?
Zembower, Kevin wrote:
> Is this how a t hypothesis test is done when I don't have the actual
> data, but just the summarized statistics:
>
>> #Homework 9.2.6 [1]
>> n<-31
>> xbar<-3.10
>> s_x<-1.469
>> m<-57
>> ybar<-2.43
>> s_y<-1.35
>> s_pooled<- (((n-1)*s_x^2) + ((m-1)*s_y^2)) / (n + m - 2)
>> s_pooled
>>
> [1] 1.939521
>
>> t_obs <- (xbar - ybar) / (s_pooled * (sqrt(1/n + 1/m)))
>> t_obs
>>
> [1] 1.547951
>
>> qt(c(.025, .975), n+m-2)
>>
> [1] -1.987934 1.987934
>
>> # Therefore, fail to reject H0 at the 0.05 level of significance
>>
>>
>
> Or am I again overlooking a canned procedure or an easier calculation
> using the t distribution.
>
I don't know if someone told you last time, but there's an Internet code
of honor about helping with homework.... Don't expect more than hints.
You're on track but there's a mistake.
Here's a way of testing your result:
> x <- scale(rnorm(31))*1.469+3.10
> y <- scale(rnorm(57))*1.35+2.43
> t.test(x,y, var.equal=TRUE)
> Thank you for your continued advice and help.
>
> -Kevin
>
> [1] An Introduction to Mathematical Statistics and its Applications,
> fourth ed., Larsen and Marx.
>
> Kevin Zembower
> Internet Services Group manager
> Center for Communication Programs
> Bloomberg School of Public Health
> Johns Hopkins University
> 111 Market Place, Suite 310
> Baltimore, Maryland 21202
> 410-659-6139
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
--
O__ ---- Peter Dalgaard Øster Farimagsgade 5, Entr.B
c/ /'_ --- Dept. of Biostatistics PO Box 2099, 1014 Cph. K
(*) \(*) -- University of Copenhagen Denmark Ph: (+45) 35327918
~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk) FAX: (+45) 35327907
More information about the R-help
mailing list