[R-sig-teaching] Creating data

Scott F. Kiesling kiesling+ at pitt.edu
Tue Mar 24 21:57:44 CET 2009


Thanks-
Yes I'm using that as a text. I'm trying to have something fresh so
that I can add an example to demonstrations or give assessments with
new data.

I'm also using Keith Johnson's text. 

SFK

On Wed, Mar 25, 2009 at 09:33:34AM +1300, Murray Jorgensen wrote:
> From: Murray Jorgensen <maj at stats.waikato.ac.nz>
> Date: Wed, 25 Mar 2009 09:33:34 +1300
> To: "Scott F. Kiesling" <kiesling at pitt.edu>
> Cc: r-sig-teaching at r-project.org
> Subject: Re: [R-sig-teaching] Creating data

> I take it you have looked at the data sets in Baayen's book
> "Analyzing linguistic data. A practical introduction to statistics"  ?
>
> www.ualberta.ca/~baayen/publications/baayenCUPstats.pdf
>
> Cheers,  Murray Jorgensen
>
> Scott F. Kiesling wrote:
>> Hi everyone-

>> I'm currently teaching a graduate course in statistics for linguistics
>> using R. I have used up most of the 'authentic' data I have been able
>> to collect for homework and demonstrations. I can think of plenty more
>> possible data sets, but I am finding the creation of them challenging,
>> and my creations are often somewhat unlealistic (generally, too
>> 'neat' and obvious). 

>> So, I was wondering if anyone had any tips on creating 'realistic'
>> data sets, or links/books that describe it.

>> For a simple example, let's say I want to create a dataset with
>> students from different countries and academic departments who took an
>> English test. I want to make some differences (significant and not)
>> and possibly even interactions among the scores by country and
>> department. I have been doing this through various iterations of
>> sample() and rnorm(), and jitter() to get some randomness, but things
>> are still coming out pretty neatly.  Is this the right (or a good)
>> method? Advice?

>> Thanks in advance-

>> SFK
-- 
Scott F. Kiesling, PhD

Associate Professor 
Department Chair

Department of Linguistics
University of Pittsburgh, 2816 CL
Pittsburgh, PA 15260
http://www.linguistics.pitt.edu
Office: +1 412-624-5916




More information about the R-sig-teaching mailing list