[R] SAS "datalines" or "cards" statement equivalent in R?
Rolf Turner
r.turner at auckland.ac.nz
Mon Dec 7 20:22:14 CET 2009
On 8/12/2009, at 5:43 AM, Barry Rowlingson wrote:
> On Mon, Dec 7, 2009 at 3:53 PM, Marshall Feldman <marsh at uri.edu>
> wrote:
>> Regarding the various methods people have suggested, what if a
>> typical
>> tab-delimited data line looks like:
>>
>> SMS11000000000000001 1990 M01 688.0
>>
>> and the SAS INPUT statement is
>>
>> INPUT survey $ 1-2 seasonal $ 3 state $ 4-5 area $ 6-10
>> supersector $
>> 11-12 @13 industry $8. datatype $ 21-22 year period $ value
>> footnote $ ;
>>
>> Note that most data lines have no footnote item, as in the sample.
>>
>> Here (I think) we'd want all the character variables to be read as
>> factors,
>> possibly "year" as a date, and "value" as numeric.
>
> Actually I'm surprised that nobody has yet said what a clearly
> bonkers thing it is to mix up your data and your analysis code in a
> single file. Now suppose you have another set of data you want to
> analyse with the same code? Are you going to create a new file and
> paste the new data in? You've now got two copies of your analysis code
> - good luck keeping corrections to that code synchronised.
>
> This just seems like horrendously bad practice, which is one reason
> it's kludgy in R. If it was good practice, someone would surely have
> written a way to do it neatly.
>
> Keep your data in data files, and your functions in .R function
> files. You'll thank me later.
Indeed. The idea of keeping data and code in the same file is a symptom
of blinkered SAS orientation. Get out of the SAS way of thinking and
get into the R way of thinking. You'll be glad you did!
To paraphrase Eric Blair: ``SAS-thinkers unbellyfeel R-speak.''
cheers,
Rolf Turner
######################################################################
Attention:\ This e-mail message is privileged and confid...{{dropped:9}}
More information about the R-help
mailing list