[R] importing from Stata

Dimitri Joe dimitrijoe at gmail.com
Mon Jan 16 22:29:43 CET 2006


I have a new job, and everyone here uses Stata. I won't give up on R, 
but I must learn better how to exchange data between the two softwares. 
I am now focusing on importing data from Stata to R, and I must confess 
that I am a bit disappointed with the read.dta function from the foreign 
package because IT typically happens that

(i) I get a big R file (for example, a 15Mb Stata file became a 42Mb R 
file; after cleanup.import() from the Hmisc package, it drooped to 35Mb, 
but that's still more than 2x the original Stata file) which, in turn, I 
suspect is due the fact that

(ii) factors are created using Stata labels as levels.

I wonder if

(i) there isn't a way of forcing each variable to be numeric or integer, 
maintaining it's original values (instead of "Stata labels" as "R 
levels"). Or,

(ii) some one has written another function/s to carry this task.

I'd appreciate any suggestions on how to import from Stata to R more 
Thanks in advance,


More information about the R-help mailing list