[R-sig-DB] CSV input returns unexpected and unwanted numbers.
Markus Jäntti
m@rku@@j@ntt| @end|ng |rom |k|@||
Thu Apr 23 07:56:58 CEST 2009
Jim Burke wrote:
> I am having extreme trouble inputting a csv file.
> Previously this worked flawlessly.
>
> Resolution attempts tried. Still problems.
> . Rebooting the PC
> . Defrag the PC
> . Reboot with the shift key down (don't run
> startup items).
> . Using old R 2.81
>
> PROBLEM is the first column incorrectly comes
> out in scientific notation.
>
> My csv data file named h01369_short_blk.csv.
> The data file is below.
> ===================================
> "sctbkey","district"
> "480019501001000",8
> "480019501001001",8
> "480019501001002",8
> "480019501001003",8
> "480019501001004",8
> "480019501001005",8
> "480019501001006",8
>
> Using R 9.0 on Windows XP I read it as follows.
> The syntax is straight from help(read.csv).
> ====================================
> all_hd_df <- read.csv("h01369_short_blk.csv", header = TRUE,
> sep = ",", quote="\"", dec=".",
> fill = TRUE, comment.char="")
>
> Output
> WHY THE SCIENTIFIC NUMBERS? I expected
> 480019501001000 for the first one
> ====================================
> > all_hd_df
> sctbkey district
> 1 4.800195e+14 8
> 2 4.800195e+14 8
> 3 4.800195e+14 8
> 4 4.800195e+14 8
> 5 4.800195e+14 8
> 6 4.800195e+14 8
> 7 4.800195e+14 8
>
> Any suggestions?
I can't answer the why part, but you can convert to strings by issuing
all_hd_df$sctbkey <- as.character(all_hd_df$sctbkey)
you should probably be able to use colClasses in the call to read.csv to make it
input as a character string.
Markus
>
> Thanks,
> Jim Burke
>
> _______________________________________________
> R-sig-DB mailing list -- R Special Interest Group
> R-sig-DB using stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/r-sig-db
>
--
Markus Jantti
Professor of Economics
Swedish Institute for Social Research
Stockholm University
More information about the R-sig-DB
mailing list