[Rd] read.csv

Duncan Murdoch murdoch@dunc@n @end|ng |rom gm@||@com
Tue Apr 16 14:11:40 CEST 2024


On 16/04/2024 7:36 a.m., Rui Barradas wrote:
> Às 11:46 de 16/04/2024, jing hua zhao escreveu:
>> Dear R-developers,
>>
>> I came to a somewhat unexpected behaviour of read.csv() which is trivial but worthwhile to note -- my data involves a protein named "1433E" but to save space I drop the quote so it becomes,
>>
>> Gene,SNP,prot,log10p
>> YWHAE,13:62129097_C_T,1433E,7.35
>> YWHAE,4:72617557_T_TA,1433E,7.73
>>
>> Both read.cv() and readr::read_csv() consider prot(ein) name as (possibly confused by scientific notation) numeric 1433 which only alerts me when I tried to combine data,
>>
>> all_data <- data.frame()
>> for (protein in proteins[1:7])
>> {
>>      cat(protein,":\n")
>>      f <- paste0(protein,".csv")
>>      if(file.exists(f))
>>      {
>>        p <- read.csv(f)
>>        print(p)
>>        if(nrow(p)>0) all_data  <- bind_rows(all_data,p)
>>      }
>> }
>>
>> proteins[1:7]
>> [1] "1433B" "1433E" "1433F" "1433G" "1433S" "1433T" "1433Z"
>>
>> dplyr::bind_rows() failed to work due to incompatible types nevertheless rbind() went ahead without warnings.
>>
>> Best wishes,
>>
>>
>> Jing Hua
>>
>> ______________________________________________
>> R-devel using r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-devel
> Hello,
> 
> I wrote a file with that content and read it back with
> 
> 
> read.csv("filename.csv", as.is = TRUE)
> 
> 
> There were no problems, it all worked as expected.

What platform are you on?  I got the same output as Jing Hua:

Input filename.csv:

Gene,SNP,prot,log10p
YWHAE,13:62129097_C_T,1433E,7.35
YWHAE,4:72617557_T_TA,1433E,7.73

Output:

 > read.csv("filename.csv")
    Gene             SNP prot log10p
1 YWHAE 13:62129097_C_T 1433   7.35
2 YWHAE 4:72617557_T_TA 1433   7.73

Duncan Murdoch



More information about the R-devel mailing list