[R] tidyverse: read_csv() misses column
jdnewm|| @end|ng |rom dcn@d@v|@@c@@u@
Mon Nov 1 18:21:21 CET 2021
More explicitly... look at rows past the first row. If your csv has 300 rows and column 1 has something non-numeric in row 299 then the whole column gets imported as character data. Try
cor_disc[[ 1 ]] |> as.numeric() |> is.na() |> where()
to find suspect rows. You may want to read about the na argument to read_csv in ?read_csv.
On November 1, 2021 9:50:23 AM PDT, Kevin Thorpe <kevin.thorpe using utoronto.ca> wrote:
>I do not have a specific answer to your particular problem. All I can say is when a CSV import doesn’t work, it can mean there is something in the CSV file that is unexpected. When read_csv() fails, I will try read.csv() to compare the results.
>> On Nov 1, 2021, at 12:40 PM, Rich Shepard <rshepard using appl-ecosys.com> wrote:
>> The data file, cor-disc.csv begins with:
>> The first 7 columns are character strings; the 8th column is an integer.
>> After loading library(tidyverse) I ran read_csv() with this result:
>>> cor_disc <- read_csv("../data/cor-disc.csv")
>> Rows: 415263 Columns: 8
>> ── Column specification ────────────────────────────────────────────────────────────────────────────
>> Delimiter: ","
>> chr (5): mon, day, hr, min, tz
>> dbl (2): site_nbr, year
>> ℹ Use `spec()` to retrieve the full column specification for this data.
>> ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
>> 1. What happed to the values in column 'disc?'
>> 2. Why are site_nbr and year seen as doubles when they're character strings?
>> I've not found answers in the book or in ?read_csv.
>> What am I missing?
>> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
Sent from my phone. Please excuse my brevity.
More information about the R-help