[R] unable to remove NAs from a data frame

Eric Berger er|cjberger @end|ng |rom gm@||@com
Thu Sep 16 16:30:40 CEST 2021


You are getting this because you asked for the contents of a row that
is beyond the number of rows in your data frame.

On Thu, Sep 16, 2021 at 5:12 PM Ana Marija <sokovic.anamarija using gmail.com> wrote:
>
> Hi All,
>
> I have lines in file that look like this:
>
> > df[14509227,]
>     SNP   A1   A2 freq  b se  p  N
> 1: <NA> <NA> <NA>   NA NA NA NA NA
>
> data looks like this:
> > head(df)
>            SNP A1 A2      freq       b     se      p      N
> 1:  rs74337086  G  A 0.0024460  0.1627 0.1231 0.1865 218792
> 2:  rs76388980  G  A 0.0034150  0.1451 0.1047 0.1660 218792
> ...
> > sapply(df,class)
>         SNP          A1          A2        freq           b          se
> "character" "character" "character"   "numeric"   "numeric"   "numeric"
>           p           N
>   "numeric"   "integer"
>
> > dim(df)
> [1] 14509225        8
>
> Tried:
> > df=na.omit(df)
> > dim(df)
> [1] 14509225        8
>
> and:
> > library(tidyr)
> > d=df %>% drop_na()
> > dim(d)
> [1] 14509225        8
>
>
> Please advise,
>
> Thanks
> Ana
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list