[R] unable to remove NAs from a data frame
PIKAL Petr
petr@p|k@| @end|ng |rom prechez@@cz
Thu Sep 16 16:24:26 CEST 2021
Hi
You should consult either complete.cases function or to remove only rows in
which are only NAs you could use something like (untested)
df[!(colSums(is.na(df))==8),]
Cheers
Petr
> -----Original Message-----
> From: R-help <r-help-bounces using r-project.org> On Behalf Of Ana Marija
> Sent: Thursday, September 16, 2021 4:12 PM
> To: r-help <r-help using r-project.org>
> Subject: [R] unable to remove NAs from a data frame
>
> Hi All,
>
> I have lines in file that look like this:
>
> > df[14509227,]
> SNP A1 A2 freq b se p N
> 1: <NA> <NA> <NA> NA NA NA NA NA
>
> data looks like this:
> > head(df)
> SNP A1 A2 freq b se p N
> 1: rs74337086 G A 0.0024460 0.1627 0.1231 0.1865 218792
> 2: rs76388980 G A 0.0034150 0.1451 0.1047 0.1660 218792 ...
> > sapply(df,class)
> SNP A1 A2 freq b se
> "character" "character" "character" "numeric" "numeric" "numeric"
> p N
> "numeric" "integer"
>
> > dim(df)
> [1] 14509225 8
>
> Tried:
> > df=na.omit(df)
> > dim(df)
> [1] 14509225 8
>
> and:
> > library(tidyr)
> > d=df %>% drop_na()
> > dim(d)
> [1] 14509225 8
>
>
> Please advise,
>
> Thanks
> Ana
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-
> guide.html
> and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list