[R] unable to remove NAs from a data frame
Ana Marija
@okov|c@@n@m@r|j@ @end|ng |rom gm@||@com
Thu Sep 16 19:39:53 CEST 2021
Completely true. Thank you for your help
On Thu, Sep 16, 2021 at 12:37 PM Rui Barradas <ruipbarradas using sapo.pt> wrote:
> Hello,
>
> You are trying to access elements that do not exist, see the example below:
>
>
> x <- 1:3
> x[5] # beyond the last element
> #[1] NA
>
> dim(df)
> #[1] 14509225 8
>
> df[14509227,] # beyond nrow(df) by 2
>
>
> Hope this helps,
>
> Rui Barradas
>
>
> Às 15:12 de 16/09/21, Ana Marija escreveu:
> > Hi All,
> >
> > I have lines in file that look like this:
> >
> >> df[14509227,]
> > SNP A1 A2 freq b se p N
> > 1: <NA> <NA> <NA> NA NA NA NA NA
> >
> > data looks like this:
> >> head(df)
> > SNP A1 A2 freq b se p N
> > 1: rs74337086 G A 0.0024460 0.1627 0.1231 0.1865 218792
> > 2: rs76388980 G A 0.0034150 0.1451 0.1047 0.1660 218792
> > ...
> >> sapply(df,class)
> > SNP A1 A2 freq b se
> > "character" "character" "character" "numeric" "numeric" "numeric"
> > p N
> > "numeric" "integer"
> >
> >> dim(df)
> > [1] 14509225 8
> >
> > Tried:
> >> df=na.omit(df)
> >> dim(df)
> > [1] 14509225 8
> >
> > and:
> >> library(tidyr)
> >> d=df %>% drop_na()
> >> dim(d)
> > [1] 14509225 8
> >
> >
> > Please advise,
> >
> > Thanks
> > Ana
> >
> > ______________________________________________
> > R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> > https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> > and provide commented, minimal, self-contained, reproducible code.
> >
>
[[alternative HTML version deleted]]
More information about the R-help
mailing list