[R] how to load data frame where numeric will be numeric instead of character

David Carlson dc@r|@on @end|ng |rom t@mu@edu
Tue Jun 2 05:21:32 CEST 2020


It might be easier to diagnose if you can show us what the first ten lines
in your original file look like.

readLines("gokind.nephropathy.fin", n=10)

David L Carlson


On Mon, Jun 1, 2020 at 6:36 PM Bert Gunter <bgunter.4567 using gmail.com> wrote:

> Agreed!
>
> However, there may still be a problem, as read.table() ordinarily would
> read numeric columns correctly (via type.convert()) without the colClasses
> specification.
> So I would suspect that her "numeric" columns contain some non-numeric
> detritus (perhaps "," or some NA symbol). But of course, who knows? -- and
> she should follow David's advice to read the docs anyway.
>
>
> Bert Gunter
>
> "The trouble with having an open mind is that people keep coming along and
> sticking things into it."
> -- Opus (aka Berkeley Breathed in his "Bloom County" comic strip )
>
>
> On Mon, Jun 1, 2020 at 3:19 PM David Winsemius <dwinsemius using comcast.net>
> wrote:
>
> >
> > On 6/1/20 2:17 PM, Ana Marija wrote:
> > > HI David,
> > >
> > > this is the problem:
> > >
> > > > NEP <- read.table("gokind.nephropathy.fin",
> > > header=T,stringsAsFactors=FALSE)
> > > > sapply(NEP,class)
> > >         Chr          BP      Marker         MAF          A1        A2
> > > "character" "character" "character" "character" "character" "character"
> > >   Direction      pValue           N
> > >
> > > So even entries like Chr, BP, MAF....are characters while they should
> > > be numeric
> > > > head(NEP)
> > >   Chr        BP           Marker      MAF A1 A2 Direction pValue    N
> > > 1  10 100000625 10:100000625:A:G   0.4156  G  A         + 0.484813 1641
> > > 2  10 100000645 10:100000645:A:C 0.216027  C  A         +  0.73597 1641
> > >
> > >
> > > Can you please tell me what colClasses=colClassvec suppose to do?
> >
> >
> > I could tell you, but I think instead that you should read the
> > documentation for the `read.table` function.
> >
> >
> > --
> >
> > David
> >
> > >
> > > Thanks
> > > Ana
> > >
> > > On Mon, Jun 1, 2020 at 4:13 PM David Winsemius <dwinsemius using comcast.net
> > > <mailto:dwinsemius using comcast.net>> wrote:
> > >
> > >
> > >     On 6/1/20 1:37 PM, Ana Marija wrote:
> > >     > Hello,
> > >     >
> > >     > I have a dataframe like this:
> > >     >
> > >     >    Chr        BP           Marker      MAF A1 A2 Direction
> > >      pValue    N
> > >     > 1  10 100000625 10:100000625:A:G 0.416562  G  A         -
> > >     0.558228 1594
> > >     > 2  10 100000645 10:100000645:A:C 0.215182  C  A         -
> > >     0.880622 1594
> > >     > ...
> > >     >
> > >     > which I load with:
> > >     > NEU <- read.table("gokind.neuropathy.fin",
> > >     header=T,stringsAsFactors=FALSE)
> > >     >
> > >     > and every column is numeric. How to say have all numeric ones
> > >     stay numeric
> > >     > like: Chr, BP, MAF, pValue, N
> > >
> > >
> > >     I cannot figure out what the problem is. You say every column is
> > >     numeric. It's not possible to have a column that contains the value
> > >     "10:100000625:A:G" be numeric.
> > >
> > >
> > >     If you meant to say the every column was character, then the answer
> > >     might be:
> > >
> > >
> > >     colClassvec <- rep("numeric",9)
> > >     colClassvec[ c(3,5:7)] <- "character"
> > >
> > >     NEU <- read.table("gokind.neuropathy.fin",
> > >     header=T,stringsAsFactors=FALSE, colClasses=colClassvec)
> > >
> > >     --
> > >     David.
> > >
> > >     >
> > >     > Thanks
> > >     > Ana
> > >     >
> > >     >       [[alternative HTML version deleted]]
> > >     >
> > >     > ______________________________________________
> > >     > R-help using r-project.org <mailto:R-help using r-project.org> mailing list
> > >     -- To UNSUBSCRIBE and more, see
> > >     >
> https://urldefense.com/v3/__https://stat.ethz.ch/mailman/listinfo/r-help__;!!KwNVnqRv!XmQVDL6oNnVVTyIGSfa2u7ps0SpI04MnrWnfq7eXZ0Zz-POPe5r-P4jj2eq-EE8$
> > >     > PLEASE do read the posting guide
> > >
> https://urldefense.com/v3/__http://www.R-project.org/posting-guide.html__;!!KwNVnqRv!XmQVDL6oNnVVTyIGSfa2u7ps0SpI04MnrWnfq7eXZ0Zz-POPe5r-P4jj5SGdDSU$
> > >     > and provide commented, minimal, self-contained, reproducible
> code.
> > >
> >
> >         [[alternative HTML version deleted]]
> >
> > ______________________________________________
> > R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> >
> https://urldefense.com/v3/__https://stat.ethz.ch/mailman/listinfo/r-help__;!!KwNVnqRv!XmQVDL6oNnVVTyIGSfa2u7ps0SpI04MnrWnfq7eXZ0Zz-POPe5r-P4jj2eq-EE8$
> > PLEASE do read the posting guide
> >
> https://urldefense.com/v3/__http://www.R-project.org/posting-guide.html__;!!KwNVnqRv!XmQVDL6oNnVVTyIGSfa2u7ps0SpI04MnrWnfq7eXZ0Zz-POPe5r-P4jj5SGdDSU$
> > and provide commented, minimal, self-contained, reproducible code.
> >
>
>         [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
>
> https://urldefense.com/v3/__https://stat.ethz.ch/mailman/listinfo/r-help__;!!KwNVnqRv!XmQVDL6oNnVVTyIGSfa2u7ps0SpI04MnrWnfq7eXZ0Zz-POPe5r-P4jj2eq-EE8$
> PLEASE do read the posting guide
> https://urldefense.com/v3/__http://www.R-project.org/posting-guide.html__;!!KwNVnqRv!XmQVDL6oNnVVTyIGSfa2u7ps0SpI04MnrWnfq7eXZ0Zz-POPe5r-P4jj5SGdDSU$
> and provide commented, minimal, self-contained, reproducible code.
>

	[[alternative HTML version deleted]]



More information about the R-help mailing list