[R] Correct subsetting in R

Eric Berger ericjberger at gmail.com
Wed Nov 1 21:05:00 CET 2017


training$TrainingRownum <- 1:nrow(training)
data$DataRownum <- 1:nrow(data)
matches <- merge(training,data,by=intersect(names(training),names(data)))

The data frame 'matches' now has additional columns telling you the row in
each data frame corresponding to the matched items.

Regards,
Eric

On Wed, Nov 1, 2017 at 9:29 PM, Elahe chalabi <chalabi.elahe at yahoo.de>
wrote:

>
> It's not what I want, the first data frame has 499 observations and the
> second data frame is a subset of the first one but with 375 observations. I
> want something that returns the ID for training data frame
>
>
> On Wednesday, November 1, 2017 10:18 AM, Eric Berger <
> ericjberger at gmail.com> wrote:
>
>
>
> matches <- merge(training,data,by=intersect(names(training),names(data)))
>
> HTH,
> Eric
>
>
>
> On Wed, Nov 1, 2017 at 6:13 PM, Elahe chalabi via R-help <
> r-help at r-project.org> wrote:
>
> Hi all,
> >I have two data frames that one of them does not have the column ID:
> >
> >    > str(data)
> >    'data.frame':       499 obs. of  608 variables:
> >    $ ID           : int  1 2 3 4 5 6 7 8 9 10 ...
> >    $ alright      : int  1 0 0 0 0 0 0 1 2 1 ...
> >    $ bad          : int  1 0 0 0 0 0 0 0 0 0 ...
> >    $ boy          : int  1 2 1 1 0 2 2 4 2 1 ...
> >    $ cooki        : int  1 2 2 1 0 1 1 4 2 3 ...
> >    $ curtain      : int  1 0 0 0 0 2 0 2 0 0 ...
> >    $ dish         : int  2 1 0 1 0 0 1 2 2 2 ...
> >    $ doesnt       : int  1 0 0 0 0 0 0 0 1 0 ...
> >    $ dont         : int  2 1 4 2 0 0 2 1 2 0 ...
> >    $ fall         : int  3 1 0 0 1 0 1 2 3 2 ...
> >    $ fell         : int  1 0 0 0 0 0 0 0 0 0 ...
> >
> >and the other one is:
> >
> >    > str(training)
> >    'data.frame':       375 obs. of  607 variables:
> >    $ alright      : num  1 0 0 0 1 2 1 0 0 0 ...
> >    $ bad          : num  1 0 0 0 0 0 0 0 0 0 ...
> >    $ boy          : num  1 1 2 2 4 2 1 0 1 0 ...
> >    $ cooki        : num  1 1 1 1 4 2 3 1 2 2 ...
> >    $ curtain      : num  1 0 2 0 2 0 0 0 0 0 ...
> >    $ dish         : num  2 1 0 1 2 2 2 1 4 1 ...
> >    $ doesnt       : num  1 0 0 0 0 1 0 0 0 0 ...
> >    $ dont         : num  2 2 0 2 1 2 0 0 1 0 ...
> >    $ fall         : num  3 0 0 1 2 3 2 0 2 0 ...
> >    $ fell         : num  1 0 0 0 0 0 0 0 0 0 ...
> >Does anyone know how should I get the IDs of training from data?
> >thanks for any help!
> >Elahe
> >
> >______________________________ ________________
> >R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> >https://stat.ethz.ch/mailman/ listinfo/r-help
> >PLEASE do read the posting guide http://www.R-project.org/
> posting-guide.html
> >and provide commented, minimal, self-contained, reproducible code.
> >
>

	[[alternative HTML version deleted]]



More information about the R-help mailing list