[R] Combining data.frames
Bert Gunter
bgunter@4567 @end|ng |rom gm@||@com
Sun Mar 20 04:44:42 CET 2022
Merge by the common keys/column names is the default. Te question is likely
what to do with rows that don't match. That's determined by 'all'
settings, which the OP may already have figured out.
On Sat, Mar 19, 2022, 7:16 PM Tom Woolman <twoolman using ontargettek.com> wrote:
> I'm trying hard to take tonight off and avoid booting up the laptop and
> launching R... :) but you need to merge by the primary key(s), e.g.
> the common columns (common IVs) shared between the two dataframes.
>
>
> On 2022-03-19 21:38, Jeff Reichman wrote:
> > Tom
> >
> > Looks like I figured it out. Syntax issue - wrong "all" argument (I
> > think)
> >
> > -----Original Message-----
> > From: Tom Woolman <twoolman using ontargettek.com>
> > Sent: Saturday, March 19, 2022 8:27 PM
> > To: reichmanj using sbcglobal.net
> > Cc: r-help using r-project.org
> > Subject: Re: [R] Combining data.frames
> >
> > You can also do "SQL-like" joins in the tidyverse with dplyr.
> >
> >
> > On 2022-03-19 21:23, Jeff Reichman wrote:
> >> Evening Tom
> >>
> >> Yest I've been playing with the merge function. But haven't been able
> >> to achieve what I need. Could maybe the way to to and it might be my
> >> syntax
> >>
> >> -----Original Message-----
> >> From: Tom Woolman <twoolman using ontargettek.com>
> >> Sent: Saturday, March 19, 2022 8:20 PM
> >> To: reichmanj using sbcglobal.net
> >> Cc: r-help using r-project.org
> >> Subject: Re: [R] Combining data.frames
> >>
> >> Have you looked at the merge function in base R?
> >>
> >> https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/mer
> >> ge
> >>
> >>
> >> On 2022-03-19 21:15, Jeff Reichman wrote:
> >>> R-Help Community
> >>>
> >>> I'm trying to combine two data.frames which each containing 10
> >>> columns of which they each share two common fields. Here are two
> >>> small test datasets.
> >>>
> >>> df1 <- data.frame(date =
> >>> c("2021-1-1","2021-1-1","2021-1-1","2021-1-1","2021-1-1",
> >>>
> >>> "2021-1-2","2021-1-2","2021-1-3","2021-1-3","2021-1-3"),
> >>> geo_hash =
> >>> c("abc123","abc123","abc456","abc789","abc246","abc123",
> >>> "asd123","abc789","abc890","abc123"),
> >>> ad_id =
> >>> c("a12345","b12345","a12345","a12345","c12345",
> >>>
> >>> "b12345","b12345","a12345","b12345","a12345"))
> >>> df2 <- data.frame(date =
> >>> c("2021-1-1","2021-1-1","2021-1-2","2021-1-3","2021-1-3"),
> >>> geo_hash =
> >>> c("abc123","abc456","abc123","abc789","abc890"),
> >>> event =
> >>> c("shoting","ied","protest","riot","protest"))
> >>>
> >>> I'm trying to combine them such that I get a combined data.frames
> >>> such as
> >>>
> >>> date geo_hash ad_id event
> >>> 1/1/2021 abc123 a12345 shoting
> >>> 1/1/2021 abc123 b12345
> >>> 1/1/2021 abc456 a12345 ied
> >>> 1/1/2021 abc789 a12345
> >>> 1/1/2021 abc246 c12345
> >>>
> >>> Jeff
> >>>
> >>> ______________________________________________
> >>> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> >>> https://stat.ethz.ch/mailman/listinfo/r-help
> >>> PLEASE do read the posting guide
> >>> http://www.R-project.org/posting-guide.html
> >>> and provide commented, minimal, self-contained, reproducible code.
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
[[alternative HTML version deleted]]
More information about the R-help
mailing list