[R] Combining data.frames
Tom Woolman
twoo|m@n @end|ng |rom ont@rgettek@com
Sun Mar 20 02:48:26 CET 2022
I'm trying hard to take tonight off and avoid booting up the laptop and
launching R... :) but you need to merge by the primary key(s), e.g.
the common columns (common IVs) shared between the two dataframes.
On 2022-03-19 21:38, Jeff Reichman wrote:
> Tom
>
> Looks like I figured it out. Syntax issue - wrong "all" argument (I
> think)
>
> -----Original Message-----
> From: Tom Woolman <twoolman using ontargettek.com>
> Sent: Saturday, March 19, 2022 8:27 PM
> To: reichmanj using sbcglobal.net
> Cc: r-help using r-project.org
> Subject: Re: [R] Combining data.frames
>
> You can also do "SQL-like" joins in the tidyverse with dplyr.
>
>
> On 2022-03-19 21:23, Jeff Reichman wrote:
>> Evening Tom
>>
>> Yest I've been playing with the merge function. But haven't been able
>> to achieve what I need. Could maybe the way to to and it might be my
>> syntax
>>
>> -----Original Message-----
>> From: Tom Woolman <twoolman using ontargettek.com>
>> Sent: Saturday, March 19, 2022 8:20 PM
>> To: reichmanj using sbcglobal.net
>> Cc: r-help using r-project.org
>> Subject: Re: [R] Combining data.frames
>>
>> Have you looked at the merge function in base R?
>>
>> https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/mer
>> ge
>>
>>
>> On 2022-03-19 21:15, Jeff Reichman wrote:
>>> R-Help Community
>>>
>>> I'm trying to combine two data.frames which each containing 10
>>> columns of which they each share two common fields. Here are two
>>> small test datasets.
>>>
>>> df1 <- data.frame(date =
>>> c("2021-1-1","2021-1-1","2021-1-1","2021-1-1","2021-1-1",
>>>
>>> "2021-1-2","2021-1-2","2021-1-3","2021-1-3","2021-1-3"),
>>> geo_hash =
>>> c("abc123","abc123","abc456","abc789","abc246","abc123",
>>> "asd123","abc789","abc890","abc123"),
>>> ad_id =
>>> c("a12345","b12345","a12345","a12345","c12345",
>>>
>>> "b12345","b12345","a12345","b12345","a12345"))
>>> df2 <- data.frame(date =
>>> c("2021-1-1","2021-1-1","2021-1-2","2021-1-3","2021-1-3"),
>>> geo_hash =
>>> c("abc123","abc456","abc123","abc789","abc890"),
>>> event =
>>> c("shoting","ied","protest","riot","protest"))
>>>
>>> I'm trying to combine them such that I get a combined data.frames
>>> such as
>>>
>>> date geo_hash ad_id event
>>> 1/1/2021 abc123 a12345 shoting
>>> 1/1/2021 abc123 b12345
>>> 1/1/2021 abc456 a12345 ied
>>> 1/1/2021 abc789 a12345
>>> 1/1/2021 abc246 c12345
>>>
>>> Jeff
>>>
>>> ______________________________________________
>>> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>> PLEASE do read the posting guide
>>> http://www.R-project.org/posting-guide.html
>>> and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list