[R] Combining data.frames
Jeff Reichman
re|chm@nj @end|ng |rom @bcg|ob@|@net
Sun Mar 20 03:05:53 CET 2022
Ok this seems to work correctly
df1 <- data.frame(date = as.factor(c("2021-1-1","2021-1-1","2021-1-1","2021-1-1","2021-1-1",
"2021-1-2","2021-1-2","2021-1-3","2021-1-3","2021-1-3",
"2021-1-4")),
geo_hash = as.factor(c("abc123","abc123","abc456","abc789","abc246","abc123",
"asd123","abc789","abc890","abc123","z12345")),
ad_id = as.factor(c("a12345","b12345","a12345","a12345","c12345",
"b12345","b12345","a12345","b12345","a12345","a12345")))
df2 <- data.frame(date = as.factor(c("2021-1-1","2021-1-1","2021-1-2","2021-1-3","2021-1-3","2021-1-4")),
geo_hash = as.factor(c("abc123","abc456","abc123","abc789","abc890","w12345")),
event = as.factor(c("shoting","ied","protest","riot","protest","killing")))
df1
df2
#df3 <- merge(df1, df2, all = TRUE)
df3 <- merge(df1, df2, by = c("date", "geo_hash" ), all = TRUE)
df3
-----Original Message-----
From: Jeff Newmiller <jdnewmil using dcn.davis.ca.us>
Sent: Saturday, March 19, 2022 8:55 PM
To: reichmanj using sbcglobal.net; Jeff Reichman <reichmanj using sbcglobal.net>; 'Tom Woolman' <twoolman using ontargettek.com>
Cc: r-help using r-project.org
Subject: Re: [R] Combining data.frames
by = c("date", "geo_hash" )
On March 19, 2022 6:31:19 PM PDT, Jeff Reichman <reichmanj using sbcglobal.net> wrote:
>Yes I'm reading that presently
>
>The closest I've gotten has been
>
>df3 <- merge(df1, df2, all = TRUE)
>
>-----Original Message-----
>From: Tom Woolman <twoolman using ontargettek.com>
>Sent: Saturday, March 19, 2022 8:27 PM
>To: reichmanj using sbcglobal.net
>Cc: r-help using r-project.org
>Subject: Re: [R] Combining data.frames
>
>You can also do "SQL-like" joins in the tidyverse with dplyr.
>
>
>On 2022-03-19 21:23, Jeff Reichman wrote:
>> Evening Tom
>>
>> Yest I've been playing with the merge function. But haven't been
>> able to achieve what I need. Could maybe the way to to and it might
>> be my syntax
>>
>> -----Original Message-----
>> From: Tom Woolman <twoolman using ontargettek.com>
>> Sent: Saturday, March 19, 2022 8:20 PM
>> To: reichmanj using sbcglobal.net
>> Cc: r-help using r-project.org
>> Subject: Re: [R] Combining data.frames
>>
>> Have you looked at the merge function in base R?
>>
>> https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/me
>> r
>> ge
>>
>>
>> On 2022-03-19 21:15, Jeff Reichman wrote:
>>> R-Help Community
>>>
>>> I'm trying to combine two data.frames which each containing 10
>>> columns of which they each share two common fields. Here are two
>>> small test datasets.
>>>
>>> df1 <- data.frame(date =
>>> c("2021-1-1","2021-1-1","2021-1-1","2021-1-1","2021-1-1",
>>>
>>> "2021-1-2","2021-1-2","2021-1-3","2021-1-3","2021-1-3"),
>>> geo_hash =
>>> c("abc123","abc123","abc456","abc789","abc246","abc123",
>>> "asd123","abc789","abc890","abc123"),
>>> ad_id =
>>> c("a12345","b12345","a12345","a12345","c12345",
>>>
>>> "b12345","b12345","a12345","b12345","a12345"))
>>> df2 <- data.frame(date =
>>> c("2021-1-1","2021-1-1","2021-1-2","2021-1-3","2021-1-3"),
>>> geo_hash =
>>> c("abc123","abc456","abc123","abc789","abc890"),
>>> event =
>>> c("shoting","ied","protest","riot","protest"))
>>>
>>> I'm trying to combine them such that I get a combined data.frames
>>> such as
>>>
>>> date geo_hash ad_id event
>>> 1/1/2021 abc123 a12345 shoting
>>> 1/1/2021 abc123 b12345
>>> 1/1/2021 abc456 a12345 ied
>>> 1/1/2021 abc789 a12345
>>> 1/1/2021 abc246 c12345
>>>
>>> Jeff
>>>
>>> ______________________________________________
>>> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>> PLEASE do read the posting guide
>>> http://www.R-project.org/posting-guide.html
>>> and provide commented, minimal, self-contained, reproducible code.
>
>______________________________________________
>R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
>https://stat.ethz.ch/mailman/listinfo/r-help
>PLEASE do read the posting guide
>http://www.R-project.org/posting-guide.html
>and provide commented, minimal, self-contained, reproducible code.
--
Sent from my phone. Please excuse my brevity.
More information about the R-help
mailing list