[R] Combining two data frames

Uwe Ligges ligges at statistik.tu-dortmund.de
Fri Sep 19 23:03:00 CEST 2014


See ?merge

Best,
Uwe Ligges

On 19.09.2014 22:58, Adrian Johnson wrote:
> Hi:
> Appreciate if I could get some help.
>
> I have two data frames.
>
> I want to combine these two dfs bases on first column.
>
>
> df1 :
>
> Subject    G1     G2    G3
>
> A              10     1       0
>
> B              20      2        20
>
>
> df2:
>
> Subject     m1    m2   m3
>
> A              20      9     30
>
> B                0      1    10
>
>
> Resulting df :
>
> Subject        G1    G2   G3     m1   m2  m3
> A                  10      1    0       20     9    30
> B                   20      2  20      0       1   10
>
>
> Sometimes subjects in both df don't match.  since the data is large,
> if I do cbind I am not sure about order and common subjects.
>
>
>
> Thanks a lot for your help.
>
>
> Following is dput of real data:
>
>
> df1 <- structure(list(Subject = c("A-04-1332", "A-04-1336", "A-04-1337",
> "A-04-1341", "A-04-1342", "A-04-1343", "A-04-1346",
> "A-04-1347", "A-04-1348", "A-04-1349"), hsa.let.7a.1 = c(8788L,
> 2745L, 11447L, 8081L, 9291L, 44912L, 72521L, 2401L, 61251L, 134859L
> ), hsa.let.7a.2 = c(17806L, 5517L, 22864L, 16271L, 19194L, 89333L,
> 146114L, 4992L, 122469L, 270005L), hsa.let.7a.3 = c(8865L, 2743L,
> 11669L, 8235L, 9710L, 44633L, 73427L, 2540L, 61533L, 136403L),
>      hsa.let.7b = c(72280L, 56297L, 70053L, 42974L, 56710L, 283102L,
>      236254L, 5966L, 310874L, 596434L), hsa.let.7c = c(20743L,
>      4848L, 10001L, 4070L, 10773L, 45697L, 60397L, 2313L, 35303L,
>      51923L), hsa.let.7d = c(8187L, 12983L, 14028L, 14611L, 14971L,
>      17719L, 7028L, 610L, 13165L, 20890L), hsa.let.7e = c(2745L,
>      1068L, 4637L, 3747L, 4716L, 17202L, 25589L, 2147L, 8244L,
>      21777L), hsa.let.7f.1 = c(1L, 0L, 5L, 3L, 5L, 11L, 5L, 0L,
>      24L, 30L), hsa.let.7f.2 = c(404L, 176L, 1081L, 1130L, 1055L,
>      3097L, 3998L, 119L, 5612L, 14844L)), .Names = c("Subject",
> "hsa.let.7a.1", "hsa.let.7a.2", "hsa.let.7a.3", "hsa.let.7b",
> "hsa.let.7c", "hsa.let.7d", "hsa.let.7e", "hsa.let.7f.1", "hsa.let.7f.2"
> ), row.names = c(NA, 10L), class = "data.frame")
>
>
> df2 <- structure(list(Subject = c("A-04-1332", "A-04-1337", "A-04-1338",
> "A-04-1341", "A-04-1343", "A-04-1347", "A-04-1348",
> "A-04-1350", "A-04-1356", "A-04-1357"), A1BG = c(404L,
> 1387L, 436L, 2225L, 2444L, 838L, 618L, 1035L, 2812L, 309L), A1CF = c(0L,
> 1L, 3L, 1L, 2L, 0L, 0L, 0L, 1L, 0L), A2BP1 = c(454L, 7L, 18L,
> 117L, 9L, 1L, 3L, 3193L, 123L, 2L), A2LD1 = c(413L, 1100L, 334L,
> 882L, 990L, 697L, 2151L, 1102L, 1088L, 578L), A2M = c(44670L,
> 40872L, 8162L, 8183L, 29555L, 5252L, 46763L, 2099L, 11868L, 31205L
> ), A2ML1 = c(667L, 6L, 420L, 82L, 33L, 303L, 88L, 225L, 181L,
> 244L), A4GALT = c(2384L, 2821L, 491L, 1588L, 3685L, 151L, 702L,
> 528L, 2564L, 1005L), A4GNT = c(13L, 15L, 2L, 0L, 2L, 2L, 4L,
> 1L, 0L, 0L), AAA1 = c(0L, 2L, 0L, 0L, 2L, 0L, 0L, 4L, 2L, 0L)), .Names
> = c("Subject",
> "A1BG", "A1CF", "A2BP1", "A2LD1", "A2M", "A2ML1", "A4GALT", "A4GNT",
> "AAA1"), row.names = c(NA, 10L), class = "data.frame")
>
>
>
> Thanks
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



More information about the R-help mailing list