[R] Merging data

Chuck Cleland ccleland at optonline.net
Wed Feb 22 19:21:20 CET 2006


Brian Perron wrote:
> Hello all,
> 
> I am fairly new to R and am trying to bring together data from multiple sources.  Here is one problem that I cannot seem to crack – I hope somebody can help.  Let me simplify the problem:  Let’s say I have two datasets:  DATA1 and DATA2.  I would like to work with all the cases in DATA2.  I have additional variables on these cases in DATA1, which is a larger data set with many additional cases.  I know how to merge data sets if the datasets contain the same cases.  However, I want to eliminate all the cases from DATA1 that are not present in DATA2 and then merge.  The CASEID is my matching variable, and there are no duplicate variable names. 
> Any guidance would be greatly appreciated. 

Take closer look at the all.x and all.y arguments in ?merge.  Does this 
give what you want?

merge(DATA1, DATA2, by="CASEID", all.x=FALSE, all.y=TRUE)

-- 
Chuck Cleland, Ph.D.
NDRI, Inc.
71 West 23rd Street, 8th floor
New York, NY 10010
tel: (212) 845-4495 (Tu, Th)
tel: (732) 452-1424 (M, W, F)
fax: (917) 438-0894




More information about the R-help mailing list