[R] merge on non-identical names
j daniel
jdlecy at maxwell.syr.edu
Tue Nov 24 22:11:47 CET 2009
Greetings,
I need to conduct a merge on two databases containing information on
organizations, but the organization names are often non-identical and there
is no common unique identifier. Does anyone know a good way to calculate a
similarity measure on two names, or even better is there a natural language
matching function in an R package? I did some searches on this but must not
know the right keywords to search.
As an example, here are some possible non-identical names:
Oxfam, Oxfam USA
American Services, Americam Services - (just mis-spelled)
Global Alliance for Action, Global Alliance for the Environment - (a
non-match)
Any suggestions are welcome!
--
View this message in context: http://old.nabble.com/merge-on-non-identical-names-tp26503346p26503346.html
Sent from the R help mailing list archive at Nabble.com.
More information about the R-help
mailing list