[R] cosine similarity tf-idf

Indhira, Anusha Anusha.Indhira at controlsdata.com
Fri Oct 28 12:21:08 CEST 2016


To find similar documents in a Corpus using cosine similarity, Is it necessary to calculate tf-idf weights while creating term document matrix or just term frequency is fine? Can anyone let me know what are advantages and disadvantages for both ways?


This e-mail (including attachments) contains contents owned by Rolls-Royce plc and its subsidiaries, affiliated companies or customers and covered by the laws of England and Wales, Brazil, US, or Canada (federal, state or provincial). The information is intended to be confidential and may be legally privileged. If you are not the intended recipient, you are hereby notified that any retention, dissemination, distribution, interception or copying of this communication is strictly prohibited and may subject you to further legal action. Reply to the sender if you received this email by accident, and then delete the email and any attachments.

	[[alternative HTML version deleted]]

More information about the R-help mailing list