[R] What is the correct way to define __hash__?
Peng Yu
pengyu.ut at gmail.com
Tue Oct 13 00:12:12 CEST 2009
On Mon, Oct 12, 2009 at 4:03 PM, Robert Kern <robert.kern at gmail.com> wrote:
> On 2009-10-12 15:45 PM, Peng Yu wrote:
>>
>> Hi,
>>
>> I'm wondering what is the general way to define __hash__. I could add
>> up all the members. But I am wondering if this would cause a
>> performance issue for certain classes.
>
> Unless if you are very familiar with the math of hash functions, I don't
> recommend that you try to implement one directly. Instead, make a tuple of
> the hashable content of your class and return the result of calling hash()
> on that tuple. Be sure to make your equality comparison do the right thing.
>
> class A(object):
> def __init__(self, a, b):
> self.a = a
> self.b = b
>
> def _key(self):
> # I include the name of the class so as to differentiate between other
> # classes that might also have a _key() method. If you have several
> classes
> # or subclasses that are allowed to compare equal to each other, use some
> # other common string here.
> return (type(self).__name__, a, b)
>
> def __hash__(self):
> return hash(self._key())
>
> # Coincidentally, the _key() method can usually be reused for comparisons.
> # I recommend doing this for the equality comparisons, at least, when you
> can
> # because of the requirement that two items that compare equal must have
> the
> # same hash value.
> def __eq__(self, other):
> return self._key() == other._key()
>
> def __ne__(self, other):
> return not (self == other)
>
> ...
Do I need to define other 4 comparison operators besides __eq__ and __ne__?
More information about the R-help
mailing list