MD5 hash for url and utf unicode converting to ascii

joe shoemaker joemystery123 at gmail.com
Mon Jun 23 19:43:05 EDT 2008


I would like to convert url into md5 hash. My question is that md5
hash will create collision at 2^64. If you do long(value,16), where
value is the md5 hash string, would value returned from long(value,
16) be unique as long as md5 hashed string is unique? when you move
md5 hashed string to long, where will the collision occur, at anything
>= 2^64?

      hash = md5.new()
      hash.update("some_url_")
      value = hash.digest()
      value_in_int = long(value, 16) #would this be unique as long as
hashed string is unique(i.e < 2^64)
      hash = md5.new() hash.update("some_url_") value = hash.digest()
value_in_int = long(value, 16) #would this be unique as long as hashed
string is unique(i.e < 2^64)

Do I need to also convert the value to base64.encodestring(value)?
What is the purpose of base64.encodestring?

For unicode encoding, I can do, md5.update(value.encode('utf-8')) to
give me ascii values.


Thank you,
j



More information about the Python-list mailing list