Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2F502D418 for ; Fri, 20 Jul 2012 20:35:44 +0000 (UTC) Received: (qmail 258 invoked by uid 500); 20 Jul 2012 20:35:42 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 191 invoked by uid 500); 20 Jul 2012 20:35:42 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 183 invoked by uid 99); 20 Jul 2012 20:35:42 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 Jul 2012 20:35:42 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of joe.pallas@gmail.com designates 209.85.161.169 as permitted sender) Received: from [209.85.161.169] (HELO mail-gg0-f169.google.com) (209.85.161.169) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 Jul 2012 20:35:36 +0000 Received: by ggm4 with SMTP id 4so5406260ggm.14 for ; Fri, 20 Jul 2012 13:35:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=sender:content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; bh=JZZePEai/kfhyRGtTAq7P8F1PcFWm4o2vx4LAlOew3k=; b=hV326OYh2/PKbi9zdeUuraLTNomK7Y7xXv8ty+hzgl7Y9FGcj34EVy81PJr+a8lKcw JNopHAeRA+92k8V6Fh3WMHAHqlzhO9osOVnKRIA4AnjRs0t2GNI16zkdzmM+uwVDwU4/ 5M5swy46abfge6Sy74C+0bq9zq4l11LZeHA3Xh0IiKy73GrcRgB9nRi69YG0DpJ+Asqh Hf0mXKst7zYVtYElyhX+h441uUtHN3Vo/96SuP00+gvCDjHnvxf/Z/rp7Rx265EZr2Mv TPk3EkQBI7rB5hyjloXIBXLYzJFGg9DhnLal5+MtwNRdmf36OEJxgbD1eMaa89GShzeq beDA== Received: by 10.50.154.225 with SMTP id vr1mr9155581igb.70.1342816515592; Fri, 20 Jul 2012 13:35:15 -0700 (PDT) Received: from [127.0.0.1] (inet-hqmc01-o.oracle.com. [148.87.19.194]) by mx.google.com with ESMTPS id nh1sm20238036igc.11.2012.07.20.13.35.14 (version=TLSv1/SSLv3 cipher=OTHER); Fri, 20 Jul 2012 13:35:14 -0700 (PDT) Sender: Joe Pallas Content-Type: text/plain; charset=windows-1252 Mime-Version: 1.0 (Apple Message framework v1278) Subject: Re: Use of MD5 as row keys - is this safe? From: Joe Pallas In-Reply-To: Date: Fri, 20 Jul 2012 13:35:10 -0700 Content-Transfer-Encoding: quoted-printable Message-Id: References: <500987F5.7090609@figarocms.fr> To: user@hbase.apache.org X-Mailer: Apple Mail (2.1278) X-Virus-Checked: Checked by ClamAV on apache.org On Jul 20, 2012, at 12:16 PM, Michel Segel wrote: > I don't believe that there has been any reports of collisions, but if. = You are concerned you could use the SHA-1 for generating the hash. = Relatively speaking, SHA-1is slower, but still fast enough for most = applications. Every hash function can have collisions, by definition. If the = correctness of your design depends on collisions being impossible, = rather than very rare, then your design is faulty. Cryptographic hash functions have the property that it is = computationally hard to create inputs that match a given output. That = doesn=92t in itself make cryptographic hash functions better than other = hash functions for avoiding hot-spotting. (But it does usually make = cryptographic hash functions more expensive to compute than other hash = functions.) You may want to look at and = . Hope this helps, joe