Return-Path: Delivered-To: apmail-lucene-solr-commits-archive@minotaur.apache.org Received: (qmail 47263 invoked from network); 10 Jul 2009 00:50:32 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 10 Jul 2009 00:50:32 -0000 Received: (qmail 15389 invoked by uid 500); 10 Jul 2009 00:50:42 -0000 Delivered-To: apmail-lucene-solr-commits-archive@lucene.apache.org Received: (qmail 15311 invoked by uid 500); 10 Jul 2009 00:50:41 -0000 Mailing-List: contact solr-commits-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-dev@lucene.apache.org Delivered-To: mailing list solr-commits@lucene.apache.org Received: (qmail 15302 invoked by uid 99); 10 Jul 2009 00:50:41 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Jul 2009 00:50:41 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.130] (HELO eos.apache.org) (140.211.11.130) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Jul 2009 00:50:31 +0000 Received: from eos.apache.org (localhost [127.0.0.1]) by eos.apache.org (Postfix) with ESMTP id B63B5118A9 for ; Fri, 10 Jul 2009 00:50:10 +0000 (GMT) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: Apache Wiki To: solr-commits@lucene.apache.org Date: Fri, 10 Jul 2009 00:50:10 -0000 Message-ID: <20090710005010.6719.78289@eos.apache.org> Subject: [Solr Wiki] Update of "UniqueKey" by Lance Norskog X-Virus-Checked: Checked by ClamAV on apache.org Dear Wiki user, You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification. The following page has been changed by Lance Norskog: http://wiki.apache.org/solr/UniqueKey ------------------------------------------------------------------------------ == Text field in the document == In the blog RSS example above, the URL of each article. The field must be single-valued. == UUID techniques == - UUID is short for Universal Unique IDentifier. The UUID standard [http://www.ietf.org/rfc/rfc4122.txt RFC-4122] includes several types of UUID with different input formats. There is a UUID field type in Solr 1.4 which implements version 4. Also, the ExtractingRequestHandler automatically creates UUID version 4. You can also implement a UUID string from a cryptographic hash. + UUID is short for Universal Unique IDentifier. The UUID standard [http://www.ietf.org/rfc/rfc4122.txt RFC-4122] includes several types of UUID with different input formats. There is a UUID field type (called {{{UUIDField}}}) in Solr 1.4 which implements version 4. Fields are defined in the schema.xml file with: + {{{ + }}} + and used by + {{{ + }}} + Also, the ExtractingRequestHandler automatically creates UUID version 4. You can also implement a UUID string from a cryptographic hash. == Cryptographic hash == A cryptographic hashing algorithm can be thought of as creating N very random bits from the input data. The MD5 algorithm create 128 bits. This means that 2 input data sets have a chance of 1 in 2^128 of creating the same MD5. There is a standard expression of this as 32 hexadecimal characters. [http://www.ietf.org/rfc/rfc1321.txt RFC-1321]. Several MD5 digest algorithm packages for various languages do not follow this standard. The UUID standard always includes the time at the creation of the UUID, which precludes some of the above use cases. You can cheat and ignore the clock requirement. It is best to use the UUID text format: ''550e8400-e29b-41d4-a716-446655440000'' instead of ''550e8400e29b41d4a716446655440000''. (You will read many of these keys.)