Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6FFDA10A32 for ; Wed, 8 Jan 2014 17:39:34 +0000 (UTC) Received: (qmail 37069 invoked by uid 500); 8 Jan 2014 17:39:31 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 36716 invoked by uid 500); 8 Jan 2014 17:39:29 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 36708 invoked by uid 99); 8 Jan 2014 17:39:28 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Jan 2014 17:39:28 +0000 X-ASF-Spam-Status: No, hits=-0.1 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_MED,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of Duane.Cornish@jhuapl.edu designates 128.244.251.36 as permitted sender) Received: from [128.244.251.36] (HELO pilot.jhuapl.edu) (128.244.251.36) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Jan 2014 17:39:23 +0000 Received: from aplexcas2.dom1.jhuapl.edu (aplexcas2.dom1.jhuapl.edu [128.244.198.91]) by pilot.jhuapl.edu with smtp (TLS: TLSv1/SSLv3,128bits,RC4-MD5) id 429a_116f_3a2fb49d_06d9_4067_a3b6_4a01b903183e; Wed, 08 Jan 2014 12:39:01 -0500 Received: from aplesstripe.dom1.jhuapl.edu ([128.244.198.211]) by aplexcas2.dom1.jhuapl.edu ([128.244.198.91]) with mapi; Wed, 8 Jan 2014 12:38:57 -0500 From: "Cornish, Duane C." To: "user@accumulo.apache.org" Date: Wed, 8 Jan 2014 12:38:57 -0500 Subject: RE: Accumulo lexicographical order Thread-Topic: Accumulo lexicographical order Thread-Index: Ac8Mk614Rh50N6FgQeWMdHeLWrBX3QABFL8Q Message-ID: References: In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: multipart/alternative; boundary="_000_AC78983C72177B4D9D1C14F7F4AEBA2144A20EFF72aplesstripedo_" MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org --_000_AC78983C72177B4D9D1C14F7F4AEBA2144A20EFF72aplesstripedo_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Great! Thanks for all the help! From: Keith Turner [mailto:keith@deenlo.com] Sent: Wednesday, January 08, 2014 12:04 PM To: user@accumulo.apache.org Subject: Re: Accumulo lexicographical order On Wed, Jan 8, 2014 at 11:50 AM, Mike Drob > wrote: Duane, Most API methods for inserting values take byte arrays or byte sequences di= rectly. The lexographic order is based on the natural ordering of the bytes= , i.e. \x00 sorts before \x01. The methods that take strings will assume UT= F-8 encoding and convert for you. If you find a situation where this is not= the case, please let us know! The one exception to all of this is the timestamp part of the key, which is= stored in numeric order. I want to say that they are kept in reverse order= , but don't remember the exact details off hand. Thats correct. The most recent timestamps are sorted first. Sorted as sig= ned long. I'm on my phone, so finding the exact place where this is documented will b= e a challenge, but I would expect it to be part of our user manual on accum= ulo.apache.org Mike On Jan 8, 2014 8:26 AM, "Cornish, Duane C." > wrote: Accumulo Users, I know Accumulo keys are sorted in lexicographical order in the tables. Wh= ere can I find the specification defining that order? For example, what le= xicographical order are symbols in? Is the order the same as the numerical= order of UTF-8 encoding, ASCII encoding, or some other encoding scheme? Thanks in advance, Duane Cornish --_000_AC78983C72177B4D9D1C14F7F4AEBA2144A20EFF72aplesstripedo_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

Great!&nb= sp; Thanks for all the help!

 

From: Keith Turner [mail= to:keith@deenlo.com]
Sent: Wednesday, January 08, 2014 12:04 PM<= br>To: user@accumulo.apache.org
Subject: Re: Accumulo lexi= cographical order

 

 

 

On Wed, Jan 8, 2014 at 11:50 AM, Mike Drob <mdrob@mdrob.com> wrote:

Duane,

Most API methods for inserting values take byt= e arrays or byte sequences directly. The lexographic order is based on the = natural ordering of the bytes, i.e. \x00 sorts before \x01. The methods tha= t take strings will assume UTF-8 encoding and convert for you. If you find = a situation where this is not the case, please let us know!

<= p>The one exception to all of this is the timestamp part of the key, which = is stored in numeric order. I want to say that they are kept in reverse ord= er, but don't remember the exact details off hand.

Thats correct.  The most recent timestamps are sorted = first.  Sorted as signed long.

 

 

I'm on m= y phone, so finding the exact place where this is documented will be a chal= lenge, but I would expect it to be part of our user manual on accumulo.apache.org

Mike

On Jan 8, 2014 8:26 AM, "Cornish, Duane C.&= quot; <Dua= ne.Cornish@jhuapl.edu> wrote:

Accumul= o Users,

 

I know Accumulo k= eys are sorted in lexicographical order in the tables.  Where can I fi= nd the specification defining that order?  For example, what lexicogra= phical order are symbols in?  Is the order the same as the numerical o= rder of UTF-8 encoding, ASCII encoding, or some other encoding scheme?

 

Thanks in advance,=

Duane Cornish

 

= --_000_AC78983C72177B4D9D1C14F7F4AEBA2144A20EFF72aplesstripedo_--