From cassandra-user-return-1702-apmail-incubator-cassandra-user-archive=incubator.apache.org@incubator.apache.org Mon Dec 07 22:50:19 2009 Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 89287 invoked from network); 7 Dec 2009 22:50:19 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 7 Dec 2009 22:50:19 -0000 Received: (qmail 31101 invoked by uid 500); 7 Dec 2009 22:50:18 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 31063 invoked by uid 500); 7 Dec 2009 22:50:18 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 31054 invoked by uid 99); 7 Dec 2009 22:50:18 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Dec 2009 22:50:18 +0000 X-ASF-Spam-Status: No, hits=-2.6 required=5.0 tests=AWL,BAYES_00 X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jbellis@gmail.com designates 209.85.219.220 as permitted sender) Received: from [209.85.219.220] (HELO mail-ew0-f220.google.com) (209.85.219.220) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Dec 2009 22:50:16 +0000 Received: by ewy20 with SMTP id 20so3885785ewy.0 for ; Mon, 07 Dec 2009 14:49:55 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=QeeCwzGFevNB9O48UibOCLm2gB29G5QwgF/bwYzaMlw=; b=Xoa1lqvjoikEWwXVsWz/sn8+HsJiu14LVyHMglt6Mt9Ndc9AkgGMtfiaznjorvGLOw WEDmNy84te5RNd7fgPrv99yUygjN6roM1AjObzALTrk8q9M8k9D6wE+wa6/jF+w6EOBh rXbMNsFzybUHtOtuUhNSRBDcEn8YFGN37Roo4= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=aGhhjf07Owdhlto4D8d4wE0KQlZe3ntOvIwZzLrnmTqqKnQx3HOyim+eMk7SYnfh55 9r/NX92sX31MVTg8iZ6aNk4v7yIHAAU/MxuhvSBgF+Y5c8EdYhEya3VdOWQ3VqSqR9uB x0VR52Uo6ESAYd72g/zvr39ZWvHo+fpqwPJOk= MIME-Version: 1.0 Received: by 10.216.88.21 with SMTP id z21mr2584455wee.60.1260226195165; Mon, 07 Dec 2009 14:49:55 -0800 (PST) In-Reply-To: References: From: Jonathan Ellis Date: Mon, 7 Dec 2009 16:49:35 -0600 Message-ID: Subject: Re: cassandra mangling non-ascii keys To: cassandra-user@incubator.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable (bugs in thrift, that is) On Mon, Dec 7, 2009 at 4:49 PM, Jonathan Ellis wrote: > what language are your clients in? =A0there are definitely some bugs > there when communicating b/t client and server of different languages. > :( > > On Mon, Dec 7, 2009 at 4:43 PM, Edmond Lau wrote: >> I'm using non-ascii keys on Cassandra, relatively close to trunk at >> r880926, and my some of my keys get mangled. >> >> As a simple test case, if I insert a one-byte key anywhere between >> \200 and \377 (octal for 128 to 255) through the thrift interface, and >> then query back my data with multi get, I get a hash back that has >> "\357\277\275" as the key. =A0All those one-byte keys get mapped to the >> same bucket, so if I insert with the key \205, I get the data back >> when querying for \300. =A0So either a) there's a bug in thrift, b) >> Cassandra doesn't support non-ascii keys, or c) Cassandra is mangling >> my key somewhere. >> >> Has anyone else run into this issue? >> >> Edmond >> >