Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F3ABF1017E for ; Wed, 28 Aug 2013 08:03:24 +0000 (UTC) Received: (qmail 99481 invoked by uid 500); 28 Aug 2013 08:03:22 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 99146 invoked by uid 500); 28 Aug 2013 08:03:21 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 99134 invoked by uid 99); 28 Aug 2013 08:03:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Aug 2013 08:03:20 +0000 X-ASF-Spam-Status: No, hits=3.1 required=5.0 tests=HTML_MESSAGE,PLING_QUERY,RCVD_IN_DNSWL_LOW,SPF_PASS,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mianmarjun.mailinglist@gmail.com designates 209.85.217.170 as permitted sender) Received: from [209.85.217.170] (HELO mail-lb0-f170.google.com) (209.85.217.170) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Aug 2013 08:03:16 +0000 Received: by mail-lb0-f170.google.com with SMTP id r12so3455682lbi.15 for ; Wed, 28 Aug 2013 01:02:55 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=sLOdCsGc9Y8TZ7MJr7qFg5Q4lI7ZmEvDTpQja2V6+Lo=; b=T33N9oo9UBBXiLCGJ+Vzlq/EFv3QQlGN8GPeJPxrTrhgMbu9Ux1ISYZooG6HJfUItv xVnbLah0ZwR5Cu+FwO2hwlD8TP1OvKOPMXlqLHZcwugnagTX0TIdwtTXFpZncaFo3J1M p9VoaIRCcxJVQru8Tipu66X74lNnBm2HwnpC9F+GZv4nA74fHe39EfLO0/PQNUNQu0aP QPKdkVSEglbw33t7QXvOmPVuZX4DbXfxkzDo1f4/htcH5b0Z/BXLHYSAa+aWnlANHUuQ Re+jFdIQbY+3vm3mW3Q3nxCVfV7/erk9cpfcsHxmiuA7uhw2EnLTH1ojtdEkjqcY1XBO SoIA== MIME-Version: 1.0 X-Received: by 10.152.5.66 with SMTP id q2mr22810299laq.13.1377676974951; Wed, 28 Aug 2013 01:02:54 -0700 (PDT) Received: by 10.114.232.12 with HTTP; Wed, 28 Aug 2013 01:02:54 -0700 (PDT) In-Reply-To: References: Date: Wed, 28 Aug 2013 10:02:54 +0200 Message-ID: Subject: Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1 From: Miguel Angel Martin junquera To: user@pig.apache.org, user@cassandra.apache.org Content-Type: multipart/alternative; boundary=089e0141a1ceb5112204e4fd6a9e X-Virus-Checked: Checked by ClamAV on apache.org --089e0141a1ceb5112204e4fd6a9e Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable hi: I can not understand why the schema is define like *"id:chararray,age:int,title:chararray" and it does not define like tuples or bag tuples, if we have pair key-values columns* * * * * *I try other time to change schema but it does not work.* * * *any ideas ...* * * *perhaps, is the issue in the definition cql3 tables ?* * * *regards* 2013/8/28 Miguel Angel Martin junquera > hi all: > > > Regards > > Still i can resolve this issue. ..... > > does anybody have this issue or try to test this simple example? > > > i am stumped I can not find a solution working. > > I appreciate any comment or help > > > 2013/8/22 Miguel Angel Martin junquera > >> hi all: >> >> >> >> >> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1 >> >> >> I am using this sample data test: >> >> >> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cas= sandra.html >> >> And I load and dump data Righ with this script: >> >> *rows =3D LOAD >> 'cql://keyspace1/test?page_size=3D1&split_size=3D4&where_clause=3Dage%3D= 30' USING >> CqlStorage();* >> * >> * >> *dump rows;* >> *describe rows;* >> * >> * >> >> *resutls: >> >> ((id,6),(age,30),(title,QA)) >> >> ((id,5),(age,30),(title,QA)) >> >> rows: {id: chararray,age: int,title: chararray} >> >> >> * >> >> >> But i can not get the column values >> >> I try to define another schemas in Load like I used with >> cassandraStorage() >> >> >> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassand= ra-and-Pig-how-to-get-column-values-td5641158.html >> >> >> example: >> >> *rows =3D LOAD >> 'cql://keyspace1/test?page_size=3D1&split_size=3D4&where_clause=3Dage%3D= 30' USING >> CqlStorage() AS (columns: bag {T: tuple(name, value)});* >> >> >> and I get this error: >> >> *2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt - >> ERROR 1031: Incompatable schema: left is >> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is >> "id:chararray,age:int,title:chararray"* >> >> >> >> >> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good >> result: >> >> Example: >> >> >> - when I flatten , I get a set of tuples like >> >> *(title,QA)* >> >> *(title,QA)* >> >> *2013-08-22 12:42:20,673 [main] INFO >> org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total >> input paths to process : 1* >> >> *A: {title: chararray}* >> >> >> >> but i can get value QA >> >> Sustring only works with title >> >> >> >> example: >> >> *B =3D FOREACH A GENERATE SUBSTRING(title,2,5);* >> * >> * >> *dump B;* >> *describe B;* >> * >> * >> * >> * >> >> *results:* >> * >> * >> >> *(tle)* >> *(tle)* >> *B: {chararray}* >> >> >> >> >> i try, this like ERIC LEE inthe other mail and have the same results: >> >> >> Anyways, what I really what is the column value, not the name. Is there >> a way to do that? I listed all of the failed attempts I made below. >> >> - colnames =3D FOREACH cols GENERATE $1 and was told $1 was out of >> bounds. >> - casted =3D FOREACH cols GENERATE (tuple(chararray, chararray))$0; b= ut >> all I got back were empty tuples >> - values =3D FOREACH cols GENERATE $0.$1; but I got an error telling = me >> data byte array can't be casted to tuple >> >> >> Please, I will appreciate any help >> >> >> Regards >> >> >> >> >> >> >> > > > -- > > Miguel Angel Mart=EDn Junquera > Analyst Engineer. > miguelangel.martin@brainsins.com > Tel. / Fax: (+34) 91 485 56 66 > *http://www.brainsins.com* > Smart eCommerce > *Madrid*: http://goo.gl/4B5kv > *London*: http://goo.gl/uIXdv > *Barcelona*: http://goo.gl/NZslW > > Antes de imprimir este e-mail, piense si es necesario. > La legislaci=F3n espa=F1ola ampara el secreto de las comunicaciones. Este > correo electr=F3nico es estrictamente confidencial y va dirigido > exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difun= da > ni copie la transmisi=F3n y nos lo notifique cuanto antes. > --=20 Miguel Angel Mart=EDn Junquera Analyst Engineer. miguelangel.martin@brainsins.com --089e0141a1ceb5112204e4fd6a9e Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
hi:

I can not understand why the schema= is =A0define like=A0"id:chararray,age:int,title:chararray" =A0and it does not defi= ne like tuples or bag tuples, =A0if we have pair key-values =A0columns<= /div>


I try other= time to change schema =A0but it does not work.

any ideas ..= .
per= haps, is the issue in the definition cql3 tables ?

regards


2013/8/= 28 Miguel Angel Martin junquera <mianmarjun.mailinglist@gma= il.com>
hi all:

=
Regards

Still i can resolve this is= sue. .....

does anybody have this issue or try to test this simple exam= ple?


i am stumped I can not find a solution w= orking.=A0

I appreciate any comment or help


2013/8/22 Miguel Angel Martin junquera <mianmarjun.mailingl= ist@gmail.com>
hi all:

=


I,m testing the new CqlStorage= () with cassandra 1.28 and pig 0.11.1=A0


I am using this sample data test:


And I load and dump data Righ with this script:

rows =3D LOAD 'cql://keyspace1/test?page_= size=3D1&split_size=3D4&where_clause=3Dage%3D30' USING CqlStora= ge();

dump rows;
describe rows;
=
resutls:

((id,6),(age,30),(title,QA))
((id,5),(age,30),(titl= e,QA))
rows:= {id: chararray,age: int,title: chararray}

=

But i can not =A0get =A0the column values=A0
=
I try to define =A0 another schemas in Load like I used with= cassandraStorage()



example:

=
rows =3D LOAD 'cql://keyspace1/test?page_size=3D1&split_siz= e=3D4&where_clause=3Dage%3D30' USING CqlStorage() AS (columns: bag = {T: tuple(name, value)});


and I get this error:
2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.g= runt.Grunt - ERROR 1031: Incompatable schema: left is "columns:bag{T:t= uple(name:bytearray,value:bytearray)}", right is "id:chararray,ag= e:int,title:chararray"



I try = to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good result:

Example:

  • when I flatten , I get a set of tuples like
(title,QA)
(title,QA)=
2013-08-22 12:42:20,673 [main] INFO =A0org.= apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input pat= hs to process : 1
A: {title: chararray}


but i can get = value QA=A0

Sustring only works with title



example:

=
B =3D FOREACH A GENERATE SUBSTRING(title,2,5);
=

dump B;
describe B;


results:<= /b>

(tle)
(tle)
B: {chararray}



i try, this like ERIC LEE inthe other mail =A0and h= ave the same results:


=A0Anyways, what I really what is the co= lumn value, not the name. Is there a way to do that? I listed all of the fa= iled attempts I made below.
  • colnames =3D FOREACH cols GENERATE $1 and was told $1 was out of bounds= .=A0
  • casted =3D FOREACH cols GENERATE (tuple(chararray, chararray))= $0; but all I got back were empty tuples
  • values =3D=A0FOREACH cols = GENERATE $0.$1; but I got an error telling me data byte array can't be = casted to tuple

Please,=A0I will appreciate any help=A0

=

Regards









--
=

Miguel An= gel Mart=EDn Junquera
Analyst Engineer.
Tel. / Fax: <= /span>(+34= ) 91 485 56 66
Smart eCommerce
Madrid:=A0http://goo.gl/4B5kv
Lon= don:=A0http://goo.gl/uIXdv
Barcelona:=A0http://goo.gl/NZslW

Antes de imprimir este e-mail, piense si = es necesario.
La legislaci=F3n espa=F1ola = ampara el secreto de las comunicaciones. Este correo electr=F3nico es estri= ctamente confidencial y va dirigido exclusivamente a su destinatario/a. Si = no es Ud., le rogamos que no difunda ni copie la transmisi=F3n y nos lo not= ifique cuanto antes.



--

Miguel An= gel Mart=EDn Junquera
Analyst Engineer.

--089e0141a1ceb5112204e4fd6a9e--