pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robbie Strickland (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1849) Pig cannot dereference Cassandra subcolumns in a Super Column Family
Date Mon, 14 Feb 2011 14:44:57 GMT

    [ https://issues.apache.org/jira/browse/PIG-1849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12994320#comment-12994320
] 

Robbie Strickland commented on PIG-1849:
----------------------------------------

I am using the input format directly, with this sample data:

(6B108476-1C40-4847-A1B0-9DA4B0B0BF83,{(12345,{(TestColumn,This is a test),(TestColumn2,This
is a test 2)}),(12346,{(TestColumn1,This is a test 1),(TestColumn2,This is a test 2)})})

and this load statement:

rows = LOAD 'cassandra://E3/StreamByProfile' USING CassandraStorage() AS (objectid, scolumns:
bag {ST: tuple(timestamp, columns: bag {T: tuple(name:chararray, value)})});

I have tried quite a number of different schema possibilities, but all produce effectively
the same result.  They don't produce an error; when you attempt to reference individual items
in a bag you still get the full bag (even though it allows the syntax).  Attempts to flatten
create the same issue.

> Pig cannot dereference Cassandra subcolumns in a Super Column Family
> --------------------------------------------------------------------
>
>                 Key: PIG-1849
>                 URL: https://issues.apache.org/jira/browse/PIG-1849
>             Project: Pig
>          Issue Type: Bug
>          Components: data
>    Affects Versions: 0.8.0
>         Environment: Ubuntu 10, Cassandra 0.7, Hadoop
>            Reporter: Robbie Strickland
>              Labels: cassandra
>
> When using the ColumnFamilyInputFormat to load data from a Cassandra Super Column Family,
the subcolumns always return in a bag where individual values cannot be dereferenced, no matter
what schema is used.  Flattening does not solve the issue.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message