cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Brandon Williams (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CASSANDRA-7241) Pig test fails on 2.1 branch
Date Tue, 27 May 2014 21:28:04 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-7241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14010299#comment-14010299
] 

Brandon Williams commented on CASSANDRA-7241:
---------------------------------------------

So, I took a different tack and ignored these errors, and instead focused on ThriftColumnFamilyDataTypeTest,
which fails with:

{noformat}
    [junit] Testcase: testCassandraStorageDataType(org.apache.cassandra.pig.ThriftColumnFamilyDataTypeTest):
   Caused an ERROR
    [junit] org.apache.pig.data.DefaultDataBag cannot be cast to org.apache.pig.data.Tuple
    [junit] java.lang.ClassCastException: org.apache.pig.data.DefaultDataBag cannot be cast
to org.apache.pig.data.Tuple
    [junit]     at org.apache.cassandra.pig.ThriftColumnFamilyDataTypeTest.testCassandraStorageDataType(ThriftColumnFamilyDataTypeTest.java:150)
{noformat}

After a lengthy, tricky, painful bisect, I land back at CASSANDRA-5417.  This test fails 100%
of the time, and given the error I don't see how it can possibly be a timing issue.  So I
recreated this test using the cli and pig so I could run it manually, and I get this:

{noformat}
org.apache.cassandra.serializers.MarshalException: Invalid UTF-8 bytes deadbeef
        at org.apache.cassandra.serializers.AbstractTextSerializer.deserialize(AbstractTextSerializer.java:43)
        at org.apache.cassandra.serializers.AbstractTextSerializer.deserialize(AbstractTextSerializer.java:26)
        at org.apache.cassandra.db.marshal.AbstractType.compose(AbstractType.java:142)
        at org.apache.cassandra.hadoop.pig.AbstractCassandraStorage.columnToTuple(AbstractCassandraStorage.java:131)
        at org.apache.cassandra.hadoop.pig.CassandraStorage.getNext(CassandraStorage.java:256)
        at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigRecordReader.nextKeyValue(PigRecordReader.java:194)
        at org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.nextKeyValue(MapTask.java:532)
        at org.apache.hadoop.mapreduce.MapContext.nextKeyValue(MapContext.java:67)
        at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:143)
        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
        at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
{noformat}

Which is interesting, since the deadbeef column is BytesType (verified in the cli), and the
line in ACS that throws is also from CASSANDRA-5417.

I'm left to conclude that, if the problem is in pig, it's still CASSANDRA-5417's fault :)
 I can attach the cli-ified script and very simple pig script to run against if needed.

> Pig test fails on 2.1 branch
> ----------------------------
>
>                 Key: CASSANDRA-7241
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-7241
>             Project: Cassandra
>          Issue Type: Bug
>            Reporter: Alex Liu
>            Assignee: Brandon Williams
>             Fix For: 2.1 rc1
>
>
> run ant pig-test on cassandra-2.1 branch. There are many tests failed. I trace it a little
and find out Pig test fails starts from https://github.com/apache/cassandra/commit/362cc05352ec67e707e0ac790732e96a15e63f6b
> commit.
> It looks like storage changes break Pig tests.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message