cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ala' Alkhaldi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CASSANDRA-7200) word count broken
Date Thu, 10 Jul 2014 20:11:04 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-7200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14057879#comment-14057879
] 

Ala' Alkhaldi commented on CASSANDRA-7200:
------------------------------------------

The slowness is caused by using vnodes at the cassandra server which is not recommended for
the hadoop case. For instance, WordCountSetup relies on the number of returned vnodes to determine
the sleep time after creating the keyspace which turned to be 256 seconds in current default
installation of Cassandra. I updated the README file to mention that disabaling vnodes  is
recommended. 
The Attached 7200_v2.txt includes the README update as well as the changes in 7200_v1.txt
changes.

I also checked Cassandra 2.0. The hadoop_word_count example works fine, but the cql3 word
count has a bug that is already solved as I mentioned in the above comment. 

> word count broken
> -----------------
>
>                 Key: CASSANDRA-7200
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-7200
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Examples
>            Reporter: Brandon Williams
>            Assignee: Ala' Alkhaldi
>             Fix For: 2.0.10
>
>         Attachments: 7200_v1.txt, 7200_v2.txt
>
>
> word_count_setup hangs forever, and word_count loops forever with this exception:
> {noformat}
> DEBUG 17:52:42,875 java.io.IOException: config(config)
>         at org.apache.hadoop.conf.Configuration.<init>(Configuration.java:260)
>         at org.apache.hadoop.mapred.JobConf.<init>(JobConf.java:341)
>         at org.apache.hadoop.mapreduce.JobContext.<init>(JobContext.java:76)
>         at org.apache.hadoop.mapreduce.TaskAttemptContext.<init>(TaskAttemptContext.java:35)
>         at org.apache.hadoop.mapreduce.TaskInputOutputContext.<init>(TaskInputOutputContext.java:44)
>         at org.apache.hadoop.mapreduce.MapContext.<init>(MapContext.java:43)
>         at org.apache.hadoop.mapreduce.Mapper$Context.<init>(Mapper.java:105)
>         at sun.reflect.GeneratedConstructorAccessor14.newInstance(Unknown Source)
>         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
>         at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
>         at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:759)
>         at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
>         at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message