cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jim Witschey (JIRA)" <>
Subject [jira] [Commented] (CASSANDRA-8367) Clash between Cassandra and Crunch mapreduce config
Date Thu, 17 Sep 2015 16:22:04 GMT


Jim Witschey commented on CASSANDRA-8367:

[~zvo] I'm closing this for now -- if this hasn't been resolved to your satisfaction, could
you follow the contribution instructions Philip linked to and reopen? Thanks.

> Clash between Cassandra and Crunch mapreduce config
> ---------------------------------------------------
>                 Key: CASSANDRA-8367
>                 URL:
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Hadoop
>            Reporter: Radovan Zvoncek
>            Priority: Minor
> We would like to use Cassandra's (Cql)BulkOutputFormats to implement Resource IOs for
Crunch. We want to do this to allow Crunch users write results of their jobs directly to Cassandra
(thus skipping writing them to file system).
> In the process of doing this, we found out there is a clash in the mapreduce job config.
The affected config key is 'mapreduce.output.basename'. Cassandra is using it [1] for something
different than Crunch [2]. This is resulting in some obscure behavior I personally don't understand,
but it causes the jobs to fail.
> We went ahead and re-implemented the output format classes to use different config key,
but we'd very much like to stop using them.
> [1]
> [2]

This message was sent by Atlassian JIRA

View raw message