crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Micah Whitacre (JIRA)" <>
Subject [jira] [Commented] (CRUNCH-556) Fix total sorts in Crunch-on-Spark
Date Thu, 13 Aug 2015 18:58:52 GMT


Micah Whitacre commented on CRUNCH-556:

Initial thought crunch-spark has test dependency on crunch-hbase.  The second thought is would
integration tests against other sources like Hive be worthwhile to blow out as well in which
case it could be a test dependency or maybe worth creating a crunch-spark-it project.  

> Fix total sorts in Crunch-on-Spark
> ----------------------------------
>                 Key: CRUNCH-556
>                 URL:
>             Project: Crunch
>          Issue Type: Bug
>          Components: Spark
>    Affects Versions: 0.13.0
>            Reporter: Josh Wills
>             Fix For: 0.14.0
>         Attachments: CRUNCH-556.patch
> From the user mailing list, trying to perform a total sort to create an HFile w/Crunch
on Spark throws the following exception:
> The problem can be traced to not properly configuring the partitioner w/the path to the
partition file that is stored in the GroupingOptions extra configuration settings. These settings
get passed correctly for the MR job, but not for the Spark ones.

This message was sent by Atlassian JIRA

View raw message