crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Micah Whitacre (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CRUNCH-556) Fix total sorts in Crunch-on-Spark
Date Thu, 13 Aug 2015 18:58:52 GMT

    [ https://issues.apache.org/jira/browse/CRUNCH-556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14695763#comment-14695763
] 

Micah Whitacre commented on CRUNCH-556:
---------------------------------------

Initial thought crunch-spark has test dependency on crunch-hbase.  The second thought is would
integration tests against other sources like Hive be worthwhile to blow out as well in which
case it could be a test dependency or maybe worth creating a crunch-spark-it project.  

> Fix total sorts in Crunch-on-Spark
> ----------------------------------
>
>                 Key: CRUNCH-556
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-556
>             Project: Crunch
>          Issue Type: Bug
>          Components: Spark
>    Affects Versions: 0.13.0
>            Reporter: Josh Wills
>             Fix For: 0.14.0
>
>         Attachments: CRUNCH-556.patch
>
>
> From the user mailing list, trying to perform a total sort to create an HFile w/Crunch
on Spark throws the following exception: https://gist.github.com/anonymous/920c000f20229eaa76d8
> The problem can be traced to not properly configuring the partitioner w/the path to the
partition file that is stored in the GroupingOptions extra configuration settings. These settings
get passed correctly for the MR job, but not for the Spark ones.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message