hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-6039) Computing Input Splits on the MR Cluster
Date Mon, 15 Jun 2009 16:50:07 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-6039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12719638#action_12719638

Owen O'Malley commented on HADOOP-6039:

This patch should reintroduce checkInputSplits into org.apache.hadoop.mapreduce.InputFormat.
This method should be documented as *optional*. It will only be invoked if Java code is doing
the submission to detect errors in the user's job configuration, such as missing or read-protected
input directory, before the job is submitted to the cluster.

> Computing Input Splits on the MR Cluster
> ----------------------------------------
>                 Key: HADOOP-6039
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6039
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: mapred
>            Reporter: Philip Zeyliger
> Instead of computing the input splits as part of job submission, Hadoop could have a
separate "job task type" that computes the input splits, therefore allowing that computation
to happen on the cluster.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message