hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nathan Roberts (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-5569) FloatSplitter is not generating correct splits
Date Mon, 07 Oct 2013 16:34:43 GMT
Nathan Roberts created MAPREDUCE-5569:
-----------------------------------------

             Summary: FloatSplitter is not generating correct splits
                 Key: MAPREDUCE-5569
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5569
             Project: Hadoop Map/Reduce
          Issue Type: Bug
    Affects Versions: 2.1.0-beta, trunk, 1.3.0
            Reporter: Nathan Roberts
            Assignee: Nathan Roberts


The closing split is not calculated correctly:
{code}
     // Catch any overage and create the closed interval for the last split.
     if (curLower <= maxVal || splits.size() == 1) {
       splits.add(new DataDrivenDBInputFormat.DataDrivenDBInputSplit(
-          lowClausePrefix + Double.toString(curUpper),
+          lowClausePrefix + Double.toString(curLower),
           colName + " <= " + Double.toString(maxVal)));
     }
{code}
For the case of min=5.0, max=7.0, 2 splits, the current code returns splits of (column1 >=5.0,
column1 <6.0), (column1 >=7.0, column1 <=7.0). The second split is obviously not
correct.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message