hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Navis Ryu" <>
Subject Re: Review Request 24688: parallel order by clause on a string column fails with IOException: Split points are out of order
Date Fri, 29 Aug 2014 09:08:01 GMT

This is an automatically generated e-mail. To reply, visit:

(Updated Aug. 29, 2014, 9:08 a.m.)

Review request for hive.


Removed the conf, as commented

Bugs: HIVE-7669

Repository: hive-git


The source table has 600 Million rows and it has a String column "l_shipinstruct" which has
4 unique values. (Ie. these 4 values are repeated across the 600 million rows)

We are sorting it based on this string column "l_shipinstruct" as shown in the below HiveQL
with the following parameters. 
set hive.optimize.sampling.orderby=true;
set hive.optimize.sampling.orderby.number=10000000;
set hive.optimize.sampling.orderby.percent=0.1f;

insert overwrite table lineitem_temp_report 
  l_orderkey, l_partkey, l_suppkey, l_linenumber, l_quantity, l_extendedprice, l_discount,
l_tax, l_returnflag, l_linestatus, l_shipdate, l_commitdate, l_receiptdate, l_shipinstruct,
l_shipmode, l_comment
order by l_shipinstruct;
Stack Trace
Diagnostic Messages for this Task:
Error: java.lang.RuntimeException: Error in configuring object
        at org.apache.hadoop.util.ReflectionUtils.setJobConf(
        at org.apache.hadoop.util.ReflectionUtils.setConf(
        at org.apache.hadoop.util.ReflectionUtils.newInstance(
        at org.apache.hadoop.mapred.MapTask$OldOutputCollector.<init>(
        at org.apache.hadoop.mapred.MapTask.runOldMapper(
        at org.apache.hadoop.mapred.YarnChild$
        at Method)
        at org.apache.hadoop.mapred.YarnChild.main(
Caused by: java.lang.reflect.InvocationTargetException
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(
        at java.lang.reflect.Method.invoke(
        at org.apache.hadoop.util.ReflectionUtils.setJobConf(
        ... 10 more
Caused by: java.lang.IllegalArgumentException: Can't read partitions file
        at org.apache.hadoop.mapreduce.lib.partition.TotalOrderPartitioner.setConf(
        at org.apache.hadoop.mapred.lib.TotalOrderPartitioner.configure(
        at org.apache.hadoop.hive.ql.exec.HiveTotalOrderPartitioner.configure(
        ... 15 more
Caused by: Split points are out of order
        at org.apache.hadoop.mapreduce.lib.partition.TotalOrderPartitioner.setConf(
        ... 17 more

Diffs (updated)

  common/src/java/org/apache/hadoop/hive/conf/ 74bb863 
  common/src/java/org/apache/hadoop/hive/conf/ cea9c41 
  ql/src/java/org/apache/hadoop/hive/ql/exec/ 6c22362 
  ql/src/java/org/apache/hadoop/hive/ql/exec/ 166461a 
  ql/src/java/org/apache/hadoop/hive/ql/exec/mr/ ef72039 
  ql/src/test/org/apache/hadoop/hive/ql/exec/ PRE-CREATION 




Navis Ryu

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message