hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Daniel Haviv <daniel.ha...@veracity-group.com>
Subject Re: Concatenating ORC files
Date Wed, 10 Dec 2014 18:35:40 GMT
I've made a little experiment and recreated the table
with 'orc.create.index'='FALSE' and now it fails on something else:
Error: java.io.IOException:
org.apache.hadoop.hive.ql.metadata.HiveException:
java.lang.ClassCastException:
org.apache.hadoop.hive.ql.io.orc.ColumnStatisticsImpl$StringStatisticsImpl
cannot be cast to
org.apache.hadoop.hive.ql.io.orc.ColumnStatisticsImpl$BooleanStatisticsImpl
        at
org.apache.hadoop.hive.ql.io.merge.MergeFileMapper.map(MergeFileMapper.java:115)
        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54)
        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:450)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343)
        at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:163)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
        at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)

It seems that the concatenation feature needs more work..

Daniel

On Wed, Dec 10, 2014 at 4:54 PM, Daniel Haviv <
daniel.haviv@veracity-group.com> wrote:

> Hi,
> I'm trying to use the new concatenate command merge small ORC files and
> file right away:
>
>  alter table requests partition(day_ts=1418083200, hour_ts=1418151600)
> concatenate;
>
> Diagnostic Messages for this Task:
> Error: java.lang.IllegalArgumentException: Column has wrong number of
> index entries found: 0 expected: 1
>         at
> org.apache.hadoop.hive.ql.io.orc.WriterImpl$TreeWriter.writeStripe(WriterImpl.java:726)
>         at
> org.apache.hadoop.hive.ql.io.orc.WriterImpl$StructTreeWriter.writeStripe(WriterImpl.java:1614)
>         at
> org.apache.hadoop.hive.ql.io.orc.WriterImpl.flushStripe(WriterImpl.java:1996)
>         at
> org.apache.hadoop.hive.ql.io.orc.WriterImpl.close(WriterImpl.java:2288)
>         at
> org.apache.hadoop.hive.ql.exec.OrcFileMergeOperator.closeOp(OrcFileMergeOperator.java:215)
>         at
> org.apache.hadoop.hive.ql.io.merge.MergeFileMapper.close(MergeFileMapper.java:98)
>         at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:61)
>         at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:450)
>         at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343)
>         at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:163)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:415)
>         at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
>         at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)
>
>
> Is there some property I need to set for ORC to be able to support
> concatenation?
>
> Thanks,
> Daniel
>
>
>

Mime
View raw message