hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From patcharee <>
Subject Re: hive on Tez - merging orc files
Date Fri, 24 Apr 2015 11:35:58 GMT

I generated the new hive-exec.jar as you suggested. On the sandbox, Hive 
0.14 with the new jar file is now using Tez to alter table concate, and 
It concatenates files correctly on Tez. Thanks!

However I also tested on the production cluster using Hive 0.14 as well, 
merging did not work and generated another exception below

2015-04-24 13:01:52,259 INFO [main] app.DAGAppMaster: Running DAG: alter 
table orc_merge5a partit...concatenate
2015-04-24 13:01:52,355 INFO [IPC Server handler 0 on 46526] ipc.Server: 
IPC Server handler 0 on 46526, call 
from Call#361 Retry#0
org.apache.tez.dag.api.TezException: No running dag at present
         at org.apache.hadoop.ipc.RPC$
         at org.apache.hadoop.ipc.Server$Handler$
         at org.apache.hadoop.ipc.Server$Handler$
         at Method)
         at org.apache.hadoop.ipc.Server$

What could be the cause of this exception? Any ideas?


On 24. april 2015 10:27, Prasanth Jayachandran wrote:
> You can download the branch-0.14 source code from,
apply HIVE-9529-branch-1.0.0.patch from and
compile it using “mvn clean install -DskipTests -Phadoop-2,dist”. This will generate tar
file under hive/packaging/target. You can extract the tar file, copy the hive-exec-x.x.x.jar
into /usr/hdp/2.2.*.*/hive/lib/ (take backup of hive-exec.jar and replace with the new one).
Rerunning hive cli should use the new hive-exec jar with the patch.
> Thanks
> Prasanth
>> On Apr 24, 2015, at 1:15 AM, patcharee <> wrote:
>> Hi,
>> The sandbox 2.2 comes with hive 0.14. Does it also have the bug? If so, how can I
patch hive on sandbox?
>> BR,
>> Patcharee
>> On 24. april 2015 09:42, Prasanth Jayachandran wrote:
>>> Hi
>>> This has been fixed recently
Merging is triggered in two different ways. INSERT/CTAS can trigger merging of small files
and CONCATENATE can trigger merging of small files. The later had a bug which generated MR
task instead of TEZ task which was fixed recently. Earlier one will use TEZ task always.
>>> Thanks
>>> Prasanth
>>>> On Apr 24, 2015, at 12:33 AM, patcharee <>
>>>> Hi,
>>>> Is there anyone using hortonworks sandbox 2.2? I am trying to use hive on
Tez on the sandbox. I set the running engine in hive-site.xml to Tez.
>>>>     <property>
>>>>       <name>hive.execution.engine</name>
>>>>       <value>tez</value>
>>>>     </property>
>>>> Then I ran the script that alters a table to merge small orc files (alter
table orc_merge5a partition(st=0.8) concatenate;). The merging feature worked, but Hive does
not use Tez, it used MapReduce, so weird!
>>>> Another point, I tried to run the same script on the production cluster which
is on always Tez, the merging feature sometimes worked, sometimes did not.
>>>> I would appreciate any suggestions.
>>>> BR,
>>>> Patcharee

View raw message