Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0C14518F47 for ; Fri, 24 Apr 2015 11:37:43 +0000 (UTC) Received: (qmail 72595 invoked by uid 500); 24 Apr 2015 11:37:41 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 72361 invoked by uid 500); 24 Apr 2015 11:37:41 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 72342 invoked by uid 99); 24 Apr 2015 11:37:41 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Apr 2015 11:37:40 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: message received from 54.76.25.247 which is an MX secondary for user@hive.apache.org) Received: from [54.76.25.247] (HELO mx1-eu-west.apache.org) (54.76.25.247) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Apr 2015 11:37:15 +0000 Received: from alfons.uib.no (alfons.uib.no [129.177.30.141]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id E96E821766 for ; Fri, 24 Apr 2015 11:37:03 +0000 (UTC) Received: from alfux.uib.no (smtp.uib.no) [2001:700:200:6::a:1f0c] by alfons.uib.no with esmtp (Exim 4.84) id 1Ylbto-000405-0u; Fri, 24 Apr 2015 13:36:00 +0200 Received: from 241-25.neasonline.no ([192.168.0.57]) [85.93.241.25]:42973 by smtp.uib.no with esmtpsa (Exim 4.84) id 1Ylbtm-0001Sp-If; Fri, 24 Apr 2015 13:35:58 +0200 Message-ID: <553A2A9E.70005@uib.no> Date: Fri, 24 Apr 2015 13:35:58 +0200 From: patcharee User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.5.0 MIME-Version: 1.0 To: user@hive.apache.org, pjayachandran@hortonworks.com Subject: Re: hive on Tez - merging orc files References: <5539F1AD.2010807@uib.no> <154CF9A2-E582-454D-A5A7-FEDD126D886E@hortonworks.com> <5539FB8A.9050304@uib.no> <6DE75112-DB39-4094-A787-47DC6778F305@hortonworks.com> In-Reply-To: <6DE75112-DB39-4094-A787-47DC6778F305@hortonworks.com> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 8bit X-UiB-submit: pth001 X-checked-clean: by exiscan on alfons.uib.no X-Scanner: b8b2f8caa57be217252a5a376b01a833 http://tjinfo.uib.no/virus.html X-UiB-SpamFlag: NO UIB: -18 hits, 8.0 required X-UiB-SpamReport: spamassassin found; -15 From is listed in 'whitelist_SA' -3.0 Message received from Norway X-Virus-Checked: Checked by ClamAV on apache.org Hi, I generated the new hive-exec.jar as you suggested. On the sandbox, Hive 0.14 with the new jar file is now using Tez to alter table concate, and It concatenates files correctly on Tez. Thanks! However I also tested on the production cluster using Hive 0.14 as well, merging did not work and generated another exception below 2015-04-24 13:01:52,259 INFO [main] app.DAGAppMaster: Running DAG: alter table orc_merge5a partit...concatenate 2015-04-24 13:01:52,355 INFO [IPC Server handler 0 on 46526] ipc.Server: IPC Server handler 0 on 46526, call org.apache.tez.dag.api.client.rpc.DAGClientAMProtocolBlockingPB.getDAGStatus from 10.2.1.254:39356 Call#361 Retry#0 org.apache.tez.dag.api.TezException: No running dag at present at org.apache.tez.dag.api.client.DAGClientHandler.getDAG(DAGClientHandler.java:84) at org.apache.tez.dag.api.client.DAGClientHandler.getACLManager(DAGClientHandler.java:151) at org.apache.tez.dag.api.client.rpc.DAGClientAMProtocolBlockingPBServerImpl.getDAGStatus(DAGClientAMProtocolBlockingPBServerImpl.java:94) at org.apache.tez.dag.api.client.rpc.DAGClientAMProtocolRPC$DAGClientAMProtocol$2.callBlockingMethod(DAGClientAMProtocolRPC.java:7375) at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:619) at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:962) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2039) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2035) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628) at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2033) What could be the cause of this exception? Any ideas? BR, Patcharee On 24. april 2015 10:27, Prasanth Jayachandran wrote: > You can download the branch-0.14 source code from https://github.com/apache/hive/tree/branch-0.14, apply HIVE-9529-branch-1.0.0.patch from https://issues.apache.org/jira/browse/HIVE-9529 and compile it using �mvn clean install -DskipTests -Phadoop-2,dist�. This will generate tar file under hive/packaging/target. You can extract the tar file, copy the hive-exec-x.x.x.jar into /usr/hdp/2.2.*.*/hive/lib/ (take backup of hive-exec.jar and replace with the new one). Rerunning hive cli should use the new hive-exec jar with the patch. > > Thanks > Prasanth > >> On Apr 24, 2015, at 1:15 AM, patcharee wrote: >> >> Hi, >> >> The sandbox 2.2 comes with hive 0.14. Does it also have the bug? If so, how can I patch hive on sandbox? >> >> BR, >> Patcharee >> >> On 24. april 2015 09:42, Prasanth Jayachandran wrote: >>> Hi >>> >>> This has been fixed recently https://issues.apache.org/jira/browse/HIVE-9529. Merging is triggered in two different ways. INSERT/CTAS can trigger merging of small files and CONCATENATE can trigger merging of small files. The later had a bug which generated MR task instead of TEZ task which was fixed recently. Earlier one will use TEZ task always. >>> >>> Thanks >>> Prasanth >>> >>>> On Apr 24, 2015, at 12:33 AM, patcharee wrote: >>>> >>>> Hi, >>>> >>>> Is there anyone using hortonworks sandbox 2.2? I am trying to use hive on Tez on the sandbox. I set the running engine in hive-site.xml to Tez. >>>> >>>> >>>> hive.execution.engine >>>> tez >>>> >>>> >>>> Then I ran the script that alters a table to merge small orc files (alter table orc_merge5a partition(st=0.8) concatenate;). The merging feature worked, but Hive does not use Tez, it used MapReduce, so weird! >>>> >>>> Another point, I tried to run the same script on the production cluster which is on always Tez, the merging feature sometimes worked, sometimes did not. >>>> >>>> I would appreciate any suggestions. >>>> >>>> BR, >>>> Patcharee