Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3068C1942F for ; Tue, 1 Mar 2016 07:23:10 +0000 (UTC) Received: (qmail 18978 invoked by uid 500); 1 Mar 2016 07:23:08 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 18889 invoked by uid 500); 1 Mar 2016 07:23:08 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 18879 invoked by uid 99); 1 Mar 2016 07:23:08 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 01 Mar 2016 07:23:08 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 3221FC00ED for ; Tue, 1 Mar 2016 07:23:08 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.002 X-Spam-Level: X-Spam-Status: No, score=-0.002 tagged_above=-999 required=6.31 tests=[RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx2-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id nwcSePPyxRrj for ; Tue, 1 Mar 2016 07:23:06 +0000 (UTC) Received: from h19.dbweb.ee (h19.dbweb.ee [90.190.106.29]) by mx2-lw-us.apache.org (ASF Mail Server at mx2-lw-us.apache.org) with ESMTPS id 3E6A25FB16 for ; Tue, 1 Mar 2016 07:23:06 +0000 (UTC) Received: from tallinn.webmedia.ee ([88.196.5.77] helo=iRack.local) by h19.dbweb.ee with esmtpsa (TLSv1:DHE-RSA-AES128-SHA:128) (Exim 4.83) (envelope-from ) id 1aaedx-0000er-OA for user@hive.apache.org; Tue, 01 Mar 2016 09:22:53 +0200 Subject: Re: Creating hive external table gives GC pool 'PS MarkSweep' had collection(s) To: user@hive.apache.org References: <56D063DA.6030402@roo.ee> <56D06EDF.2030001@roo.ee> <56D3F2FC.6030302@roo.ee> From: Margus Roo Message-ID: <56D5434E.8010301@roo.ee> Date: Tue, 1 Mar 2016 09:22:54 +0200 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.11; rv:38.0) Gecko/20100101 Thunderbird/38.6.0 MIME-Version: 1.0 In-Reply-To: <56D3F2FC.6030302@roo.ee> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Hi Found that in my config there was hive.exec.dynamic.partition = true; I turned it false and most of the times I can create table now but not every time. Margus (margusja) Roo http://margus.roo.ee skype: margusja +372 51 48 780 On 29/02/16 09:27, Margus Roo wrote: > Hi > > Can someone confirm that Hive checks files in destination directory > before creating external table? > At the moment in Hive 1.2.1 end user can just easily kill whole Hive > server creating external table and pointing to directory where are > loads of files. > > Margus (margusja) Roo > http://margus.roo.ee > skype: margusja > +372 51 48 780 > > On 26/02/16 17:27, Margus Roo wrote: >> Basically the question is: >> Does Hive checks files in location before creating table? >> Because if I move files away before creating tables it works and >> after table is created I can move files back and all works :) >> >> Margus (margusja) Roo >> http://margus.roo.ee >> skype: margusja >> +372 51 48 780 >> >> On 26/02/16 16:40, Margus Roo wrote: >>> Hi >>> >>> I try to create external table and in the location there are 8960 >>> small files >>> >>> And I am getting every time something like that: >>> GC pool 'PS MarkSweep' had collection(s): count=1 time=1672ms >>> GC pool 'PS Scavenge' had collection(s): count=1 time=45ms >>> 2016-02-26 15:18:29,721 INFO >>> [org.apache.hadoop.util.JvmPauseMonitor$Monitor@3af36922]: >>> util.JvmPauseMonitor (JvmPauseMonitor.java:run(195)) - Detected >>> pause in JVM or host machine (eg GC): pause of approximately 1995ms >>> GC pool 'PS MarkSweep' had collection(s): count=1 time=1936ms >>> >>> And all 16 cpu cores are in 100% and all 16G memory almost gone. >>> >>> Helps only if I restart hive server. >>> >>> I use Hive 1.2.1 from HDP-2.3 by packaged by Hortonworks >>> >>> hive.tez.java.opts=-server -Djava.net.preferIPv4Stack=true >>> -XX:NewRatio=8 -XX:+UseNUMA -XX:+UseG1GC -XX:+ResizeTLAB >>> -XX:+PrintGCDetails -verbose:gc -XX:+PrintGCTimeStamps >>> >>> >> >