Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2B144172B1 for ; Fri, 10 Apr 2015 07:46:13 +0000 (UTC) Received: (qmail 50663 invoked by uid 500); 10 Apr 2015 07:46:12 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 50587 invoked by uid 500); 10 Apr 2015 07:46:12 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 50574 invoked by uid 99); 10 Apr 2015 07:46:12 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Apr 2015 07:46:12 +0000 Date: Fri, 10 Apr 2015 07:46:12 +0000 (UTC) From: "Gopal V (JIRA)" To: dev@hive.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (HIVE-10298) LLAP: Cache cleaner throws Negative sleep exception MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Gopal V created HIVE-10298: ------------------------------ Summary: LLAP: Cache cleaner throws Negative sleep exception Key: HIVE-10298 URL: https://issues.apache.org/jira/browse/HIVE-10298 Project: Hive Issue Type: Sub-task Affects Versions: llap Reporter: Gopal V Assignee: Sergey Shelukhin Priority: Minor {code} 2015-04-10 00:33:57,791 [org.apache.hadoop.util.JvmPauseMonitor$Monitor@5cee5251()] WARN org.apache.hadoop.util.JvmPauseMonitor: Detected pause in JVM or host machine (eg GC): pause of approximately 10838ms GC pool 'PS MarkSweep' had collection(s): count=74 time=75850ms ... 2015-04-10 00:34:33,265 [IPC Server handler 3 on 15001()] WARN org.apache.hadoop.ipc.Server: IPC Server handler 3 on 15001, call org.apache.hadoop.hive.llap.daemon.LlapDaemonProtocolBlockingPB.submitWork from 172.19.128.59:47617 Call#2788 Retry#0: output error 2015-04-10 00:34:34,454 [Llap low level cache cleanup thread()] ERROR org.apache.hadoop.hive.llap.io.api.impl.LlapIoImpl: Cleanup has failed; the thread will now exit java.lang.IllegalArgumentException: timeout value is negative at java.lang.Thread.sleep(Native Method) at org.apache.hadoop.hive.llap.cache.LowLevelCacheImpl$CleanupThread.doOneCleanupRound(LowLevelCacheImpl.java:426) at org.apache.hadoop.hive.llap.cache.LowLevelCacheImpl$CleanupThread.run(LowLevelCacheImpl.java:387) {code} This seems to be due to a rather long GC pause {code} // Duration is an estimate; if the size of the map changes, it can be very different. long endTime = System.nanoTime() + approxCleanupIntervalSec * 1000000000L; ... while (subIter.hasNext()) { Thread.sleep((leftToCheck <= 0) ? 1 : (endTime - System.nanoTime()) / (1000000L * leftToCheck)); if (subIter.next().getValue().isInvalid()) { {code} We spent more than 5 minutes in a GC pause causing this error. -- This message was sent by Atlassian JIRA (v6.3.4#6332)