Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 44E0117E68 for ; Fri, 7 Nov 2014 05:48:35 +0000 (UTC) Received: (qmail 86249 invoked by uid 500); 7 Nov 2014 05:48:34 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 86183 invoked by uid 500); 7 Nov 2014 05:48:34 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 86169 invoked by uid 500); 7 Nov 2014 05:48:34 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 86166 invoked by uid 99); 7 Nov 2014 05:48:34 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Nov 2014 05:48:34 +0000 Date: Fri, 7 Nov 2014 05:48:34 +0000 (UTC) From: "Chengxiang Li (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-8780) insert1.q and ppd_join4.q hangs with hadoop-1 [Spark Branch] MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14201649#comment-14201649 ] Chengxiang Li commented on HIVE-8780: ------------------------------------- Hi, [~jxiang], SparkJobMonitor check job state in an loop until job finished/failed, and sleep check interval time after checking, so it should be normal to get this stacktrace, SparkJobMonitor should not block spark job execution as the job is submitted asynchronous before we call SparkJobMonitor::startMonitor. could you test with a single qtest file and upload a full JVM stacktrace when it's long enough to be sure job is hanging? > insert1.q and ppd_join4.q hangs with hadoop-1 [Spark Branch] > ------------------------------------------------------------ > > Key: HIVE-8780 > URL: https://issues.apache.org/jira/browse/HIVE-8780 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Jimmy Xiang > > In working on HIVE-8758, found these tests hang at > {noformat} > java.lang.Thread.State: TIMED_WAITING (sleeping) > at java.lang.Thread.sleep(Native Method) > at org.apache.hadoop.hive.ql.exec.spark.status.SparkJobMonitor.startMoni > tor(SparkJobMonitor.java:129) > at org.apache.hadoop.hive.ql.exec.spark.SparkTask.execute(SparkTask.java > :111) > at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:161) > at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.ja > va:85) > at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1644) > at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1404) > at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1216) > at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1043) > at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1033) > at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:2 > 47) > at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:199) > at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:410) > at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:345) > at org.apache.hadoop.hive.ql.QTestUtil.executeClient(QTestUtil.java:832) > at org.apache.hadoop.hive.cli.TestSparkCliDriver.runTest(TestSparkCliDri > ver.java:3706) > at org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_ppd_join4 > (TestSparkCliDriver.java:2790) > {noformat} > Both tests hang at the same place. There could be other hanging tests. -- This message was sent by Atlassian JIRA (v6.3.4#6332)