Return-Path: X-Original-To: apmail-hive-issues-archive@minotaur.apache.org Delivered-To: apmail-hive-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DBDFF17CE3 for ; Tue, 19 May 2015 08:03:01 +0000 (UTC) Received: (qmail 32597 invoked by uid 500); 19 May 2015 08:03:00 -0000 Delivered-To: apmail-hive-issues-archive@hive.apache.org Received: (qmail 32486 invoked by uid 500); 19 May 2015 08:03:00 -0000 Mailing-List: contact issues-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list issues@hive.apache.org Received: (qmail 32216 invoked by uid 99); 19 May 2015 08:03:00 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 19 May 2015 08:03:00 +0000 Date: Tue, 19 May 2015 08:03:00 +0000 (UTC) From: "Prasanth Jayachandran (JIRA)" To: issues@hive.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HIVE-10744) LLAP: dags get stuck in yet another way MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-10744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HIVE-10744: ----------------------------------------- Attachment: (was: HIVE-10744.patch) > LLAP: dags get stuck in yet another way > --------------------------------------- > > Key: HIVE-10744 > URL: https://issues.apache.org/jira/browse/HIVE-10744 > Project: Hive > Issue Type: Sub-task > Reporter: Sergey Shelukhin > Assignee: Prasanth Jayachandran > Attachments: HIVE-10744.patch > > > DAG gets stuck when number of tasks that is multiple of number of containers on machine (6, 12, ... in my case) fails to finish at the end of the stage (I am running a job with 500-1000 maps). Status just hangs forever (beyond 5 min timeout) with some tasks shown as running. Happened twice on 3rd DAG with 1000-map job (TPCH Q1), then when I reduced to 500 happened on 7th DAG so far. [~sseth] has the details. -- This message was sent by Atlassian JIRA (v6.3.4#6332)