Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 17F37CBFA for ; Thu, 26 Jul 2012 21:42:37 +0000 (UTC) Received: (qmail 1090 invoked by uid 500); 26 Jul 2012 21:42:35 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 1017 invoked by uid 500); 26 Jul 2012 21:42:35 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 950 invoked by uid 500); 26 Jul 2012 21:42:35 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 926 invoked by uid 99); 26 Jul 2012 21:42:35 -0000 Received: from issues-vm.apache.org (HELO issues-vm) (140.211.11.160) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Jul 2012 21:42:35 +0000 Received: from isssues-vm.apache.org (localhost [127.0.0.1]) by issues-vm (Postfix) with ESMTP id B72D91427F2 for ; Thu, 26 Jul 2012 21:42:34 +0000 (UTC) Date: Thu, 26 Jul 2012 21:42:34 +0000 (UTC) From: "Kevin Wilfong (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: <847619141.108229.1343338954752.JavaMail.jiratomcat@issues-vm> In-Reply-To: <764979986.104116.1343263774564.JavaMail.jiratomcat@issues-vm> Subject: [jira] [Updated] (HIVE-3302) Race condition in query plan for merging at the end of a query MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-3302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Wilfong updated HIVE-3302: -------------------------------- Status: Patch Available (was: Open) > Race condition in query plan for merging at the end of a query > -------------------------------------------------------------- > > Key: HIVE-3302 > URL: https://issues.apache.org/jira/browse/HIVE-3302 > Project: Hive > Issue Type: Bug > Components: Query Processor > Affects Versions: 0.10.0 > Reporter: Kevin Wilfong > Assignee: Kevin Wilfong > Priority: Critical > Attachments: HIVE-3302.1.patch.txt > > > In the query plan that's used to merge files at the end of a query, the dependency tree looks something like: > MoveTask(1) > / \ > ...ConditionalTask MoveTask(2)... > \ / > MergeTask > Here MoveTask(1) moves the partition data to a temporary location, and MoveTask(2) moves it to the final location. > However if there are dynamic partitions generated and some of these partitions are merged and others are moved, the dependency tree is changed at runtime to: > ...ConditionalTask MoveTask(2)... > \ / > MergeTask > \ > MoveTask(1) > This produces a race condition between the two MoveTasks where if MoveTask(2) runs before MoveTask(1) the partitions moved by MoveTask(1) will get moved to an intermediate location and never moved to the final location. In this case those partitions are quietly lost. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira