Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F2E1111B3F for ; Thu, 31 Jul 2014 20:43:39 +0000 (UTC) Received: (qmail 32840 invoked by uid 500); 31 Jul 2014 20:43:39 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 32762 invoked by uid 500); 31 Jul 2014 20:43:39 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 32747 invoked by uid 500); 31 Jul 2014 20:43:39 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 32744 invoked by uid 99); 31 Jul 2014 20:43:39 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 31 Jul 2014 20:43:39 +0000 Date: Thu, 31 Jul 2014 20:43:39 +0000 (UTC) From: "Xuefu Zhang (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-7334) Create SparkShuffler, shuffling data between map-side data processing and reduce-side processing MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-7334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14081430#comment-14081430 ] Xuefu Zhang commented on HIVE-7334: ----------------------------------- [~lirui] Please feel free to create smaller JIRAs to enable sorting in Hive on Spark. Here are some ideas: 1. Complete SortByShuffler 2. Add logic in SparkCompiler to generate SparkEdgeProperty with right sorting property. 3. Add logic in SparkPlanGenerator to generate plan with right shuffle type. 4. Test Hive's sorting related queries to make sure they work. File JIRAs for problems found. Also, please take a look at the link [~rxin] pointed out above to see if we can benefit in any way. > Create SparkShuffler, shuffling data between map-side data processing and reduce-side processing > ------------------------------------------------------------------------------------------------ > > Key: HIVE-7334 > URL: https://issues.apache.org/jira/browse/HIVE-7334 > Project: Hive > Issue Type: Sub-task > Reporter: Xuefu Zhang > Assignee: Rui Li > Attachments: HIVE-7334.patch > > > Please refer to the design spec. -- This message was sent by Atlassian JIRA (v6.2#6252)