Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8AE1811E56 for ; Wed, 21 May 2014 10:56:39 +0000 (UTC) Received: (qmail 15281 invoked by uid 500); 21 May 2014 10:56:39 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 15225 invoked by uid 500); 21 May 2014 10:56:38 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 15175 invoked by uid 500); 21 May 2014 10:56:38 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 15157 invoked by uid 99); 21 May 2014 10:56:38 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 21 May 2014 10:56:38 +0000 Date: Wed, 21 May 2014 10:56:38 +0000 (UTC) From: "Remus Rusanu (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-7105) Enable ReduceRecordProcessor to generate VectorizedRowBatches MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-7105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14004572#comment-14004572 ] Remus Rusanu commented on HIVE-7105: ------------------------------------ Extending the vectorized processing to the reduce side is a complex undertaking. None of the vector mode operators are implemented in reduce side. The thinking is that the bulk of the CPU intensive processing occurs on the map side and our goal was to provide maximum feature coverage (ie. implement as many operators as needed to cover the most queries) but atm vectorization only works for map side of first stage. I'm not sure whether at this stage we can call the map side effort stable/mature/complete enough to warrant a focus shift to reduce side. > Enable ReduceRecordProcessor to generate VectorizedRowBatches > ------------------------------------------------------------- > > Key: HIVE-7105 > URL: https://issues.apache.org/jira/browse/HIVE-7105 > Project: Hive > Issue Type: Bug > Components: Vectorization > Reporter: Rajesh Balamohan > Assignee: Jitendra Nath Pandey > > Currently, ReduceRecordProcessor sends one key,value pair at a time to its operator pipeline. It would be beneficial to send VectorizedRowBatch to downstream operators. -- This message was sent by Atlassian JIRA (v6.2#6252)