Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id C9C8A200CC1 for ; Mon, 10 Jul 2017 13:14:29 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id C84921628C3; Mon, 10 Jul 2017 11:14:29 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 4598F162953 for ; Mon, 10 Jul 2017 13:14:29 +0200 (CEST) Received: (qmail 35952 invoked by uid 500); 10 Jul 2017 11:14:28 -0000 Mailing-List: contact issues-help@carbondata.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@carbondata.apache.org Delivered-To: mailing list issues@carbondata.apache.org Received: (qmail 35943 invoked by uid 99); 10 Jul 2017 11:14:28 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 10 Jul 2017 11:14:28 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 551BDE102F; Mon, 10 Jul 2017 11:14:27 +0000 (UTC) From: bhavya411 To: issues@carbondata.apache.org Reply-To: issues@carbondata.apache.org References: In-Reply-To: Subject: [GitHub] carbondata pull request #1142: [CARBONDATA-1271] Enhanced Performance for Hi... Content-Type: text/plain Message-Id: <20170710111427.551BDE102F@git1-us-west.apache.org> Date: Mon, 10 Jul 2017 11:14:27 +0000 (UTC) archived-at: Mon, 10 Jul 2017 11:14:30 -0000 Github user bhavya411 commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1142#discussion_r126392544 --- Diff: integration/hive/src/main/java/org/apache/carbondata/hive/CarbonHiveRecordReader.java --- @@ -111,58 +108,46 @@ private void initialize(InputSplit inputSplit, Configuration conf) throws IOExce } else { columnTypes = TypeInfoUtils.getTypeInfosFromTypeString(columnTypeProperty); } + + if (valueObj == null) { + valueObj = new ArrayWritable(Writable.class, new Writable[columnTypes.size()]); + } --- End diff -- Actually the data structure should be consistent, initially we were just returning the project columns and the Arraywritable was having variable length but in Parquet and ORC both implementation the ArrayWritable length was equivalent to the number of columns in table . It was causing issues in TPCh queries so that's why made changes to have Arraywritable size equivalent to number of columns and then populate the data at respective position --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---