Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 08EAB200B92 for ; Tue, 13 Sep 2016 14:08:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 0615E160AC6; Tue, 13 Sep 2016 12:08:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 7F1D8160AD2 for ; Tue, 13 Sep 2016 14:08:21 +0200 (CEST) Received: (qmail 25752 invoked by uid 500); 13 Sep 2016 12:08:20 -0000 Mailing-List: contact issues-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list issues@hive.apache.org Received: (qmail 25712 invoked by uid 99); 13 Sep 2016 12:08:20 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Sep 2016 12:08:20 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 853D62C1B81 for ; Tue, 13 Sep 2016 12:08:20 +0000 (UTC) Date: Tue, 13 Sep 2016 12:08:20 +0000 (UTC) From: "Amruth S (JIRA)" To: issues@hive.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HIVE-14741) Incorrect results on boolean col when vectorization is ON MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 13 Sep 2016 12:08:22 -0000 [ https://issues.apache.org/jira/browse/HIVE-14741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amruth S updated HIVE-14741: ---------------------------- Labels: orc vectorization (was: ) > Incorrect results on boolean col when vectorization is ON > --------------------------------------------------------- > > Key: HIVE-14741 > URL: https://issues.apache.org/jira/browse/HIVE-14741 > Project: Hive > Issue Type: Bug > Affects Versions: 2.0.0, 2.1.0 > Reporter: Amruth S > Labels: orc, vectorization > Attachments: 000000_0 > > > I have attached the ORC part file on which the issue is manifesting. It has just one boolean column (lot of nulls, 231=trues : verified using orc file dump utility) > 1) Create external table on the part file attached > CREATE EXTERNAL TABLE bool_vect_issue ( > `bool_col` BOOLEAN) > ROW FORMAT SERDE > 'org.apache.hadoop.hive.ql.io.orc.OrcSerde' > STORED AS INPUTFORMAT > 'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat' > OUTPUTFORMAT > 'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat' > LOCATION > ''; > 2) > set hive.vectorized.execution.enabled = true; > select sum(if((bool_col) , 1, 0)) from bool_vect_issue; > gives > 708206 > 3) > set hive.vectorized.execution.enabled = false; > select sum(if((bool_col) , 1, 0)) from bool_vect_issue; > gives > 231 > The issue seem to have the same impact as HIVE-12435 -- This message was sent by Atlassian JIRA (v6.3.4#6332)