Return-Path: X-Original-To: apmail-drill-issues-archive@minotaur.apache.org Delivered-To: apmail-drill-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 469E8177F0 for ; Fri, 7 Nov 2014 21:55:55 +0000 (UTC) Received: (qmail 76379 invoked by uid 500); 7 Nov 2014 21:55:55 -0000 Delivered-To: apmail-drill-issues-archive@drill.apache.org Received: (qmail 76351 invoked by uid 500); 7 Nov 2014 21:55:55 -0000 Mailing-List: contact issues-help@drill.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.incubator.apache.org Delivered-To: mailing list issues@drill.incubator.apache.org Received: (qmail 76342 invoked by uid 99); 7 Nov 2014 21:55:55 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Nov 2014 21:55:55 +0000 X-ASF-Spam-Status: No, hits=-2000.6 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.3] (HELO mail.apache.org) (140.211.11.3) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 07 Nov 2014 21:55:54 +0000 Received: (qmail 75558 invoked by uid 99); 7 Nov 2014 21:55:33 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Nov 2014 21:55:33 +0000 Date: Fri, 7 Nov 2014 21:55:33 +0000 (UTC) From: "Venkata krishnan Sowrirajan (JIRA)" To: issues@drill.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (DRILL-1664) Drill gives wrong count on a parquet file which is created as a table by drill MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/DRILL-1664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14202761#comment-14202761 ] Venkata krishnan Sowrirajan commented on DRILL-1664: ---------------------------------------------------- If I do "create table `t2-csv` as select * from `t2.csv`;", then the parquet file created looks like below: columns = 9711942 columns = HX362083 columns = 9707867 columns = HX357851 Here all the columns are named as `columns`, this is why the count is showing up as 4. If I do "create table `t2-parq` as select columns[0] as a, columns[1] as b from `t2.csv`;", then the parquet file created looks like below: a = 9711942 b = HX362083 a = 9707867 b = HX357851 This way if create the table, the count showing correctly as 2. > Drill gives wrong count on a parquet file which is created as a table by drill > ------------------------------------------------------------------------------ > > Key: DRILL-1664 > URL: https://issues.apache.org/jira/browse/DRILL-1664 > Project: Apache Drill > Issue Type: Bug > Reporter: Venkata krishnan Sowrirajan > > Steps carried out: > 0: jdbc:drill:> select * from `t2.csv`; > +------------+ > | columns | > +------------+ > | ["9711942","HX362083"] | > | ["9707867","HX357851"] | > +------------+ > 2 rows selected (0.123 seconds) > 0: jdbc:drill:> create table `t2-csv` as select * from `t2.csv`; > +------------+---------------------------+ > | Fragment | Number of records written | > +------------+---------------------------+ > | 0_0 | 2 | > +------------+---------------------------+ > 1 row selected (0.252 seconds) > 0: jdbc:drill:> select * from `t2-csv`; > +------------+ > | columns | > +------------+ > | ["9711942","HX362083"] | > | ["9707867","HX357851"] | > +------------+ > 2 rows selected (0.116 seconds) > 0: jdbc:drill:> select count(*) from `t2-csv` > . . . . . . . > ; > +------------+ > | EXPR$0 | > +------------+ > | 4 | > +------------+ > 1 row selected (0.128 seconds) > Is there a similar bug for this already filed? If there is a similar bug for this, mark this as duplicate as I couldn't find that. -- This message was sent by Atlassian JIRA (v6.3.4#6332)