Return-Path: X-Original-To: apmail-drill-issues-archive@minotaur.apache.org Delivered-To: apmail-drill-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6F038196F8 for ; Wed, 20 Apr 2016 12:22:26 +0000 (UTC) Received: (qmail 41296 invoked by uid 500); 20 Apr 2016 12:22:26 -0000 Delivered-To: apmail-drill-issues-archive@drill.apache.org Received: (qmail 41233 invoked by uid 500); 20 Apr 2016 12:22:26 -0000 Mailing-List: contact issues-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list issues@drill.apache.org Received: (qmail 41101 invoked by uid 99); 20 Apr 2016 12:22:25 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 20 Apr 2016 12:22:25 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id DE5412C14F7 for ; Wed, 20 Apr 2016 12:22:25 +0000 (UTC) Date: Wed, 20 Apr 2016 12:22:25 +0000 (UTC) From: "Vitalii Diravka (JIRA)" To: issues@drill.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Closed] (DRILL-3577) Counting nested fields on CTAS-created-parquet file/s reports inaccurate results MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/DRILL-3577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vitalii Diravka closed DRILL-3577. ---------------------------------- Resolution: Resolved > Counting nested fields on CTAS-created-parquet file/s reports inaccurate results > -------------------------------------------------------------------------------- > > Key: DRILL-3577 > URL: https://issues.apache.org/jira/browse/DRILL-3577 > Project: Apache Drill > Issue Type: Bug > Components: Functions - Drill > Affects Versions: 1.1.0 > Reporter: Hanifi Gunes > Assignee: Vitalii Diravka > Priority: Critical > Fix For: 1.7.0 > > > I have not tried this at a smaller scale nor on JSON file directly but the following seems to re-prod the issue > 1. Create an input file as follows > 20K rows with the following - > {"some":"yes","others":{"other":"true","all":"false","sometimes":"yes"}} > 200 rows with the following - > {"some":"yes","others":{"other":"true","all":"false","sometimes":"yes","additional":"last > entries only"}} > 2. CTAS as follows > {code:sql} > CREATE TABLE dfs.`tmp`.`tp` as select * from dfs.`data.json` t > {code} > This should read > {code} > Fragment Number of records written > 0_0 20200 > {code} > 3. Count on nested fields via > {code:sql} > select count(t.others.additional) from dfs.`tmp`.`tp` t > OR > select count(t.others.other) from dfs.`tmp`.`tp` t > {code} > reports no rows as follows > {code} > EXPR$0 > 0 > {code} > While > {code:sql} > select count(t.`some`) from dfs.`tmp`.`tp` t where t.others.additional is not null > {code} > reports expected 200 rows > {code} > EXPR$0 > 200 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)