Return-Path: X-Original-To: apmail-drill-issues-archive@minotaur.apache.org Delivered-To: apmail-drill-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ADE9F17F12 for ; Wed, 11 Feb 2015 19:20:15 +0000 (UTC) Received: (qmail 70467 invoked by uid 500); 11 Feb 2015 19:20:11 -0000 Delivered-To: apmail-drill-issues-archive@drill.apache.org Received: (qmail 70437 invoked by uid 500); 11 Feb 2015 19:20:11 -0000 Mailing-List: contact issues-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list issues@drill.apache.org Received: (qmail 70421 invoked by uid 99); 11 Feb 2015 19:20:11 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Feb 2015 19:20:11 +0000 Date: Wed, 11 Feb 2015 19:20:11 +0000 (UTC) From: "Parth Chandra (JIRA)" To: issues@drill.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (DRILL-2082) nested arrays of strings returned wrong results MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/DRILL-2082?page=3Dcom.atlassia= n.jira.plugin.system.issuetabpanels:all-tabpanel ] Parth Chandra updated DRILL-2082: --------------------------------- Fix Version/s: 0.8.0 > nested arrays of strings returned wrong results > ----------------------------------------------- > > Key: DRILL-2082 > URL: https://issues.apache.org/jira/browse/DRILL-2082 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Data Types > Affects Versions: 0.8.0 > Reporter: Chun Chang > Assignee: Mehant Baid > Priority: Critical > Fix For: 0.8.0 > > > #Mon Jan 26 14:10:51 PST 2015 > git.commit.id.abbrev=3D3c6d0ef > Querying Complex JSON data type nested array of strings returned wrong re= sults when data size is large (1 million row). Smaller data size (a few row= s) returned correct results. Test data can be accessed at http://apache-dri= ll.s3.amazonaws.com/files/complex.json.gz > For small data size, I got correct results: > {code} > 0: jdbc:drill:schema=3Ddfs.drillTestDirComplexJ> select t.id, t.aaa from = `aaa.json` t; > +------------+------------+ > | id | aaa | > +------------+------------+ > | 1 | [[["aa0 1"],["ab0 1"]],[["ba0 1"],["bb0 1"]],[["ca0 1","ca= 1 1"],["cb0 1","cb1 1","cb2 1"]]] | > | 2 | [[["aa0 2"],["ab0 2"]],[["ba0 2"],["bb0 2"]],[["ca0 2","ca= 1 2"],["cb0 2","cb1 2","cb2 2"]]] | > +------------+------------+ > {code} > But large data size returned wrong results: > {code} > 0: jdbc:drill:schema=3Ddfs.drillTestDirComplexJ> select t.id, t.aaa from = `complex.json` t where t.id=3D1 limit 1; > +------------+------------+ > | id | aaa | > +------------+------------+ > | 1 | [[["ba0 56"],["bb0 56"],["ca0 56","ca1 56"],["cb0 56","cb1= 56","cb2 56"],["aa0 91"],["ab0 91"],["aa0 125"],["ab0 125"],["aa0 140"],["= ab0 140"],["aa0 142"],["ab0 142"],["aa0 146"],["ab0 146"],["ba0 402"],["bb0= 402"],["ca0 402","ca1 402"],["cb0 402","cb1 402","cb2 402"],["aa0 403"],["= ab0 403"],["ba0 403"],["bb0 403"],["ca0 403","ca1 403"],["cb0 403","cb1 403= ","cb2 403"],["aa0 404"],["ab0 404"],["ba0 404"],["bb0 404"],["ca0 404","ca= 1 404"],["cb0 404","cb1 404","cb2 404"],["aa0 405"],["ab0 405"],["ba0 405"]= ,["bb0 405"],["ca0 405","ca1 405"],["cb0 405","cb1 405","cb2 405"],["aa0 43= 7"],["ab0 437"],["aa0 485"],["ab0 485"],["aa0 503"],["ab0 503"],["aa0 569"]= ,["ab0 569"],["aa0 581"],["ab0 581"],["aa0 620"],["ab0 620"],["aa0 632"],["= ab0 632"],["aa0 640"],["ab0 640"],["aa0 650"],["ab0 650"],["aa0 669"],["ab0= 669"],["aa0 671"],["ab0 671"],["aa0 728"],["ab0 728"],["aa0 735"],["ab0 73= 5"],["aa0 772"],["ab0 772"],["aa0 784"],["ab0 784"],["aa0 811"],["ab0 811"]= ,["aa0 817"],["ab0 817"],["aa0 836"],["ab0 836"],["aa0 881"],["ab0 881"],["= aa0 891"],["ab0 891"],["aa0 924"],["ab0 924"],["aa0 1005"],["ab0 1005"],["a= a0 1057"],["ab0 1057"],["aa0 1086"],["ab0 1086"],["aa0 1089"],["ab0 1089"],= ["aa0 1097"],["ab0 1097"],["aa0 1133"],["ab0 1133"],["aa0 1136"],["ab0 1136= "],["aa0 1146"],["ab0 1146"],["aa0 1169"],["ab0 1169"],["aa0 1178"],["ab0 1= 178"],["aa0 1184"],["ab0 1184"],["aa0 1189"],["ab0 1189"],["aa0 1223"],["ab= 0 1223"],["aa0 1275"],["ab0 1275"],["aa0 1290"],["ab0 1290"],["aa0 1295"],[= "ab0 1295"],["aa0 1320"],["ab0 1320"],["aa0 1343"],["ab0 1343"],["aa0 1400"= ],["ab0 1400"],["aa0 1426"],["ab0 1426"],["aa0 1442"],["ab0 1442"],["aa0 14= 55"],["ab0 1455"],["aa0 1499"],["ab0 1499"],["aa0 1521"],["ab0 1521"],["aa0= 1541"],["ab0 1541"],["aa0 1557"],["ab0 1557"],["aa0 1578"],["ab0 1578"],["= aa0 1633"],["ab0 1633"],["aa0 1635"],["ab0 1635"],["aa0 1651"],["ab0 1651"]= ,["aa0 1665"],["ab0 1665"],["aa0 1689"],["ab0 1689"],["aa0 1760"],["ab0 176= 0"],["aa0 1784"],["ab0 1784"],["aa0 1796"],["ab0 1796"],["aa0 1801"],["ab0 = 1801"],["aa0 1817"],["ab0 1817"],["aa0 1861"],["ab0 1861"],["aa0 1872"],["a= b0 1872"],["aa0 1895"],["ab0 1895"],["aa0 1897"],["ab0 1897"],["aa0 1911"],= ["ab0 1911"],["aa0 1975"],["ab0 1975"],["aa0 1983"],["ab0 1983"],["aa0 1996= "],["ab0 1996"],["aa0 2005"],["ab0 2005"],["aa0 2048"],["ab0 2048"],["aa0 2= 063"],["ab0 2063"],["aa0 2150"],["ab0 2150"],["aa0 2159"],["ab0 2159"],["aa= 0 2214"],["ab0 2214"],["aa0 2218"],["ab0 2218"],["aa0 2220"],["ab0 2220"],[= "aa0 2250"],["ab0 2250"],["aa0 2256"],["ab0 2256"],["aa0 2265"],["ab0 2265"= ],["aa0 2296"],["ab0 2296"],["aa0 2319"],["ab0 2319"],["aa0 2327"],["ab0 23= 27"],["aa0 2333"],["ab0 2333"],["aa0 2361"],["ab0 2361"],["aa0 2392"],["ab0= 2392"],["aa0 2399"],["ab0 2399"],["aa0 2424"],["ab0 2424"],["aa0 2466"],["= ab0 2466"],["aa0 2473"],["ab0 2473"],["aa0 2508"],["ab0 2508"],["aa0 2524"]= ,["ab0 2524"],["aa0 2550"],["ab0 2550"],["aa0 2553"],["ab0 2553"],["aa0 256= 0"],["ab0 2560"],["aa0 2563"],["ab0 2563"],["aa0 2574"],["ab0 2574"],["aa0 = 2592"],["ab0 2592"],["aa0 2600"],["ab0 2600"],["aa0 2606"],["ab0 2606"],["a= a0 2639"],["ab0 2639"],["aa0 2670"],["ab0 2670"],["aa0 2684"],["ab0 2684"],= ["aa0 2720"],["ab0 2720"],["aa0 2745"],["ab0 2745"],["aa0 2763"],["ab0 2763= "],["aa0 2786"],["ab0 2786"],["aa0 2831"],["ab0 2831"],["aa0 2834"],["ab0 2= 834"],["aa0 2838"],["ab0 2838"],["aa0 2842"],["ab0 2842"],["aa0 2909"],["ab= 0 2909"],["aa0 2982"],["ab0 2982"],["aa0 2989"],["ab0 2989"],["aa0 2992"],[= "ab0 2992"],["aa0 3027"],["ab0 3027"],["aa0 3033"],["ab0 3033"],["aa0 3052"= ],["ab0 3052"],["aa0 3072"],["ab0 3072"],["aa0 3078"],["ab0 3078"],["aa0 31= 04"],["ab0 3104"],["aa0 3116"],["ab0 3116"],["aa0 3152"],["ab0 3152"],["aa0= 3168"],["ab0 3168"],["aa0 3195"],["ab0 3195"],["aa0 3202"],["ab0 3202"],["= aa0 3212"],["ab0 3212"],["aa0 3227"],["ab0 3227"],["aa0 3252"],["ab0 3252"]= ,["aa0 3258"],["ab0 3258"],["aa0 3269"],["ab0 3269"],["aa0 3308"],["ab0 330= 8"],["aa0 3332"],["ab0 3332"],["aa0 3351"],["ab0 3351"],["aa0 3359"],["ab0 = 3359"],["aa0 3382"],["ab0 3382"],["aa0 3400"],["ab0 3400"],["aa0 3450"],["a= b0 3450"],["aa0 3455"],["ab0 3455"],["aa0 3478"],["ab0 3478"],["aa0 3484"],= ["ab0 3484"],["aa0 3504"],["ab0 3504"],["aa0 3531"],["ab0 3531"],["aa0 3557= "],["ab0 3557"],["aa0 3582"],["ab0 3582"],["aa0 3631"],["ab0 3631"],["aa0 3= 658"],["ab0 3658"],["aa0 3703"],["ab0 3703"],["aa0 3710"],["ab0 3710"],["aa= 0 3716"],["ab0 3716"],["aa0 3741"],["ab0 3741"],["aa0 3759"],["ab0 3759"],[= "aa0 3803"],["ab0 3803"],["aa0 3852"],["ab0 3852"],["aa0 3874"],["ab0 3874"= ],["aa0 3884"],["ab0 3884"],["aa0 3887"],["ab0 3887"],["aa0 3889"],["ab0 38= 89"],["aa0 3981"],["ab0 3981"],["aa0 3993"],["ab0 3993"],["aa0 4012"],["ab0= 4012"],["aa0 4024"],["ab0 4024"],["aa0 4032"],["ab0 4032"],["aa0 4042"],["= ab0 4042"],["aa0 4066"],["ab0 4066"],["aa0 4088"],["ab0 4088"],["aa0 4095"]= ,["ab0 4095"]],[[""],["bb0 3741"],["ba0 3759"],["bb0 3759"],["ba0 3803"],["= bb0 3803"],["ba0 3814"],["bb0 3814"],["ba0 3852"],["bb0 3852"],["ba0 3874"]= ,["bb0 3874"],["ba0 3884"],["bb0 3884"],["ba0 3887"],["bb0 3887"],["ba0 388= 9"],["bb0 3889"],["ba0 3957"],["bb0 3957"],["ba0 3981"],["bb0 3981"],["ba0 = 3993"],["bb0 3993"],["ba0 4012"],["bb0 4012"],["ba0 4024"],["bb0 4024"],["b= a0 4032"],["bb0 4032"],["ba0 4042"],["bb0 4042"],["ba0 4066"],["bb0 4066"],= ["ba0 4088"],["bb0 4088"],["ba0 4095"],["bb0 4095"]],[["ca0 4095","ca1 4095= "],["cb0 4095","cb1 4095","cb2 4095"]]] | > +------------+------------+ > {code} > physical plan > {code} > 0: jdbc:drill:schema=3Ddfs.drillTestDirComplexJ> explain plan for select = t.id, t.aaa from `complex.json` t where t.id=3D1 limit 1; > +------------+------------+ > | text | json | > +------------+------------+ > | 00-00 Screen > 00-01 Project(id=3D[$0], aaa=3D[$1]) > 00-02 SelectionVectorRemover > 00-03 Limit(fetch=3D[1]) > 00-04 Filter(condition=3D[=3D($0, 1)]) > 00-05 Project(id=3D[$1], aaa=3D[$0]) > 00-06 Scan(groupscan=3D[EasyGroupScan [selectionRoot=3D/dr= ill/testdata/complex_type/json/complex.json, numFiles=3D1, columns=3D[`id`,= `aaa`], files=3D[maprfs:/drill/testdata/complex_type/json/complex.json]]]) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)