Mailing-List: contact issues-help@drill.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@drill.apache.org
Date: Fri, 24 Jul 2015 04:25:04 +0000 (UTC)
From: "Parth Chandra (JIRA)" <jira@apache.org>
To: issues@drill.apache.org
Message-ID: <JIRA.12848784.1437711800000.277665.1437711904548@Atlassian.JIRA>
In-Reply-To: <JIRA.12848784.1437711800000@Atlassian.JIRA>
References: <JIRA.12848784.1437711800000@Atlassian.JIRA>
 <JIRA.12848784.1437711800180@arcas>
Subject: [jira] [Updated] (DRILL-3551) CTAS from complex Json source with
 schema change  is not written (and hence not read back ) correctly
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit


     [ https://issues.apache.org/jira/browse/DRILL-3551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Parth Chandra updated DRILL-3551:
---------------------------------
    Attachment: sb.json

> CTAS from complex Json source with schema change  is not written (and hence not read back ) correctly
> -----------------------------------------------------------------------------------------------------
>
>                 Key: DRILL-3551
>                 URL: https://issues.apache.org/jira/browse/DRILL-3551
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Data Types
>    Affects Versions: 1.1.0
>            Reporter: Parth Chandra
>            Assignee: Hanifi Gunes
>            Priority: Critical
>             Fix For: 1.2.0
>
>
> The source data contains - 
> 20K rows with the following - 
> {"some":"yes","others":{"other":"true","all":"false","sometimes":"yes"}}   
> 200 rows with the following - 
> {"some":"yes","others":{"other":"true","all":"false","sometimes":"yes","additional":"last
> entries only"}}
> Creating a table and reading it back returns incorrect data - 
> CREATE TABLE testparquet as select * from `test.json`;
> SELECT * from testparquet;
> Yields 
> | yes  | {"other":"true","all":"false","sometimes":"yes"}  |
> | yes  | {"other":"true","all":"false","sometimes":"yes"}  |
> | yes  | {"other":"true","all":"false","sometimes":"yes"}  |
> | yes  | {"other":"true","all":"false","sometimes":"yes"}  |
> The "additional" field is missing in all records
> Parquet metadata for the created file does not have the 'additional' field 


--
This message was sent by Atlassian JIRA
(v6.3.4#6332)