Return-Path: X-Original-To: apmail-drill-issues-archive@minotaur.apache.org Delivered-To: apmail-drill-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EA14317C25 for ; Tue, 21 Oct 2014 00:15:59 +0000 (UTC) Received: (qmail 77523 invoked by uid 500); 21 Oct 2014 00:15:59 -0000 Delivered-To: apmail-drill-issues-archive@drill.apache.org Received: (qmail 77497 invoked by uid 500); 21 Oct 2014 00:15:59 -0000 Mailing-List: contact issues-help@drill.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.incubator.apache.org Delivered-To: mailing list issues@drill.incubator.apache.org Received: (qmail 77488 invoked by uid 99); 21 Oct 2014 00:15:59 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Oct 2014 00:15:59 +0000 X-ASF-Spam-Status: No, hits=-2001.4 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.3] (HELO mail.apache.org) (140.211.11.3) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 21 Oct 2014 00:15:37 +0000 Received: (qmail 76107 invoked by uid 99); 21 Oct 2014 00:15:35 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Oct 2014 00:15:35 +0000 Date: Tue, 21 Oct 2014 00:15:35 +0000 (UTC) From: "Abhishek Girish (JIRA)" To: issues@drill.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (DRILL-1559) Writing to JSON from Parquet throws error when the Parquet file is created from JSON MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/DRILL-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Abhishek Girish updated DRILL-1559: ----------------------------------- Summary: Writing to JSON from Parquet throws error when the Parquet file is created from JSON (was: Writing to JSON from Parquet fails when the Parquet file is created from JSON) > Writing to JSON from Parquet throws error when the Parquet file is created from JSON > ------------------------------------------------------------------------------------ > > Key: DRILL-1559 > URL: https://issues.apache.org/jira/browse/DRILL-1559 > Project: Apache Drill > Issue Type: Bug > Components: Storage - JSON > Reporter: Abhishek Girish > Assignee: Steven Phillips > > Succeeds: > > alter session set `store.format` = 'parquet'; > +------------+------------+ > | ok | summary | > +------------+------------+ > | true | store.format updated. | > +------------+------------+ > 1 row selected (0.038 seconds) > > create table `yelp_academic_dataset_review_parquet` as select * from `yelp_academic_dataset_review.json`; > +------------+---------------------------+ > | Fragment | Number of records written | > +------------+---------------------------+ > | 0_0 | 1125458 | > +------------+---------------------------+ > 1 row selected (163.893 seconds) > $ hadoop fs -ls /jsondata/yelp_academic_dataset_review_parquet > Found 2 items > -rwxr-xr-x 3 mapr mapr 535544902 2014-10-20 17:08 /jsondata/yelp_academic_dataset_review_parquet/0_0_0.parquet > -rwxr-xr-x 3 mapr mapr 29696406 2014-10-20 17:09 /jsondata/yelp_academic_dataset_review_parquet/0_0_1.parquet > Fails: > > alter session set `store.format` = 'json'; > +------------+------------+ > | ok | summary | > +------------+------------+ > | true | store.format updated. | > +------------+------------+ > 1 row selected (0.033 seconds) > > create table `yelp_academic_dataset_review_json` as select * from yelp_academic_dataset_review_parquet; > Query failed: Failure while running fragment. Schema is currently null. You must call buildSchema(SelectionVectorMode) before this container can return a schema. [b96dc570-77f2-46db-b9e6-8215e2062b15] > $ hadoop fs -ls /jsondata/yelp_academic_dataset_review_json > Found 2 items > -rwxr-xr-x 3 root root 54493549 2014-10-20 17:10 /jsondata/yelp_academic_dataset_review_json/1_0_0.json > -rwxr-xr-x 3 mapr mapr 37305528 2014-10-20 17:10 /jsondata/yelp_academic_dataset_review_json/1_1_0.json > LOG entry: > 2014-10-20 17:10:47,785 [cbccfeb9-a235-4ea7-9bcc-56d35daf4827:frag:1:0] ERROR o.a.d.e.w.f.AbstractStatusReporter - Error de3eb523-3924-4941-8cf4-eb7a71a2df2d: Failure while running fragment. > java.lang.NullPointerException: Schema is currently null. You must call buildSchema(SelectionVectorMode) before this container can return a schema. > at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:208) ~[guava-14.0.1.jar:na] > at org.apache.drill.exec.record.VectorContainer.getSchema(VectorContainer.java:220) ~[drill-java-exec-0.6.0-incubating-SNAPSHOT-rebuffed.jar:0.6.0-incubating-SNAPSHOT] > at org.apache.drill.exec.record.AbstractRecordBatch.getSchema(AbstractRecordBatch.java:115) ~[drill-java-exec-0.6.0-incubating-SNAPSHOT-rebuffed.jar:0.6.0-incubating-SNAPSHOT] > at org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.getSchema(IteratorValidatorBatchIterator.java:74) ~[drill-java-exec-0.6.0-incubating-SNAPSHOT-rebuffed.jar:0.6.0-incubating-SNAPSHOT] > at org.apache.drill.exec.physical.impl.SingleSenderCreator$SingleSenderRootExec.innerNext(SingleSenderCreator.java:101) ~[drill-java-exec-0.6.0-incubating-SNAPSHOT-rebuffed.jar:0.6.0-incubating-SNAPSHOT] > at org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:57) ~[drill-java-exec-0.6.0-incubating-SNAPSHOT-rebuffed.jar:0.6.0-incubating-SNAPSHOT] > at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:104) ~[drill-java-exec-0.6.0-incubating-SNAPSHOT-rebuffed.jar:0.6.0-incubating-SNAPSHOT] > at org.apache.drill.exec.work.WorkManager$RunnableWrapper.run(WorkManager.java:250) [drill-java-exec-0.6.0-incubating-SNAPSHOT-rebuffed.jar:0.6.0-incubating-SNAPSHOT] > at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) [na:1.7.0_65] > at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) [na:1.7.0_65] > at java.lang.Thread.run(Thread.java:745) [na:1.7.0_65] -- This message was sent by Atlassian JIRA (v6.3.4#6332)