Return-Path: X-Original-To: apmail-drill-issues-archive@minotaur.apache.org Delivered-To: apmail-drill-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 21C0B182A0 for ; Fri, 22 May 2015 19:50:18 +0000 (UTC) Received: (qmail 96485 invoked by uid 500); 22 May 2015 19:50:18 -0000 Delivered-To: apmail-drill-issues-archive@drill.apache.org Received: (qmail 96375 invoked by uid 500); 22 May 2015 19:50:18 -0000 Mailing-List: contact issues-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list issues@drill.apache.org Received: (qmail 96195 invoked by uid 99); 22 May 2015 19:50:17 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 22 May 2015 19:50:17 +0000 Date: Fri, 22 May 2015 19:50:17 +0000 (UTC) From: "Steven Phillips (JIRA)" To: issues@drill.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (DRILL-3169) gz files cannot be accessed without gz formats extension definition MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/DRILL-3169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14556721#comment-14556721 ] Steven Phillips commented on DRILL-3169: ---------------------------------------- The way the compression extension works is it is added in addition to the format extendsion. So, for example, googlebooks.tsv.gz should work. You shouldn't define the compression extension as one of the format extensions. I might add code to prevent a user from doing this. > gz files cannot be accessed without gz formats extension definition > ------------------------------------------------------------------- > > Key: DRILL-3169 > URL: https://issues.apache.org/jira/browse/DRILL-3169 > Project: Apache Drill > Issue Type: Bug > Affects Versions: 1.0.0 > Environment: Mac OS X > Reporter: Kristine Hahn > > To reproduce the problem: > 1. Put a gz file on the file system. > 2. Define a plugin with and without a gz extension. For example: > {noformat} > { > "type": "file", > "enabled": true, > "connection": "file:///", > "workspaces": { > "ngram": { > "location": "/Users/khahn/drill/apache-drill-1.0.0", > "writable": false, > "defaultInputFormat": null > } > }, > "formats": { > "tsv": { > "type": "text", > "extensions": [ > "tsv", > "gz" > ], > "delimiter": "\t" > } > } > } > {noformat} > 3. Try to query the gz file. > Expected results: success with and without the gz extension > Actual results: error without the gz extension defined in formats. > *Output--no gz extension in formats* > {noformat} > 0: jdbc:drill:zk=local> SELECT * FROM ngram.`/googlebooks.gz`; > May 22, 2015 6:06:51 AM org.apache.calcite.sql.validate.SqlValidatorException > SEVERE: org.apache.calcite.sql.validate.SqlValidatorException: Table 'ngram./googlebooks.gz' not found > May 22, 2015 6:06:51 AM org.apache.calcite.runtime.CalciteException > SEVERE: org.apache.calcite.runtime.CalciteContextException: From line 1, column 16 to line 1, column 20: Table 'ngram./googlebooks.gz' not found > Error: PARSE ERROR: From line 1, column 16 to line 1, column 20: Table 'ngram./googlebooks.gz' not found > [Error Id: 28f38441-81a0-4167-afad-86a8169d383b on 172.30.1.90:31010] (state=,code=0) > {noformat} > *Output with gz extension defined in formats* > {noformat} > 0: jdbc:drill:zk=local> SELECT * FROM ngram.`/googlebooks.gz`; > +-----------------------------------------------------------------------+ > | columns | > +-----------------------------------------------------------------------+ > | ["ZOCOR should be taken with","2002","7","5"] | > | ["ZOCOR should be taken with","2003","12","12"] | > . . . > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)