Return-Path: X-Original-To: apmail-drill-dev-archive@www.apache.org Delivered-To: apmail-drill-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 17DA0109FE for ; Fri, 12 Dec 2014 21:23:30 +0000 (UTC) Received: (qmail 40670 invoked by uid 500); 12 Dec 2014 21:23:24 -0000 Delivered-To: apmail-drill-dev-archive@drill.apache.org Received: (qmail 40612 invoked by uid 500); 12 Dec 2014 21:23:24 -0000 Mailing-List: contact dev-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list dev@drill.apache.org Received: (qmail 40599 invoked by uid 99); 12 Dec 2014 21:23:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Dec 2014 21:23:24 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of altekrusejason@gmail.com designates 209.85.216.49 as permitted sender) Received: from [209.85.216.49] (HELO mail-qa0-f49.google.com) (209.85.216.49) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Dec 2014 21:22:58 +0000 Received: by mail-qa0-f49.google.com with SMTP id s7so5661020qap.22 for ; Fri, 12 Dec 2014 13:22:12 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=vwGavfYq8DzJjDZOGHlKJL7x1A2z3Qu29joBCf3jVxw=; b=gPLpgThFvSsIr8lwJfMGVbc46G3FnphQd4vZJ6PXZBTUgiKPq0iqv2BagoyTx6uW02 HlG+8UkWT4WKj1nxDAmxlM2b85o4OrJsUIL016weamYmL1SZjpUGukWi0MJWTIxx287f lUmHWguaafIbFvbslPmABB/9biLFcJ26QgLqcabe+HRrFS4ktg2O6kqyrUI5YcOPFxph +7iWAqJJSwWT2mde4yxvDVxtb/BL/bzal3KtzEjUIKBfjDG4cVwJh5hzBwUlgvnijc1D fYaO+OTrgqROlUdQboemeLp5xJn1fGiB2byhM6u/8+wM668EPxCVXT7f+SUJkr3YurUQ fgkQ== X-Received: by 10.140.35.231 with SMTP id n94mr32554513qgn.17.1418419332387; Fri, 12 Dec 2014 13:22:12 -0800 (PST) MIME-Version: 1.0 Received: by 10.140.93.107 with HTTP; Fri, 12 Dec 2014 13:21:52 -0800 (PST) In-Reply-To: References: <820771418220593@web15o.yandex.ru> <4675141418281959@web18g.yandex.ru> From: Jason Altekruse Date: Fri, 12 Dec 2014 13:21:52 -0800 Message-ID: Subject: Re: Drill plugin for zipped files To: dev@drill.apache.org Content-Type: multipart/alternative; boundary=001a11c0013e731b56050a0b7cdc X-Virus-Checked: Checked by ClamAV on apache.org --001a11c0013e731b56050a0b7cdc Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Is this feature documented? A quick search of the drill wiki for 'compressed', 'zip' and 'gzip' does not seem to show any useful results. - Jason On Fri, Dec 12, 2014 at 10:36 AM, Steven Phillips wrote: > > You want to keep the .gz extension: > > 0: jdbc:drill:zk=3Dlocal> select * from dfs.`/drill/nation.csv.gz` limit = 2; > +------------+ > | columns | > +------------+ > | ["0","ALGERIA","0"," haggle. carefully final deposits detect slyly agai= "] > | > | ["1","ARGENTINA","1","al foxes promise slyly according to the regular > accounts. bold requests alon"] | > +------------+ > > bzip2 is also supported: > > 0: jdbc:drill:zk=3Dlocal> select * from dfs.`/drill/nation.csv.bz2` limit= 2; > +------------+ > | columns | > +------------+ > | ["0","ALGERIA","0"," haggle. carefully final deposits detect slyly agai= "] > | > | ["1","ARGENTINA","1","al foxes promise slyly according to the regular > accounts. bold requests alon"] | > +------------+ > > On Fri, Dec 12, 2014 at 6:01 AM, Hmxxyy wrote: > > > > Could you please show an example for that? Especially how to define the > > file suffix, should I renamed the data files from .csv.gz to .csv? Does > it > > support bz2 as well? > > > > Thanks. > > > > Sent from my iPhone > > > > > On Dec 12, 2014, at 5:49 AM, Jim Scott wrote: > > > > > > With csv files (as an example) it will automatically decompress them. > > > > > >> On Thu, Dec 11, 2014 at 1:12 AM, =D0=94=D0=9F wro= te: > > >> > > >> Hi! > > >> Apache Drill is most interesting project! > > >> I think, Apache Drill is best project for view and analyze logs by > > >> sql-developers and managers! > > >> But I have some disappointment. =3D( > > >> Most logs are stored in zip, gzip, tar or other archive and packed > > formats. > > >> How I can to config dfs storage to use unzip/gunzip command before g= et > > >> data? > > >> Or can you add plugin for zipped files? > > >> --------------- > > >> Best regards, > > >> Dima Pl > > > > > > > > > -- > > > *Jim Scott* > > > Director, Enterprise Strategy & Architecture > > > +1 (347) 746-9281 > > > > > > > > > [image: MapR Technologies] > > > > > -- > Steven Phillips > Software Engineer > > mapr.com > --001a11c0013e731b56050a0b7cdc--