Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 84512F118 for ; Tue, 9 Apr 2013 16:52:20 +0000 (UTC) Received: (qmail 36110 invoked by uid 500); 9 Apr 2013 16:52:15 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 36013 invoked by uid 500); 9 Apr 2013 16:52:15 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 36006 invoked by uid 99); 9 Apr 2013 16:52:15 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 09 Apr 2013 16:52:14 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of static.void.dev@gmail.com designates 209.85.192.180 as permitted sender) Received: from [209.85.192.180] (HELO mail-pd0-f180.google.com) (209.85.192.180) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 09 Apr 2013 16:52:10 +0000 Received: by mail-pd0-f180.google.com with SMTP id q11so3858960pdj.25 for ; Tue, 09 Apr 2013 09:51:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; bh=xAhQoLSlx+T4YIkuDry4BEMQzrYR//z6Mec0CpVQuuo=; b=U7ydEuswM4SGqvJoJtB8VMMDElJLzl8ErOiyz1uydhkEqrTt3DYurnBmf/kYMuQfSn Q5iJBjyh2iS1l6VUYDOiG4jnKlP2GiKs1/IL2Imx0jI6vE9/QiGoZzKEwxsNoM2wzEY4 mHRmz79Yv0xieRX0sWJjL7bwLQ9v/RiJcT7a2wrWacn12TVHr/jsuHUSZPHedNRVMgFA 6D2bpTQk3awf6WIPhDUKneVh3SEGGjRDkw4Jk7gWPQ4tlJ84lAN2Ef5YqMewYCqyCBnj n9vW64pjFe86tQN2GPdxgdGG6y1/hgqNZdXv7nlv3skCk6G2+R5wO8zkT56xHWbh0ACD p5rw== X-Received: by 10.68.105.193 with SMTP id go1mr3354921pbb.13.1365526309917; Tue, 09 Apr 2013 09:51:49 -0700 (PDT) Received: from [172.16.1.74] (206-15-64-66.static.twtelecom.net. [206.15.64.66]) by mx.google.com with ESMTPS id mm9sm2843664pbc.43.2013.04.09.09.51.48 (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 09 Apr 2013 09:51:48 -0700 (PDT) Content-Type: text/plain; charset=iso-8859-1 Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\)) Subject: Re: Best format to use From: Mark In-Reply-To: <09674432-2891-4729-979A-3E41D0A7F361@gmail.com> Date: Tue, 9 Apr 2013 09:51:47 -0700 Content-Transfer-Encoding: quoted-printable Message-Id: <5FD6A36D-4F5D-4ECB-83A2-CFBFFB321FA7@gmail.com> References: <3F80E783-79EE-4D10-8E84-C5881B9D56EF@gmail.com> <09674432-2891-4729-979A-3E41D0A7F361@gmail.com> To: user@hadoop.apache.org X-Mailer: Apple Mail (2.1499) X-Virus-Checked: Checked by ClamAV on apache.org Actually, compressed sequence files may not work with Pig or Hive then = right? On Apr 9, 2013, at 9:50 AM, Mark wrote: > Forgetting Impala, what format would be best to use with daily logs?=20= >=20 > Block-compressed sequence files? >=20 > On Apr 8, 2013, at 8:12 PM, Harsh J wrote: >=20 >> Hey Mark, >>=20 >> Gzip codec creates extension .gzip, not .deflate (which is >> DeflateCodec). You may want to re-check your settings. >>=20 >> Impala questions are best resolved at its current user and developer >> community at = https://groups.google.com/a/cloudera.org/forum/#!forum/impala-user. >> Impala does currently support LZO (and also Indexed LZO) compressed >> text files however, so you may want to try that as its splittable >> (compared to Gzip ones). >>=20 >> On Tue, Apr 9, 2013 at 5:18 AM, Mark = wrote: >>> Trying to determine what the best format to use for storing daily = logs. We recently switch from snappy (.snappy) to gzip (.deflate) but = I'm wondering if there is something better? Our main clients for these = daily logs are pig and hive using an external table. We were thinking = about testing out impala but we see that it doesn't work with compressed = text files. Any suggestions? >>>=20 >>> Thanks >>=20 >>=20 >>=20 >> --=20 >> Harsh J >=20