Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C97441892B for ; Sat, 11 Jul 2015 03:42:50 +0000 (UTC) Received: (qmail 31456 invoked by uid 500); 11 Jul 2015 03:42:49 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 31385 invoked by uid 500); 11 Jul 2015 03:42:49 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 31375 invoked by uid 99); 11 Jul 2015 03:42:48 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 11 Jul 2015 03:42:48 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 833B8C0711 for ; Sat, 11 Jul 2015 03:42:48 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.879 X-Spam-Level: ** X-Spam-Status: No, score=2.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id MuscaLphrqLY for ; Sat, 11 Jul 2015 03:42:48 +0000 (UTC) Received: from mail-ig0-f181.google.com (mail-ig0-f181.google.com [209.85.213.181]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id BE0FC20FD3 for ; Sat, 11 Jul 2015 03:42:47 +0000 (UTC) Received: by igcqs7 with SMTP id qs7so25406910igc.0 for ; Fri, 10 Jul 2015 20:42:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=n6Ins3WmglS9BNhwkRU34Gl3B0YsEQXKSew1Ks2cDZI=; b=DgeGLONGme7LAn+h43DIAZdSGE2gJ/Edvrmb/aTTvkjVTv3Pow1Rv7dkfxKXtZh+3w A0yR96xtKG3m3+jTpj3quv6n5iaZPUZWUfCDhvaMEcB+OjCYPWjpRj11jv7MWVSOwgaI 5tQpUiSw8GtHcY8AD/JIe3qpxjfj1srUZ2PL76363p5gDCjgRYvi9aeSjJaEcsBpj0R1 Co2c2R3sAv3q08uGJHtXrv9xmodeJ9B+F94ipceGwl5oB2bUKdpVKvq7cPLWR6EFCOaa IdCOs5faJWIUHzq91thSHOJyrBqnof3g+P9eGn4l5/wtYDP3pF6qZ+hWiSFvRc4OXNMh KeKA== MIME-Version: 1.0 X-Received: by 10.50.114.40 with SMTP id jd8mr1885961igb.47.1436586167335; Fri, 10 Jul 2015 20:42:47 -0700 (PDT) Received: by 10.79.94.135 with HTTP; Fri, 10 Jul 2015 20:42:47 -0700 (PDT) Date: Sat, 11 Jul 2015 11:42:47 +0800 Message-ID: Subject: How to use the index in Parquet to improve the query From: Yue Liu To: user@hive.apache.org Content-Type: multipart/alternative; boundary=047d7b3a9ba23180c2051a914887 --047d7b3a9ba23180c2051a914887 Content-Type: text/plain; charset=UTF-8 Hi, All, I am using Hive-1.2.1, and store table as Parquet. Now I have a query as below: select count(1) from lineitem where l_quantity=1.0; I read the document of Parquet, it said Parquet have the similar Min and Max statistics like ORC to filter unrelated data. But I notice that the records number showed by Counter RECORDS_IN is the same with the whole table. That is, the index in Parquet does not work. What are the reasons? Thanks! --047d7b3a9ba23180c2051a914887 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi, All,

I am using Hive-1.2.1, and sto= re table as Parquet. Now I have a query as below:

= select count(1)
from lineitem
where l_quantity=3D1.0;

I read the document of Parquet, it said Parquet hav= e the similar Min and Max statistics like ORC to filter unrelated data.

But I notice that the records number showed by Counte= r=C2=A0RECORDS_IN=C2=A0is the same with the whole table.
That is, the index in Parquet does not work.

What are the reasons?

Thanks!
--047d7b3a9ba23180c2051a914887--