Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E9EEA1158A for ; Fri, 1 Aug 2014 10:51:24 +0000 (UTC) Received: (qmail 3931 invoked by uid 500); 1 Aug 2014 10:51:23 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 3861 invoked by uid 500); 1 Aug 2014 10:51:23 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 3844 invoked by uid 99); 1 Aug 2014 10:51:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Aug 2014 10:51:23 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [80.237.132.70] (HELO wp063.webpack.hosteurope.de) (80.237.132.70) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Aug 2014 10:51:21 +0000 Received: from app03.ox.hosteurope.de ([92.51.170.10]); authenticated by wp063.webpack.hosteurope.de running ExIM with esmtpsa (TLS1.0:RSA_ARCFOUR_MD5:16) id 1XDAQK-00038u-Co; Fri, 01 Aug 2014 12:50:56 +0200 Date: Fri, 1 Aug 2014 11:50:56 +0100 (IST) From: Uli Bethke Reply-To: Uli Bethke To: user@hive.apache.org Message-ID: <414362556.104588.1406890256389.open-xchange@app03.ox.hosteurope.de> In-Reply-To: References: <1096099950.104351.1406889791005.open-xchange@app03.ox.hosteurope.de> Subject: Re: Hive: Centralized HDFS Caching MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_104587_1369284670.1406890256329" X-Priority: 3 Importance: Medium X-Mailer: Open-Xchange Mailer v7.4.2-Rev30 X-bounce-key: webpack.hosteurope.de;uli.bethke@sonra.io;1406890277;1aecdb78; X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_104587_1369284670.1406890256329 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit I am already using tez as the execution engine and used hdfs cacheadmin to pin a file to memroy. However querying that file through Hive still goes to disk. Any ideas? > On 01 August 2014 at 11:46 Nitin Pawar wrote: > > Please take a look at hive with tez as execution engine on hadoop 2.3. > > it may help you compare it with what you want to achieve > > > On Fri, Aug 1, 2014 at 4:13 PM, Uli Bethke > wrote: > > > Hi. > > > > in Hive can I make use of the centralized cache management introduced in > > Hadoop 2.3 ( > > http://hadoop.apache.org/docs/r2.3.0/hadoop-project-dist/hadoop-hdfs/CentralizedCacheManagement.html)? > > If not implemented yet, is this on the roadmap? > > > > My use case is that I want to pin a fact table that needs to be queried > > frequently into memory. > > > > Impala already supports this as per the Cloudera documentation > > http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Installing-and-Using-Impala/ciiu_perf_hdfs_caching.html > > > > Thanks > > uli > > > > > > -- > Nitin Pawar > ------------------------------ Uli Bethke Sonra. Unleash the Value of your Data. Web: http://www.sonra.io Skype: uli.bethke ODI Training. Now available! http://www.odi-training.com Our ODI book on Amazon Kindle http://amzn.to/1kDMFor ------=_Part_104587_1369284670.1406890256329 MIME-Version: 1.0 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: 7bit
I am already using tez as the execution engine and used hdfs cacheadmin to pin a file to memroy. However querying that file through Hive still goes to disk.
 
Any ideas?
 
On 01 August 2014 at 11:46 Nitin Pawar <nitinpawar432@gmail.com> wrote:

Please take a look at hive with tez as execution engine on hadoop 2.3.
 
it may help you compare it with what you want to achieve 


On Fri, Aug 1, 2014 at 4:13 PM, Uli Bethke <uli.bethke@sonra.io> wrote:
Hi.
 
in Hive can I make use of the centralized cache management introduced in Hadoop 2.3 ( http://hadoop.apache.org/docs/r2.3.0/hadoop-project-dist/hadoop-hdfs/CentralizedCacheManagement.html)? If not implemented yet, is this on the roadmap?
 
My use case is that I want to pin a fact table that needs to be queried frequently into memory.
 
 
Thanks
uli


 
--
Nitin Pawar

 
------------------------------
Uli Bethke
Sonra. Unleash the Value of your Data.
Web: http://www.sonra.io
Skype: uli.bethke

ODI Training. Now available!
http://www.odi-training.com
Our ODI book on Amazon Kindle
http://amzn.to/1kDMFor
------=_Part_104587_1369284670.1406890256329--