Return-Path: X-Original-To: apmail-incubator-drill-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-drill-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9EBD9D950 for ; Sat, 15 Sep 2012 02:27:00 +0000 (UTC) Received: (qmail 45930 invoked by uid 500); 15 Sep 2012 02:27:00 -0000 Delivered-To: apmail-incubator-drill-dev-archive@incubator.apache.org Received: (qmail 45900 invoked by uid 500); 15 Sep 2012 02:27:00 -0000 Mailing-List: contact drill-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: drill-dev@incubator.apache.org Delivered-To: mailing list drill-dev@incubator.apache.org Received: (qmail 45892 invoked by uid 99); 15 Sep 2012 02:27:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 15 Sep 2012 02:27:00 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of yangzhuoluo@gmail.com designates 209.85.216.54 as permitted sender) Received: from [209.85.216.54] (HELO mail-qa0-f54.google.com) (209.85.216.54) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 15 Sep 2012 02:26:56 +0000 Received: by qatn12 with SMTP id n12so435270qat.6 for ; Fri, 14 Sep 2012 19:26:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=EJAIxMYbiD4nY8vJTC3zEDDMMoSFRFsOQDXzofwkYcM=; b=Bz1M8l0JGOxfl8NCZZ7PoiuAllSumCwRvutDHLnjNuZUKuKckbG85fRWohNUSxLVvg 3j/tmi7YMn0neEg9t3xAs2HnnXGLG8bVWQdxBcrQwTTTQmG785i7pr/bDBWg115gVxtX 9hLZeMMjcdTuJlSfDtn/BaHh90LiGe7dSZ85hDDZ727FxkM61IbF9egIIboRg8joIuVA ekaBCQ8/8YY4dNSQ9898+RRxERi98DOqr/s/TxFXaFGuMOHqaAM4uDl600qIs3Ejn0Q7 ilW2sT/rlTg2tiPQ4il0Lk38g6mp/aoXm5kwVryhgvUJNyh/P7+JVfgjrW5u8P7SR9BT 4DVg== Received: by 10.224.186.130 with SMTP id cs2mr11825851qab.22.1347675995010; Fri, 14 Sep 2012 19:26:35 -0700 (PDT) MIME-Version: 1.0 Received: by 10.49.26.227 with HTTP; Fri, 14 Sep 2012 19:26:14 -0700 (PDT) In-Reply-To: References: <4DD0EF35-E83C-4520-BAF3-1A29E8F76D2A@gmail.com> From: =?UTF-8?B?Q2xhcmsgWWFuZyAo5p2o5Y2T6I2mKQ==?= Date: Sat, 15 Sep 2012 10:26:14 +0800 Message-ID: Subject: Re: Drill native format To: drill-dev@incubator.apache.org Content-Type: multipart/alternative; boundary=485b397dd68df476a004c9b4441f X-Virus-Checked: Checked by ClamAV on apache.org --485b397dd68df476a004c9b4441f Content-Type: text/plain; charset=ISO-8859-1 Hi I have been working on the column storage for a while. I think the most important thing for the distributed column storage is data locality on MapReduce (See the paper 4.1). That means how each horizontal partition stores in the same node to compute locally and reduce data transfer. To achieve this, the big data is usually horizontally partitioned and distributed first and vertically partitioned second. There need some strategies to do this, HDFS use "block placement policy" Cheers, Zhuoluo (Clark) Yang 2012/9/15 karthik tunga > Hi, > > This paper (http://arxiv.org/pdf/1105.4252.pdf) has column oriented (one > file per column) vs RCFile. > They use skip list and lazy record construction. > > Cheers, > Karthik > > On 14 September 2012 17:15, Amir Youssefi wrote: > > > "Nested data is not yet implemented" in BigQuery (if I recall exact words > > correctly). Quoting speaker at the BigQuery presentation at Google > > Technology User Group last week in Googleplex (intentionally not citing > > speaker's name). > > > > -ay > > > > On Sep 14, 2012, at 1:28 PM, David Gruzman > wrote: > > > > > I assume that evolution of BigQuery reflects resolution of Dremel... If > > > somebody have information on it it would be great. > > > Storage system should understand that all file comprising the > horizontal > > > partition of the table are one logical entity, and should store them > > > together / in some proximity. I agree that PAX will be much more > > > convinient. The question is - is there performance penalty of PAX vs > file > > > per column? > > > David > > > > > > On Fri, Sep 14, 2012 at 11:21 PM, Tomer Shiran > > wrote: > > > > > >> Is there any public information suggesting that Google moved away from > > >> supporting nested data? Clearly BigQuery doesn't yet allow nested > data, > > but > > >> not sure that applies to Dremel. > > >> > > >> There are challenges with one file per column. How do you ensure that > a > > >> single record is located on a single machine to avoid costly record > > >> reconstruction? > > >> > > >> On Fri, Sep 14, 2012 at 1:05 PM, David Gruzman < > david@bigdatacraft.com > > >>> wrote: > > >> > > >>> Hi All, > > >>> I would like to discuss the question of what will be native format > for > > >>> drill. Original Google dremel paper defined their hierarchical > columnar > > >>> data format. Since then > > >>> google shifted from hierarchical data format... So it is a question > if > > it > > >>> makes sense to stick with it? > > >>> If we are also moving to simple flat format we need our own format we > > >> have > > >>> to support "native". In case of Drill I would define that native > > support > > >> as > > >>> "high performance". > > >>> I think we can go to some kind of PAX format with comprehensive > > metadata > > >> in > > >>> the header, so each file is completely self contained and can be > > >> understood > > >>> and processed without any external data. > > >>> Alternative is to have single file per column. As far as I remember > > from > > >>> our OpenDremel work the main decision point is - if we can read one > > >> column > > >>> from the file without loading into node memory unnecessary data from > > >> other > > >>> columns. > > >>> With best regards, > > >>> David > > >>> > > >> > > >> > > >> > > >> -- > > >> Tomer Shiran > > >> Director of Product Management | MapR Technologies | 650-804-8657 > > >> > > > --485b397dd68df476a004c9b4441f--