Return-Path: X-Original-To: apmail-incubator-drill-user-archive@minotaur.apache.org Delivered-To: apmail-incubator-drill-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5B5BA10664 for ; Wed, 30 Oct 2013 09:43:29 +0000 (UTC) Received: (qmail 6514 invoked by uid 500); 30 Oct 2013 09:43:28 -0000 Delivered-To: apmail-incubator-drill-user-archive@incubator.apache.org Received: (qmail 6480 invoked by uid 500); 30 Oct 2013 09:43:28 -0000 Mailing-List: contact drill-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: drill-user@incubator.apache.org Delivered-To: mailing list drill-user@incubator.apache.org Received: (qmail 6472 invoked by uid 99); 30 Oct 2013 09:43:28 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Oct 2013 09:43:28 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mr.tom.seddon@gmail.com designates 209.85.214.46 as permitted sender) Received: from [209.85.214.46] (HELO mail-bk0-f46.google.com) (209.85.214.46) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Oct 2013 09:43:21 +0000 Received: by mail-bk0-f46.google.com with SMTP id w17so342313bkz.5 for ; Wed, 30 Oct 2013 02:43:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=XKKiyaj3sktXoOuVJZOj/03uhrgvfRIXFiZ0Apr1o9A=; b=HtxY4Esn1qJUAKdi9MEVk3V1l4ER9QhXNTv4+9dU0Ci0kIPEElzlWHypwhqOIAAlPU s7+TRdwODNwiWSIpLA7rfkFRxiYYsyBYGKmXouT2/okFsSawTshzbHRtLSwiNTnEnF0X A7P5Lbc6ROFjPUZCOcKqOoTxmRV1m0y0//QUQ32evyXkaDR/ScVTO6/WhOH7J9d9bNZT qi1jshRQTy3780vwHKQeru4ntq1NmJKUNSnKkiaK09eJWRrlcbYLCNzMAnM1t63Wg6HF 2WnVLzj+zpW5DxYmipLT2vReS8e6cOEd+pNuJWZcTJg5h+Tt+V/4ZGsj5CygDUByhRmr x7Cw== MIME-Version: 1.0 X-Received: by 10.205.40.136 with SMTP id tq8mr640574bkb.56.1383126181039; Wed, 30 Oct 2013 02:43:01 -0700 (PDT) Received: by 10.204.102.195 with HTTP; Wed, 30 Oct 2013 02:43:00 -0700 (PDT) Date: Wed, 30 Oct 2013 09:43:00 +0000 Message-ID: Subject: Distributed Drill question From: Tom Seddon To: drill-user@incubator.apache.org Content-Type: multipart/alternative; boundary=bcaec52997d5b3830e04e9f2281b X-Virus-Checked: Checked by ClamAV on apache.org --bcaec52997d5b3830e04e9f2281b Content-Type: text/plain; charset=ISO-8859-1 Hi, I would like to know more about how Drill's parallel processing of queries relates, if at all, to the parallel nature of a data source such as Hadeoop. Am I correct in thinking that if a Drill cluster is querying data from a Hadoop cluster, that the drillbits are unaware of where the data resides in HDFS, as their interaction is through the NameNode. If this is the case, how does scaling Drill out help performance if it's always having to route through the NameNode? Sorry if this is a silly question. I've tried to find the answer by reading the documentation and the mailing list, but I'm still not clear on it. Thanks, Tom --bcaec52997d5b3830e04e9f2281b--