Return-Path: X-Original-To: apmail-incubator-drill-user-archive@minotaur.apache.org Delivered-To: apmail-incubator-drill-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9FFC61008B for ; Sun, 25 Aug 2013 20:25:31 +0000 (UTC) Received: (qmail 76915 invoked by uid 500); 25 Aug 2013 20:25:31 -0000 Delivered-To: apmail-incubator-drill-user-archive@incubator.apache.org Received: (qmail 76746 invoked by uid 500); 25 Aug 2013 20:25:30 -0000 Mailing-List: contact drill-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: drill-user@incubator.apache.org Delivered-To: mailing list drill-user@incubator.apache.org Received: (qmail 76738 invoked by uid 99); 25 Aug 2013 20:25:30 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 25 Aug 2013 20:25:30 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mr.tom.seddon@gmail.com designates 209.85.214.42 as permitted sender) Received: from [209.85.214.42] (HELO mail-bk0-f42.google.com) (209.85.214.42) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 25 Aug 2013 20:25:23 +0000 Received: by mail-bk0-f42.google.com with SMTP id my10so886538bkb.29 for ; Sun, 25 Aug 2013 13:25:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=3pjx6R5ZmsSr/ulD/yHg4tc72V6paBVXaoS1QmaSkBw=; b=by8427ABSebMQbomeRM8w5PLAh87DAPBpV/67W4m2KPW4PSiuvHptt5a19aGG5G1n7 6LufQjjEJ0wxawCLU6Eid/VDFLEBkyljDxYdliksvozXUoYBRZvX95dsd5bWWlbRa8i+ 5tCvTmXTHTf/N26tB2SjM2hIEgHmfpY9uABrGuiFThx82fz+YYYM3hZojTyFG6F8XaEx wn53QU4uNjPi4ENMLN96zeqBwINCkmaD9iM9bDgEUwhvp1GaR42DFi3rLyZtX8vmLahu UCNkeQMStoUja7ZKEjA1CtD/0hBK+c6rscI5jBE6C8Q3WNWgtOER2kxVfTU34hJ5yOWs WoTg== MIME-Version: 1.0 X-Received: by 10.204.224.77 with SMTP id in13mr2597581bkb.24.1377462303474; Sun, 25 Aug 2013 13:25:03 -0700 (PDT) Received: by 10.204.122.131 with HTTP; Sun, 25 Aug 2013 13:25:03 -0700 (PDT) Date: Sun, 25 Aug 2013 21:25:03 +0100 Message-ID: Subject: Drill Masters Project From: Tom Seddon To: drill-user@incubator.apache.org Content-Type: multipart/alternative; boundary=485b3970ceac4a878904e4cb6fc1 X-Virus-Checked: Checked by ClamAV on apache.org --485b3970ceac4a878904e4cb6fc1 Content-Type: text/plain; charset=ISO-8859-1 Hi, I'm looking to do a dissertation on Drill, as part of masters degree in Data Science. I'm hoping to set up a cluster to run it and then analyse its efficiency with different datasets, as well as make recommendations for its usage. I know Drill is in a fairly early stage of development but I have around 18 months until the project is due, so I'm hoping the timing will work as Drill is developed further. I'd be grateful for any advice on how I could get started on this. Would a Hadoop cluster be a good back-end to base my project on or would something more suited to nested data like MongoDB be more appropriate? Also, I haven't found much documentation on configuring Drill in a distributed environment, so any help on this would be appreciated. I'd also be willing to contribute but not sure if I have enough Java experience. My background is mainly in BI and database technologies. Thanks, Tom --485b3970ceac4a878904e4cb6fc1--