Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id F39DD200BCE for ; Fri, 2 Dec 2016 22:34:52 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id F2520160B24; Fri, 2 Dec 2016 21:34:52 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 48DD9160B08 for ; Fri, 2 Dec 2016 22:34:52 +0100 (CET) Received: (qmail 29776 invoked by uid 500); 2 Dec 2016 21:34:50 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 29763 invoked by uid 99); 2 Dec 2016 21:34:50 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Dec 2016 21:34:50 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 20718CEF4F for ; Fri, 2 Dec 2016 21:34:50 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -3.021 X-Spam-Level: X-Spam-Status: No, score=-3.021 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id 5usdEjEJLqtJ for ; Fri, 2 Dec 2016 21:34:49 +0000 (UTC) Received: from mail16.uhc.com (sp-internet-ext-ply.uhc.com [198.203.175.175]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id D567B5FD92 for ; Fri, 2 Dec 2016 21:34:48 +0000 (UTC) X-IronPort-AV: E=Sophos;i="5.33,288,1477976400"; d="scan'208,217";a="685658941" Received: from irmelr24.uhc.com (HELO mail25.uhc.com) ([10.177.104.170]) by mail18.uhc.com with ESMTP; 02 Dec 2016 15:34:32 -0600 X-IronPort-AV: E=Sophos;i="5.33,288,1477976400"; d="scan'208,217";a="699289553" X-CONF-FOOTER: True Received: from apsep0922.ms.ds.uhc.com ([10.114.192.99]) by mail25.uhc.com with ESMTP; 02 Dec 2016 15:34:32 -0600 Received: from APSWP0832.ms.ds.uhc.com ([fe80::914f:81e7:980c:51db]) by APSEP0922.ms.ds.uhc.com ([fe80::19b9:b26:5561:ed68%20]) with mapi id 14.03.0279.002; Fri, 2 Dec 2016 15:34:32 -0600 From: "Boudreau, Carl" To: "general@hadoop.apache.org" Subject: How do I limit queries made to my Hadoop Cluster Thread-Topic: How do I limit queries made to my Hadoop Cluster Thread-Index: AdJM4vSeoPh9MyeLQ/qlLMdAyGKndwAALCUw Date: Fri, 2 Dec 2016 21:34:33 +0000 Message-ID: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.175.224.122] Content-Type: multipart/alternative; boundary="_000_CD06D96EB5A22F4899649F7CB34B43AEF1E6BCAPSWP0832msdsuhcc_" MIME-Version: 1.0 X-IP-VFilter-R: R archived-at: Fri, 02 Dec 2016 21:34:53 -0000 --_000_CD06D96EB5A22F4899649F7CB34B43AEF1E6BCAPSWP0832msdsuhcc_ Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Dear Hadoop Expert, This is my first post to this group, and I am new to Hadoop, so if this is = not the correct list please excuse me. If you have a better group please l= et me know by replying directly to me. I have a challenge before me. In my Hadoop system I have data from three c= ompanies called ABC, XYZ, and 123. And because of my Business Need; all th= e records from these three companies are in the same data store. The recor= ds are randomly mixed so one record could be a record from ABC and the next= could be from XYX or 123. When I query my Hadoop system, for all records t= hat have the last name of Boudreau for data analytical work; I get all 3000= records that have the last name Boudreau. However, I also have a contract with ABC, that says I cannot aggregate thei= r records. So I need a way to apply these contract rules when the data is = queried. Please note: I have given 20 other developers access to my Hadoop= system, but I am responsible to mirage the contractual obligations for my = customers What is the best way about going about this? Can or do I write a Plug-in or modify YARN to have it check my contract rul= es prior to returning a dataset? Can or do I write a plug in for each and = every Gateway Application such as Pig, Elastic Search, MapR, etc (about 10 = applications that have access to my Hadoop system) What are other options? I have installed, configured and running Hadoop onto my local machine. I h= ave the source code also downloaded onto my machine, and I am able to dig i= nto it and compile it. Regards Carl This e-mail, including attachments, may include confidential and/or proprietary information, and may be used only by the person or entity to which it is addressed. If the reader of this e-mail is not the intended recipient or his or her authorized agent, the reader is hereby notified that any dissemination, distribution or copying of this e-mail is prohibited. If you have received this e-mail in error, please notify the sender by replying to this message and delete this e-mail immediately. --_000_CD06D96EB5A22F4899649F7CB34B43AEF1E6BCAPSWP0832msdsuhcc_--