Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 1A42A200B38 for ; Thu, 23 Jun 2016 19:43:19 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 19187160A35; Thu, 23 Jun 2016 17:43:19 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 87900160A59 for ; Thu, 23 Jun 2016 19:43:18 +0200 (CEST) Received: (qmail 59088 invoked by uid 500); 23 Jun 2016 17:43:16 -0000 Mailing-List: contact dev-help@pig.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@pig.apache.org Delivered-To: mailing list dev@pig.apache.org Received: (qmail 59062 invoked by uid 500); 23 Jun 2016 17:43:16 -0000 Delivered-To: apmail-hadoop-pig-dev@hadoop.apache.org Received: (qmail 59058 invoked by uid 99); 23 Jun 2016 17:43:16 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 Jun 2016 17:43:16 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 5F3252C1F68 for ; Thu, 23 Jun 2016 17:43:16 +0000 (UTC) Date: Thu, 23 Jun 2016 17:43:16 +0000 (UTC) From: "Daniel Dai (JIRA)" To: pig-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (PIG-4925) Support for passing the bloom filter to the Bloom UDF MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 23 Jun 2016 17:43:19 -0000 [ https://issues.apache.org/jira/browse/PIG-4925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15346845#comment-15346845 ] Daniel Dai commented on PIG-4925: --------------------------------- +1 > Support for passing the bloom filter to the Bloom UDF > ----------------------------------------------------- > > Key: PIG-4925 > URL: https://issues.apache.org/jira/browse/PIG-4925 > Project: Pig > Issue Type: New Feature > Reporter: Rohini Palaniswamy > Assignee: Rohini Palaniswamy > Fix For: 0.17.0 > > Attachments: PIG-4925-1.patch > > > Currently the Bloom Filter from BuildBloom has to be stored to HDFS to be able to be used in Bloom UDF. Most of the time the bloom filter is not reused and so have to be deleted after the end of the script. The load/store also forces multiple DAGs. If it was passed as a scalar, then it would be simpler and more efficient. -- This message was sent by Atlassian JIRA (v6.3.4#6332)