pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Dai (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-4713) Document Bloom UDF
Date Sat, 31 Oct 2015 00:32:27 GMT

    [ https://issues.apache.org/jira/browse/PIG-4713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14983689#comment-14983689
] 

Daniel Dai commented on PIG-4713:
---------------------------------

It should be in Eval Functions section.

> Document Bloom UDF
> ------------------
>
>                 Key: PIG-4713
>                 URL: https://issues.apache.org/jira/browse/PIG-4713
>             Project: Pig
>          Issue Type: Task
>            Reporter: Rohini Palaniswamy
>              Labels: newbie
>
> Release notes of https://issues.apache.org/jira/browse/PIG-2328 should go into Builtin
Functions (https://pig.apache.org/docs/r0.15.0/func.html) of Apache Pig documentation.  
> Saw one user trying to use Bloom Filter to filter data on a different column than the
join column which should not be done as Bloom Filters give false positives and can include
records that actually don't match the filter criteria. That should be documented as well and
highlighted to avoid users trying to use Bloom Filters for just regular filtering. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message