pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Satish Subhashrao Saley (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (PIG-5342) Add setting to turn off combiner
Date Wed, 13 Jun 2018 19:10:00 GMT

     [ https://issues.apache.org/jira/browse/PIG-5342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Satish Subhashrao Saley updated PIG-5342:
-----------------------------------------
    Description: 
1) Need a new setting pig.bloomjoin.nocombiner to turn off combiner for bloom join. When the
keys are all unique, the combiner is unnecessary overhead.
2) Mention in documentation that bloom join is also ideal in cases of right outer join with
smaller dataset on the right. Replicate join only supports left outer join.

 

> Add setting to turn off combiner
> --------------------------------
>
>                 Key: PIG-5342
>                 URL: https://issues.apache.org/jira/browse/PIG-5342
>             Project: Pig
>          Issue Type: Sub-task
>            Reporter: Satish Subhashrao Saley
>            Assignee: Satish Subhashrao Saley
>            Priority: Major
>
> 1) Need a new setting pig.bloomjoin.nocombiner to turn off combiner for bloom join. When
the keys are all unique, the combiner is unnecessary overhead.
> 2) Mention in documentation that bloom join is also ideal in cases of right outer join
with smaller dataset on the right. Replicate join only supports left outer join.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message