apex-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Guilherme Hott <guilhermeh...@gmail.com>
Subject Re: BoundedDedup or TimeBasedDedup
Date Fri, 26 May 2017 04:46:56 GMT
Thank you Bhupesh. I think this is the best to do.

On Thu, May 25, 2017 at 7:54 PM, Bhupesh Chawda <bhupesh@datatorrent.com>
wrote:

> Hi,
>
> If you are just de-duplicating based on a key and have a limited batch of
> transactions, then you should go with BoundedDedup.
>
> TimeBasedDedup is for cases where you want to dedup within a stream with
> expiry based on the time in your tuples.
>
> ~ Bhupesh
>
>
> _______________________________________________________
>
> Bhupesh Chawda
>
> E: bhupesh@datatorrent.com | Twitter: @bhupeshsc
>
> www.datatorrent.com  |  apex.apache.org
>
>
>
> On Thu, May 25, 2017 at 7:39 PM, Guilherme Hott <guilhermehott@gmail.com>
> wrote:
>
>> Hi everyone,
>>
>> I have in my kafka operator messages coming and in my input port and
>> I have to process and emit a batch of transactions to a Dedup operator.
>> Should I use BoundedDedup or TimeBasedDedup?
>>
>> Thanks
>>
>> --
>> *Guilherme Hott*
>> *Software Engineer*
>> Skype: guilhermehott
>> @guilhermehott
>> https://www.linkedin.com/in/guilhermehott
>>
>>
>


-- 
*Guilherme Hott*
*Software Engineer*
Skype: guilhermehott
@guilhermehott
https://www.linkedin.com/in/guilhermehott

Mime
View raw message