beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jean-Baptiste Onofré (JIRA) <j...@apache.org>
Subject [jira] [Resolved] (BEAM-2488) Elasticsearch IO should read also in replica shards
Date Wed, 28 Jun 2017 08:12:00 GMT

     [ https://issues.apache.org/jira/browse/BEAM-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jean-Baptiste Onofré resolved BEAM-2488.
----------------------------------------
       Resolution: Fixed
    Fix Version/s: 2.1.0

> Elasticsearch IO should read also in replica shards
> ---------------------------------------------------
>
>                 Key: BEAM-2488
>                 URL: https://issues.apache.org/jira/browse/BEAM-2488
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-java-extensions
>            Reporter: Etienne Chauchot
>            Assignee: Etienne Chauchot
>             Fix For: 2.1.0
>
>
> To avoid duplication of data ElasticsearchIO reads from primary shards only and filters
out replica shards. But in reality, even if _shard-preference:shardId is set in scroll request,
ES internally load balances requests between primary and replica shards and ensures that there
will be no duplicates. Targeting all the shards and letting ES deal with replicas is better
in some corner cases like failover.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message