Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id ED093200CC6 for ; Tue, 18 Jul 2017 17:45:04 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id EB8B21634FF; Tue, 18 Jul 2017 15:45:04 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3D225162EF7 for ; Tue, 18 Jul 2017 17:45:04 +0200 (CEST) Received: (qmail 79650 invoked by uid 500); 18 Jul 2017 15:45:03 -0000 Mailing-List: contact commits-help@beam.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@beam.apache.org Delivered-To: mailing list commits@beam.apache.org Received: (qmail 79641 invoked by uid 99); 18 Jul 2017 15:45:03 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 18 Jul 2017 15:45:03 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id DC9671805B5 for ; Tue, 18 Jul 2017 15:45:02 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id o29mD7BLQtrZ for ; Tue, 18 Jul 2017 15:45:02 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 5079260CD9 for ; Tue, 18 Jul 2017 15:45:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 8EE5AE0D85 for ; Tue, 18 Jul 2017 15:45:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 4760421E92 for ; Tue, 18 Jul 2017 15:45:00 +0000 (UTC) Date: Tue, 18 Jul 2017 15:45:00 +0000 (UTC) From: "ASF GitHub Bot (JIRA)" To: commits@beam.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (BEAM-2439) Datastore writer can fail to progress if Datastore is slow MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 18 Jul 2017 15:45:05 -0000 [ https://issues.apache.org/jira/browse/BEAM-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091737#comment-16091737 ] ASF GitHub Bot commented on BEAM-2439: -------------------------------------- GitHub user cph6 opened a pull request: https://github.com/apache/beam/pull/3585 [BEAM-2439] Dynamic sizing of Datastore write RPCs. This implements the same behaviour recently added to Java SDK: - start at 200 entities per RPC; - size subsequent requests based on observed latency of previous requests. Includes a MovingSum class to track recent latency. Report RPC success & failure counts as metrics (again, as in the Java SDK). R: @vikkyrk R: @ssisk You can merge this pull request into a Git repository by running: $ git pull https://github.com/cph6/beam datastore_batching_py Alternatively you can review and apply these changes as the patch at: https://github.com/apache/beam/pull/3585.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #3585 ---- ---- > Datastore writer can fail to progress if Datastore is slow > ---------------------------------------------------------- > > Key: BEAM-2439 > URL: https://issues.apache.org/jira/browse/BEAM-2439 > Project: Beam > Issue Type: Bug > Components: sdk-java-gcp > Reporter: Colin Phipps > Assignee: Colin Phipps > Priority: Minor > Labels: datastore > Fix For: 2.1.0 > > > When writing to Datastore, Beam groups writes into large batches (usually 500 entities per write, the maximum permitted by the API). If these writes are slow to commit on the serving side, the request may time out before all of the entities are written. > When this happens, it loses any progress that has been made on those entities (the connector uses non-transactional writes, so some entities might have been written, but partial results are not returned to the connector so it has to assume that all entities need rewriting). It will retry the write with the same set of entities, which may time out in the same way repeatedly. This can be influenced by factors on the Datastore serving side, some of which are transient (hotspots) but some of which are not. > We (Datastore) are developing a fix for this. -- This message was sent by Atlassian JIRA (v6.4.14#64029)