From dev-return-5867-archive-asf-public=cust-asf.ponee.io@gobblin.incubator.apache.org Fri Sep 13 21:45:07 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 1A67A180652 for ; Fri, 13 Sep 2019 23:45:07 +0200 (CEST) Received: (qmail 34175 invoked by uid 500); 13 Sep 2019 21:45:06 -0000 Mailing-List: contact dev-help@gobblin.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@gobblin.incubator.apache.org Delivered-To: mailing list dev@gobblin.incubator.apache.org Received: (qmail 34163 invoked by uid 99); 13 Sep 2019 21:45:06 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Sep 2019 21:45:06 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id F2D161A4986 for ; Fri, 13 Sep 2019 21:45:05 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -112.199 X-Spam-Level: X-Spam-Status: No, score=-112.199 tagged_above=-999 required=6.31 tests=[ENV_AND_HDR_SPF_MATCH=-0.5, KAM_ASCII_DIVIDERS=0.8, RCVD_IN_DNSWL_HI=-5, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, USER_IN_DEF_SPF_WL=-7.5, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-he-de.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id dVZ_caLtw7cf for ; Fri, 13 Sep 2019 21:45:03 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=207.244.88.153; helo=mail.apache.org; envelope-from=jira@apache.org; receiver= Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with SMTP id BB1157DD67 for ; Fri, 13 Sep 2019 21:45:02 +0000 (UTC) Received: (qmail 34114 invoked by uid 99); 13 Sep 2019 21:45:02 -0000 Received: from mailrelay1-us-west.apache.org (HELO mailrelay1-us-west.apache.org) (209.188.14.139) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Sep 2019 21:45:02 +0000 Received: from jira-he-de.apache.org (static.172.67.40.188.clients.your-server.de [188.40.67.172]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 5516AE305B for ; Fri, 13 Sep 2019 21:45:01 +0000 (UTC) Received: from jira-he-de.apache.org (localhost.localdomain [127.0.0.1]) by jira-he-de.apache.org (ASF Mail Server at jira-he-de.apache.org) with ESMTP id 627897803B2 for ; Fri, 13 Sep 2019 21:45:00 +0000 (UTC) Date: Fri, 13 Sep 2019 21:45:00 +0000 (UTC) From: "ASF GitHub Bot (Jira)" To: dev@gobblin.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Work logged] (GOBBLIN-865) Add feature that enables PK-chunking in partition MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/GOBBLIN-865?focusedWorklogId= =3D312359&page=3Dcom.atlassian.jira.plugin.system.issuetabpanels:worklog-ta= bpanel#worklog-312359 ] ASF GitHub Bot logged work on GOBBLIN-865: ------------------------------------------ Author: ASF GitHub Bot Created on: 13/Sep/19 21:44 Start Date: 13/Sep/19 21:44 Worklog Time Spent: 10m=20 Work Description: codecov-io commented on issue #2722: GOBBLIN-865: A= dd feature that enables PK-chunking in partition URL: https://github.com/apache/incubator-gobblin/pull/2722#issuecomment-531= 069100 =20 =20 # [Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2722?src= =3Dpr&el=3Dh1) Report > Merging [#2722](https://codecov.io/gh/apache/incubator-gobblin/pull/27= 22?src=3Dpr&el=3Ddesc) into [master](https://codecov.io/gh/apache/incubator= -gobblin/commit/9bf9a882427e98e7f4ef089c4ca1bde42f4b36a3?src=3Dpr&el=3Ddesc= ) will **decrease** coverage by `0.07%`. > The diff coverage is `1.78%`. =20 [![Impacted file tree graph](https://codecov.io/gh/apache/incubator-gobb= lin/pull/2722/graphs/tree.svg?width=3D650&token=3D4MgURJ0bGc&height=3D150&s= rc=3Dpr)](https://codecov.io/gh/apache/incubator-gobblin/pull/2722?src=3Dpr= &el=3Dtree) =20 ```diff @@ Coverage Diff @@ ## master #2722 +/- ## =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D - Coverage 45.04% 44.96% -0.08% =20 - Complexity 8739 8753 +14 =20 =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Files 1880 1884 +4 =20 Lines 70205 70454 +249 =20 Branches 7707 7730 +23 =20 =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D + Hits 31623 31680 +57 =20 - Misses 35651 35831 +180 =20 - Partials 2931 2943 +12 ``` =20 =20 | [Impacted Files](https://codecov.io/gh/apache/incubator-gobblin/pull/2= 722?src=3Dpr&el=3Dtree) | Coverage =CE=94 | Complexity =CE=94 | | |---|---|---|---| | [...obblin/salesforce/SalesforceConfigurationKeys.java](https://codeco= v.io/gh/apache/incubator-gobblin/pull/2722/diff?src=3Dpr&el=3Dtree#diff-Z29= iYmxpbi1zYWxlc2ZvcmNlL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NhbGVzZm= 9yY2UvU2FsZXNmb3JjZUNvbmZpZ3VyYXRpb25LZXlzLmphdmE=3D) | `0% <=C3=B8> (=C3= =B8)` | `0 <0> (=C3=B8)` | :arrow_down: | | [...apache/gobblin/salesforce/SalesforceExtractor.java](https://codeco= v.io/gh/apache/incubator-gobblin/pull/2722/diff?src=3Dpr&el=3Dtree#diff-Z29= iYmxpbi1zYWxlc2ZvcmNlL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NhbGVzZm= 9yY2UvU2FsZXNmb3JjZUV4dHJhY3Rvci5qYXZh) | `0% <0%> (=C3=B8)` | `0 <0> (=C3= =B8)` | :arrow_down: | | [...rg/apache/gobblin/salesforce/SalesforceSource.java](https://codeco= v.io/gh/apache/incubator-gobblin/pull/2722/diff?src=3Dpr&el=3Dtree#diff-Z29= iYmxpbi1zYWxlc2ZvcmNlL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NhbGVzZm= 9yY2UvU2FsZXNmb3JjZVNvdXJjZS5qYXZh) | `19.74% <5.66%> (-3.02%)` | `12 <1> (= +1)` | | | [...obblin/service/monitoring/FlowStatusGenerator.java](https://codeco= v.io/gh/apache/incubator-gobblin/pull/2722/diff?src=3Dpr&el=3Dtree#diff-Z29= iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NlcnZpY2UvbW= 9uaXRvcmluZy9GbG93U3RhdHVzR2VuZXJhdG9yLmphdmE=3D) | `82.14% <0%> (-7.15%)` = | `11% <0%> (-1%)` | | | [...apache/gobblin/runtime/local/LocalJobLauncher.java](https://codeco= v.io/gh/apache/incubator-gobblin/pull/2722/diff?src=3Dpr&el=3Dtree#diff-Z29= iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvbG= 9jYWwvTG9jYWxKb2JMYXVuY2hlci5qYXZh) | `61.81% <0%> (-2.34%)` | `5% <0%> (= =C3=B8)` | | | [...e/gobblin/runtime/locks/ZookeeperBasedJobLock.java](https://codeco= v.io/gh/apache/incubator-gobblin/pull/2722/diff?src=3Dpr&el=3Dtree#diff-Z29= iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvbG= 9ja3MvWm9va2VlcGVyQmFzZWRKb2JMb2NrLmphdmE=3D) | `63.33% <0%> (-0.72%)` | `1= 5% <0%> (=C3=B8)` | | | [...che/gobblin/hive/metastore/HiveMetaStoreUtils.java](https://codeco= v.io/gh/apache/incubator-gobblin/pull/2722/diff?src=3Dpr&el=3Dtree#diff-Z29= iYmxpbi1oaXZlLXJlZ2lzdHJhdGlvbi9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvZ29iYmxpbi= 9oaXZlL21ldGFzdG9yZS9IaXZlTWV0YVN0b3JlVXRpbHMuamF2YQ=3D=3D) | `31.69% <0%> = (-0.15%)` | `12% <0%> (=C3=B8)` | | | [...e/modules/flowgraph/datanodes/fs/AdlsDataNode.java](https://codeco= v.io/gh/apache/incubator-gobblin/pull/2722/diff?src=3Dpr&el=3Dtree#diff-Z29= iYmxpbi1zZXJ2aWNlL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NlcnZpY2UvbW= 9kdWxlcy9mbG93Z3JhcGgvZGF0YW5vZGVzL2ZzL0FkbHNEYXRhTm9kZS5qYXZh) | `50% <0%>= (=C3=B8)` | `2% <0%> (=C3=B8)` | :arrow_down: | | [...in/source/extractor/extract/kafka/KafkaSource.java](https://codeco= v.io/gh/apache/incubator-gobblin/pull/2722/diff?src=3Dpr&el=3Dtree#diff-Z29= iYmxpbi1tb2R1bGVzL2dvYmJsaW4ta2Fma2EtY29tbW9uL3NyYy9tYWluL2phdmEvb3JnL2FwYW= NoZS9nb2JibGluL3NvdXJjZS9leHRyYWN0b3IvZXh0cmFjdC9rYWZrYS9LYWZrYVNvdXJjZS5qY= XZh) | `0% <0%> (=C3=B8)` | `0% <0%> (=C3=B8)` | :arrow_down: | | [...org/apache/gobblin/service/FlowStatusResource.java](https://codeco= v.io/gh/apache/incubator-gobblin/pull/2722/diff?src=3Dpr&el=3Dtree#diff-Z29= iYmxpbi1yZXN0bGkvZ29iYmxpbi1mbG93LWNvbmZpZy1zZXJ2aWNlL2dvYmJsaW4tZmxvdy1jb2= 5maWctc2VydmljZS1zZXJ2ZXIvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2dvYmJsaW4vc2Vyd= mljZS9GbG93U3RhdHVzUmVzb3VyY2UuamF2YQ=3D=3D) | `0% <0%> (=C3=B8)` | `0% <0%= > (=C3=B8)` | :arrow_down: | | ... and [20 more](https://codecov.io/gh/apache/incubator-gobblin/pull/= 2722/diff?src=3Dpr&el=3Dtree-more) | | =20 ------ =20 [Continue to review full report at Codecov](https://codecov.io/gh/apache= /incubator-gobblin/pull/2722?src=3Dpr&el=3Dcontinue). > **Legend** - [Click here to learn more](https://docs.codecov.io/docs/c= odecov-delta) > `=CE=94 =3D absolute (impact)`, `=C3=B8 =3D not affected`, = `? =3D missing data` > Powered by [Codecov](https://codecov.io/gh/apache/incubator-gobblin/pu= ll/2722?src=3Dpr&el=3Dfooter). Last update [9bf9a88...3eb4f4d](https://code= cov.io/gh/apache/incubator-gobblin/pull/2722?src=3Dpr&el=3Dlastupdated). Re= ad the [comment docs](https://docs.codecov.io/docs/pull-request-comments). =20 =20 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. =20 For queries about this service, please contact Infrastructure at: users@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 312359) Time Spent: 4h (was: 3h 50m) > Add feature that enables PK-chunking in partition=20 > -------------------------------------------------- > > Key: GOBBLIN-865 > URL: https://issues.apache.org/jira/browse/GOBBLIN-865 > Project: Apache Gobblin > Issue Type: Task > Reporter: Alex Li > Priority: Major > Labels: salesforce > Time Spent: 4h > Remaining Estimate: 0h > > In SFDC(salesforce) connector, we have partitioning mechanisms to split a= giant query to multiple sub queries. There are 3 mechanisms: > * simple partition (equally split by time) > * dynamic pre-partition (generate=C2=A0histogram and split by row number= s) > * user specified partition (set up time range in job file) > However there are tables like Task and Contract are failing time to time = to fetch full data. > We may want to utilize PK-chunking to partition the query. > =C2=A0 > The pk-chunking doc from=C2=A0SFDC -=C2=A0[https://developer.salesforce.c= om/docs/atlas.en-us.api_asynch.meta/api_asynch/async_api_headers_enable_pk_= chunking.htm] -- This message was sent by Atlassian Jira (v8.3.2#803003)