Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 54C4F2009F4 for ; Thu, 26 May 2016 21:42:24 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 53541160A18; Thu, 26 May 2016 19:42:24 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 9B607160A17 for ; Thu, 26 May 2016 21:42:23 +0200 (CEST) Received: (qmail 66129 invoked by uid 500); 26 May 2016 19:42:22 -0000 Mailing-List: contact dev-help@apex.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@apex.apache.org Delivered-To: mailing list dev@apex.apache.org Received: (qmail 66118 invoked by uid 99); 26 May 2016 19:42:22 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 May 2016 19:42:22 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 35E93CED0D for ; Thu, 26 May 2016 19:42:22 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -1.346 X-Spam-Level: X-Spam-Status: No, score=-1.346 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-1.426, URIBL_SBL=4, URIBL_SBL_A=0.1] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id 7sprRnBI0miq for ; Thu, 26 May 2016 19:42:20 +0000 (UTC) Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with SMTP id 2A6C65F36C for ; Thu, 26 May 2016 19:42:19 +0000 (UTC) Received: (qmail 66111 invoked by uid 99); 26 May 2016 19:42:19 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 May 2016 19:42:19 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 4B5CFDFBDE; Thu, 26 May 2016 19:42:19 +0000 (UTC) From: ilganeli To: dev@apex.incubator.apache.org Reply-To: dev@apex.incubator.apache.org Message-ID: Subject: [GitHub] incubator-apex-malhar pull request: [APEXCORE-146] Add ClonePartit... Content-Type: text/plain Date: Thu, 26 May 2016 19:42:19 +0000 (UTC) archived-at: Thu, 26 May 2016 19:42:24 -0000 GitHub user ilganeli opened a pull request: https://github.com/apache/incubator-apex-malhar/pull/297 [APEXCORE-146] Add ClonePartitioner **Note:** This is still marked as an Apex issue even though it should live in Malhar. Once the JIRA is updated I'll update title and reference. * Created a Clone partitioner similar to the StatelessPartitioner that assigns all data to all partitions * Added a simple unit test suite to test scale up and scale down **Questions** Is there a good place/way to add a test to verify that data is partitioned appropriately? For example, the stateless partitioner assigns keys based on the nearest power of two - is that validated anywhere? How is the serialVersionUUID generated? Does the manual assignment of ports to PartitionKeys need to happen for each Partition? There is a comment in the Partition class that by default all data is sent to all partitions. However, digging into the implementation in depth doesn't show that behavior. Instead, it seems data is assigned based on the mask associated with each Partition. This references: https://issues.apache.org/jira/browse/APEXCORE-146 You can merge this pull request into a Git repository by running: $ git pull https://github.com/ilganeli/incubator-apex-malhar APEXCORE-146 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/incubator-apex-malhar/pull/297.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #297 ---- commit 74f6fd5113a3d488015a9817790018dd63b5e8cb Author: Ilya Ganelin Date: 2016-05-26T19:39:24Z Added a new partitioner that replicates data across all partitions by default. ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---