Return-Path: X-Original-To: apmail-flink-issues-archive@minotaur.apache.org Delivered-To: apmail-flink-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D48F918272 for ; Mon, 2 Nov 2015 17:42:18 +0000 (UTC) Received: (qmail 3737 invoked by uid 500); 2 Nov 2015 17:42:18 -0000 Delivered-To: apmail-flink-issues-archive@flink.apache.org Received: (qmail 3689 invoked by uid 500); 2 Nov 2015 17:42:18 -0000 Mailing-List: contact issues-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list issues@flink.apache.org Received: (qmail 3680 invoked by uid 99); 2 Nov 2015 17:42:18 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Nov 2015 17:42:18 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 5C2AFC094C for ; Mon, 2 Nov 2015 17:42:18 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.971 X-Spam-Level: X-Spam-Status: No, score=0.971 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, T_RP_MATCHES_RCVD=-0.01, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id EHykj8TvZukO for ; Mon, 2 Nov 2015 17:42:10 +0000 (UTC) Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with SMTP id A013023123 for ; Mon, 2 Nov 2015 17:42:09 +0000 (UTC) Received: (qmail 3555 invoked by uid 99); 2 Nov 2015 17:42:08 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Nov 2015 17:42:08 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 9D959E0544; Mon, 2 Nov 2015 17:42:08 +0000 (UTC) From: chiwanpark To: issues@flink.incubator.apache.org Reply-To: issues@flink.incubator.apache.org References: In-Reply-To: Subject: [GitHub] flink pull request: [FLINK-1745] Add exact k-nearest-neighbours al... Content-Type: text/plain Message-Id: <20151102174208.9D959E0544@git1-us-west.apache.org> Date: Mon, 2 Nov 2015 17:42:08 +0000 (UTC) Github user chiwanpark commented on the pull request: https://github.com/apache/flink/pull/1220#issuecomment-153096308 I would suggest logic like following: ```scala val useQuadTree = ~~~ if (useQuadTree) { knnQueryWithQuadTree(training, testing, out) } else { knnQueryBasic(training, testing, out) } ``` Or to reduce duplicated code in L257-L266, we can use following: ```scala val useQuadTree = ~~~ val quadTree: Option[QuadTree] = if (useQuadTree) { Some(buildQuadTree(training, testing)) } else { None } for (a <- testing.values) { val trainingFiltered: Seq[Vector] = quadTree match { case Some(tree) => getSibilingsFromQuadTree(a, tree) case None => training.values } for (b <- trainingFiltered) { // (training vector, input vector, input key, distance) queue.enqueue((b, a._2, a._1, metric.distance(b, a._2))) if (queue.size > k) { queue.dequeue() } } for (v <- queue) { out.collect(v) } } ``` In this case, we create methods about quadtree operation. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---