Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 31E6C18567 for ; Mon, 1 Jun 2015 15:24:28 +0000 (UTC) Received: (qmail 68523 invoked by uid 500); 1 Jun 2015 15:24:20 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 68412 invoked by uid 500); 1 Jun 2015 15:24:20 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 68402 invoked by uid 99); 1 Jun 2015 15:24:20 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2015 15:24:20 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id EAEFACA6CC for ; Mon, 1 Jun 2015 15:24:19 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.001 X-Spam-Level: * X-Spam-Status: No, score=1.001 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id aFnUpTkaqr0f for ; Mon, 1 Jun 2015 15:24:08 +0000 (UTC) Received: from gateway32.websitewelcome.com (gateway32.websitewelcome.com [192.185.145.111]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 90C33428E3 for ; Mon, 1 Jun 2015 15:24:08 +0000 (UTC) Received: by gateway32.websitewelcome.com (Postfix, from userid 500) id 45CDA212DF2B9; Mon, 1 Jun 2015 10:24:02 -0500 (CDT) Received: from gator4106.hostgator.com (gator4106.hostgator.com [192.185.4.118]) by gateway32.websitewelcome.com (Postfix) with ESMTP id 43888212DF29F for ; Mon, 1 Jun 2015 10:24:02 -0500 (CDT) Received: from [89.216.116.44] (port=56099 helo=[192.168.1.156]) by gator4106.hostgator.com with esmtpsa (TLSv1:DHE-RSA-AES128-SHA:128) (Exim 4.82) (envelope-from ) id 1YzRZJ-0003K2-Ol for user@hadoop.apache.org; Mon, 01 Jun 2015 10:24:02 -0500 Message-ID: <556C790E.5090901@nissatech.com> Date: Mon, 01 Jun 2015 17:23:58 +0200 From: Marko Dinic User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.4.0 MIME-Version: 1.0 To: user@hadoop.apache.org Subject: Streaming K-medoids Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - gator4106.hostgator.com X-AntiAbuse: Original Domain - hadoop.apache.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - nissatech.com X-BWhitelist: no X-Source-IP: 89.216.116.44 X-Exim-ID: 1YzRZJ-0003K2-Ol X-Source: X-Source-Args: X-Source-Dir: X-Source-Sender: ([192.168.1.156]) [89.216.116.44]:56099 X-Source-Auth: marko.dinic@nissatech.com X-Email-Count: 1 X-Source-Cap: bmlzc2E7bmlzc2E7Z2F0b3I0MTA2Lmhvc3RnYXRvci5jb20= Hello everyone, I have an idea and I would like to get a validation from community about it. In Mahout there is an implementation of Streaming K-means. I'm interested in your opinion would it make sense to make a similar implementation of Streaming K-medoids? K-medoids has even bigger problems than K-means because it's not scalable, but can be useful in some cases (e.g. It allows more sophisticated distance measures). What is your opinion about such an approach? Does anyone see problems with it? I already implemented K-medoids using this approach https://seer.lcc.ufmg.br/index.php/jidm/article/viewFile/99/82 but I now have a problem with a distance measure that does not allow projections, so I came up to the idea to implement it in a similar way as Streaming K-medoids. Best regards, Marko