Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 61789D76E for ; Sun, 2 Dec 2012 21:12:31 +0000 (UTC) Received: (qmail 52949 invoked by uid 500); 2 Dec 2012 21:12:29 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 52825 invoked by uid 500); 2 Dec 2012 21:12:29 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 52814 invoked by uid 99); 2 Dec 2012 21:12:29 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 02 Dec 2012 21:12:29 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of koobas@gmail.com designates 209.85.215.42 as permitted sender) Received: from [209.85.215.42] (HELO mail-la0-f42.google.com) (209.85.215.42) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 02 Dec 2012 21:12:22 +0000 Received: by mail-la0-f42.google.com with SMTP id s15so3481890lag.1 for ; Sun, 02 Dec 2012 13:12:01 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=GyfCPkl88IQCciDlbLiN31ow1AzkKS3VlSuLfGKiceg=; b=zEpubDg1Md+dDuQ2pZlbwOIYJg0FgZXzeSiu6BwNclLvsIhC5F9/A0//K8FXVnoB4H IVzHE6vBFDkWwwYiQOBmtBgLOFTPaPWbz/c3zxtevjj3npf6DD5yxurajxNCxLH4TNkO zhlHafkFEblWJYZjLM7bJHL0uixjHMoBaSazmRKgBrLcI7ox8DdG1AFkIvgIKDDTcTZp GC4Rg8vz4QCL8ht7vE+lWcpRCaJKV2y0kDz/2MDpJ7tngklAwD7HAXbZahs0PUs3lWij Po3WBCIs3zEgnWxghHDdRIi39JiIajewVUmplpp8Bi4FhWrI3rwOy0+F/2U81RU5YQXk 2R6w== MIME-Version: 1.0 Received: by 10.152.144.164 with SMTP id sn4mr7356952lab.57.1354482721036; Sun, 02 Dec 2012 13:12:01 -0800 (PST) Received: by 10.112.20.2 with HTTP; Sun, 2 Dec 2012 13:12:00 -0800 (PST) Date: Sun, 2 Dec 2012 16:12:00 -0500 Message-ID: Subject: Mahout Amazon EMR usage cost From: Koobas To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=e89a8f22c4af711acb04cfe515e6 X-Virus-Checked: Checked by ClamAV on apache.org --e89a8f22c4af711acb04cfe515e6 Content-Type: text/plain; charset=ISO-8859-1 I was wondering if somebody could give me a rough estimate of the cost of running Mahout on Amazon's Elastic MapReduce for a specific problem. I am working with a common case of implicit feedback. I have a simple, boolean input, i.e., user-item pairs (userID, itemID). I would like to find 50 nearest neighbors for each item. I have 10M users, 10K items, and 500M records. If anybody has any ballpark idea of the kind of cost it would take to solve the problem using EMR, I would appreciate it very much. Jacob --e89a8f22c4af711acb04cfe515e6--