Return-Path: X-Original-To: apmail-mahout-dev-archive@www.apache.org Delivered-To: apmail-mahout-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C7B5ADB81 for ; Wed, 13 Feb 2013 11:13:44 +0000 (UTC) Received: (qmail 32689 invoked by uid 500); 13 Feb 2013 11:13:44 -0000 Delivered-To: apmail-mahout-dev-archive@mahout.apache.org Received: (qmail 32631 invoked by uid 500); 13 Feb 2013 11:13:43 -0000 Mailing-List: contact dev-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mahout.apache.org Delivered-To: mailing list dev@mahout.apache.org Received: (qmail 32610 invoked by uid 99); 13 Feb 2013 11:13:42 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Feb 2013 11:13:42 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dangeorge.filimon@gmail.com designates 74.125.82.51 as permitted sender) Received: from [74.125.82.51] (HELO mail-wg0-f51.google.com) (74.125.82.51) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Feb 2013 11:13:34 +0000 Received: by mail-wg0-f51.google.com with SMTP id 8so848362wgl.6 for ; Wed, 13 Feb 2013 03:13:14 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type; bh=ix5mnJji2AQ/dlSKeTJl28gTv8gK/i2zHbSrZ9FFG58=; b=BthKtfhw/rhE/TkQqvP8SSKHBHGE/aWph8iJogBolA+pwD2DndLJWX3pI8vp7Fe20a CdX6qkb/Ntf5a7lLjgH9FD8h2E5Z3djUpFt8F5APGlpfQ4XLHkWnkVlfaxbexv7PljZ4 3C78NaGmtMGPob+q6lp4lmEpWRd3ORrOAwgmRiF5ca6VYbqo/fHn2V/5Mg9BAdg2Ppdv mzK86zC23DvFSJq9nWUAe+cJsYfoCLPQoGO/tQ2aa9qEL8PNul8Ctv4sa5GGLsTVssFd 8qe/yEE6pQ4ki0kvC6kfMFnejkx1LbRsFXxvfcvYtSP4X4j+JwH5eMZ1An+xpqDRPb4u qFnA== X-Received: by 10.180.14.166 with SMTP id q6mr9210114wic.22.1360753994450; Wed, 13 Feb 2013 03:13:14 -0800 (PST) MIME-Version: 1.0 Received: by 10.194.138.146 with HTTP; Wed, 13 Feb 2013 03:12:34 -0800 (PST) In-Reply-To: References: From: Dan Filimon Date: Wed, 13 Feb 2013 13:12:34 +0200 Message-ID: Subject: Re: Accessing the local filesystem from AbstractJob To: dev@mahout.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org I see. Well, my use case was wanting to run the job on one machine, being lazy and not wanting to put the files on HDFS. :) On Tue, Feb 12, 2013 at 8:27 PM, Sean Owen wrote: > Yes because the input path is something processed by the jobtracker and > later the tasktrackers themselves, which won't be on your machine > (necessarily). > > Mappers can read the local file system but it's not clear what may or may > not be there. Consider the distributed cache for smallish data. > > > On Tue, Feb 12, 2013 at 7:05 PM, Dan Filimon wrote: > >> When creating my own job driver, I'm unable to give it any inputs from >> the local file system. An exception gets thrown when starting the job >> (and trying to get the splits). >> Apparently the files have to be on HDFS. >> >> Is there any way around this (ideally, I'd like it to first look for >> the file on the local file system and if no file is found, look at >> HDFS)? >>