Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3C62FD188 for ; Thu, 11 Oct 2012 19:53:32 +0000 (UTC) Received: (qmail 21742 invoked by uid 500); 11 Oct 2012 19:53:27 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 21670 invoked by uid 500); 11 Oct 2012 19:53:27 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 21663 invoked by uid 99); 11 Oct 2012 19:53:27 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Oct 2012 19:53:27 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of markq2011@gmail.com designates 209.85.212.182 as permitted sender) Received: from [209.85.212.182] (HELO mail-wi0-f182.google.com) (209.85.212.182) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Oct 2012 19:53:20 +0000 Received: by mail-wi0-f182.google.com with SMTP id hm2so2251709wib.11 for ; Thu, 11 Oct 2012 12:53:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=oHmKL9ckiWp5zL2nVZ28O1XKWf66rtyl/8467gF/fag=; b=lk6AnV41a7YVi/UTfcCeMqyl4ZM0UeEcXYwWOOO4+iUHYF5khfJp13H8XKv/Ec6KJM heVoDqe/894B+zDOTJT9PhvMZMsxMpLczTeC9E/t8lfVKpUFpL2PIPBeDGIBo7gGFaj1 OPx+pQMxvaTrlcWdNyrQeElpbdVJGEcuCd0B+I2JAS6P/+zHN63IGb5rUmWytvIlz0qO iqfI6jvnSdpHOIAm40uHs2TYyB7f7EPnRrYGhN+Zk15HXAZQYlwhsK9x/1wwrYboIwCu GMjJRc8TO9vJFl5eQoLArttmoaklZsqHbdYnw5KRgDmdkSzQ/oaxlO8AiGl128fBtX40 X/Cw== MIME-Version: 1.0 Received: by 10.180.81.37 with SMTP id w5mr322353wix.10.1349985180750; Thu, 11 Oct 2012 12:53:00 -0700 (PDT) Received: by 10.223.170.198 with HTTP; Thu, 11 Oct 2012 12:53:00 -0700 (PDT) In-Reply-To: References: Date: Thu, 11 Oct 2012 12:53:00 -0700 Message-ID: Subject: Re: Reading Sequence File from Hadoop Distributed Cache .. From: Mark Olimpiati To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=f46d04428dc02692ec04cbcdebdc --f46d04428dc02692ec04cbcdebdc Content-Type: text/plain; charset=ISO-8859-1 Thanks for the reply Harsh, but as I said I tried locally too by using the following: FileSystem localFs = cachedFiles[0].getFileSystem(new Configuration()); Isn't the above supposed to give me the local file system ?? If yes, I tried that but didn't work, if not, can you show one example? Thank you, Mark On Wed, Oct 10, 2012 at 7:33 PM, Harsh J wrote: > Hi Mark, > > DistributedCache files, when accessed from a Task, exist on the local > file system. You should make sure the SequenceFile.Reader tries to > read it with a LocalFS than a HDFS instance. > > On Thu, Oct 11, 2012 at 5:15 AM, Mark Olimpiati > wrote: > > Hi, > > > > I'm storing sequence files in the distributed cache which seems to be > > stored somewher under each node's /tmp .../local/archive/ ... path. > > > > In mapper code, I tried using SequenceFile.Reader with all possible > > configurations (locally, distribtued) however, it can't find it. Are > > sequence files supported in the distribued cache?? > > > > Thank you, > > Mark > > > > -- > Harsh J > --f46d04428dc02692ec04cbcdebdc Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Thanks for the reply Harsh, but as I said I tried locally too by using the = following:

=A0=A0=A0=A0 FileSystem localFs =3D cachedFiles[0].getFil= eSystem(new Configuration());


Isn't the above supposed to g= ive me the local file system ?? If yes, I tried that but didn't work, i= f not, can you show one example?

Thank you,
Mark

On Wed, Oct 10, 20= 12 at 7:33 PM, Harsh J <harsh@cloudera.com> wrote:
Hi Mark,

DistributedCache files, when accessed from a Task, exist on the local
file system. You should make sure the SequenceFile.Reader tries to
read it with a LocalFS than a HDFS instance.

On Thu, Oct 11, 2012 at 5:15 AM, Mark Olimpiati <markq2011@gmail.com> wrote:
> Hi,
>
> =A0 I'm storing sequence files in the distributed cache which seem= s to be
> stored somewher under each node's /tmp .../local/archive/ ... path= .
>
> In mapper code, I tried using SequenceFile.Reader with all possible > configurations (locally, distribtued) however, it can't find it. A= re
> sequence files supported in the distribued cache??
>
> Thank you,
> Mark



--
Harsh J

--f46d04428dc02692ec04cbcdebdc--