Return-Path: Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: (qmail 73810 invoked from network); 12 Apr 2010 15:36:10 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 12 Apr 2010 15:36:10 -0000 Received: (qmail 15664 invoked by uid 500); 12 Apr 2010 15:36:09 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 15633 invoked by uid 500); 12 Apr 2010 15:36:09 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 15625 invoked by uid 99); 12 Apr 2010 15:36:09 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 12 Apr 2010 15:36:09 +0000 X-ASF-Spam-Status: No, hits=1.8 required=10.0 tests=AWL,FREEMAIL_FROM,HK_RANDOM_ENVFROM,HK_RANDOM_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of zjffdu@gmail.com designates 209.85.223.198 as permitted sender) Received: from [209.85.223.198] (HELO mail-iw0-f198.google.com) (209.85.223.198) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 12 Apr 2010 15:36:03 +0000 Received: by iwn36 with SMTP id 36so4684554iwn.29 for ; Mon, 12 Apr 2010 08:35:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:received:message-id:subject:from:to:content-type; bh=g8lwPWbqum0F/sLLOJHygt6HjlYRsUwz5mSRSTzOceA=; b=BtsUic0yXPlFKdd6TVXTxe5txXoTeAjkTI/RgHnNs+mHhAbeD3/FTb1suftsnwhKrd /nxEwvcC1ZeR0nWxsqTp///psJ1E61eOvBPtX2evQfVakt4cDZxW/Uk9NEVs1lunkLQL 534bdRx9lna1VGaLSb0uKJLlZCB2KMYICcSZo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=aMuGiE2TdqRBGbxn9fJNojpRh6ktXlbcPB0bOrwWIWjFezD7ylgj7zZBmfz7wx/8rU 9yxf8zyhed3So0RUP1Yfmk35b+uaoyAPwP/XcmohIfmCFhSvkd56vVzMrTbzirRskysm xohmLw2LarSgN4VXzMVZt04LwHlETdL+dMEzQ= MIME-Version: 1.0 Received: by 10.142.212.19 with HTTP; Mon, 12 Apr 2010 08:35:42 -0700 (PDT) In-Reply-To: References: Date: Mon, 12 Apr 2010 08:35:42 -0700 Received: by 10.142.2.22 with SMTP id 22mr1803865wfb.13.1271086542117; Mon, 12 Apr 2010 08:35:42 -0700 (PDT) Message-ID: Subject: Re: Configured & PathFilter From: Jeff Zhang To: mapreduce-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=00504502ad41d2a2c104840be5c4 --00504502ad41d2a2c104840be5c4 Content-Type: text/plain; charset=UTF-8 Hi Kris, Do you mean you want to use the PathFilter in map or reduce task ? Or you mean using the PathFilter in InputFormat ? I guess you mean the second case, if so you only need to call FileInputFormat.setInputPathFilter(,) to provide the filter information. On Mon, Apr 12, 2010 at 8:13 AM, Kris Nuttycombe wrote: > Hi, all, quick question about using PathFilter. > > Is there any way to provide information from the job configuration to > a PathFilter instance? In my case, I want to limit the date range of > the files being selected by the filter, and don't want to have to > hard-code a separate PathFilter instance for each date range I'm > interested in, obviously. If I make my PathFilter extend Configured, > will it do the right thing? > > Thanks! > > Kris > -- Best Regards Jeff Zhang --00504502ad41d2a2c104840be5c4 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable =C2=A0Hi Kris,

Do you mean you want to use the PathFilter in map or = reduce task ? Or you mean using the PathFilter in InputFormat ?
I guess = you mean the second case, if so you only need to call FileInputFormat.setIn= putPathFilter(,) to provide the filter information.


On Mon, Apr 12, 2010 at 8:13 AM, Kris Nu= ttycombe <kris.nuttycombe@gmail.com> wrote:
Hi, all, quick question about using PathFilter.

Is there any way to provide information from the job configuration to
a PathFilter instance? In my case, I want to =C2=A0limit the date range of<= br> the files being selected by the filter, and don't want to have to
hard-code a separate PathFilter instance for each date range I'm
interested in, obviously. If I make my PathFilter extend Configured,
will it do the right thing?

Thanks!

Kris



--
Best Regards
=
Jeff Zhang
--00504502ad41d2a2c104840be5c4--