Return-Path: Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: (qmail 28743 invoked from network); 4 Mar 2011 18:13:28 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Mar 2011 18:13:28 -0000 Received: (qmail 36263 invoked by uid 500); 4 Mar 2011 18:13:28 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 36204 invoked by uid 500); 4 Mar 2011 18:13:27 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 36195 invoked by uid 99); 4 Mar 2011 18:13:27 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Mar 2011 18:13:27 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of qwertymaniac@gmail.com designates 209.85.161.48 as permitted sender) Received: from [209.85.161.48] (HELO mail-fx0-f48.google.com) (209.85.161.48) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Mar 2011 18:13:23 +0000 Received: by fxm2 with SMTP id 2so3166051fxm.35 for ; Fri, 04 Mar 2011 10:13:01 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type:content-transfer-encoding; bh=YdceVWq3pMfU6cvXbu6HQvngeSMnXL5Dph641OkLqsU=; b=qM18LMi6mQZ2T7V4PjkD2G+5m5KSwb8qyDmrbXdQrASLHdh6hOM9UmSOD4GCStJW5j caq//AO7xBEYNU6KS5KEGg7dNWHpS/6tGm64nkcY7LjuSii+TvGwO6ev3eD36B1JnQfU OfuYy6ZjpV9I6lWY7ZigEoKDTvueWghYJ8n2o= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=gRvQJPw/CKKKI+6ktDO3kwWsUaErx1m0q1c1bI1EtK2Y+kBs2w/Usm0/vtG1yG4DyI TeCyHi2iTXmhjKchcxKhSVF1vQ7MYxF51DBk5Dhg9jCZD6L965vRk7UubaJX/OnYzZDm HyNv1wY8O5ueRgLyvbIxCDoGI1iaJHF8iEJXU= Received: by 10.223.2.205 with SMTP id 13mr1157685fak.138.1299262380860; Fri, 04 Mar 2011 10:13:00 -0800 (PST) MIME-Version: 1.0 Received: by 10.223.122.83 with HTTP; Fri, 4 Mar 2011 10:11:31 -0800 (PST) In-Reply-To: References: From: Harsh J Date: Fri, 4 Mar 2011 23:41:31 +0530 Message-ID: Subject: Re: custom InputFormat class To: mapreduce-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable It is worth reading some implementations of already existing InputFormat classes, such as the simple TextInputFormat, or the SequenceFileInputFormat which also has a RecordReader implementation in it. You may find these source files in your downloaded Hadoop distribution's src/ directory itself (in their appropriate packages). I do not know of an article that has a complete, tutorial-based approach to this (yet). Perhaps others would know! On Fri, Mar 4, 2011 at 11:27 PM, souri datta wro= te: > Hi, > =A0Is there a good tutorial for writing custom InputFormat classes? > Any help would be greatly appreciated. > > Thanks, > Souri > --=20 Harsh J www.harshj.com