Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BD7FADB99 for ; Thu, 23 Aug 2012 09:57:23 +0000 (UTC) Received: (qmail 49742 invoked by uid 500); 23 Aug 2012 09:57:19 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 49406 invoked by uid 500); 23 Aug 2012 09:57:18 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 49388 invoked by uid 99); 23 Aug 2012 09:57:18 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 Aug 2012 09:57:18 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of siddharth.tiwari@live.com designates 65.55.90.91 as permitted sender) Received: from [65.55.90.91] (HELO snt0-omc2-s16.snt0.hotmail.com) (65.55.90.91) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 Aug 2012 09:57:09 +0000 Received: from SNT142-W29 ([65.55.90.72]) by snt0-omc2-s16.snt0.hotmail.com with Microsoft SMTPSVC(6.0.3790.4675); Thu, 23 Aug 2012 02:56:48 -0700 Message-ID: Content-Type: multipart/alternative; boundary="_d1d510eb-df4c-43e5-8a27-132e4b2100a0_" X-Originating-IP: [14.195.5.142] From: Siddharth Tiwari To: Subject: Customized input format Date: Thu, 23 Aug 2012 09:56:48 +0000 Importance: Normal MIME-Version: 1.0 X-OriginalArrivalTime: 23 Aug 2012 09:56:48.0986 (UTC) FILETIME=[9CE063A0:01CD8115] X-Virus-Checked: Checked by ClamAV on apache.org --_d1d510eb-df4c-43e5-8a27-132e4b2100a0_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hi=2C I want to read file paragraph wise that is until it encounters an empty lin= e it must take the content and pass out to mapper. Please guide me on how c= an I achieve it. Some example would be of great help. = --_d1d510eb-df4c-43e5-8a27-132e4b2100a0_ Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable
Hi=2C
I want to read file par= agraph wise that is until it encounters an empty line it must take the cont= ent and pass out to mapper. Please guide me on how can I achieve it. Some e= xample would be of great help.
= --_d1d510eb-df4c-43e5-8a27-132e4b2100a0_--