Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6E7E4D7D5 for ; Mon, 3 Sep 2012 11:01:18 +0000 (UTC) Received: (qmail 74042 invoked by uid 500); 3 Sep 2012 11:01:13 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 73750 invoked by uid 500); 3 Sep 2012 11:01:12 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 72804 invoked by uid 99); 3 Sep 2012 11:01:10 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Sep 2012 11:01:10 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of anitamalex@gmail.com designates 209.85.212.48 as permitted sender) Received: from [209.85.212.48] (HELO mail-vb0-f48.google.com) (209.85.212.48) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Sep 2012 11:01:04 +0000 Received: by vbme21 with SMTP id e21so6500434vbm.35 for ; Mon, 03 Sep 2012 04:00:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=t2uFAX44sDucy+eP2I0rF7qHW0HnIb3uD4oFglTvJTI=; b=AnV82DGBolgNWSlLOkmA0s9+Hf29xY0q78drqDGLlH6l690eB+XtMt4V+yux2JA3SJ SEeUoOOlXzHG6W42GvxNvA1gAc/eSR91uAGaaPIQzjWilOgtaeKOfA1cdHBNe3y85AKl 23kuJ30/b+3jCA+Oxit8U4i865Vg/1V6wf18kvKYyHAYJR6MN10UKMN73i0ym78W1O4U 9gUTaJnxVghqLbWnZjJMTqxc8ju5juDjK8MubROckU7bbtadOt/uRgUgcKiFEQWqCzpt C2fZBUJnLT90PL7DCW64TkcPt9b8RjxHUSxlAzcZC7Jecv75dqAGMk2UR5XlLKiZK7h/ r8/g== MIME-Version: 1.0 Received: by 10.58.23.100 with SMTP id l4mr4626701vef.46.1346670043331; Mon, 03 Sep 2012 04:00:43 -0700 (PDT) Received: by 10.58.211.232 with HTTP; Mon, 3 Sep 2012 04:00:43 -0700 (PDT) Date: Mon, 3 Sep 2012 16:30:43 +0530 Message-ID: Subject: custom format From: Anit Alexander To: user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org hello user, I am trying to create a map reduce program which will have splits based on a specific length. The content has to be extracted in a way such that the newline(\n) or tab(\t) etc characters will be considered as a byte and not as a mapper instance. is this possible through custom input? if yes, how will i create a custom file split based on a specific length value. Any suggestions? Regards, Anit