Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 64081925D for ; Wed, 23 May 2012 19:47:58 +0000 (UTC) Received: (qmail 24740 invoked by uid 500); 23 May 2012 19:47:55 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 24672 invoked by uid 500); 23 May 2012 19:47:54 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 24664 invoked by uid 99); 23 May 2012 19:47:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 May 2012 19:47:54 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of samir.helpdoc@gmail.com designates 209.85.216.176 as permitted sender) Received: from [209.85.216.176] (HELO mail-qc0-f176.google.com) (209.85.216.176) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 May 2012 19:47:48 +0000 Received: by qcsc21 with SMTP id c21so6755397qcs.35 for ; Wed, 23 May 2012 12:47:28 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=dsEJSIrXAOMMoyftPYwFn4cvdGTQJx4I3/DqpWs/EbU=; b=PmpiV0WDtFih8Ei4Gw/4kppWAuc155zaLMgzzuQbUg71sKWxtn9dPISZb1SoqHDfJs 94l3H6ri1B9f0igpouKW9oo0zfMjjlWg2XBbSdRMonjfAAyR0mpf4bmiA9qt8eseJUc7 /L5gdG2kXxw/PEeHB/6MgAOc8p7c35Zhlk8M5gwYVPoe2PvYE/7MA09z/8Btzom3qrU2 VKZwtFcQENc/ECFWrOI+1hrKtRkJ2IKzhYGqOc98rBnDkTRYO402ahqRn2E6YVqrTDyH oU4vnscfyOuD2zOBexHIWH72jzFZWvWmJBPcdWQcuybvw+mBmp8vK2b9sqyjyhEbPrPn wdvQ== MIME-Version: 1.0 Received: by 10.229.135.136 with SMTP id n8mr14561346qct.135.1337802447886; Wed, 23 May 2012 12:47:27 -0700 (PDT) Received: by 10.229.4.32 with HTTP; Wed, 23 May 2012 12:47:27 -0700 (PDT) Date: Thu, 24 May 2012 01:17:27 +0530 Message-ID: Subject: Right way to implement MR ? From: samir das mohapatra To: common-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=00248c6a6772af98bd04c0b967c2 X-Virus-Checked: Checked by ClamAV on apache.org --00248c6a6772af98bd04c0b967c2 Content-Type: text/plain; charset=ISO-8859-1 Hi All, How to compare to input file In M/R Job. let A Log file around 30GB and B Log file size is around 60 GB I wanted to know how i will define inside the mapper. Thanks samir. --00248c6a6772af98bd04c0b967c2--