Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DB528185F0 for ; Fri, 26 Jun 2015 03:39:25 +0000 (UTC) Received: (qmail 17485 invoked by uid 500); 26 Jun 2015 03:39:20 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 17358 invoked by uid 500); 26 Jun 2015 03:39:20 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 17348 invoked by uid 99); 26 Jun 2015 03:39:20 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Jun 2015 03:39:20 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id C065B1A614B for ; Fri, 26 Jun 2015 03:39:19 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.791 X-Spam-Level: ** X-Spam-Status: No, score=2.791 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_REPLY=1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H2=-1.108, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id YQ-o5Bp8o8Uv for ; Fri, 26 Jun 2015 03:39:19 +0000 (UTC) Received: from mail-lb0-f172.google.com (mail-lb0-f172.google.com [209.85.217.172]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id C0C9F43DA9 for ; Fri, 26 Jun 2015 03:39:18 +0000 (UTC) Received: by lbbvz5 with SMTP id vz5so56834842lbb.0 for ; Thu, 25 Jun 2015 20:39:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=+CHauQn2WzauQTLvF5hHBeQ+dSt8J2ZQDmv5cH+jmCA=; b=lv5lesJzF4Gt0fgNOPNEhsP6rqvh7Gi8duaJyW51vxs9k/GjUKVTLqFvp7viOP5R03 RUnV3jAd21qoqWhl90+KyEAbcOItujy6gxTzhccMVd3rNma5mz8fcIGLkbAc89MC/JGG GTEfZgrOfbEDxma0HClFkbZO5ogsGmW3D2dj0kQlfeu1AmKt8pA8MRTgDR+qEFGOZqt3 Ha5K1l7ReAnKPUSePYpxBDCNmjp0iewCIPGhgaRUsp734RiuapuKnBvQ36mOn82rgQPq eGUJYqCtK1/hWGiI84V3dZ9pbUiXRVk5A0KCftfQN8KDBz0DIjTrAeowwKeEk/6BZglx 9Znw== MIME-Version: 1.0 X-Received: by 10.112.72.164 with SMTP id e4mr47170842lbv.113.1435289951766; Thu, 25 Jun 2015 20:39:11 -0700 (PDT) Received: by 10.112.158.167 with HTTP; Thu, 25 Jun 2015 20:39:11 -0700 (PDT) In-Reply-To: References: Date: Fri, 26 Jun 2015 09:09:11 +0530 Message-ID: Subject: Re: how to assign unique ID (Long Value) in mapper From: Harshit Mathur To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=001a11c2b616b98e060519637b49 --001a11c2b616b98e060519637b49 Content-Type: text/plain; charset=UTF-8 Are there loops in your graph? On Thu, Jun 25, 2015 at 10:39 PM, Ravikant Dindokar wrote: > Hi Hadoop user, > > I have a file containing one line for each edge in the graph with two > vertex ids (source & sink). > sample: > 1 2 (here 1 is source and 2 is sink node for the edge) > 1 5 > 2 3 > 4 2 > 4 3 > I want to assign a unique Id (Long value )to each edge i.e for each line > of the file. > > How to ensure assignment of unique value in distributed mapper process? > > Note : File size is large, so using only one reducer is not feasible. > > Thanks > Ravikant > -- Harshit Mathur --001a11c2b616b98e060519637b49 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Are there loops in your graph?


On Thu, Jun 25, 2015 at 10:39 P= M, Ravikant Dindokar <ravikant.iisc@gmail.com> wrote:<= br>
Hi Hadoop user,

I have a file containing one line for ea= ch edge in the graph with two vertex ids (source & sink).
sample:
1=C2=A0=C2=A0=C2=A0 2 (here 1 is source and 2 is sink node for the ed= ge)
1=C2=A0=C2=A0=C2=A0 5
2=C2=A0=C2=A0=C2=A0 3
4=C2=A0=C2= =A0=C2=A0 2
4=C2=A0=C2=A0=C2=A0 3
I want to assign a unique Id = (Long value )to each edge i.e for each line of the file.

How t= o ensure assignment of unique value in distributed mapper process?

<= /div>Note : File size is large, so using only one reducer is not feasible.<= br>
Thanks
Ravikant
<= /font>



--
Harshit Mathur
--001a11c2b616b98e060519637b49--