Return-Path: X-Original-To: apmail-giraph-dev-archive@www.apache.org Delivered-To: apmail-giraph-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A0BC010795 for ; Fri, 5 Dec 2014 00:12:34 +0000 (UTC) Received: (qmail 66839 invoked by uid 500); 5 Dec 2014 00:12:34 -0000 Delivered-To: apmail-giraph-dev-archive@giraph.apache.org Received: (qmail 66805 invoked by uid 500); 5 Dec 2014 00:12:34 -0000 Mailing-List: contact dev-help@giraph.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@giraph.apache.org Delivered-To: mailing list dev@giraph.apache.org Received: (qmail 66786 invoked by uid 99); 5 Dec 2014 00:12:33 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 05 Dec 2014 00:12:33 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of arghyakusumdas2266@gmail.com designates 209.85.218.41 as permitted sender) Received: from [209.85.218.41] (HELO mail-oi0-f41.google.com) (209.85.218.41) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 05 Dec 2014 00:12:28 +0000 Received: by mail-oi0-f41.google.com with SMTP id a3so13367730oib.0 for ; Thu, 04 Dec 2014 16:12:07 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=+U1rmzaVbHKwPfEheQ8ycdiq8XtubsiHfSd2XiNYUws=; b=pkIZQbQKw3woHzJN/8IYvwu9e42znvRXrjCNSqd2mGU7ZFW5sVUC37yVwfs+Adggv0 xOgmRDSmSITVcOj5fJhUeS/PfoATSWXBoQSLxyh+9gdeUx6aSnEMVuZnWhKK5NMG0667 N2VwzoACOvZXNvwyYTnVyhITd1mZEKDPzpJclT/T9Y3KqPpcmj+flapnSWQbh31TR/mW UfnC8qCr8ZvD5z1wFJpTarOuNCDAUxiezt908wpNU9QWcwVVS8mGs7EMyE053vA58YEh yCbBw40OafMgNakY+0YVelSVwMdq17FDsyRZgPDIScbA1CGBsRr0q404pyWJa86nEowf XD0w== MIME-Version: 1.0 X-Received: by 10.60.112.72 with SMTP id io8mr8462472oeb.67.1417738327565; Thu, 04 Dec 2014 16:12:07 -0800 (PST) Received: by 10.202.82.11 with HTTP; Thu, 4 Dec 2014 16:12:07 -0800 (PST) In-Reply-To: References: Date: Thu, 4 Dec 2014 18:12:07 -0600 Message-ID: Subject: Re: Need help on simple giraph input format From: Arghya Kusum Das To: dev Content-Type: multipart/alternative; boundary=001a11332adc662c6605096ced4f X-Virus-Checked: Checked by ClamAV on apache.org --001a11332adc662c6605096ced4f Content-Type: text/plain; charset=UTF-8 Hi Matthew, Thanks for your quick reply. I sent a subscription request to user list also. Regarding the graph: No it is not possible to come up with different integer ids for different nodes. Because the vertex id can be even more than 50 characters (It is just a simple excerpt). So I need a very simple text based vertex input format where each line represents a vertex and its outgoing edges (adjacency list). I thought since it is pretty generic and simple there should be something already present Thansk and Regards, Arghya Kusum Das On Thu, Dec 4, 2014 at 5:50 PM, Matthew Saltz wrote: > Hi, > > If you can simply come up with long values to represent your vertex ids > instead of text (AAA -> 1, AAC -> 2, etc.), you can just use this class > < > https://giraph.apache.org/apidocs/org/apache/giraph/io/formats/LongLongNullTextInputFormat.html > > > directly. > If not, take a look at that code to give you some ideas on how to implement > it with text ids instead. There might be another one already available to > do this but this is the best I can think of at the moment. > > Best, > Matthew > > > PS I think this is a question that would be better directed towards the > user > mailing list. > > On Fri, Dec 5, 2014 at 12:20 AM, Arghya Kusum Das < > arghyakusumdas2266@gmail.com> wrote: > > > Hi, > > This is the first time I am trying to deal with giraph input format class > > and need some help. > > > > I have a simple Graph like following: > > > > AAA AAT AAG > > AAC ACG ACT > > AAG AGT AGA > > AAT ATT > > ACG CGA > > ACT CTG CTT CTC > > > > The first column is the vertex id and the following are the edges > > (Adjacency list). > > > > Is there any predefined input-format in giraph for this? > > If so what is that? And if not can anybody provide a simple class for > that? > > > > Any help is appreciated > > > > -- > > Thanks and regards, > > Arghya Kusum Das > > (225-270-6163) > > > -- Thanks and regards, Arghya Kusum Das (225-270-6163) --001a11332adc662c6605096ced4f--