Return-Path: X-Original-To: apmail-giraph-user-archive@www.apache.org Delivered-To: apmail-giraph-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4A6EDE711 for ; Sat, 2 Mar 2013 12:32:10 +0000 (UTC) Received: (qmail 39691 invoked by uid 500); 2 Mar 2013 12:32:09 -0000 Delivered-To: apmail-giraph-user-archive@giraph.apache.org Received: (qmail 39525 invoked by uid 500); 2 Mar 2013 12:32:03 -0000 Mailing-List: contact user-help@giraph.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@giraph.apache.org Delivered-To: mailing list user@giraph.apache.org Received: (qmail 39470 invoked by uid 99); 2 Mar 2013 12:32:01 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 02 Mar 2013 12:32:01 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of pmorgan246@gmail.com designates 74.125.82.171 as permitted sender) Received: from [74.125.82.171] (HELO mail-we0-f171.google.com) (74.125.82.171) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 02 Mar 2013 12:31:54 +0000 Received: by mail-we0-f171.google.com with SMTP id u54so3304407wey.16 for ; Sat, 02 Mar 2013 04:31:33 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:date:message-id:subject:from:to :content-type; bh=I0vu3neJhqirBfNTvAzTMK1ZjiFFk69nBjNiziEtaFc=; b=grRmILY4dSnyGhYTCsm6g4lyqUmlGYCEzkE/QPPsYQBlGd6MGcyT4Y7T4fL1NACLya h60B64D3Pee4ONHDfQiibBB0Uhr7fgVuoaL8py+FuzdiKYhunNF4H8n+sBEGuS2Nvm5s +KdsfwvFQUEUJ5rsWq9OetgDZKV2ES5sFDDbfK7ipeFUEeYR08B8Jy/W4OkVtuPWPvh1 XOcSA1UadFKZP3DED3/CDlcueZrOQpxBV4qSS5r/lOJpdxdCJQHZHlYONpBwPfc9XNky +XbVhkHVtSUwOl1t5JAUkFHzyuElSosHSyXHYadIbstJX0N3iLufNzYOk8TaNn+4WxKw C8Bg== MIME-Version: 1.0 X-Received: by 10.194.62.170 with SMTP id z10mr22691479wjr.34.1362227493582; Sat, 02 Mar 2013 04:31:33 -0800 (PST) Received: by 10.216.43.10 with HTTP; Sat, 2 Mar 2013 04:31:33 -0800 (PST) Date: Sat, 2 Mar 2013 12:31:33 +0000 Message-ID: Subject: Several Queries From: Peter Morgan To: user@giraph.apache.org Content-Type: multipart/alternative; boundary=047d7ba98302dbbf2004d6f04d9c X-Virus-Checked: Checked by ClamAV on apache.org --047d7ba98302dbbf2004d6f04d9c Content-Type: text/plain; charset=ISO-8859-1 Hi, I have several questions/comments relating to various parts of Giraph. Not sure whether they should be here or in the dev mailing list. They might have already been answered somewhere. 1) When is there likely to be a new release of Giraph, its been a while since the last one and I know there has been a lot of work on it? 2) I've been using the AccumuloVertexInputFormat, and not found it very useful - it is most likely that graph data held in Accumulo is an edge list, so would it not be better that the Accumulo Input Format for Giraph extends EdgeInputFormat instead? The current example for using with Accumulo is called AccumuloEdgeInputFormat (which extends AccumuloVertexInputFormat) - that seems to be badly named? I was also wondering what the behaviour is when using a VertexInputFormat but with an edge list - do vertices created later in the input override previous ones, and so edges get lost/ not added? 3) Is there a way to ensure that if the input file doesn't exist, the giraph job (and hence MR job) will exit, and not hang? 4) The SequenceFileReader currently assumes that the Key type is going to be the same as the Vertex name type, but in my case this isn't true. Is there, or can there be a version which allows the Sequence file key and value be different to the giraph vertex name, state, edge and message types. 5) I've been having a problem with a Giraph job successfully finishing, but the process/jvm on each compute node not being killed properly and it just sits there idle, but keeping the RAM that the job has used. Doesn't seem to be a problem when we run normal MR. We then have to manually kill each process to release the RAM. Any ideas why this might be happening? Thanks in advance for any help. Peter --047d7ba98302dbbf2004d6f04d9c Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi,

I have several questions/comments relating to variou= s parts of Giraph. Not sure whether they should be here or in the dev maili= ng list. They might have already been answered somewhere.

1) When is there likely to be a new release of Giraph, its been = a while since the last one and I know there has been a lot of work on it?

2) I've been using the AccumuloVertexInputForma= t, and not found it very useful - it is most likely that graph data held in= Accumulo is an edge list, so would it not be better that the Accumulo Inpu= t Format for Giraph extends EdgeInputFormat instead? The current example fo= r using with Accumulo is called AccumuloEdgeInputFormat (which extends Accu= muloVertexInputFormat) - that seems to be badly named? I was also wondering= what the behaviour is when using a VertexInputFormat but with an edge list= - do vertices created later in the input override previous ones, and so ed= ges get lost/ not added?

3) Is there a way to ensure that if the input file does= n't exist, the giraph job (and hence MR job) will exit, and not hang?

4) The SequenceFileReader currently assumes that th= e Key type is going to be the same as the Vertex name type, but in my case = this isn't true. Is there, or can there be a version which allows the S= equence file key and value be different to the giraph vertex name, state, e= dge and message types.=A0

5) I've been having a problem with a Giraph job=A0s= uccessfully finishing, but the process/jvm on each compute node not being k= illed properly and it just sits there idle, but keeping the RAM that the jo= b has used. Doesn't seem to be a problem when we run normal MR. We then= have to manually kill each process to release the RAM. Any ideas why this = might be happening?

Thanks in advance for any help.
Peter=A0 --047d7ba98302dbbf2004d6f04d9c--