Return-Path: X-Original-To: apmail-giraph-user-archive@www.apache.org Delivered-To: apmail-giraph-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E3F04112B5 for ; Thu, 19 Jun 2014 12:03:07 +0000 (UTC) Received: (qmail 53002 invoked by uid 500); 19 Jun 2014 12:03:07 -0000 Delivered-To: apmail-giraph-user-archive@giraph.apache.org Received: (qmail 52949 invoked by uid 500); 19 Jun 2014 12:03:07 -0000 Mailing-List: contact user-help@giraph.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@giraph.apache.org Delivered-To: mailing list user@giraph.apache.org Received: (qmail 52939 invoked by uid 99); 19 Jun 2014 12:03:07 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Jun 2014 12:03:07 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of soozandjohnyost@gmail.com designates 209.85.219.49 as permitted sender) Received: from [209.85.219.49] (HELO mail-oa0-f49.google.com) (209.85.219.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Jun 2014 12:03:00 +0000 Received: by mail-oa0-f49.google.com with SMTP id i7so4833940oag.36 for ; Thu, 19 Jun 2014 05:02:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=N+wvGN9GYxiwIns/uv1AnoC03TLxKqe1ovibp6L2FYU=; b=y0GRqv3BMNSyge8qzU7ufa8UVpMKzyyUUXE0fWbTj5wSNglJWwqgIfXQ6hvkz7sZnG ybDsnNZt9qcXrRe/7+yh4ljMlNGhMG4Kf6gCyZ5TW+BxLCUX1CDKhDh9/0rfykqhCBKf 0Bn4kiZZofJEuio1ojry5r2BMOMtmgviWkryORbUdZAqPA1/esou0U4m5C1NEVb/HD2s P2oeTZ5AVNj81dJkXpfgn+8fVQvMCU+mRKYHDI7JkQnQ80WN+3N5Elys6jhdHGycHrAF pl0qIbsxQWH4iXC8Feeqohy8z7O+dTiFto0dA4O9GWtWMcm0bGS7fX3d+M81JUzjvqEl QiUg== MIME-Version: 1.0 X-Received: by 10.60.160.4 with SMTP id xg4mr4208503oeb.4.1403179359544; Thu, 19 Jun 2014 05:02:39 -0700 (PDT) Received: by 10.202.94.69 with HTTP; Thu, 19 Jun 2014 05:02:39 -0700 (PDT) In-Reply-To: References: Date: Thu, 19 Jun 2014 08:02:39 -0400 Message-ID: Subject: Re: How to output into multiple files through a GiraphJob From: John Yost To: user@giraph.apache.org Content-Type: multipart/alternative; boundary=089e0112cb30482aea04fc2f27da X-Virus-Checked: Checked by ClamAV on apache.org --089e0112cb30482aea04fc2f27da Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi Ferenc, I have an Giraph job that outputs from the Computation class as opposed to the MasterCompute because I need to maintain alot of state within VertexValues as opposed to Aggregators. This is one way of outputting results as multiple files. I am assuming that you want to scope output files per sub-graph groupings of vertices, of course. :) --John On Thu, Jun 19, 2014 at 4:02 AM, Ferenc B=C3=A9res wr= ote: > Hi Everyone, > > Currently I'm working on an ALS implementation in giraph 1.1.0 and I woul= d > like to output the values of the vertices into multiple output files, but= I > could not figure it out how to do it. > > I found that in Hadoop it can be done by using *org.apache.hadoop.mapredu= ce.lib.output.MultipleOutputs, > *but it didn't work with the GiraphJob. > > Is it possible to output into multiple files by configuring the GiraphJob= , > or there is an other way? > > I would appreciate any idea in this matter. > > Thank you, > Ferenc B=C3=A9res > --089e0112cb30482aea04fc2f27da Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi Ferenc,

I have an Giraph job tha= t outputs from the Computation class as opposed to the MasterCompute becaus= e I need to maintain alot of state within VertexValues as opposed to Aggreg= ators.=C2=A0 This is one way of outputting results as multiple files.=C2=A0= I am assuming that you want to scope output files per sub-graph groupings = of vertices, of course. :)

--John


On Thu, Jun 19, 2014 at 4:02 AM, Ferenc B=C3=A9res <ferdz= so05@gmail.com> wrote:
Hi Everyone,

Cu= rrently I'm working on an ALS implementation in giraph=20 1.1.0 and I would like to output the values of the vertices into=20 multiple output files, but I could not figure it out how to do it.

I= found that in Hadoop it can be done by using=C2=A0org.apache.hadoop.map= reduce.lib.output.MultipleOutputs<KEYOUT,VALUEOUT>, but it didn&#= 39;t work with the GiraphJob.

Is it possible to output into multiple files by configuring the GiraphJ= ob, or there is an other way?

I would appreciate any idea in this ma= tter.

Thank you,
Ferenc B=C3=A9res

--089e0112cb30482aea04fc2f27da--