Mailing-List: contact user-help@giraph.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@giraph.apache.org
Received-SPF: pass (athena.apache.org: domain of claudio.martella@gmail.com
 designates 74.125.82.177 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CACahaSL=NQXqOPf=PSxTuy8Vog15m7s0Wt4PMNXcYCbr5aHj5Q@mail.gmail.com>
References: 
 <CACahaSJniyjEMjCH4OPRr1WkwmH2mZiyQmf-9ok96HkNSz8j-Q@mail.gmail.com>
 <CAFJOoJcGO17Ha7TNoFZ42NB=VAK-8hrq0gLruowESj=OnWZF2A@mail.gmail.com>
 <CACahaSL=NQXqOPf=PSxTuy8Vog15m7s0Wt4PMNXcYCbr5aHj5Q@mail.gmail.com>
From: Claudio Martella <claudio.martella@gmail.com>
Date: Fri, 22 Nov 2013 00:43:37 +0100
Message-ID: 
 <CAFJOoJfYhki0q-Do9szOfiX7C6YfVc5jxmJQUG=fWd=KKwBU3w@mail.gmail.com>
Subject: Re: Waking up all the vertices after every vertex calls vote to halt
To: "user@giraph.apache.org" <user@giraph.apache.org>
Content-Type: multipart/alternative; boundary=f46d04430638a793c604ebb878b7

--f46d04430638a793c604ebb878b7
Content-Type: text/plain; charset=ISO-8859-1

The simplest thing, is that you get a flag for each vertex to signal
whether they are really active. If not, they return. This means that
vertices never really vote to halt. Computationally, it does not cost you
much more than this check. You can play the rest of the logics with some
aggregators and the master compute.


On Thu, Nov 21, 2013 at 11:57 PM, Ameya Vilankar
<ameya.vilankar@gmail.com>wrote:

> Hi,
> I have implemented Alternating Least Squares on top apache giraph. On the
> edge, I store the type of the edge. Edges can be either a training edge or
> testing edge. When I run the algorithm, I use only the ratings on the
> training edge to tune the vectors on the vertices.
> The algorithm ends in one of the two scenarios:
> 1. All the vertices have tuned their vector with in the tolerable error.
> At this point there are no active vertices since everyone has called vote
> to halt.
> 2. We reached the maximum number of supersteps. At this point, some
> vertices are active since they received messages from the last superstep.
>
> I have written an Aggregator that counts the training error along this
> process. But now, I want to calculate the prediction/testing error which is
> along the testing labelled edges. But there are either no active vertices
> or few active vertices at this point in my algorithm. I need all the
> vertices to send their vectors along all of their testing edges to compute
> the testing error and send it to a error sum aggregator. For this I need to
> activate all the vertices.
> Hope it is clear to you now.
>
> Thanks,
> Ameya.
> Zynga
>
>
> On Thu, Nov 21, 2013 at 2:45 PM, Claudio Martella <
> claudio.martella@gmail.com> wrote:
>
>> Hi Ameya,
>>
>> I'm not sure I understand the problem correctly. The maximum number of
>> supersteps allows you to halt the computation when that threshold is
>> reached. The RMSE can be computed within the master compute.
>>
>> What do you want to achieve exactly?
>>
>>
>> On Thu, Nov 21, 2013 at 10:47 PM, Ameya Vilankar <
>> ameya.vilankar@gmail.com> wrote:
>>
>>> Hi,
>>> I am implementing a machine learning algorithm on top giraph. The
>>> algorithm converges when all the vertices call voteToHalt or some max
>>> number of supersteps have completed.
>>> I want to calculate the RMSE error  after the algorithm has converged.
>>> But the problem is either all the vertices have called vote to halt or (in
>>> the case where we reach max supersteps) only some of them are still active.
>>> I need to reactivate or wake up all the vertices. Is there any way in
>>> giraph that I could do this?
>>>
>>> Thanks,
>>> Ameya Vilankar
>>> Zynga
>>>
>>
>>
>>
>> --
>>    Claudio Martella
>>    claudio.martella@gmail.com
>>
>
>


-- 
   Claudio Martella
   claudio.martella@gmail.com

--f46d04430638a793c604ebb878b7
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">The simplest thing, is that you get a flag for each vertex=
 to signal whether they are really active. If not, they return. This means =
that vertices never really vote to halt. Computationally, it does not cost =
you much more than this check. You can play the rest of the logics with som=
e aggregators and the master compute.<div class=3D"gmail_extra">

<br><br><div class=3D"gmail_quote">On Thu, Nov 21, 2013 at 11:57 PM, Ameya =
Vilankar <span dir=3D"ltr">&lt;<a href=3D"mailto:ameya.vilankar@gmail.com" =
target=3D"_blank">ameya.vilankar@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi,<div>I have implemented =
Alternating Least Squares on top apache giraph. On the edge, I store the ty=
pe of the edge. Edges can be either a training edge or testing edge. When I=
 run the algorithm, I use only the ratings on the training edge to tune the=
 vectors on the vertices.=A0</div>


<div>The algorithm ends in one of the two scenarios:</div><div>1. All the v=
ertices have tuned their vector with in the tolerable error. At this point =
there are no active vertices since everyone has called vote to halt.</div>


<div>2. We reached the maximum number of supersteps. At this point, some ve=
rtices are active since they received messages from the last superstep.</di=
v><div><br></div><div>I have written an Aggregator that counts the training=
 error along this process. But now, I want to calculate the prediction/test=
ing error which is along the testing labelled edges. But there are either n=
o active vertices or few active vertices at this point in my algorithm. I n=
eed all the vertices to send their vectors along all of their testing edges=
 to compute the testing error and send it to a error sum aggregator. For th=
is I need to activate all the vertices.</div>


<div>Hope it is clear to you now.</div><div><br></div><div>Thanks,</div><di=
v>Ameya.</div><div>Zynga</div></div><div><div><div class=3D"gmail_extra"><b=
r><br><div class=3D"gmail_quote">On Thu, Nov 21, 2013 at 2:45 PM, Claudio M=
artella <span dir=3D"ltr">&lt;<a href=3D"mailto:claudio.martella@gmail.com"=
 target=3D"_blank">claudio.martella@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Ameya,<div><br></div><di=
v>I&#39;m not sure I understand the problem correctly. The maximum number o=
f supersteps allows you to halt the computation when that threshold is reac=
hed. The RMSE can be computed within the master compute.</div>


<div><br></div><div>What do you want to achieve exactly?</div></div><div cl=
ass=3D"gmail_extra"><div><div><br><br><div class=3D"gmail_quote">On Thu, No=
v 21, 2013 at 10:47 PM, Ameya Vilankar <span dir=3D"ltr">&lt;<a href=3D"mai=
lto:ameya.vilankar@gmail.com" target=3D"_blank">ameya.vilankar@gmail.com</a=
>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi,<div>I am implementing a=
 machine learning algorithm on top giraph. The algorithm converges when all=
 the vertices call voteToHalt or some max number of supersteps have complet=
ed.=A0</div>


<div>I want to calculate the RMSE error =A0after the algorithm has converge=
d. But the problem is either all the vertices have called vote to halt or (=
in the case where we reach max supersteps) only some of them are still acti=
ve.</div>


<div>I need to reactivate or wake up all the vertices. Is there any way in =
giraph that I could do this?</div><div><br></div><div>Thanks,</div><div>Ame=
ya Vilankar</div><div>Zynga</div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div></div><span><=
font color=3D"#888888">-- <br> =A0 =A0Claudio Martella<br> =A0 =A0<a href=
=3D"mailto:claudio.martella@gmail.com" target=3D"_blank">claudio.martella@g=
mail.com</a>=A0 =A0
</font></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
 =A0 =A0Claudio Martella<br> =A0 =A0<a href=3D"mailto:claudio.martella@gmai=
l.com" target=3D"_blank">claudio.martella@gmail.com</a>=A0 =A0
</div></div>

--f46d04430638a793c604ebb878b7--