Mailing-List: contact hama-user-help@incubator.apache.org; run by ezmlm
Precedence: bulk
Reply-To: hama-user@incubator.apache.org
Received-SPF: neutral (athena.apache.org: local policy)
MIME-Version: 1.0
Sender: edward@udanax.org
In-Reply-To: <a0a50dbb0906030919u7c54556cl8b7634da155d5e18@mail.gmail.com>
References: <a0a50dbb0906030145l66f9bd56h189b4b9a8efe967f@mail.gmail.com>
	 <eb4706e0906030200m59b00ff5j71d58ec6c264eb50@mail.gmail.com>
	 <a0a50dbb0906030207h9107074ke1b39f38fb946f95@mail.gmail.com>
	 <eb4706e0906030244u342396bdndaef8453cffa5ff8@mail.gmail.com>
	 <a0a50dbb0906030255i156f835bo806df29fb3c01006@mail.gmail.com>
	 <eb4706e0906030314n192cb3del7f011987d5471d7b@mail.gmail.com>
	 <a0a50dbb0906030919u7c54556cl8b7634da155d5e18@mail.gmail.com>
Date: Thu, 4 Jun 2009 11:24:57 +0900
Message-ID: <eb4706e0906031924x1f313432g348dabdfde88d4c3@mail.gmail.com>
Subject: Re: Hama status /
From: "Edward J. Yoon" <edwardyoon@apache.org>
To: hama-user@incubator.apache.org
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Yes, the goal is to handle really huge matrices, for example, matrix
operations for large-scale statistical processing, matrix
decomposition of huge web link graph/social graph.

It's the tests on 5 nodes and 10 nodes. In the future, I'll try them
on a thousand nodes.

On Thu, Jun 4, 2009 at 1:19 AM, tog <guillaume.alleon@gmail.com> wrote:
> Hi Edward,
>
> I had a look to the benchmarks ...
> Well a 5000 dense matrix multiply is roughly 30 seconds on my laptop. I h=
ave
> been doing out-of-core parallel matrix factor on solve with dense systems=
 up
> to 350000
> so I guess this is at least probably for larger matrix that Hama could be
> interesting
> Do you plan to do such tests with really huge matrices ?
> Otherwise what is your business case ?
>
> Cheers
> Guillaume
>
> On Wed, Jun 3, 2009 at 6:14 PM, Edward J. Yoon <edwardyoon@apache.org>wro=
te:
>
>> FYI, I ran some benchmarks -
>> http://wiki.apache.org/hama/PerformanceEvaluation
>>
>> If you need any help, Pls let us know.
>>
>> Thanks.
>>
>> On Wed, Jun 3, 2009 at 6:55 PM, tog <guillaume.alleon@gmail.com> wrote:
>> > Yes I understand the difference between MPI and Hadoop - I have been
>> using
>> > MPI before it actually exists :)
>> > But as you phrased it, I had the impression that Hama was working on a=
 1
>> > node/core cluster !!
>> >
>> > Regards
>> > Guillaume
>> >
>> > On Wed, Jun 3, 2009 at 5:44 PM, Edward J. Yoon <edwardyoon@apache.org
>> >wrote:
>> >
>> >> Hi,
>> >>
>> >> There is some difference between Map/Reduce and MPI programming. MPI
>> >> is based on and designed for fast parallel computing using network
>> >> communication on small cluster. Since MPI requires network
>> >> communication, Increased node numbers, there is a linear increase of
>> >> network cost at same time. On the contrary, Map/Reduce is designed to
>> >> distributed processing by connecting many commodity computers
>> >> together. Therefore, The algorithms should avoid large amounts of
>> >> communication for best performance and that key is the 'sequential
>> >> process'.
>> >>
>> >> Thanks.
>> >>
>> >> On Wed, Jun 3, 2009 at 6:07 PM, tog <guillaume.alleon@gmail.com> wrot=
e:
>> >> > Hi Edward
>> >> >
>> >> > I have a test to do which is basically Sparce Mat Vec multiplicatio=
n
>> and
>> >> Mat
>> >> > norm computation. So that should be possible with Hama in its curre=
nt
>> >> state
>> >> > I guess.
>> >> > What do you mean by "sequentially executed"
>> >> >
>> >> > Cheers
>> >> > Guillaume
>> >> >
>> >> > On Wed, Jun 3, 2009 at 5:00 PM, Edward J. Yoon <edwardyoon@apache.o=
rg
>> >> >wrote:
>> >> >
>> >> >> Hi,
>> >> >>
>> >> >> Currently, the basic matrix operations are implemented based on th=
e
>> >> >> map/reduce programming model. For example, the matrix get/set
>> methods,
>> >> >> the matrix norms, matrix-matrix multiplication/addition, matrix
>> >> >> transpose. In near future, SVD, Eigenvalue decomposition and some
>> >> >> graph algorithms will be implemented. All the operations are
>> >> >> sequentially executed.
>> >> >>
>> >> >> Thanks.
>> >> >>
>> >> >> On Wed, Jun 3, 2009 at 5:45 PM, tog <guillaume.alleon@gmail.com>
>> wrote:
>> >> >> >
>> >> >> > Hi,
>> >> >> >
>> >> >> > I would like to know what is the status of Hama ?
>> >> >> > What am I able to do with it ?
>> >> >> > What are the future directions ?
>> >> >> >
>> >> >> > Cheers
>> >> >> > Guillaume
>> >> >>
>> >> >>
>> >> >>
>> >> >> --
>> >> >> Best Regards, Edward J. Yoon @ NHN, corp.
>> >> >> edwardyoon@apache.org
>> >> >> http://blog.udanax.org
>> >> >>
>> >> >
>> >> >
>> >> >
>> >> > --
>> >> >
>> >> > PGP KeyID: 1024D/47172155
>> >> > FingerPrint: C739 8B3C 5ABF 127F CCFA =C2=A05835 F673 370B 4717 215=
5
>> >> >
>> >> > http://cheztog.blogspot.com
>> >> >
>> >>
>> >>
>> >>
>> >> --
>> >> Best Regards, Edward J. Yoon @ NHN, corp.
>> >> edwardyoon@apache.org
>> >> http://blog.udanax.org
>> >>
>> >
>> >
>> >
>> > --
>> >
>> > PGP KeyID: 1024D/47172155
>> > FingerPrint: C739 8B3C 5ABF 127F CCFA =C2=A05835 F673 370B 4717 2155
>> >
>> > http://cheztog.blogspot.com
>> >
>>
>>
>>
>> --
>> Best Regards, Edward J. Yoon @ NHN, corp.
>> edwardyoon@apache.org
>> http://blog.udanax.org
>>
>
>
>
> --
>
> PGP KeyID: 1024D/47172155
> FingerPrint: C739 8B3C 5ABF 127F CCFA =C2=A05835 F673 370B 4717 2155
>
> http://cheztog.blogspot.com
>


--=20
Best Regards, Edward J. Yoon @ NHN, corp.
edwardyoon@apache.org
http://blog.udanax.org