Return-Path: Delivered-To: apmail-mahout-user-archive@www.apache.org Received: (qmail 128 invoked from network); 24 Nov 2010 19:33:40 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 24 Nov 2010 19:33:40 -0000 Received: (qmail 68221 invoked by uid 500); 24 Nov 2010 19:34:11 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 68179 invoked by uid 500); 24 Nov 2010 19:34:11 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 68171 invoked by uid 99); 24 Nov 2010 19:34:11 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Nov 2010 19:34:11 +0000 X-ASF-Spam-Status: No, hits=4.7 required=10.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,FREEMAIL_REPLY,RCVD_IN_DNSWL_NONE,RFC_ABUSE_POST,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of pmjimenez1983@hotmail.com designates 65.55.90.37 as permitted sender) Received: from [65.55.90.37] (HELO snt0-omc1-s26.snt0.hotmail.com) (65.55.90.37) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Nov 2010 19:34:01 +0000 Received: from SNT112-W11 ([65.55.90.7]) by snt0-omc1-s26.snt0.hotmail.com with Microsoft SMTPSVC(6.0.3790.4675); Wed, 24 Nov 2010 11:33:40 -0800 Message-ID: X-Originating-IP: [85.53.20.174] From: PEDRO MANUEL JIMENEZ RODRIGUEZ To: Subject: RE: Lanczos Algorithm Date: Wed, 24 Nov 2010 20:33:39 +0100 Importance: Normal In-Reply-To: References: ,,, , , ,, Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OriginalArrivalTime: 24 Nov 2010 19:33:40.0013 (UTC) FILETIME=[7F1B7DD0:01CB8C0E] X-Virus-Checked: Checked by ClamAV on apache.org Hi Ted=2C This is in the worst case. I=A0 haved asked for the amount of data and that= 's what somebody told me. Anyway I'm not going to make any test with that a= mount of data.=20 I'm doing my final year project and I just have to make this works with a r= easonable amount of data. So I just need to know how the algorithm it's wor= king and why I'm getting different values when I compare the results with a= nother implementation in R. =A0 =A0 ---------------------------------------- > From: ted.dunning@gmail.com > Date: Tue=2C 23 Nov 2010 14:55:00 -0800 > Subject: Re: Lanczos Algorithm > To: user@mahout.apache.org > > I seriously doubt that=2C actually. 10^14 is a very large number. > > As far as I know=2C the record for computing an SVD of a large sparse mat= rix > started with about 2-3 x 10^9 non-zero elements. You are saying that your > problem is 100=2C000 times larger than this. I think that you are going t= o > have to > wait for another 15 compute speed doubling times before this becomes a > feasible computation. > > On Tue=2C Nov 23=2C 2010 at 11:55 AM=2C PEDRO MANUEL JIMENEZ RODRIGUEZ < > pmjimenez1983@hotmail.com> wrote: > > > Well=2C this is in the worst case but it could be possible. > > > > I'm not going to make any tests with this amount of data because for me= is > > impossible but this project is part of a bigger one and they would have > > enough space to deal with this amount of data. > > > > > > ---------------------------------------- > > > From: ted.dunning@gmail.com > > > Date: Mon=2C 22 Nov 2010 14:46:20 -0800 > > > Subject: Re: Lanczos Algorithm > > > To: user@mahout.apache.org > > > > > > That seems like a lot. That would mean that have 10^14 =3D 100 trilli= on > > > nonzero elements which would take 10PB to store with one bit per non-= zero > > > element. > > > > > > Are there many totally zero rows? > > > > > > Can you estimate how many non-zero elements you have in all? > > > > > =