Return-Path: X-Original-To: apmail-lucy-user-archive@www.apache.org Delivered-To: apmail-lucy-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7C05EDC8F for ; Tue, 23 Oct 2012 12:35:38 +0000 (UTC) Received: (qmail 52905 invoked by uid 500); 23 Oct 2012 12:35:38 -0000 Delivered-To: apmail-lucy-user-archive@lucy.apache.org Received: (qmail 52815 invoked by uid 500); 23 Oct 2012 12:35:38 -0000 Mailing-List: contact user-help@lucy.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@lucy.apache.org Delivered-To: mailing list user@lucy.apache.org Received: (qmail 52793 invoked by uid 99); 23 Oct 2012 12:35:37 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Oct 2012 12:35:37 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dag@nimrod.no designates 195.139.160.2 as permitted sender) Received: from [195.139.160.2] (HELO sid.nimrod.no) (195.139.160.2) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Oct 2012 12:35:31 +0000 Received: from sid.nimrod.no (localhost.localdomain [127.0.0.1]) by sid.nimrod.no (8.13.8/8.13.8) with ESMTP id q9NCZABD021775 for ; Tue, 23 Oct 2012 14:35:10 +0200 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nimrod.no; s=n20120428; t=1350995710; i=@nimrod.no; bh=tH4xfWJ7K/nsiekqJtNv363QyKflzX7tsenP5LXPF0o=; h=Sender:To:Subject:References:From:Date:In-Reply-To:Message-ID: MIME-Version:Content-Type; b=ofu+cJgcuSarreekgqHAux/I0FYrzAVG2Ri0iMhG0fS79zH8IijYgRAHcqcqzq70o PZO7Vf0Zagoh0MzerGLLFD8cOq2W9knzjMaj4OlU9ohZeYSyTV8FiwZAgnvfY+lM16 WIRDuyDjQul9v8SaZcAqoJ8YRgRDtXMaXA7kssyA= Received: (from dag@localhost) by sid.nimrod.no (8.13.8/8.13.8/Submit) id q9NCZAue021772; Tue, 23 Oct 2012 14:35:10 +0200 X-Authentication-Warning: sid.nimrod.no: dag set sender to dag@nimrod.no using -f Sender: dag@nimrod.no To: user@lucy.apache.org References: <8CC8A0E7-1CE0-40BE-9127-7966E45162F6@gmail.com> From: Dag Lem Organization: Nimrod Date: 23 Oct 2012 14:35:10 +0200 In-Reply-To: <8CC8A0E7-1CE0-40BE-9127-7966E45162F6@gmail.com> Message-ID: <8662614b9d.fsf@sid.nimrod.no> Lines: 24 User-Agent: Gnus/5.09 (Gnus v5.9.0) Emacs/21.4 MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Checked: Checked by ClamAV on apache.org Subject: Re: [lucy-user] Running out of memory while loading Ken Youens-Clark writes: > Hi, > > I'm just experimenting with the use of Lucy to index the data I formally store in relational (MySQL) databases. I'm just taking the text from the db and putting into Lucy stores. Each directory gets its own directory so that it's easy for me to update just part of my search index when a db changes. So far, I've processed about 30 dbs into a total of about 2.3GB of Lucy indexes. The problem is that my machine (a pretty decent, dual-core Linux host) keeps running out of memory, esp. when indexing a large db with 100K+ records. My sysadmin keeps killing it as the it will take down the machine. > > I'm using the latest Perl and Lucy source. Any ideas? > > Ken > One possible cause of your problems is that your database driver attempts to blast the complete result set from your SQL query into client memory in one go, so that you are likely to run out of memory before even starting on the indexing. I don't really know DBD::mysql, assuming that's what you're using, but I know from first hand experience that you have to take special care with DBD::Pg. -- Best regards, Dag Lem