From java-user-return-36436-apmail-lucene-java-user-archive=lucene.apache.org@lucene.apache.org Wed Oct 01 12:13:22 2008 Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 26165 invoked from network); 1 Oct 2008 12:13:21 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 1 Oct 2008 12:13:21 -0000 Received: (qmail 44646 invoked by uid 500); 1 Oct 2008 12:13:13 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 44319 invoked by uid 500); 1 Oct 2008 12:13:12 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 44308 invoked by uid 99); 1 Oct 2008 12:13:12 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Oct 2008 05:13:12 -0700 X-ASF-Spam-Status: No, hits=0.4 required=10.0 tests=SPF_PASS,URIBL_GREY,WHOIS_MYPRIVREG X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of marcelo.ochoa@gmail.com designates 209.85.217.13 as permitted sender) Received: from [209.85.217.13] (HELO mail-gx0-f13.google.com) (209.85.217.13) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Oct 2008 12:12:07 +0000 Received: by gxk6 with SMTP id 6so39506gxk.5 for ; Wed, 01 Oct 2008 05:12:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:in-reply-to:mime-version:content-type :content-transfer-encoding:content-disposition:references; bh=cq/KVBcsTI2MpQ7SR+In7hR+H/OdUoTmMQffdOBfvLM=; b=dIIMdExUMVBQLMnpViOcMTbjtn8QEBxsCuIhyHLUyEQlpstH4BsGjoiEzl2Osl/opC G5DWNT08YkDwmBExb/XuE6aG3vcvKRZWXSsVu1ExQP55tvNfI5s9rnJUi7n1oOI4wIF+ z4Rtz138r3C9SgK2lr95ixgP2Y2/LoeTvgNwc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version :content-type:content-transfer-encoding:content-disposition :references; b=sTOtIawEtlNfzfGEN93wdoFL+sRlq5+s2A2CV2ystZ++hGhbbAfS6av1YMGXR8np1b 0xxfpb5efB7qf2XvTaH6R1AxUZ6k/uy5vHLP60F1YlaamilV9juXIBSfnZ3f0DHD3bAI MYHcJkB671dg5xlVXxLDEQNgx0qidGuI8xk6o= Received: by 10.100.96.9 with SMTP id t9mr7460288anb.66.1222863161677; Wed, 01 Oct 2008 05:12:41 -0700 (PDT) Received: by 10.100.8.18 with HTTP; Wed, 1 Oct 2008 05:12:41 -0700 (PDT) Message-ID: <126142c0810010512w188f379cqbb3fa307bf6a2dbd@mail.gmail.com> Date: Wed, 1 Oct 2008 09:12:41 -0300 From: "Marcelo Ochoa" To: java-user@lucene.apache.org Subject: Re: Lucene vs. Database In-Reply-To: <19755932.post@talk.nabble.com> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <19755932.post@talk.nabble.com> X-Virus-Checked: Checked by ClamAV on apache.org Hi Zoran: One of the biggest issues with Lucene DB integration is the network traffic consumed as consequence of indexing or updating operation, apart from transactionalbilty which can be relaxed in some application. During our Oracle Open World presentation we present some of these issues comparing performance during indexing time (integrated solution versus middle tier solution), network traffic, executions plans and others. Obviously its based on Oracle databases but the concepts are similar for any other solutions. You can download the complete presentation from: http://www28.cplan.com/cbo_export/208/PS_S298820_298820_208-1_v1.pdf http://www28.cplan.com/cbo_export/208/PS_S298820_298820_208-1_v2.pdf To download presentations, please enter the following when prompted. Note the Username and Password are case sensitive. Username: cboracle Password: oraclec6 Or you can see on-line only the Oracle Lucene integration details using Google docs at: http://docs.google.com/Presentation?id=ddgw7sjp_156gf9hczxv Best regards, Marcelo. On Wed, Oct 1, 2008 at 4:43 AM, agatone wrote: > > Hi, > I asked this question already on "lucene-general" list but also got advised > to ask here too. > > I'm working on a project that has big database in the background (some > tables have about 1500000 rows). We decided to use Lucene for "faster" > search. Our search works similar as all searches: you write search string, > get list of hits with detail link. But there is dilemma if we should store > more data into index than it's needed. > > One side of developing team insists that we should use lucene index as > somekind of storage for data so when you get hit, you go onto details and > then again use lucene to find document that matches the selected ID and take > the data from Lucene index. So in the end you end with copying complete > database tables into the lucene index. > > Other side insists on storing to index only data that is displayed directly > to the user when showing the search results list and needed for search > criteria. When you go onto details, you have the matching ID so you can > pickup that row from database by that ID rather than search it inside Lucene > index. > > Can someone please describe drawbacks and advantages of both approaches. > Actually can someone write down what's the actual profit, where and when of > the Lucene itself in real production env. > > IT would be great if there is anyone who could write his experience with > indexing and searching large amount of data. > > > Thank you > -- > View this message in context: http://www.nabble.com/Lucene-vs.-Database-tp19755932p19755932.html > Sent from the Lucene - Java Users mailing list archive at Nabble.com. > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > -- Marcelo F. Ochoa http://marceloochoa.blogspot.com/ http://marcelo.ochoa.googlepages.com/home ______________ Want to integrate Lucene and Oracle? http://marceloochoa.blogspot.com/2007/09/running-lucene-inside-your-oracle-jvm.html Is Oracle 11g REST ready? http://marceloochoa.blogspot.com/2008/02/is-oracle-11g-rest-ready.html --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org