Return-Path: X-Original-To: apmail-lucene-general-archive@www.apache.org Delivered-To: apmail-lucene-general-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 46D3518B1A for ; Mon, 15 Jun 2015 20:18:53 +0000 (UTC) Received: (qmail 56178 invoked by uid 500); 15 Jun 2015 20:18:52 -0000 Delivered-To: apmail-lucene-general-archive@lucene.apache.org Received: (qmail 56124 invoked by uid 500); 15 Jun 2015 20:18:52 -0000 Mailing-List: contact general-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@lucene.apache.org Delivered-To: mailing list general@lucene.apache.org Received: (qmail 56113 invoked by uid 99); 15 Jun 2015 20:18:52 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 15 Jun 2015 20:18:52 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW X-Spam-Check-By: apache.org Received-SPF: error (nike.apache.org: local policy) Received: from [64.34.111.239] (HELO barmail6.idig.net) (64.34.111.239) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 15 Jun 2015 20:16:36 +0000 X-ASG-Debug-ID: 1434399467-0538de3ffb611850001-nmAKgu Received: from cwh901.canadianwebhosting.com (cwh901.canadianwebhosting.com [67.231.20.52]) by barmail6.idig.net with ESMTP id mncXAb8C4Tf21G56 for ; Mon, 15 Jun 2015 13:17:47 -0700 (PDT) X-Barracuda-Envelope-From: fuad@efendi.ca X-Barracuda-Apparent-Source-IP: 67.231.20.52 Received: from 216-191-42-62.dedicated.allstream.net ([216.191.42.62]:64489 helo=FEFENDIHP) by cwh901.canadianwebhosting.com with esmtpa (Exim 4.85) (envelope-from ) id 1Z4apH-003I9q-5x for general@lucene.apache.org; Mon, 15 Jun 2015 13:17:47 -0700 From: "Fuad Efendi" To: References: <1434394561.2590356.296201153.2C09E40F@webmail.messagingengine.com> In-Reply-To: Subject: RE: Lucene vs Solr Indexing Speed on Sample data Issue!!! Date: Mon, 15 Jun 2015 16:17:47 -0400 X-ASG-Orig-Subj: RE: Lucene vs Solr Indexing Speed on Sample data Issue!!! Message-ID: <001a01d0a7a8$5895eff0$09c1cfd0$@efendi.ca> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable X-Mailer: Microsoft Outlook 14.0 Thread-Index: AQGzxs9OX+O7ujPMr8ym38MWIgBTjQHuNZz6Akqjceydxfqz8A== Content-Language: en-ca X-Barracuda-Connect: cwh901.canadianwebhosting.com[67.231.20.52] X-Barracuda-Start-Time: 1434399467 X-Barracuda-URL: https://64.34.111.239:443/cgi-mod/mark.cgi X-Virus-Scanned: by bsmtpd at idig.net X-Barracuda-BRTS-Status: 1 X-Barracuda-Spam-Score: 0.59 X-Barracuda-Spam-Status: No, SCORE=0.59 using global scores of TAG_LEVEL=1000.0 QUARANTINE_LEVEL=1000.0 KILL_LEVEL=3.5 tests=CN_BODY_332, PLING_PLING, THREAD_INDEX X-Barracuda-Spam-Report: Code version 3.2, rules version 3.2.3.19876 Rule breakdown below pts rule name description ---- ---------------------- -------------------------------------------------- 0.01 THREAD_INDEX thread-index: AcO7Y8iR61tzADqsRmmc5wNiFHEOig== 0.12 CN_BODY_332 BODY: CN_BODY_332 0.46 PLING_PLING Subject has lots of exclamation marks X-Virus-Checked: Checked by ClamAV on apache.org >From my experience, "high throughput" example: Using single-thread SolrJ client, I can index (for example) 1000 = documents per second. And this is maximum "speed". Using 12 Threads, I can index 12000 documents per second, just because = we have 8-core SOLR, and 75% of processing is CPU-bound. You can do it with SOLR + SolrJ easily; with Lucene you will need much = more development efforts, but it is the same. Thanks, http://www.tokenizer.ca -----Original Message----- From: Ted Dunning [mailto:ted.dunning@gmail.com]=20 Sent: June-15-15 3:17 PM To: general@lucene.apache.org Subject: Re: Lucene vs Solr Indexing Speed on Sample data Issue!!! And what does high throughput actually mean in terms of number of = documents per second and bytes (or terms) per document? On Mon, Jun 15, 2015 at 11:56 AM, Upayavira wrote: > Please post the original question here, so that everything people need = > to review your question is included within this thread! > > Oh, and for a high-throughput system, 8Gb RAM doesn't sound like much. = > A Lucene index, whether inside Solr or not, benefits from a lot of = RAM. > > Thanks! > > Upayavira > > On Mon, Jun 15, 2015, at 05:19 AM, Argho Chatterjee wrote: > > Hello Everyone, > > > > I had posted a question on stackoverflow.com after performing a few=20 > > POCs > > > > My hadrware consist of a single i-3 intel processor (4 CPU as per=20 > > "dxdiag" > > on run ), 8GB Ram, Laptop machine. > > > > My Question Link : > > > http://stackoverflow.com/questions/30823314/lucene-vs-solr-indexning-s > peed-for-sampe-data > > > > but no one could solve it as of now.. > > I hope the question I posted is undertandable. > > > > Please if anyone could help me out with the indexing speed of Solr=20 > > (way > > slower) vs Lucene (way faster).. > > > > I am trying to build a module for real time indexing and querying,=20 > > and the traffic is high, POC pass with Lucene for handling High=20 > > Traffic for Indexing, for Solr It is not able to do so.. > > > > Again My Machine Spec : > > HP, intel core i3, 8GB ram, TB HDD. > > > > Please let me know if there is a problem with Solr or am I doing=20 > > anything wrong. > > > > Thanks > > Argho >