Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C94EFE47E for ; Fri, 1 Feb 2013 05:49:13 +0000 (UTC) Received: (qmail 43622 invoked by uid 500); 1 Feb 2013 05:49:11 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 43559 invoked by uid 500); 1 Feb 2013 05:49:10 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 43534 invoked by uid 99); 1 Feb 2013 05:49:09 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Feb 2013 05:49:09 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of arunk786@gmail.com designates 209.85.210.177 as permitted sender) Received: from [209.85.210.177] (HELO mail-ia0-f177.google.com) (209.85.210.177) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Feb 2013 05:49:03 +0000 Received: by mail-ia0-f177.google.com with SMTP id h8so4739416iaa.8 for ; Thu, 31 Jan 2013 21:48:42 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type; bh=IstiBk8SAx6a95953mZINPhqje18Z1vikKs6PgicbSs=; b=FWn+B3Obhql/um/uJtpfJVABfIWnL2oPn79/zHwFR/W6V3NOxen0i65Kg9hSIG3n63 7beflbX3YrGY+yF6nwzV1gdxrR2a6p0GHykl5LgE3pCUI/gAsIT0g3FPCL2X8Y/gJ8+W W8Cif3+TrDxZ/clCI26fewG1XJ6NGfssaGl7ppHfJs4UuYn8hFH2G4P94JsHGjBkG+Zd TldnZ+3t4YJNzB7VBezG5WsXJnz4zhSRkfKiVtuA5aOWQ+w3ksBGzp8U8Z5BjjCzFLoY mIX775DoE4RozCRSNIs0ThvwCHRsvTMIidKaLv63SCY5t8NC4oyaYgzHr8ySuGWAWxmp ClaA== X-Received: by 10.50.195.232 with SMTP id ih8mr300740igc.17.1359697722176; Thu, 31 Jan 2013 21:48:42 -0800 (PST) MIME-Version: 1.0 Received: by 10.42.254.136 with HTTP; Thu, 31 Jan 2013 21:48:22 -0800 (PST) In-Reply-To: References: <005101cdfee1$6d99dff0$48cd9fd0$@thetaphi.de> From: arun k Date: Fri, 1 Feb 2013 11:18:22 +0530 Message-ID: Subject: Re: CompressingStoredFieldsFormat doesn't show improvement To: java-user Content-Type: multipart/alternative; boundary=14dae9341255bb75a204d4a34b0a X-Virus-Checked: Checked by ClamAV on apache.org --14dae9341255bb75a204d4a34b0a Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Hi, Random data was indexed. I wanted to see the worst case where little data is same across documents and which in most of my cases is. So, i guess in these scenarios compression becomes an overhead. Arun On Thu, Jan 31, 2013 at 8:00 PM, Robert Muir wrote: > The top method here is your random string generation. > > are you indexing random data? > > On Thu, Jan 31, 2013 at 12:46 AM, arun k wrote: > > Hi, > > > > Please find the snapshots here. > > http://picpaste.com/Lucene3.0.2-G00Z5FfX.png > > http://picpaste.com/Lucene4.1-LsxpcQk0.png > > > > Arun > > > > > > On Wed, Jan 30, 2013 at 5:30 PM, Uwe Schindler wrote: > > > >> Hi, > >> > >> > >> > >> there is nothing attached to your mail; maybe the mailing list softwar= e > >> removed it. Can you place it somewhere on the web (e.g. pastebin,=85)? > >> > >> > >> > >> Uwe > >> > >> > >> > >> ----- > >> > >> Uwe Schindler > >> > >> H.-H.-Meier-Allee 63, D-28213 Bremen > >> > >> http://www.thetaphi.de > >> > >> eMail: uwe@thetaphi.de > >> > >> > >> > >> From: arun k [mailto:arunk786@gmail.com] > >> Sent: Wednesday, January 30, 2013 12:04 PM > >> To: java-user > >> Subject: Re: CompressingStoredFieldsFormat doesn't show improvement > >> > >> > >> > >> Adrein, > >> Please find the attached profilers report. > >> > >> > >> > >> On Wed, Jan 30, 2013 at 3:35 PM, Adrien Grand > wrote: > >> > >> On Wed, Jan 30, 2013 at 8:08 AM, arun k wrote: > >> > Adrein, > >> > > >> > I have created an index of size 370M of 1 million docs of 40 fields > of 40 > >> > chars and did the profiling. > >> > I see that the indexing and in particular the addDocument & > >> > ConcurrentMergeScheduler in 4.1 takes double the time compared to > 3.0.2. > >> > >> Can you provide me with the detailed profiles? > >> > >> > >> > Looks like CompressionStoredFieldsFormat is of little use in my > scenario. > >> > >> You can can disable stored fields compression and use another > >> StoredFieldsFormat by defining a custom codec > >> ( > >> > http://lucene.apache.org/core/4_1_0/core/org/apache/lucene/codecs/package= -summary.html#package_description > >> ). > >> > >> > >> -- > >> Adrien > >> > >> --------------------------------------------------------------------- > >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > >> For additional commands, e-mail: java-user-help@lucene.apache.org > >> > >> > >> > >> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --14dae9341255bb75a204d4a34b0a--