Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 98C04D310 for ; Fri, 14 Dec 2012 23:11:58 +0000 (UTC) Received: (qmail 80245 invoked by uid 500); 14 Dec 2012 23:11:56 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 80202 invoked by uid 500); 14 Dec 2012 23:11:56 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 80194 invoked by uid 99); 14 Dec 2012 23:11:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Dec 2012 23:11:56 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of SRS0=QmKHBr=KI=basetechnology.com=jack@yourhostingaccount.com designates 65.254.253.130 as permitted sender) Received: from [65.254.253.130] (HELO mailout16.yourhostingaccount.com) (65.254.253.130) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Dec 2012 23:11:49 +0000 Received: from mailscan11.yourhostingaccount.com ([10.1.15.11] helo=mailscan11.yourhostingaccount.com) by mailout16.yourhostingaccount.com with esmtp (Exim) id 1TjePg-0002be-9G for java-user@lucene.apache.org; Fri, 14 Dec 2012 18:11:28 -0500 Received: from impout02.yourhostingaccount.com ([10.1.55.2] helo=impout02.yourhostingaccount.com) by mailscan11.yourhostingaccount.com with esmtp (Exim) id 1TjePf-0007Ey-RC for java-user@lucene.apache.org; Fri, 14 Dec 2012 18:11:27 -0500 Received: from authsmtp12.yourhostingaccount.com ([10.1.18.12]) by impout02.yourhostingaccount.com with NO UCE id bbBT1k0060FdXoS01bBT7N; Fri, 14 Dec 2012 18:11:27 -0500 X-Authority-Analysis: v=2.0 cv=HIVB5/Rv c=1 sm=1 a=yH02RjTyxywMAIqhn74x1Q==:17 a=aQzbgH187woA:10 a=a7ci0OKSP-kA:10 a=3jZET7lWBKwA:10 a=IkcTkHD0fZMA:10 a=jvYhGVW7AAAA:8 a=lrZXOINqSxEA:10 a=mV9VRH-2AAAA:8 a=h8Ubxvp8AAAA:20 a=CDBuLP3tAAAA:20 a=-meOZ-mNthXv93CZ2EAA:9 a=QEXdDO2ut3YA:10 a=bY6mUefMC8EA:10 a=t1ijpx9AV50gTBtUFlM2vg==:117 X-EN-OrigOutIP: 10.1.18.12 X-EN-IMPSID: bbBT1k0060FdXoS01bBT7N Received: from 207-237-113-14.c3-0.nyr-ubr1.nyr.ny.cable.rcn.com ([207.237.113.14] helo=JackKrupansky) by authsmtp12.yourhostingaccount.com with esmtpa (Exim) id 1TjePf-0007FV-K1 for java-user@lucene.apache.org; Fri, 14 Dec 2012 18:11:27 -0500 Message-ID: <4D2978C9E2864D018F7C7B3A025AABA2@JackKrupansky> From: "Jack Krupansky" To: References: <019F8C89858B498B9DE68F34634CFFF5@JackKrupansky> <032301cdda4e$942ace70$bc806b50$@thetaphi.de> In-Reply-To: <032301cdda4e$942ace70$bc806b50$@thetaphi.de> Subject: Re: precisionStep for days in TrieDate Date: Fri, 14 Dec 2012 18:11:21 -0500 MIME-Version: 1.0 Content-Type: text/plain; format=flowed; charset="UTF-8"; reply-type=original Content-Transfer-Encoding: 8bit X-Priority: 3 X-MSMail-Priority: Normal Importance: Normal X-Mailer: Microsoft Windows Live Mail 15.4.3555.308 X-MimeOLE: Produced By Microsoft MimeOLE V15.4.3555.308 X-EN-UserInfo: e0a4b55451ed9f27313ebf02e3d4348d:fc4a93e1349e680c52bdd723c0ab3ef6 X-EN-AuthUser: jack@basetechnology.com Sender: "Jack Krupansky" X-EN-OrigIP: 207.237.113.14 X-EN-OrigHost: 207-237-113-14.c3-0.nyr-ubr1.nyr.ny.cable.rcn.com X-Virus-Checked: Checked by ClamAV on apache.org Thanks, you answered the main question - 26 doesn't simply lop off the time of day. Although, I still don't completely follow how trie works (without reading the paper itself.) -- Jack Krupansky -----Original Message----- From: Uwe Schindler Sent: Friday, December 14, 2012 5:58 PM To: java-user@lucene.apache.org Subject: RE: precisionStep for days in TrieDate Hi, > If I specify a precisionStep of 26 for a TrieDate field, what rough impact > should this have on both performance and index size? This value is mostly useless, everything > 8 does slowdown the queries tot he speed of TermRangeQuery. > The input data has time in it, but the milliseconds per day is not needed > for > the app. Will Lucene store only the top 64 minus 26 bits of data and > discard > the low 26 bits? No, you may need to read the Javadocs of NumericRangeQuery, now updated with formulas: http://goo.gl/nyXQR The precisionStep is a count, after how many bits of the indexed value a new term starts. The original value is always indexed in full precision. Precision step of 4 for a 32 bit value(integer) means terms with these bit counts: All 32, left 28, left 24, left 20, left 16, left 12, left 8, left 4 bits of the value (total 8 terms/value). A precision step of 26 would index 2 terms: all 32 bits and one single term with the remaining 6 bits from the left. > I’ve read that a higher precisionStep will lower performance. Will a > precisionStep of 26 have dramatically lower performance when referencing > days (without time of day)? See above. The assumption that 26 will limit precision to days is wrong. > I suppose that the piece of information I am missing is whether trie > precisionStep simply affects some extra index table that trie keeps beyond > the raw data values or the data values themselves. It only affects how the value is indexed (how many terms), but not the value. Uwe --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org