Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 41383 invoked from network); 2 Apr 2010 19:08:52 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 2 Apr 2010 19:08:52 -0000 Received: (qmail 98883 invoked by uid 500); 2 Apr 2010 19:08:51 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 98828 invoked by uid 500); 2 Apr 2010 19:08:51 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 98818 invoked by uid 99); 2 Apr 2010 19:08:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Apr 2010 19:08:51 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Apr 2010 19:08:49 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id D6992234C4CC for ; Fri, 2 Apr 2010 19:08:27 +0000 (UTC) Message-ID: <2071895735.664691270235307877.JavaMail.jira@brutus.apache.org> Date: Fri, 2 Apr 2010 19:08:27 +0000 (UTC) From: "Paul Elschot (JIRA)" To: java-dev@lucene.apache.org Subject: [jira] Commented: (LUCENE-1410) PFOR implementation In-Reply-To: <521761508.1222889624375.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/LUCENE-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852924#action_12852924 ] Paul Elschot commented on LUCENE-1410: -------------------------------------- I think the mixed performance results for decompression and query times may be caused by the use of only a single method. For very short sequences (1 to 2 or 3 integers), I would expect VInt (actually VByte) to perform best. For long sequences (from about 25 integers) , (P)FOR should do best. In between the two, (a variant of) S9. The problem will be to find the optimal bordering sequence sizes to change the compression method. The fact that S9 is already doing better than VInt is encouraging. Since (P)FOR can do even better than S9, when using (P)FOR only for longer sequences, I'd expect a real performance boost for queries using frequently occurring terms in the index. Also, I'd recommend to verify query results for each method. S9 as I implemented it is only tested by its own test cases. When the query results are incorrect, measuring performance is not really useful, and this has happened already for the PFOR implementation here, see above in early October 2008. > PFOR implementation > ------------------- > > Key: LUCENE-1410 > URL: https://issues.apache.org/jira/browse/LUCENE-1410 > Project: Lucene - Java > Issue Type: New Feature > Components: Other > Reporter: Paul Elschot > Priority: Minor > Attachments: autogen.tgz, for-summary.txt, LUCENE-1410-codecs.tar.bz2, LUCENE-1410b.patch, LUCENE-1410c.patch, LUCENE-1410d.patch, LUCENE-1410e.patch, TermQueryTests.tgz, TestPFor2.java, TestPFor2.java, TestPFor2.java > > Original Estimate: 21840h > Remaining Estimate: 21840h > > Implementation of Patched Frame of Reference. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org