From java-user-return-45230-apmail-lucene-java-user-archive=lucene.apache.org@lucene.apache.org Sat Feb 27 15:04:25 2010 Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 5501 invoked from network); 27 Feb 2010 15:04:24 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 27 Feb 2010 15:04:24 -0000 Received: (qmail 92228 invoked by uid 500); 27 Feb 2010 15:04:22 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 92193 invoked by uid 500); 27 Feb 2010 15:04:22 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 92185 invoked by uid 99); 27 Feb 2010 15:04:22 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 27 Feb 2010 15:04:22 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of glen.newton@gmail.com designates 209.85.160.176 as permitted sender) Received: from [209.85.160.176] (HELO mail-gy0-f176.google.com) (209.85.160.176) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 27 Feb 2010 15:04:12 +0000 Received: by gyb13 with SMTP id 13so488406gyb.35 for ; Sat, 27 Feb 2010 07:03:51 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=vbe5kfsi3/hmqVxoxD1dVldyHRFfeqBl0McHrrYBOvk=; b=xScGZHvOatAtGfiDbFehauC+cRNQ/JD/ranxq3eLHNnocRXini8fHn13LPp/GOUK+R SZB39VWvReZlT/vAP+7QIyyRpen0onVmttyWhs+kACAHz/2SmXSrfneSPHvNy6l9Vd5G gbsQ+fKm7acnQMDD/aGEwURf2dSY8Zb8DGn0o= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=vOKYtGs+3g3DZnWmAmxGhZ0l6N3qNbAXIcpNd1q40nDsr9qTe0I76YlMMPPFdkLKt0 YCqxQb48GhNFPZDbm0bsb9Xd4ID5Vygs3OA/zmakVsx7loPLStzbKXqzzPV5KT4KsM7Y A+aox+TA7FmYZejtSq4WmFIer8iD7itLVPAK8= MIME-Version: 1.0 Received: by 10.101.58.5 with SMTP id l5mr2782484ank.73.1267283031571; Sat, 27 Feb 2010 07:03:51 -0800 (PST) In-Reply-To: References: Date: Sat, 27 Feb 2010 10:03:51 -0500 Message-ID: <5e76f3841002270703r92535b1u2016f2f2591fa9ee@mail.gmail.com> Subject: Re: If you could have one feature in Lucene... From: Glen Newton To: java-user@lucene.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org Pluggable compression allowing for alternatives to gzip for text compression for storing. Specifically I am interested in bzip2[1] as implemented in Apache Commons Compress[2]. While bzip2 compression is considerable slower than gzip (although decompression is not too much slower than gzip) it compresses much better than gzip (especially text). Having the choice would be helpful, and for Lucene usage for non-text indexing, content specific compression algorithms may outperform the default gzip. And in these days of multi-core / multi-threading, perhaps we could convince the Apache Commons Compress team to implement a parallel Java version of bzip2 compression (theirs is single threaded), like pbzip2[3]. -glen [1]http://en.wikipedia.org/wiki/Bzip2 [2]http://commons.apache.org/compress/ [3]http://compression.ca/pbzip2/ On 24 February 2010 08:42, Grant Ingersoll wrote: > What would it be? > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > -- - --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org