Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B5DAD7DA9 for ; Thu, 14 Jul 2011 22:24:48 +0000 (UTC) Received: (qmail 40947 invoked by uid 500); 14 Jul 2011 22:24:45 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 40800 invoked by uid 500); 14 Jul 2011 22:24:45 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 40786 invoked by uid 99); 14 Jul 2011 22:24:44 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 14 Jul 2011 22:24:44 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.210.176] (HELO mail-iy0-f176.google.com) (209.85.210.176) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 14 Jul 2011 22:24:39 +0000 Received: by iyi20 with SMTP id 20so941950iyi.35 for ; Thu, 14 Jul 2011 15:24:18 -0700 (PDT) Received: by 10.42.29.194 with SMTP id s2mr2448394icc.6.1310682258112; Thu, 14 Jul 2011 15:24:18 -0700 (PDT) Received: from bester.local ([65.78.136.75]) by mx.google.com with ESMTPS id v3sm416478ibh.33.2011.07.14.15.24.15 (version=SSLv3 cipher=OTHER); Thu, 14 Jul 2011 15:24:16 -0700 (PDT) Date: Thu, 14 Jul 2011 15:24:14 -0700 (PDT) From: Chris Hostetter To: solr-user@lucene.apache.org Subject: Re: omitTermFreq only? In-Reply-To: Message-ID: References: <85524CC8-3AA8-464A-95E8-119DE74DD8A2@mac.com> <201107132234.29266.markus.jelsma@openindex.io> User-Agent: Alpine 2.00 (DEB 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII : Sorry I should have made the objectives clear. The goal is to reduce the : index size by avoiding TermFrequency stored in the index (in .frq : segment files). Hmmm... why? you're talking about eliminating a single (compressed) int per term, and yet you want positions which take up a lot more space (at a minimum, even if each term only appears once in a single document, that's already as much space as the frequencies) on anything except a toy index, eliminating freq while keeping positions (if it were possible) is unlikely to even noticably affect the index size. what is the motivation for your objective? If your main motivation is to just "to reduce index size", then perhaps tell us more about your configuration/use cases and maybe we can offer alternative suggestions. http://people.apache.org/~hossman/#xyproblem XY Problem Your question appears to be an "XY Problem" ... that is: you are dealing with "X", you are assuming "Y" will help you, and you are asking about "Y" without giving more details about the "X" so that we can understand the full issue. Perhaps the best solution doesn't involve "Y" at all? See Also: http://www.perlmonks.org/index.pl?node_id=542341 -Hoss