Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 96776 invoked from network); 26 Oct 2005 10:24:01 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 26 Oct 2005 10:24:01 -0000 Received: (qmail 961 invoked by uid 500); 26 Oct 2005 10:23:54 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 835 invoked by uid 500); 26 Oct 2005 10:23:53 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 786 invoked by uid 99); 26 Oct 2005 10:23:53 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Oct 2005 03:23:53 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [69.55.225.129] (HELO ehatchersolutions.com) (69.55.225.129) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Oct 2005 03:23:51 -0700 Received: by ehatchersolutions.com (Postfix, from userid 504) id BB83513E2006; Wed, 26 Oct 2005 06:23:27 -0400 (EDT) Received: from [172.16.1.101] (va-71-48-138-146.dhcp.sprint-hsd.net [71.48.138.146]) by ehatchersolutions.com (Postfix) with ESMTP id 5896B13E2006 for ; Wed, 26 Oct 2005 06:23:02 -0400 (EDT) Mime-Version: 1.0 (Apple Message framework v734) In-Reply-To: <50f433360510252350r3bddd8e9ldfb6dd47f19ea6bf@mail.gmail.com> References: <50f433360510252350r3bddd8e9ldfb6dd47f19ea6bf@mail.gmail.com> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: Content-Transfer-Encoding: 7bit From: Erik Hatcher Subject: Re: MaxFieldLength or MaxFields? Date: Wed, 26 Oct 2005 06:23:00 -0400 To: java-user@lucene.apache.org X-Mailer: Apple Mail (2.734) X-Spam-Checker-Version: SpamAssassin 3.0.1 (2004-10-22) on javelina X-Spam-Level: X-Virus-Checked: Checked by ClamAV on apache.org X-Old-Spam-Status: No, score=-5.9 required=5.0 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.0.1 X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N On 26 Oct 2005, at 02:50, Jeff Rodenburg wrote: > I'm considering building out an index that will flatten a data > structure, > such that some Document "A" will have Fields 1,2 and 3. > Fields 1 and 2 are indexed/tokenized field. Field 3 is indexed, and > will > contain many discrete values (up to possibly 5000). > Couple of questions: > 1. Does the DEFAULT_MAX_FIELD_LENGTH on IndexWriter come into play > for Field > 3? Field 3 values are added to the document individually, there's > just a > bunch of them. No. The max field length only comes into play for tokenized (analyzed) fields. > 2. Is MaxFieldLength a consideration for this? The default is 10,000, so you're safe :) From IndexWriter.java: public final static int DEFAULT_MAX_FIELD_LENGTH = 10000; > Any guidance before I go through the trouble of building the index > would be > greatly appreciated. Never be afraid to just try out things. It's easy enough to whip up a simple Java program of only a few lines that would exercise this sort of thing. A simple for loop adding 5000 identically named non- tokenized, indexed fields to a document would prove this to yourself. And now with Ferret - http://ferret.davebalmain.com/trac/ - the fun and lightweight nature of Ruby will allow us to experiment with Lucene without even compilation (or suffering Python syntax :). Erik --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org