Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 89279 invoked from network); 1 Aug 2007 21:39:31 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 1 Aug 2007 21:39:31 -0000 Received: (qmail 19034 invoked by uid 500); 1 Aug 2007 21:39:27 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 18074 invoked by uid 500); 1 Aug 2007 21:39:25 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 18063 invoked by uid 99); 1 Aug 2007 21:39:25 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Aug 2007 14:39:25 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of erickerickson@gmail.com designates 209.85.134.187 as permitted sender) Received: from [209.85.134.187] (HELO mu-out-0910.google.com) (209.85.134.187) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Aug 2007 21:39:12 +0000 Received: by mu-out-0910.google.com with SMTP id g7so319793muf for ; Wed, 01 Aug 2007 14:38:55 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=HbT9sJPIMYpPEDxvQTltqNoO91+bIJWefXwjZxdYgK7P7i17uWncOo1qmp3j/C7ptikzKsQ+kPB3r5c32FSg3PcXWQkm2njvR6yHGoHDD96flEaF4ktGu1OzZ2MYyyeWgHDsxjxC2wCDe2oIeOuXvnL3+/YibelUax4s1zkpWZU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=KQvc9COzAnwSP/1mjC16q6YcKvecKYr3SGZyiDh6Tjk3mpGFgi1rjob9C341s0a1ZNQucfJ9VQpnsvzgG2ktF/J7JWg8W8DJ4cDNopaeAxSLoRbQgaietCyFY0wam3ro8Sk4utIf4vBahgCElkrgPAeLHytJq+7Cfj+evCNaymo= Received: by 10.82.127.14 with SMTP id z14mr1531708buc.1186004335227; Wed, 01 Aug 2007 14:38:55 -0700 (PDT) Received: by 10.82.190.14 with HTTP; Wed, 1 Aug 2007 14:38:55 -0700 (PDT) Message-ID: <359a92830708011438n7e3c90c1g3feba4087d1cef35@mail.gmail.com> Date: Wed, 1 Aug 2007 17:38:55 -0400 From: "Erick Erickson" To: java-user@lucene.apache.org Subject: Re: Size of field? In-Reply-To: <9708bc630708011312h2fff0c34oc1a0b03a903f5f3c@mail.gmail.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_50715_30057575.1186004335211" References: <9708bc630707270822r541ffdd1t9b9e3732f9afaf80@mail.gmail.com> <359a92830707300727r9e9220euacb499d78038213f@mail.gmail.com> <9708bc630708011312h2fff0c34oc1a0b03a903f5f3c@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_50715_30057575.1186004335211 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Glad it worked out for you.... Did you ever have any insight into what was magical about 87,300? Although now that I re-read your mail, that was the number of characters, so I can imagine that your corpus averaged 8.73 characters/word .... Best Erick On 8/1/07, Eduardo Botelho wrote: > > Hi Erick!! > > You're right, I just use setMaxFieldLength() and all work fine. > > You save my life, thanks! (y) > > On 7/30/07, Erick Erickson wrote: > > > > See IndexWriter.setMaxFieldLength(). 87,300 is odd, since the default > > max field length, last I knew, was 10,000. But this sounds like > > it might relate to your issue. > > > > Best > > Erick > > > > On 7/27/07, Eduardo Botelho wrote: > > > > > > Hi guys, > > > > > > I would like to know if exist some limit of size for the fields of a > > > document. > > > > > > I'm with the following problem: > > > When a term is after a certain amount of characters (approximately > > 87300) > > > in > > > a field, the search does not find de occurrency. > > > If I divide my field in pages, the terms are found normally. > > > This problem occours when I make an exact query (query between quotes) > > > > > > What can be happening? > > > > > > I'm using BrazilianAnalyzer and StandardAnalyzer(for tests only) for > > both, > > > search and indexation. > > > > > > thanks... > > > > > > Sorry for my poor english... > > > > > > ------=_Part_50715_30057575.1186004335211--