Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 60336 invoked from network); 24 May 2002 18:52:55 -0000 Received: from unknown (HELO nagoya.betaversion.org) (192.18.49.131) by daedalus.apache.org with SMTP; 24 May 2002 18:52:55 -0000 Received: (qmail 21930 invoked by uid 97); 24 May 2002 18:52:57 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@jakarta.apache.org Received: (qmail 21913 invoked by uid 97); 24 May 2002 18:52:56 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 21901 invoked by uid 98); 24 May 2002 18:52:56 -0000 X-Antivirus: nagoya (v4198 created Apr 24 2002) Message-ID: <3CEE8CEC.3080803@earthlink.net> Date: Fri, 24 May 2002 12:56:44 -0600 From: Dmitry Serebrennikov User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:0.9.7) Gecko/20011221 X-Accept-Language: en-us MIME-Version: 1.0 To: Lucene Users List Subject: Re: Small indexes References: <200205241150.17bk9a6ga3NZFji2@eagle> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N MAILER-DAEMON@nagoya.betaversion.org wrote: >>------------------------------------------------------------------------ >> >>Subject: >> >>Small indexes >>From: >> >>"David Elworthy" >>Date: >> >>Thu, 23 May 2002 17:04:58 -0400 >>To: >> >> >> >> >>Are there are known problems with indexes over very small numbers of files? I have a program which works fine when it is indexing plenty of documents, but when it only indexes 10 or so, all that gets created is an 8 byte segments file. I build the index in RAM, and then merge it to disc, and I am stabilizing and optimizing in appropriate places. The exact threshold for an index to get created seems to vary with the document size - if they are smaller you need more of them. >> >>It's not a big deal, as my actual document collection is not this small. I'm just curious. >> >>-- David Elworthy >> >There is no known problem, but there is buffering where 10 documents are >indexed into memory and then are flushed to disk. The docs will also be >flushed when the IndexWriter is closed. So it looks like you either >don't close the indexwriter or something else like that. > >Good luck. >Dmitry. > > > > -- To unsubscribe, e-mail: For additional commands, e-mail: