Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: java-dev@lucene.apache.org
Received-SPF: pass (asf.osuosl.org: local policy)
Mime-Version: 1.0 (Apple Message framework v734)
In-Reply-To: <4367AB21.1030204@apache.org>
References: <LMENLAOACIBLMOIILNNNAEKCFGAA.rengels@ix.netcom.com>
 <B9588083-F098-42BD-822F-158FD15558D2@rectangular.com>
 <D0A696EE-92B7-4D0E-A464-82FC846F81CD@rectangular.com>
 <4367AB21.1030204@apache.org>
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
Message-Id: <778BFD4A-B58B-4041-91BD-32AFFA2FDDD7@rectangular.com>
Content-Transfer-Encoding: 7bit
From: Marvin Humphrey <marvin@rectangular.com>
Subject: Re: bytecount as String and prefix length
Date: Tue, 1 Nov 2005 20:52:28 -0800
To: java-dev@lucene.apache.org


On Nov 1, 2005, at 9:51 AM, Doug Cutting wrote:

> Another approach might be to, instead of converting to UTF-8 to  
> strings right away, change things to convert lazily, if at all.
> During index merging such conversion should never be needed.

!!

There ought to be some gains possible there, then.  No predictions as  
to how much, though.

> You needn't do this systematically throughout Lucene, but only  
> where it makes a big difference.  For example, if you could avoid  
> strings in SegmentMerger.mergeTermInfos() it might make a huge  
> difference.  This might be as simple as changing SegmentMergeInfo  
> to use a TermBuffer instead of a Term.  Does that make sense?

Abundant sense.  I'm not as familiar with SegmentMerger as I am with  
other parts of the org.apache.lucene.index package, because I haven't  
ported it yet.  But conceptually I understand exactly why this should  
require fewer resources.

I'll take a swing at SegmentMerger and submit a comprehensive diff.

Thanks for the suggestions,

Marvin Humphrey
Rectangular Research
http://www.rectangular.com/


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org