lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andi Vajda <va...@osafoundation.org>
Subject Re: inconsistency/performance trap of empty terms
Date Fri, 29 Oct 2010 06:12:51 GMT

On Oct 28, 2010, at 22:32, Robert Muir <rcmuir@gmail.com> wrote:

> On Thu, Oct 28, 2010 at 10:28 PM, Andi Vajda  
> <vajda@osafoundation.org> wrote:
>>
>> I've used this in a URL index. I needed to be able to distinguish  
>> between
>> searching URLs that had, say, no path, from searching URLs without  
>> matching
>> the path component. The absence of path was represented with an  
>> empty token
>> in the path field.
>>
>
> but you didn't really need to use the empty term... you could have
> used something like U+001F INFORMATION SEPARATOR... and your whole
> index would have been an entire byte bigger?

Of course, anything not found in a url path could serve as 'empty'.

Andi..


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message