incubator-lucy-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Peter Karman <pe...@peknet.com>
Subject Re: [lucy-dev] Implementing a tokenizer in core
Date Tue, 29 Nov 2011 03:29:53 GMT
Nathan Kurz wrote on 11/25/11 4:35 PM:
>  I'd like to discourage a quoted search for "Proper Name"
> from matching "is that proper?<br>\nName your price," and I think the
> easiest way to do this is by indexing some things that would normally
> be ignored.

The easiest way is to use the libswish3 parser, which automatically bumps the
token position based on HTML constructs like that.


-- 
Peter Karman  .  http://peknet.com/  .  peter@peknet.com

Mime
View raw message