lucy-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Peter Karman <>
Subject Re: [lucy-dev] Implementing a tokenizer in core
Date Tue, 29 Nov 2011 03:29:53 GMT
Nathan Kurz wrote on 11/25/11 4:35 PM:
>  I'd like to discourage a quoted search for "Proper Name"
> from matching "is that proper?<br>\nName your price," and I think the
> easiest way to do this is by indexing some things that would normally
> be ignored.

The easiest way is to use the libswish3 parser, which automatically bumps the
token position based on HTML constructs like that.

Peter Karman  .  .

View raw message