lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Koji Sekiguchi <k...@r.email.ne.jp>
Subject Re: HTML tags and Lucene highlighting
Date Thu, 05 Apr 2012 21:52:15 GMT
(12/04/06 2:34), okayndc wrote:
> Hello,
>
> I currently use Lucene version 3.0...probably need to upgrade to a more
> current version soon.
> The problem that I have is when I test search for a an HTML tag (ex.
> <strong>), Lucene returns
> the highlighted HTML tag ~ which is what I DO NOT want.  Is there a way to
> "filter" HTML tags?
> I have read up on HTMLStripChar filter (packaged with Solr) and wondered if
> this is the way to go?
>
> Any help will be greatly appreciated,
> Thanks
>

There is a way to encode HTML tags:

https://builds.apache.org/job/Lucene-3.x/javadoc/contrib-highlighter/org/apache/lucene/search/highlight/SimpleHTMLEncoder.html

koji
-- 
Query Log Visualizer for Apache Solr
http://soleami.com/

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message