lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <>
Subject [jira] Commented: (LUCENE-1377) Add HTMLStripReader and WordDelimiterFilter from SOLR
Date Tue, 16 Jun 2009 19:35:07 GMT


Robert Muir commented on LUCENE-1377:

Michael, I only opened one issue: LUCENE-1488...
When I talk about missing unicode support, I'm not really referring to the surrogate pair
I'm talking about support for the unicode standard (yes the stuff like breaking text into
words and erasing case differences and normalization and what not).

separately, maybe your confusion is related to the fact that there are a lot of existing jira
issues whose root cause is the lack of this functionality. I didn't cause this!

I consider the surrogate pair issue a separate java 5 migration issue, and I think it is consolidated:

> Add HTMLStripReader and WordDelimiterFilter from SOLR
> -----------------------------------------------------
>                 Key: LUCENE-1377
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>    Affects Versions: 2.3.2
>            Reporter: Jason Rutherglen
>            Priority: Minor
>   Original Estimate: 24h
>  Remaining Estimate: 24h
> SOLR has two classes HTMLStripReader and WordDelimiterFilter which are very useful for
a wide variety of use cases.  It would be good to place them into core Lucene.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message