lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael McCandless (JIRA)" <j...@apache.org>
Subject [jira] Commented: (LUCENE-1377) Add HTMLStripReader and WordDelimiterFilter from SOLR
Date Tue, 16 Jun 2009 19:25:07 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-1377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720305#action_12720305
] 

Michael McCandless commented on LUCENE-1377:
--------------------------------------------

I agree we'd need a more comprehensive "strategy" to consolidate all
analyzers in one place.  I also think it's important.  But it's a
biggie.

bq. its a pretty good workaround for the missing unicode support in lucene, but hopefully
this won't take much longer to fix.

I'm having trouble keeping track of the various issues to fix the
"missing unicode support in lucene".  Are there issues opened for all?
Should we open a consolidated issue for properly handling surrogate
pairs?

> Add HTMLStripReader and WordDelimiterFilter from SOLR
> -----------------------------------------------------
>
>                 Key: LUCENE-1377
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1377
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>    Affects Versions: 2.3.2
>            Reporter: Jason Rutherglen
>            Priority: Minor
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> SOLR has two classes HTMLStripReader and WordDelimiterFilter which are very useful for
a wide variety of use cases.  It would be good to place them into core Lucene.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message