lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Simon Willnauer (JIRA)" <j...@apache.org>
Subject [jira] Commented: (LUCENE-1967) make it easier to access default stopwords for language analyzers
Date Fri, 09 Oct 2009 14:00:32 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-1967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12764025#action_12764025
] 

Simon Willnauer commented on LUCENE-1967:
-----------------------------------------

Thanks Robert for bringing this up in a general context. I will take care of it soon.

> make it easier to access default stopwords for language analyzers
> -----------------------------------------------------------------
>
>                 Key: LUCENE-1967
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1967
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/analyzers
>            Reporter: Robert Muir
>            Assignee: Simon Willnauer
>            Priority: Minor
>
> DM Smith made the following comment: (sometimes it is hard to dig out the stop set from
the analyzers)
> Looking around, some of these analyzers have very different ways of storing the default
list.
> One idea is to consider generalizing something like what Simon did with LUCENE-1965,
LUCENE-1962,
> and having all stopwords lists stored as .txt files in resources folder.
> {code}
>   /**
>    * Returns an unmodifiable instance of the default stop-words set.
>    * @return an unmodifiable instance of the default stop-words set.
>    */
>   public static Set<String> getDefaultStopSet()
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message