lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Simon Willnauer (JIRA)" <j...@apache.org>
Subject [jira] Assigned: (LUCENE-1967) make it easier to access default stopwords for language analyzers
Date Fri, 09 Oct 2009 14:00:32 GMT

     [ https://issues.apache.org/jira/browse/LUCENE-1967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Simon Willnauer reassigned LUCENE-1967:
---------------------------------------

    Assignee: Simon Willnauer

> make it easier to access default stopwords for language analyzers
> -----------------------------------------------------------------
>
>                 Key: LUCENE-1967
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1967
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/analyzers
>            Reporter: Robert Muir
>            Assignee: Simon Willnauer
>            Priority: Minor
>
> DM Smith made the following comment: (sometimes it is hard to dig out the stop set from
the analyzers)
> Looking around, some of these analyzers have very different ways of storing the default
list.
> One idea is to consider generalizing something like what Simon did with LUCENE-1965,
LUCENE-1962,
> and having all stopwords lists stored as .txt files in resources folder.
> {code}
>   /**
>    * Returns an unmodifiable instance of the default stop-words set.
>    * @return an unmodifiable instance of the default stop-words set.
>    */
>   public static Set<String> getDefaultStopSet()
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message