lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-3105) Add analysis configurations for different languages to the example
Date Tue, 07 Feb 2012 16:30:59 GMT

    [ https://issues.apache.org/jira/browse/SOLR-3105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202511#comment-13202511
] 

Robert Muir commented on SOLR-3105:
-----------------------------------

{quote}
Question: you add new contraction lists for three languages to Solr's example, but shouldn't
they go into the common analyzer's resources directory and be copied over by ant sync-analyzers?
{quote}

Maybe, i put a TODO in those lists for that reason (its the first line in each one).
The problem is, in most cases they are tiny short! so a text file is awkward. But, maybe we
should just do this anyway.

{quote}
One other thing (separate issue probly): ElisionFilter is in package o.a.l.analysis.fr, but
you've added example uses with Italian and Catalan - shouldn't this class move up to package
o.a.l.analysis?
{quote}

Yeah its a little awkward: I think maybe it belongs in the .util package?

Also, we don't sync the english stopwords, (though, it does match lucene's). So thats another
improvement we could do, text-file those under .en package
instead of being a hardwired set in StopAnalyzer.

I think maybe we could open issues for all of these? I don't like it either, but i decided
to go with the TODO approach because
I'm not sure it should really block this issue (to the user, it will all be the same, this
is implementation details).

                
> Add analysis configurations for different languages to the example
> ------------------------------------------------------------------
>
>                 Key: SOLR-3105
>                 URL: https://issues.apache.org/jira/browse/SOLR-3105
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Robert Muir
>             Fix For: 3.6, 4.0
>
>         Attachments: SOLR-3105.patch
>
>
> I think we should have good baseline configurations for our supported analyzers
> so that its easy for people to get started.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message