lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (Commented) (JIRA)" <>
Subject [jira] [Commented] (LUCENE-3883) Analysis for Irish
Date Tue, 20 Mar 2012 03:39:40 GMT


Robert Muir commented on LUCENE-3883:

Hmm another downside of #1 is that with a simple stopfilter approach, position increments
won't line up
if we have a phrase query of "n-athair" with indexed nAthair.

So I start to lean towards #2 since it would be a better solution... but I'm going to think
about it
and see if I come up with any other ideas.

Separately, what about h- when succeeded by a vowel? Is there actually usually a hyphen here?
(Wikipedia says no, playing around with GaelSpell seems to agree, but I don't know anything
about this language!)
Would this case be too aggressive to handle?
> Analysis for Irish
> ------------------
>                 Key: LUCENE-3883
>                 URL:
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/analysis
>            Reporter: Jim Regan
>            Priority: Trivial
>              Labels: analysis, newbie
>         Attachments: LUCENE-3883.patch, irish.sbl
> Adds analysis for Irish.
> The stemmer is generated from a snowball stemmer. I've sent it to Martin Porter, who
says it will be added during the week.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message