lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <j...@apache.org>
Subject [jira] Created: (LUCENE-1786) improve performance of contrib/TestCompoundWordTokenFilter
Date Thu, 06 Aug 2009 09:51:15 GMT
improve performance of contrib/TestCompoundWordTokenFilter
----------------------------------------------------------

                 Key: LUCENE-1786
                 URL: https://issues.apache.org/jira/browse/LUCENE-1786
             Project: Lucene - Java
          Issue Type: Test
          Components: contrib/analyzers
            Reporter: Robert Muir
            Priority: Minor


contrib/analyzers/compound has some tests that use a hyphenation grammar file.

The tests are currently for german, and they actually are nice, they show how the combination
of the hyphenation rules and dictionary work in tandem.
The issue is that the german grammar file is not apache licensed: http://offo.sourceforge.net/hyphenation/licenses.html
So the test must download the entire offo zip file from sourceforge to execute.

I happen to think the test is a great example of how this thing works (with a language where
it matters), but we could consider using a different grammar file, for a language that is
apache licensed.
This way it could be included in the source with the test and would be more practical.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message