lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tommaso Teofili (Issue Comment Edited) (JIRA)" <j...@apache.org>
Subject [jira] [Issue Comment Edited] (LUCENE-3731) Create a analysis/uima module for UIMA based tokenizers/analyzers
Date Wed, 15 Feb 2012 00:09:59 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-3731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13208145#comment-13208145
] 

Tommaso Teofili edited comment on LUCENE-3731 at 2/15/12 12:08 AM:
-------------------------------------------------------------------

Thank you very much Steven for reporting.

The 
{noformat}
Feb 14, 2012 6:34:18 PM WhitespaceTokenizer initialize
INFO: "Whitespace tokenizer successfully initialized"
Feb 14, 2012 6:34:18 PM WhitespaceTokenizer typeSystemInit
INFO: "Whitespace tokenizer typesystem initialized"
{noformat}

messages are due to UIMA WhitespaceTokenizer Annotator which logs the initialization/processing/etc.
calls.
That is printed out many times because the testRandomStrings test method just does lots of
tricky tests on the UIMABaseAnalyzer which require the above calls to be executed repeatedly.

I'll take a look to the other failures which didn't show up on the tests I had done till now.
                
      was (Author: teofili):
    Thank you very much Steven for reporting.

The 
{noformat}
Feb 14, 2012 6:34:18 PM WhitespaceTokenizer initialize
INFO: "Whitespace tokenizer successfully initialized"
Feb 14, 2012 6:34:18 PM WhitespaceTokenizer typeSystemInit
INFO: "Whitespace tokenizer typesystem initialized"
{noformat}

messages are due to UIMA WhitespaceTokenizer Annotator which logs the initialization/processing/etc.
calls.
That is printed out many times because the testRandomStrings test method just does lots of
tricky tests on the UIMATokenizer which require the above calls to be executed repeatedly.

I'll take a look to the other failures which didn't show up on the tests I had done till now.
                  
> Create a analysis/uima module for UIMA based tokenizers/analyzers
> -----------------------------------------------------------------
>
>                 Key: LUCENE-3731
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3731
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>            Reporter: Tommaso Teofili
>            Assignee: Tommaso Teofili
>             Fix For: 3.6, 4.0
>
>         Attachments: LUCENE-3731.patch, LUCENE-3731_2.patch, LUCENE-3731_3.patch, LUCENE-3731_4.patch
>
>
> As discussed in SOLR-3013 the UIMA Tokenizers/Analyzer should be refactored out in a
separate module (modules/analysis/uima) as they can be used in plain Lucene. Then the solr/contrib/uima
will contain only the related factories.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message