uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Philip Ogren <phi...@ogren.info>
Subject Re: performance of JCas.reset()
Date Thu, 17 Jun 2010 21:56:04 GMT
I did this using version 2.2.2.  The JCas is initialized with a very 
simple type system consisting of a single 'Sentence' type which has no 
features.  There are no additional user-defined indexes.


On 6/17/2010 5:06 AM, Marshall Schor wrote:
> What release of UIMA are you testing with?
>
> Are there any UIMA index definitions in your test case instance?
>
> -Marshall
>
> On 6/16/2010 4:23 PM, Philip Ogren wrote:
>    
>> When I run the following loop it takes about 6 seconds on my 2GHz
>> machine:
>>
>> for(int i=0; i<10000; i++) {
>>
>> jCas.reset();
>>
>> }
>>
>> Which comes out to a .6 milliseconds per call. This is pretty slow for
>> cases in which you have many short documents. For example, this would
>> add 10 minutes of processing time for 1M document corpus. Is this a
>> known issue and is there anything that I can do to minimize this impact?
>>
>> Thanks,
>>
>> Philip
>>
>>      
> >
>
>
>
> No virus found in this incoming message.
> Checked by AVG - www.avg.com
> Version: 9.0.829 / Virus Database: 271.1.1/2942 - Release Date: 06/16/10 12:35:00
>
>    

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message