lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stanislaw Osinski (JIRA)" <j...@apache.org>
Subject [jira] Commented: (LUCENE-871) ISOLatin1AccentFilter a bit slow
Date Mon, 20 Aug 2007 13:47:30 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12521089
] 

Stanislaw Osinski commented on LUCENE-871:
------------------------------------------

One possible (and probably large) speed up for this code would be replacing the switch / case
structure (which for most characters needs to be evaluated down to the "default", which is
_lots_ of comparisons) with a plain static char[65536][] table. The cost for this is roughly
130kB of memory, but the speed-up should be pretty good. If lots of people are using this
filter and the memory cost is acceptable, later this week I can try to prepare another patch.

> ISOLatin1AccentFilter a bit slow
> --------------------------------
>
>                 Key: LUCENE-871
>                 URL: https://issues.apache.org/jira/browse/LUCENE-871
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: Analysis
>    Affects Versions: 1.9, 2.0.0, 2.0.1, 2.1, 2.2
>            Reporter: Ian Boston
>            Assignee: Michael McCandless
>             Fix For: 2.3
>
>         Attachments: fasterisoremove1.patch, fasterisoremove2.patch, ISOLatin1AccentFilter.java.patch,
LUCENE-871.take4.patch
>
>
> The ISOLatin1AccentFilter is a bit slow giving 300+ ms responses when used in a highligher
for output responses.
> Patch to follow

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message