nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ferdy Galema (JIRA)" <>
Subject [jira] [Updated] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.
Date Mon, 07 May 2012 13:44:50 GMT


Ferdy Galema updated NUTCH-1356:

    Attachment: NUTCH-1356-trunk-v2.patch

It was working though, I guess that is because of a transitive dependancy. Anyway it's best
to declare it as a direct dependancy too. Patch v2 does this. (11.0.2 --> the same as the
already present jar).
> ParseUtil use ExecutorService instead of manually thread handling.
> ------------------------------------------------------------------
>                 Key: NUTCH-1356
>                 URL:
>             Project: Nutch
>          Issue Type: Improvement
>            Reporter: Ferdy Galema
>             Fix For: nutchgora, 1.6
>         Attachments: NUTCH-1356-trunk-v2.patch, NUTCH-1356-trunk.patch, NUTCH-1356.patch
> Because ParseUtil manages it's own parser threads by creating a thread for every parse
it sometimes happens that specific parsers are very expensive. For example, parsers that have
threadlocal fields will initialize them for every item to be parsed.
> By simply introducing a caching ExecutorService the ParseUtil will be able to cache threads
therefore parsing more efficient. See attached patch.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message