nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-1028) Log parser keys
Date Wed, 02 Nov 2011 11:19:32 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13142056#comment-13142056
] 

Hudson commented on NUTCH-1028:
-------------------------------

Integrated in Nutch-nutchgora-ant #9 (See [https://builds.apache.org/job/Nutch-nutchgora-ant/9/])
    NUTCH-1187 Port NUTCH-1028 to nutchgora - log parser keys

ferdy : http://svn.apache.org/viewvc/nutch/branches/nutchgora/viewvc/?view=rev&root=&revision=1196514
Files : 
* /nutch/branches/nutchgora/CHANGES.txt
* /nutch/branches/nutchgora/src/java/org/apache/nutch/parse/ParserJob.java

                
> Log parser keys
> ---------------
>
>                 Key: NUTCH-1028
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1028
>             Project: Nutch
>          Issue Type: Task
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Trivial
>             Fix For: 1.4
>
>         Attachments: NUTCH-1028-1.4-1.patch
>
>
> The parser can take ages (many hours) to complete. During this time the only output is
an error or warning if it's unable to parse something (which is very common). Sometimes the
parser can run for several hours without any output: this is scary. I propose to add a LOG.info
to the mapper and write the key when parsing, similar to the fetcher.
> Thoughts?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message