nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrzej Bialecki (JIRA)" <j...@apache.org>
Subject [jira] Commented: (NUTCH-244) Inconsistent handling of property values boundaries / unable to set db.max.outlinks.per.page to infinite
Date Wed, 05 Apr 2006 16:52:45 GMT
    [ http://issues.apache.org/jira/browse/NUTCH-244?page=comments#action_12373396 ] 

Andrzej Bialecki  commented on NUTCH-244:
-----------------------------------------

We don't pass the Configuration object to the constructor, so we have no way to read the value
of this. Configuration is set later, using setConf().

Also, ParseData needs to correctly read serialized instances, which were created with possibly
different values of this parameter, so this piece of code has to be there anyway.

Also, note that we always write out all outlinks. This is to ensure that if you e.g. increase
the parameter value in the future you can still recover as much data as possible from older
segments.

> Inconsistent handling of property values boundaries / unable to set db.max.outlinks.per.page
to infinite
> --------------------------------------------------------------------------------------------------------
>
>          Key: NUTCH-244
>          URL: http://issues.apache.org/jira/browse/NUTCH-244
>      Project: Nutch
>         Type: Bug

>     Versions: 0.8-dev
>     Reporter: AJ Banck

>
> Some properties like file.content.limit support using negative numbers (-1) to 'disable'
a limitation.
> Other properties do not support this. 
> I tried disabling the limit set by db.max.outlinks.per.page, but this isn't possible.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


Mime
View raw message