nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jasper Kamperman <jasper.kamper...@openwaternet.com>
Subject Re: About search inner links information
Date Tue, 03 Mar 2009 21:57:53 GMT
Oh and the documentation also specifies a depth parameter that says  
how far afield the crawler may go. I think default is 10 but not sure.

Sent from my iPhone

On Mar 3, 2009, at 12:53 PM, Yves Yu <yves0815@gmail.com> wrote:

> you mean, we can do this without additional configuration? how about  
> 10
> depth like this? how can I set it?thanks.
>
> 2009/3/4 Jasper Kamperman <jasper.kamperman@openwaternet.com>
>
>> Could be a lot of reasons. I'd start by investigating the index  
>> with Luke
>> to see if ccc made it into the index and if I can search out the  
>> page with
>> the word "big". From what I find out with Luke I'd work my way back  
>> to the
>> root cause
>>
>> Sent from my iPhone
>>
>>
>> On Mar 3, 2009, at 7:40 AM, Yves Yu <yves0815@gmail.com> wrote:
>>
>> Hi, all,
>>> for example,
>>>
>>> The page www.aaa.com has a link www.bbb.com
>>> www.bbb.com has a link www.ccc.com
>>> www.ccc.com has a word: big
>>>
>>> It seems I cannot find "big" in www.ccc.com, is it possible? How  
>>> can I
>>> set
>>> the configurations?
>>>
>>> Thanks in advance!
>>>
>>> Yves
>>>
>>

Mime
View raw message