nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mohamed Parvez <par...@gmail.com>
Subject Re: Nutch truncating URL to 318 Chars
Date Tue, 01 Sep 2009 21:55:27 GMT
http://business.verizon.net/SMBPortalWeb/appmanager/SMBPortal/smb?_pageLabel=SMBPortal_page_main_marketplace&_nfpb=true&_windowLabel=MarketPlacePFController_1&MarketPlacePFController_1_actionOverride=%252Fpageflows%252Fverizon%252Fsmb%252Fportal%252FmarketPlacePF%252FgetProductDetails&MarketPlacePFController_1productsId=386

Thanks/Regards,
Parvez



On Tue, Sep 1, 2009 at 4:43 PM, Fuad Efendi <fuad@efendi.ca> wrote:

> > I opened the part-00000 file in the dump folder and there, is only ONE
> url
> > and it has been truncated to 318 chars
> > How make Nutch consider URLs with length more than 318 chars
>
> Please provide original (before truncating) sample of such URL
> Thanks
>
>
>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message