Return-Path: Delivered-To: apmail-incubator-nutch-user-archive@www.apache.org Received: (qmail 65692 invoked from network); 11 May 2005 12:38:20 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 11 May 2005 12:38:20 -0000 Received: (qmail 12755 invoked by uid 500); 11 May 2005 12:41:32 -0000 Mailing-List: contact nutch-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: nutch-user@incubator.apache.org Delivered-To: mailing list nutch-user@incubator.apache.org Received: (qmail 12735 invoked by uid 99); 11 May 2005 12:41:32 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=RCVD_BY_IP X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: domain of pkosiorowski@gmail.com designates 64.233.184.195 as permitted sender) Received: from wproxy.gmail.com (HELO wproxy.gmail.com) (64.233.184.195) by apache.org (qpsmtpd/0.28) with ESMTP; Wed, 11 May 2005 05:41:32 -0700 Received: by wproxy.gmail.com with SMTP id 69so203812wra for ; Wed, 11 May 2005 05:37:47 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:user-agent:x-accept-language:mime-version:to:subject:references:in-reply-to:content-type:content-transfer-encoding; b=L7Q/90EAZjeibMhcdwEvIABC8tj3qerrytZrvINEXKKKFH85i3gOhCZ7KEDAITAEiXOH2N+srM1opKv87O+x5dSI6WPVvaukeSkw/RYtvZ3zz/beHq9TYo8Rfyfd2D7kdX55s19ndr/TGnZIThbJPN1LyLTljyo36Zut4ezX+S0= Received: by 10.54.96.10 with SMTP id t10mr442161wrb; Wed, 11 May 2005 05:37:47 -0700 (PDT) Received: from ?192.168.0.122? ([217.96.97.173]) by mx.gmail.com with ESMTP id 27sm721418wrl.2005.05.11.05.37.46; Wed, 11 May 2005 05:37:47 -0700 (PDT) Message-ID: <4281FC97.4080603@gmail.com> Date: Wed, 11 May 2005 14:37:43 +0200 From: Piotr Kosiorowski User-Agent: Mozilla Thunderbird 1.0 (Windows/20041206) X-Accept-Language: en-us, en MIME-Version: 1.0 To: nutch-user@incubator.apache.org Subject: Re: proxy References: <966e89870505110521437b3cb@mail.gmail.com> In-Reply-To: <966e89870505110521437b3cb@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Hello, As far as I remember in current implementation it is not possible to fetch through a proxy that requires authorization. But Andrzej Bialecki is working on httpclient based implementation right now and as httpclient allows one to use proxy with authorization it should be no problem to do so in nutch. You can check current status of httpclient based patch by looking for thread: "Update: HTTPClient for protocol-http and protocol-https". You can even try the patch yourself. Regards Piotr k-team wrote: > Hi all, > I'm testing nutch on my PC, and need to get through a proxy to crawls pages. > I've tried to set the "http.proxy.host" property like this: > user:password@proxyIP > but I get this error message: > fetch of http://www.host.com/ failed with: > net.nutch.protocol.http.HttpException: java.net.UnknownHostException: > user:password@proxyIP > > How can I set my proxy with user/pwd? Is it possible? > > thanks, > Kteam