Return-Path: X-Original-To: apmail-incubator-connectors-user-archive@minotaur.apache.org Delivered-To: apmail-incubator-connectors-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5504392F8 for ; Thu, 26 Apr 2012 19:15:20 +0000 (UTC) Received: (qmail 30495 invoked by uid 500); 26 Apr 2012 19:15:20 -0000 Delivered-To: apmail-incubator-connectors-user-archive@incubator.apache.org Received: (qmail 30465 invoked by uid 500); 26 Apr 2012 19:15:20 -0000 Mailing-List: contact connectors-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: connectors-user@incubator.apache.org Delivered-To: mailing list connectors-user@incubator.apache.org Received: (qmail 30452 invoked by uid 99); 26 Apr 2012 19:15:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Apr 2012 19:15:20 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of daddywri@gmail.com designates 209.85.217.175 as permitted sender) Received: from [209.85.217.175] (HELO mail-lb0-f175.google.com) (209.85.217.175) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Apr 2012 19:15:13 +0000 Received: by lbbgo4 with SMTP id go4so82592lbb.6 for ; Thu, 26 Apr 2012 12:14:53 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=+726aatBFqSec7ypOw9ZxGWORkEDFx4JWJFfoXVIqvA=; b=h6jk0g5R4kf4HinPrjLlvMkskqlLJhsDVDbrW7S1eyIb+Q/IZ5GxKBfVlb6ZQknL6f bTeohdF18QaOwyZY+RlubKoISgZOjRiYeLQvsyFqvw8eZryemp9xKOVPKNhWKR+nw5OZ BVyv0fCBY/EBVCxGPkSsalb0X8XZSTY65ZhXjuaSceGiva9HD1VFrjzuwWTsISSU4JN2 gBGDYebXD2lCnuEpnkF+zmj2gVQ1jpR2ZyofrWzntweNaif+AyOKYbmZMiC1CqVQLt+M jD4uLvBfoaTSLGlJuONX61jOWTqfxSySwj1VhUaaQy8ilkHQ+H6TX0YxRw9PkVb54rib lzOg== MIME-Version: 1.0 Received: by 10.152.144.101 with SMTP id sl5mr1342179lab.51.1335467693109; Thu, 26 Apr 2012 12:14:53 -0700 (PDT) Received: by 10.112.6.165 with HTTP; Thu, 26 Apr 2012 12:14:53 -0700 (PDT) In-Reply-To: References: <4F995979.5080303@usit.uio.no> Date: Thu, 26 Apr 2012 15:14:53 -0400 Message-ID: Subject: Re: Ingestion API socket timeout exception waiting for response code From: Karl Wright To: connectors-user@incubator.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi Erlend, I had some time today and was able to verify that everything worked fine against what I have currently on my laptop, which is Solr 3.2. The second job run looks like this: 04-26-2012 15:11:44.154 job end 1335467343879(test) 0 1 =09 04-26-2012 15:11:34.159 document deletion (solr) file:/C:/testcrawl/there.txt 200 0 117 04-26-2012 15:11:24.690 read document C:\testcrawl OK 0 1 =09 04-26-2012 15:11:24.494 job start 1335467343879(test) 0 1 So it appears that either something changed in Solr, or SSL support is broken, or your network is not permitting a valid HTTP response for some reason. Karl On Thu, Apr 26, 2012 at 11:10 AM, Karl Wright wrote: > Hi Erlend, > > Can you try the following: > > (1) Make a fresh Solr checkout of 3.6 or whatever Solr version you are > using, and build it > (2) Start it > (3) Run a simple filesystem crawl using a Solr connection that is > created with the default values > (4) Delete a file in your filesystem that was crawled > (5) Crawl again > > Does the deletion happen OK? > > AFAIK, nothing has changed in the Solr connector that should affect > the ability to delete. =A0This test will confirm that it is still > working. > > Thanks, > Karl > > > On Thu, Apr 26, 2012 at 10:19 AM, Erlend Gar=E5sen > wrote: >> It seems that MCF cannot delete documents from Solr. A timeout occurs, a= nd >> the job stops after a while. >> >> This is what I can see from the log: >> =A0WARN 2012-04-20 18:24:30,373 (Worker thread '16') - Service interrupt= ion >> reported for job 1327930125433 connection 'Web crawler': Ingestion API >> socket timeout exception waiting for response code: Read timed out; >> ingestion will be retried again later >> >> If I take a further look in Simple History, it seems that this error is >> related to document deletion. >> >> I have tried to delete the document manually by using curl from the same >> server MCF is installed on in case we have some access restrictions, but >> Curr succeeded. >> >> We do not have any problems with adding, the timeout only occurs while >> deleting documents. >> >> I have checked our Solr configuration. MCF does use the correct path for >> document deletion, i.e. /update. >> >> The correct realm, username and password for our Solr server are entered >> correctly and the SSL certificate is valid as well. >> >> Erlend >> >> -- >> Erlend Gar=E5sen >> Center for Information Technology Services >> University of Oslo >> P.O. Box 1086 Blindern, N-0317 OSLO, Norway >> Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31= 050