Return-Path: X-Original-To: apmail-manifoldcf-user-archive@www.apache.org Delivered-To: apmail-manifoldcf-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B347BD33F for ; Tue, 30 Oct 2012 15:03:38 +0000 (UTC) Received: (qmail 61068 invoked by uid 500); 30 Oct 2012 15:03:38 -0000 Delivered-To: apmail-manifoldcf-user-archive@manifoldcf.apache.org Received: (qmail 61032 invoked by uid 500); 30 Oct 2012 15:03:38 -0000 Mailing-List: contact user-help@manifoldcf.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@manifoldcf.apache.org Delivered-To: mailing list user@manifoldcf.apache.org Received: (qmail 61015 invoked by uid 99); 30 Oct 2012 15:03:38 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Oct 2012 15:03:38 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of daddywri@gmail.com designates 209.85.210.178 as permitted sender) Received: from [209.85.210.178] (HELO mail-ia0-f178.google.com) (209.85.210.178) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Oct 2012 15:03:30 +0000 Received: by mail-ia0-f178.google.com with SMTP id y26so235450iab.9 for ; Tue, 30 Oct 2012 08:03:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=fd8jTDyZRRoJRB8uguEbmyb/5QuBrflJ8p7dQdWNtFA=; b=XUdN/+z4Y7kmIA9KXIwNK85/qZYswo9sNGg3DRA1h6FSq2qSlpTTF3spW3AHIWh1Fb ZITeQQiBEh4Tnzi7jCH7t0XJsgnj/i3vB+1Nd+0t8hyY9p1IvBZTNicImfpea522Fin0 vokGEX8jEo/QouyISuz3gm4ryswwJix8MgaFRVFsDgFm7NcMmtlU1pS93mZAKP1Y1mQb qgp6Askxpc7a29gCozulRGvB9VMGkoq4Tz8IS0fv5uIQNGsuDH8i3uCI40exQY9KXrQN 7lb/HXgl1imz29Go7ylL0Tq/GuYAaiLYkpzNJD/Gif0kfGCPumPblmrCjRJqWt/spKyZ QSrQ== MIME-Version: 1.0 Received: by 10.50.140.106 with SMTP id rf10mr1728258igb.48.1351609388722; Tue, 30 Oct 2012 08:03:08 -0700 (PDT) Received: by 10.42.33.10 with HTTP; Tue, 30 Oct 2012 08:03:08 -0700 (PDT) In-Reply-To: <1E611569B274304DB0C7D90B3735637C01CADE0F@G1W3645.americas.hpqcorp.net> References: <1E611569B274304DB0C7D90B3735637C01CADE0F@G1W3645.americas.hpqcorp.net> Date: Tue, 30 Oct 2012 11:03:08 -0400 Message-ID: Subject: Re: Problem with reading files from Sharepoint 2010 to manifldcf 1.0.1 From: Karl Wright To: user@manifoldcf.apache.org Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org I finally was able to look at the logs. The exception that stops the job is in fact coming from the GetListItems ca= ll: at org.apache.axis.client.Call.invoke(Call.java:1812) at com.microsoft.sharepoint.webpartpages.PermissionsSoapStub.getListItems(P= ermissionsSoapStub.java:234) at org.apache.manifoldcf.crawler.connectors.sharepoint.SPSProxyHelper.getCh= ildren(SPSProxyHelper.java:619) at org.apache.manifoldcf.crawler.connectors.sharepoint.SharePointRepository= .processDocuments(SharePointRepository.java:1303) at org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.process= Documents(BaseRepositoryConnector.java:423) at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:= 551) Clearly certain entities are expected to have children, but we're either not invoking the service correctly for those, OR we're invoking the service for entities that don't have the ability to get children at all. I don't see any evidence in this log that ANY getListItems calls are succeeding. In fact, it is the first such call that fails. Why do you think that discovery is working? There seems to be no evidence of that. The headers etc. all look good too: DEBUG 2012-10-30 14:04:35,223 (Thread-439) - HttpConnectionManager.getConnection: config =3D HostConfiguration[host=3Dhttp://16.59.60.113], timeout =3D 0 DEBUG 2012-10-30 14:04:35,223 (Thread-439) - Getting free connection, hostConfig=3DHostConfiguration[host=3Dhttp://16.59.60.113] DEBUG 2012-10-30 14:04:35,224 (Thread-439) - >> "POST /_vti_bin/MCPermissions.asmx HTTP/1.1[\r][\n]" Karl On Tue, Oct 30, 2012 at 8:39 AM, Fridler, Oren wrote: > Hi > > I=92m using apache-manifoldcf-1.0.1-bin > > I installed apache-manifoldcf-sharepoint-2010-plugin-0.1 on top of > Sharepoint 2010 > > > > On mcf I managed to create a Sharepoint repository connection and saw the > status is =93Connection Working=94 > > Also when I create the =93Sharepoint to Solr=94 Job I can see some of the= wiki > libraries that I created on SP are available for selection so I assume MC= F > is getting this data from SP. > > But when I start the job it is getting stuck in status =93running=94 fore= ver, > the mcf UI shows documents are discovered, some are processed and some ar= e > active, but on Solr side no document is received. > > On mcf logs I see the error at the end of this email. > > On my browser I can open http://16.59.60.113 - getting to SP site, and al= so > http://16.59.60.113/_vti_bin/MCPermissions.asmx - getting to a page that > lists these 2 services - GetListItems and GetPermissionCollection > > Attached are the mcf logs with DEBUG level. > > Any help or idea what can I do would be highly appreciated. > > Thanks > > Oren. > > > > > > AxisFault > > faultCode: {http://schemas.xmlsoap.org/soap/envelope/}Client > > faultSubcode: > > faultString: The Web application at http://16.59.60.113 could not be fou= nd. > Verify that you have typed the URL correctly. If the URL should be servin= g > existing content, the system administrator may need to add a new request = URL > mapping to the intended application. > > faultActor: http://16.59.60.113/_vti_bin/MCPermissions.asmx > > faultNode: > > faultDetail: > > {}Error:1010The W= eb > application at http://16.59.60.113 could not be found. Verify that you ha= ve > typed the URL correctly. If the URL should be serving existing content, t= he > system administrator may need to add a new request URL mapping to the > intended > application.Microsoft.SharePoint > > > > > > > > > >