manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Swapna Vuppala <swapna.kollip...@gmail.com>
Subject Re: Job crawling SharePoint repository does not end
Date Wed, 12 Sep 2012 07:48:41 GMT
Thanks for the clarification Karl, I now understand the need for the plugin.

We are fine with using the plugin for SharePoint 2010 and we will get back
to you if we come across any issues.

Thanks and Regards,
Swapna.

On Mon, Sep 10, 2012 at 5:59 PM, Karl Wright <daddywri@gmail.com> wrote:

> I should mention that *partial* crawling is technically possible with
> SharePoint 2010, but we'd have to set the connector up with a switch
> that you can set to tell it to do things in ways that don't require
> the plugin.  This basically involves:
>
> (1) No file/folder security (same as SharePoint 2007)
> (2) For large libraries, only the first 2000 documents can be crawled
>
> If this is what you want, please open a ticket and I'll try to work
> this into our September release.
> Karl
>
>
> On Mon, Sep 10, 2012 at 8:05 AM, Karl Wright <daddywri@gmail.com> wrote:
> > The difference is SharePoint 2010, which disabled a number of key
> features
> > that were necessary for crawling.  For SharePoint 2010, the plugin is
> indeed
> > mandatory.
> >
> > Karl
> >
> > Sent from my Windows Phone
> > ________________________________
> > From: Swapna Vuppala
> > Sent: 9/10/2012 7:54 AM
> > To: user@manifoldcf.apache.org
> > Subject: Re: Job crawling SharePoint repository does not end
> >
> > Hi Karl,
> >
> > I have got the SharePoint-2010 plugin but I have got couple of doubts
> before
> > using this.
> >
> > When I was using ManfoldCF version 0.6, I tried defining repository
> > connections and crawling documents on them by running jobs without
> > installing anything on the SharePoint server. I thought I was just using
> the
> > connector mcf-sharepoint-connector.jar which is one the machine running
> > ManifoldCF and I was of the assumption that, I will be able to crawl
> > documents on any SharePoint server, for which I have got access
> permissions.
> > I was of the opinion that I don't have to be a SharePoint administrator
> and
> > also I don't have to install anything on the SharePoint server.
> >
> > But looking at this plug-in, I think I have been of a wrong opinion. Can
> you
> > please clarify if installation of these web services on the SharePoint
> > server is mandatory, just for being able to crawl them and index into
> Solr ?
> > Why is it different from the connector I was using in ManifoldCF 0.6 ?
> >
> > Thanks and Regards,
> > Swapna.
> >
> > On Thu, Sep 6, 2012 at 7:17 PM, Karl Wright <daddywri@gmail.com> wrote:
> >>
> >> There is a SharePoint-2010 plugin 0.1 release candidate available now
> >> on http://people.apache.org/~kwright .  This might save you some time.
> >>
> >> Karl
> >>
> >>
> >> On Thu, Sep 6, 2012 at 12:47 AM, Swapna Vuppala
> >> <swapna.kollipara@gmail.com> wrote:
> >> > Thanks Karl, I'll try and get the new build and use it shortly.
> >> >
> >> > Thanks and Regards,
> >> > Swapna.
> >> >
> >> > On Wed, Sep 5, 2012 at 11:01 PM, Karl Wright <daddywri@gmail.com>
> wrote:
> >> >>
> >> >> FWIW, CONNECTORS-492 was just completed, and merged into trunk.
> >> >>
> >> >> You will need a new build of the SharePoint-2010 plugin to use it.
> >> >>
> >> >> Thanks,
> >> >> Karl
> >> >>
> >> >> On Tue, Sep 4, 2012 at 7:34 AM, Swapna Vuppala
> >> >> <swapna.kollipara@gmail.com> wrote:
> >> >> > Hi Karl,
> >> >> >
> >> >> > I'll make sure to look at the things you had mentioned. Thanks
> again
> >> >> > for
> >> >> > the
> >> >> > information.
> >> >> >
> >> >> > Thanks and Regards,
> >> >> > Swapna.
> >> >> >
> >> >> >
> >> >> > On Tue, Sep 4, 2012 at 4:19 PM, Karl Wright <daddywri@gmail.com>
> >> >> > wrote:
> >> >> >>
> >> >> >> Also, please be certain to look at CONNECTORS-492, which applies
> to
> >> >> >> SharePoint 2010.  It may not affect you, but if it does, bear
in
> >> >> >> mind
> >> >> >> we have not completed development on it yet.
> >> >> >>
> >> >> >> Karl
> >> >> >>
> >> >> >> On Tue, Sep 4, 2012 at 6:48 AM, Karl Wright <daddywri@gmail.com>
> >> >> >> wrote:
> >> >> >> > You will need the SharePoint-2010 plugin, also.  You
can check
> >> >> >> > that
> >> >> >> > out
> >> >> >> > at:
> >> >> >> >
> >> >> >> >
> >> >> >> >
> >> >> >> >
> >> >> >> >
> https://svn.apache.org/repos/asf/manifoldcf/integration/sharepoint-2010/trunk
> >> >> >> >
> >> >> >> > ... and follow the README.txt directions.
> >> >> >> >
> >> >> >> > Thanks!
> >> >> >> > Karl
> >> >> >> >
> >> >> >> >
> >> >> >> > On Tue, Sep 4, 2012 at 6:31 AM, Swapna Vuppala
> >> >> >> > <swapna.kollipara@gmail.com> wrote:
> >> >> >> >> Hi Karl,
> >> >> >> >>
> >> >> >> >> Yes, this is SharePoint 2010
> >> >> >> >> OK, then I'll try switching to trunk and start working
with it.
> >> >> >> >> Thanks
> >> >> >> >> for
> >> >> >> >> the information, Karl.
> >> >> >> >>
> >> >> >> >> Thanks and Regards,
> >> >> >> >> Swapna.
> >> >> >> >>
> >> >> >> >>
> >> >> >> >> On Tue, Sep 4, 2012 at 3:44 PM, Karl Wright <
> daddywri@gmail.com>
> >> >> >> >> wrote:
> >> >> >> >>>
> >> >> >> >>> Hi - What version of SharePoint are you trying
to crawl?
> >> >> >> >>>
> >> >> >> >>> If this is SharePoint 2010, development is underway
and you
> will
> >> >> >> >>> have
> >> >> >> >>> to use trunk.
> >> >> >> >>>
> >> >> >> >>> Karl
> >> >> >> >>>
> >> >> >> >>> On Tue, Sep 4, 2012 at 5:26 AM, Swapna Vuppala
> >> >> >> >>> <swapna.kollipara@gmail.com> wrote:
> >> >> >> >>> > Hi,
> >> >> >> >>> >
> >> >> >> >>> > Am trying to use SharePoint connector of
ManifoldCF for the
> >> >> >> >>> > first
> >> >> >> >>> > time
> >> >> >> >>> > and
> >> >> >> >>> > am having couple of issues. Can someone
please help me in
> >> >> >> >>> > successfully
> >> >> >> >>> > crawling these repositories ?
> >> >> >> >>> >
> >> >> >> >>> > Am using ManifoldCF version 0.6 and I see
that the
> SharePoint
> >> >> >> >>> > connector
> >> >> >> >>> > is
> >> >> >> >>> > readily available for use. I have defined
a Repository
> >> >> >> >>> > Connection
> >> >> >> >>> > of
> >> >> >> >>> > SharePoint type for the URL
> >> >> >> >>> > "
> https://mysite.arup.com/personal/swapna_vuppala/default.aspx"
> >> >> >> >>> > and
> >> >> >> >>> > the
> >> >> >> >>> > connection status shows "Connection working".
> >> >> >> >>> >
> >> >> >> >>> > I have got a couple of documents in the
libraries "Shared
> >> >> >> >>> > Documents"
> >> >> >> >>> > and
> >> >> >> >>> > "Personal Documents" and am interested in
indexing them into
> >> >> >> >>> > Solr.
> >> >> >> >>> > Now
> >> >> >> >>> > when
> >> >> >> >>> > I try to define a job using the above created
repository
> >> >> >> >>> > connection
> >> >> >> >>> > and
> >> >> >> >>> > a
> >> >> >> >>> > Solr output connection, am able to add rules
to include the
> >> >> >> >>> > libraries I
> >> >> >> >>> > have
> >> >> >> >>> > got. When I start the job, the number listed
in "Documents"
> >> >> >> >>> > column
> >> >> >> >>> > is
> >> >> >> >>> > coming
> >> >> >> >>> > correctly, but the job never ends. It is
always in the
> >> >> >> >>> > "Running"
> >> >> >> >>> > state.
> >> >> >> >>> > I
> >> >> >> >>> > cannot see anything in "Simple History"
except the "Job
> >> >> >> >>> > Start".
> >> >> >> >>> >
> >> >> >> >>> > The manifoldcf log file shows something
like WARN 2012-09-04
> >> >> >> >>> > 14:39:05,204
> >> >> >> >>> > (Worker thread '1') - Service interruption
reported for job
> >> >> >> >>> > 1346736412103
> >> >> >> >>> > connection 'Test SharePoint': Remote procedure
exception:
> >> >> >> >>> > Request
> >> >> >> >>> > is
> >> >> >> >>> > empty.
> >> >> >> >>> >
> >> >> >> >>> > Can someone please tell me if am missing
some steps or
> >> >> >> >>> > configuration
> >> >> >> >>> > of
> >> >> >> >>> > something ??
> >> >> >> >>> >
> >> >> >> >>> > Thanks and Regards,
> >> >> >> >>> > Swapna.
> >> >> >> >>
> >> >> >> >>
> >> >> >
> >> >> >
> >> >
> >> >
> >
> >
>

Mime
View raw message