manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karl Wright <daddy...@gmail.com>
Subject Re: No SharePoint (2010) Site Pages in crawl
Date Wed, 20 Nov 2013 20:56:51 GMT
Right, that makes sense.
Karl



On Wed, Nov 20, 2013 at 3:53 PM, Mark Libucha <mlibucha@gmail.com> wrote:

> A little more information...the mime type in all these cases is null.
>
>
> On Wed, Nov 20, 2013 at 12:26 PM, Mark Libucha <mlibucha@gmail.com> wrote:
>
>> I noticed this in the logs today...so it appears to be a problem with the
>> output connector, not with the SharePoint crawler. Sorry for the confusion.
>>
>> SharePoint: Skipping document '/test3/SitePages//greg data.aspx' because
>> output connector says mime type is not indexable
>>
>> Would be nice if the log message included the mime type.
>>
>>
>> On Tue, Nov 19, 2013 at 8:08 PM, Mark Libucha <mlibucha@gmail.com> wrote:
>>
>>> EnumerateFolder?
>>>
>>> http://msdn.microsoft.com/en-us/library/ms774758(v=office.12).aspx
>>>
>>> I really have no idea, and I'm not trying to add work to your plate.
>>>
>>> I'm just trying to understand what MCF does and does not crawl on a
>>> SharePoint site. I think you answered that.
>>>
>>> Thanks,
>>>
>>> Mark
>>>
>>>
>>> On Tue, Nov 19, 2013 at 5:32 PM, Karl Wright <daddywri@gmail.com> wrote:
>>>
>>>> Hi Mark,
>>>>
>>>> The only content the current SharePoint connector crawls is:
>>>> - Sites and subsites
>>>> - Libraries, and files within
>>>> - Lists, and listitems, and attachments of listitems
>>>>
>>>> The connector does not currently know how to discover SitePages or
>>>> crawl them.  If you would like this functionality, you will need to create
>>>> a ticket and hope that I have time to figure out how to do it using the
>>>> available web services.
>>>>
>>>> :-)
>>>> Karl
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> On Tue, Nov 19, 2013 at 8:27 PM, Mark Libucha <mlibucha@gmail.com>wrote:
>>>>
>>>>> None of my crawls ever turn up SP "site pages". These are just normal
>>>>> .aspx web pages. The URLs look something like this:
>>>>>
>>>>> http://mydomain.com/mysite/SitePages/mark%20data.aspx
>>>>>
>>>>> I've tried tweaking the "Allowed file extensions", but can't get it to
>>>>> work.
>>>>>
>>>>> I'm using MCF 4.1.
>>>>>
>>>>> Any other suggestions?
>>>>>
>>>>> Thanks,
>>>>>
>>>>> Mark
>>>>>
>>>>
>>>>
>>>
>>
>

Mime
View raw message