manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From lalit jangra <lalit.j.jan...@gmail.com>
Subject Re: FW: What all items MCF crawls in Sharepoint Lists
Date Mon, 04 Aug 2014 12:40:55 GMT
Thanks Karl,

Please find below some of URLs from solr logs.

http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D792&literal.Created=2014-01-06+14:00:45&literal._UIVersionString=1.0&literal.Outcome=&wt=xml&literal.Group=0&literal.Modified=2014-01-06+14:00:45&literal.Author=System+Account&literal.User=Geraldine+Davis&literal.WorkflowAssociation={7791ab09-0117-4a78-a14e-e5aee9b40d78}&literal.lcf_metadata_id=792&literal.Occurred=2014-01-06+14:00:44&literal.Event=1&literal.Editor=System+Account&literal.ContentType=Workflow+History}
<http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D792&literal.Created=2014-01-06+14:00:45&literal._UIVersionString=1.0&literal.Outcome=&wt=xml&literal.Group=0&literal.Modified=2014-01-06+14:00:45&literal.Author=System+Account&literal.User=Geraldine+Davis&literal.WorkflowAssociation=%7b7791ab09-0117-4a78-a14e-e5aee9b40d78%7d&literal.lcf_metadata_id=792&literal.Occurred=2014-01-06+14:00:44&literal.Event=1&literal.Editor=System+Account&literal.ContentType=Workflow+History%7d>
{add=[http://testirishwaterportal/kbase/cd/Lists/Workflow
History/DispForm.aspx?ID=792 (1475502859497766912)]} 0 0



http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D561&literal.Created=2014-01-02+17:15:58&literal._UIVersionString=1.0&literal.Outcome=&wt=xml&literal.Group=0&literal.Modified=2014-01-02+17:15:58&literal.Author=System+Account&literal.User=Geraldine+Davis&literal.WorkflowAssociation={7791ab09-0117-4a78-a14e-e5aee9b40d78}&literal.lcf_metadata_id=561&literal.Occurred=2014-01-02+17:15:57&literal.Event=1&literal.Editor=System+Account&literal.ContentType=Workflow+History}
<http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D561&literal.Created=2014-01-02+17:15:58&literal._UIVersionString=1.0&literal.Outcome=&wt=xml&literal.Group=0&literal.Modified=2014-01-02+17:15:58&literal.Author=System+Account&literal.User=Geraldine+Davis&literal.WorkflowAssociation=%7b7791ab09-0117-4a78-a14e-e5aee9b40d78%7d&literal.lcf_metadata_id=561&literal.Occurred=2014-01-02+17:15:57&literal.Event=1&literal.Editor=System+Account&literal.ContentType=Workflow+History%7d>
{add=[http://testirishwaterportal/kbase/cd/Lists/Workflow
History/DispForm.aspx?ID=561 (1475502859497766913)]} 0 0



http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D3270&literal.Created=2014-05-08+17:34:58&literal._UIVersionString=1.0&literal.Outcome=&wt=xml&literal.Group=0&literal.Modified=2014-05-08+17:34:58&literal.Author=System+Account&literal.User=System+Account&literal.WorkflowAssociation={0c553c29-0e6a-4411-b834-8bf1cc02eb2e}&literal.lcf_metadata_id=3270&literal.Occurred=2014-05-08+17:34:57&literal.Event=10&literal.Editor=System+Account&literal.ContentType=Workflow+History}
<http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D3270&literal.Created=2014-05-08+17:34:58&literal._UIVersionString=1.0&literal.Outcome=&wt=xml&literal.Group=0&literal.Modified=2014-05-08+17:34:58&literal.Author=System+Account&literal.User=System+Account&literal.WorkflowAssociation=%7b0c553c29-0e6a-4411-b834-8bf1cc02eb2e%7d&literal.lcf_metadata_id=3270&literal.Occurred=2014-05-08+17:34:57&literal.Event=10&literal.Editor=System+Account&literal.ContentType=Workflow+History%7d>
{add=[http://testirishwaterportal/kbase/cd/Lists/Workflow
History/DispForm.aspx?ID=3270 (1475502859521884160)]} 0 1



http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D613&literal.Created=2014-01-03+11:03:57&literal._UIVersionString=1.0&literal.Outcome=Approved+by+Geraldine+Davis&wt=xml&literal.Group=0&literal.Modified=2014-01-03+11:03:57&literal.Author=System+Account&literal.User=Geraldine+Davis&literal.WorkflowAssociation={919e1558-a8f3-44af-9803-b95a16876236}&literal.lcf_metadata_id=613&literal.Occurred=2014-01-03+11:03:56&literal.Event=6&literal.Editor=System+Account&literal.ContentType=Workflow+History}
<http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D613&literal.Created=2014-01-03+11:03:57&literal._UIVersionString=1.0&literal.Outcome=Approved+by+Geraldine+Davis&wt=xml&literal.Group=0&literal.Modified=2014-01-03+11:03:57&literal.Author=System+Account&literal.User=Geraldine+Davis&literal.WorkflowAssociation=%7b919e1558-a8f3-44af-9803-b95a16876236%7d&literal.lcf_metadata_id=613&literal.Occurred=2014-01-03+11:03:56&literal.Event=6&literal.Editor=System+Account&literal.ContentType=Workflow+History%7d>
{add=[http://testirishwaterportal/kbase/cd/Lists/Workflow
History/DispForm.aspx?ID=613 (1475502859537612800)]} 0 1



http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D940&literal.Created=2014-01-10+12:09:27&literal._UIVersionString=1.0&literal.Outcome=Approval+on+IW-HSQE-SOP-024+has+successfully+completed.+All+participants+have+completed+their+tasks.&wt=xml&literal.Group=0&literal.Modified=2014-01-10+12:09:27&literal.Author=System+Account&literal.User=Julie+Curtin&literal.WorkflowAssociation={1e02d877-6a8d-4d9f-8a3c-7d46b771becb}&literal.lcf_metadata_id=940&literal.Occurred=2014-01-10+12:09:26&literal.Event=2&literal.Editor=System+Account&literal.ContentType=Workflow+History}
<http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D940&literal.Created=2014-01-10+12:09:27&literal._UIVersionString=1.0&literal.Outcome=Approval+on+IW-HSQE-SOP-024+has+successfully+completed.+All+participants+have+completed+their+tasks.&wt=xml&literal.Group=0&literal.Modified=2014-01-10+12:09:27&literal.Author=System+Account&literal.User=Julie+Curtin&literal.WorkflowAssociation=%7b1e02d877-6a8d-4d9f-8a3c-7d46b771becb%7d&literal.lcf_metadata_id=940&literal.Occurred=2014-01-10+12:09:26&literal.Event=2&literal.Editor=System+Account&literal.ContentType=Workflow+History%7d>
{add=[http://testirishwaterportal/kbase/cd/Lists/Workflow
History/DispForm.aspx?ID=940 (1475502859560681472)]} 0 0



http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D3365&literal.Created=2014-05-19+14:19:49&literal._UIVersionString=1.0&literal.Outcome=&wt=xml&literal.Group=0&literal.Modified=2014-05-19+14:19:49&literal.Author=System+Account&literal.User=System+Account&literal.WorkflowAssociation={7791ab09-0117-4a78-a14e-e5aee9b40d78}&literal.lcf_metadata_id=3365&literal.Occurred=2014-05-19+14:19:48&literal.Event=3&literal.Editor=System+Account&literal.ContentType=Workflow+History}
<http://testirishwaterportal/kbase/cd/Lists/Workflow+History/DispForm.aspx?ID%3D3365&literal.Created=2014-05-19+14:19:49&literal._UIVersionString=1.0&literal.Outcome=&wt=xml&literal.Group=0&literal.Modified=2014-05-19+14:19:49&literal.Author=System+Account&literal.User=System+Account&literal.WorkflowAssociation=%7b7791ab09-0117-4a78-a14e-e5aee9b40d78%7d&literal.lcf_metadata_id=3365&literal.Occurred=2014-05-19+14:19:48&literal.Event=3&literal.Editor=System+Account&literal.ContentType=Workflow+History%7d>
{add=[http://testirishwaterportal/kbase/cd/Lists/Workflow
History/DispForm.aspx?ID=3365 (1475502859564875776)]} 0 0



On Mon, Aug 4, 2014 at 6:02 PM, Karl Wright <daddywri@gmail.com> wrote:

> Hi Lalit,
>
> You do not need to try the null output connection; the report indicates
> that your list items are in fact being sent to Solr.  So the next step is
> to look at the Solr log.  There should be an INFO statement in the log for
> every list item, which includes a URL.  The URL should have the metadata
> for the list item in it.  Can you send a couple of those to us please.
>
> Karl
>
>
>
> On Mon, Aug 4, 2014 at 8:22 AM, lalit jangra <lalit.j.jangra@gmail.com>
> wrote:
>
>> Hi Karl,
>>
>> I can see list items getting rendered in simple history report, please
>> see attached screenshot.
>>
>> I have not tried null connection till but planning to do it very soon.
>>
>>
>> On Mon, Aug 4, 2014 at 5:47 PM, Karl Wright <daddywri@gmail.com> wrote:
>>
>>> Hi Lalit,
>>>
>>> From your response it is not clear to me whether you see any list items
>>> appearing in the simple history report.  Nor is it clear whether you see
>>> list items in a simple history report going against a null output
>>> connection.  Can you clarify this please?  If you don't know what a list
>>> item would look like in the history, please just include a screen shot.
>>>
>>> Thanks,
>>> Karl
>>>
>>>
>>>
>>> On Mon, Aug 4, 2014 at 8:13 AM, lalit jangra <lalit.j.jangra@gmail.com>
>>> wrote:
>>>
>>>> Thanks Karl,
>>>>
>>>> I checked simple history to see number of different documents as well
>>>> as content types getting indexed from lists.
>>>>
>>>> Also I have customized SharePointRepository.java to add couple of
>>>> custom metadata such as content url, modifier etc. but i assume these
>>>> metadata will be available for all items including lists. Also i can see
>>>> some items in solr index which do not have all metadata fields attached to
>>>> them. But i am not filtering on basis of extensions &  crawling all items
>>>> needed.
>>>>
>>>> Since start, i could see any content items such as pdf or doc or xls
>>>> attached to any list item getting indexed well but that item itself is
>>>> missing.
>>>>
>>>> Regards.
>>>>
>>>>
>>>>
>>>>
>>>> On Mon, Aug 4, 2014 at 5:07 PM, Karl Wright <daddywri@gmail.com> wrote:
>>>>
>>>>> Hi Lalit,
>>>>>
>>>>> Can you have a look at the Simple History Report?  Do  you see
>>>>> indexing attempts for list items in there?
>>>>>
>>>>> It may be that you've filtered list items out by some other means you
>>>>> aren't expecting, such as extensions.  It might also be worth trying
to
>>>>> index to a null output connection to verify that the filtering hasn't
been
>>>>> done by your output connection configuration.
>>>>>
>>>>> Karl
>>>>>
>>>>>
>>>>>
>>>>> On Mon, Aug 4, 2014 at 7:31 AM, lalit jangra <lalit.j.jangra@gmail.com
>>>>> > wrote:
>>>>>
>>>>>> Thanks Karl,
>>>>>>
>>>>>> I am adding all metadata available in any list item by checking
>>>>>> "Include All Metadata" checkbox with include as selected option.
>>>>>>
>>>>>> I assume this way i should get all metadata & content for a list
item
>>>>>> but i am not getting content for any list item? For some content
i am not
>>>>>> getting metadata such as modified date etc?
>>>>>>
>>>>>> Regards.
>>>>>>
>>>>>>
>>>>>> On Sun, Aug 3, 2014 at 6:34 AM, Karl Wright <daddywri@gmail.com>
>>>>>> wrote:
>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> Sent from my Windows Phone
>>>>>>> ------------------------------
>>>>>>> From: Wright, Karl
>>>>>>> Sent: 8/2/2014 8:42 PM
>>>>>>> To: Dad
>>>>>>> Subject: FW: What all items MCF crawls in Sharepoint Lists
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> Sent from my Windows Phone
>>>>>>>  ------------------------------
>>>>>>> From: Wright, Karl
>>>>>>> Sent: 8/2/2014 1:31 PM
>>>>>>> To: user@manifoldcf.apache.org
>>>>>>> Subject: RE: What all items MCF crawls in Sharepoint Lists
>>>>>>>
>>>>>>>  Hi Lalit,
>>>>>>>
>>>>>>> List items have no content except metadata, so you will need
to
>>>>>>> properly configure your metadata in order for it to be indexed.
>>>>>>>
>>>>>>> Most of what you describe as missing is implemented in SharePoint
as
>>>>>>> a list, so the same issue applies.
>>>>>>>
>>>>>>> Thanks
>>>>>>> Karl
>>>>>>>
>>>>>>> Sent from my Windows Phone
>>>>>>>  ------------------------------
>>>>>>> From: lalit jangra
>>>>>>> Sent: 8/2/2014 11:51 AM
>>>>>>> To: user@manifoldcf.apache.org
>>>>>>> Subject: What all items MCF crawls in Sharepoint Lists
>>>>>>>
>>>>>>>
>>>>>>>   Hello,
>>>>>>>
>>>>>>> I am using MCF 1.5.1 and i have configured a Sharepoint Job to
crawl
>>>>>>> all lists included in a sharepoint site.
>>>>>>>
>>>>>>>
>>>>>>>  I can see that job crawls all attachments such as pdf or doc
>>>>>>> attached to items in lists but i am not able to see the list
items to which
>>>>>>> these documents attached. Also i could not see items such as
FAQs etc.
>>>>>>> which do not have any documents attached to them.
>>>>>>>
>>>>>>>  Is it normal behavior or am i missing anything? By default what
all
>>>>>>> items got indexed in lists?
>>>>>>>
>>>>>>> Also to get all such items such as news items, announcements
without
>>>>>>> attachments, FAQs without any attachments etc., what steps should
i take?
>>>>>>>
>>>>>>>
>>>>>>> Regards,
>>>>>>> Lalit.
>>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>> Regards,
>>>>>> Lalit.
>>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Regards,
>>>> Lalit.
>>>>
>>>
>>>
>>
>>
>> --
>> Regards,
>> Lalit.
>>
>
>


-- 
Regards,
Lalit.

Mime
View raw message