manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-235) item description element not indexed
Date Wed, 03 Aug 2011 22:51:27 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13079097#comment-13079097
] 

Karl Wright commented on CONNECTORS-235:
----------------------------------------

Hmm, I'm using the very same feed you are, with PostgreSQL, and seeing perfect results.
Can you attach a screen shot of the View Job page of the job in question?  Also, the View
Connection page for both the output connection and the repository connection?


> item description element not indexed
> ------------------------------------
>
>                 Key: CONNECTORS-235
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-235
>             Project: ManifoldCF
>          Issue Type: Improvement
>          Components: RSS connector
>    Affects Versions: ManifoldCF 0.2
>            Reporter: Kate McGonigal
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 0.3
>
>
> The RSS feed's *item* description is not written to any field in the Solr index. 
> I have a typical RSS feed with the general structure:
> <rss>
>     <channel>
>         <title></title>
>         <link></link>
>         <description></description>
>         <item>
>             <title></title>
>             <link></link>
>             <pubDate></pubDate>
>             <description> *** the description I do want *** </description>
>             <author></author>
>             <category></category>
>         </item>
>     </channel>
> </rss>
> Example:
> For the RSS feed: http://www.onemansjazz.ca/component/option,com_rss/feed,RSS2.0/no_html,1/
> the rss/channel/item/description field is not indexed into Solr.
> Example notes:
>   - what does get written to the Solr "description" field is the description metadata
from the website, i.e. "Jazz radio show from Winnipeg on CKUW 95.9 FM, hosted by Maurice Hogue."
in this case.
>   - on the "Dechromed Content" tab of the job, "No dechromed content" is selected. I'm
not sure if that is relevant.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message