incubator-jspwiki-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Murray Altheim (JIRA)" <j...@apache.org>
Subject [jira] Commented: (JSPWIKI-427) Keywords for Lucene Index
Date Wed, 12 Nov 2008 18:53:47 GMT

    [ https://issues.apache.org/jira/browse/JSPWIKI-427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12646997#action_12646997
] 

Murray Altheim commented on JSPWIKI-427:
----------------------------------------

Given that 'keywords' and 'tags' might be considered similar, the approach that we've taken
is to use the TagManager to provide Lucene with information about the declared tags for a
given wiki page. This requires a slight modification to the Lucene indexer since the change
is to the Lucene index. 

Another difference between tags and keywords is that in our implementation of tags they are
by definition a 'controlled vocabulary', specifically each tag must reference a wiki page.
This is so that each tag has a definition, which more restrictive reduces spelling errors
and ambiguity of meaning. 

If this proposal moves forward I would hope we store the keywords in a metadata field labeled
'DC.subject' since that's the appropriate field in Dublin Core for keywords, as used internationally.


> Keywords for Lucene Index
> -------------------------
>
>                 Key: JSPWIKI-427
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-427
>             Project: JSPWiki
>          Issue Type: New Feature
>          Components: Core & storage
>    Affects Versions: 2.6.4
>         Environment: LuceneSearchProvider
>            Reporter: Kurt Stein
>
> It would be nice feature if you can define keywords for wikipages so that the search
results can be made more accurate. 
> Cause:
> Sometime you have a big page where you explain something that is only mentioned in the
headline. So this page contains very good content but lucene does not rank it high enough
because its relation is very low.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message