forrest-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dmitriy Kargapolov (JIRA)" <j...@apache.org>
Subject [jira] Commented: (FOR-703) Our local search with Lucene gets failures at index time when remote files are declared in site.xml
Date Fri, 19 Sep 2008 00:00:44 GMT

    [ https://issues.apache.org/jira/browse/FOR-703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12632462#action_12632462
] 

Dmitriy Kargapolov commented on FOR-703:
----------------------------------------

I got similar issue which was fixed by patching book-to-cinclude-lucene.xsl:

--- book-to-cinclude-lucene.xsl.orig    2007-04-14 22:42:01.000000000 -0400
+++ book-to-cinclude-lucene.xsl 2008-09-18 19:37:51.000000000 -0400
@@ -59,6 +59,10 @@
 <!-- Ignore absolute http urls -->
   <xsl:template match="menu-item[starts-with(@href, 'https:')]"/>
 <!-- Ignore absolute https urls -->
+  <xsl:template match="menu-item[starts-with(@href, 'site:')]"/>
+<!-- Ignore internal site: urls -->
+  <xsl:template match="menu-item[starts-with(@href, 'ext:')]"/>
+<!-- Ignore external ext: urls -->
 <!-- For entries whose @href ends in "/", refer to @href/index.lucene -->
   <xsl:template match="menu-item[substring(@href, string-length(@href) - string-length('/')
+ 1) = '/']">
     <cinclude:include>

There is no much sense to try to index site: references (just duplicating info) or external
links...

Regards.


> Our local search with Lucene gets failures at index time when remote files are declared
in site.xml
> ---------------------------------------------------------------------------------------------------
>
>                 Key: FOR-703
>                 URL: https://issues.apache.org/jira/browse/FOR-703
>             Project: Forrest
>          Issue Type: Bug
>          Components: Core operations, Other
>    Affects Versions: 0.8
>            Reporter: David Crossley
>
> Our Lucene search, when building its index, follows our site.xml (linkmap) to gather
the files to be indexed. If the entries include a link to a remote site, then Lucene build
a wonky reference.
> There are some examples in our site-author/content/site.xml for example ...
> ----------
> <SVN label="Browse SVN" href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/>
> ... causes error because it tries to index this URI ...
> /svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml
> ----------

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message