jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Marcel Reutegger (JIRA)" <j...@apache.org>
Subject [jira] Reopened: (JCR-281) textfilters module patch: Support for text extraction for HTML,XML and RTF files
Date Wed, 30 Nov 2005 08:23:31 GMT
     [ http://issues.apache.org/jira/browse/JCR-281?page=all ]
     
Marcel Reutegger reopened JCR-281:
----------------------------------


Hmm, that's too bad.

I'm still a bit confused what kind of libraries we may use for apache projects. Roy, is there
a list of licenses that are compatible with the apache license? Just to make sure we don't
spend too much time in the future for extensions that we cannot include. Thanks.

Would it be ok with you Martin, to remove the HTML filter from the patch? The XML and RTF
filters are still very good contributions that I'd like to include.

> textfilters module patch: Support for text extraction for HTML,XML and RTF files
> --------------------------------------------------------------------------------
>
>          Key: JCR-281
>          URL: http://issues.apache.org/jira/browse/JCR-281
>      Project: Jackrabbit
>         Type: Improvement
>   Components: query
>     Reporter: Martin Perez
>  Attachments: patch.diff
>
> This patch adds text extraction support form XML, RTF and HTML files.
> The unique dependency is htmlparser library for handling HTML text extraction.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


Mime
View raw message