manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karl Wright <>
Subject More Tika dependency problems
Date Fri, 20 Mar 2015 08:35:26 GMT
It appears that another of Tika's dependencies is partly LGPL.  From the
Lucene dev list:

Solr's contrib/extraction contains jhighlight-1.0.jar which declares itself
as dual CDDL or LGPL license. However, some of its classes are distributed
only under LGPL, e.g.


I downloaded the sources from Maven (
to confirm that, and also found this SVN repo:, though the project's
website seems to not exist anymore (

I didn't find any direct usage of it in our code, so I guess it's probably
needed by a 3rd party dependency, such as Tika. Therefore if we e.g. omit
it, things will compile, but may fail at runtime.

I've created a ticket, TIKA-1581, to make sure that the Tika team looks at
this. (They also looked at netcdf and removed that dependency, based on
Lucene research).

(1) Does anyone know what jHighlight does?  What content types does it
apply to?
(2) Will we have problems if we just remove it?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message