lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jukka Zitting <>
Subject Re: Draft Board Report
Date Thu, 12 Mar 2009 16:02:49 GMT

On Thu, Mar 12, 2009 at 1:25 PM, Grant Ingersoll <> wrote:
> A draft of the board report is available at
> We need Nutch, Lucene.NET, Tika.  If someone from those projects could fill
> them out, I'd appreciate it.

I added the following for Tika based on discussion on tika-dev@.

Apache Tika is a toolkit for detecting and extracting metadata and
structured text content from various documents using existing parser

The first candidate for the 0.3 release is already in place and the
release should be pushed out in March.

Metadata handling and metadata frameworks like XMP have been a source
of much discussion, but so far no clear consensus on has been reached
on whether or how the metadata features in Tika should be extended.

A wiki was created for Tika.


Jukka Zitting

View raw message