lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Brian Goetz <br...@quiotix.com>
Subject Re: Configuration RFC
Date Mon, 15 Jul 2002 04:04:43 GMT
> >Having a framework for dealing with multiple file types (text, HTML,
> >PDF, Word, etc) is critical.  There was a proposal that floated
> >around
> >a few months ago which should be dusted off.
> 
> Indyo, the indexing framework I checked into Sandbox (under the appex 
> project) handles this aspect of it. I need abit more time to get the 
> documentation sorted out, but it'll be real soon now.

I think I submitted a simple framework for plugging in document
converters.  The idea was that converters would digest a document
and produce a list of named fields, and then there was a mapping
to map the document fields to the Lucene field names (which might
be different.)  It was pretty simple.

--
To unsubscribe, e-mail:   <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>


Mime
View raw message