lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Brian Goetz <>
Subject Re: Configuration RFC
Date Mon, 15 Jul 2002 04:04:43 GMT
> >Having a framework for dealing with multiple file types (text, HTML,
> >PDF, Word, etc) is critical.  There was a proposal that floated
> >around
> >a few months ago which should be dusted off.
> Indyo, the indexing framework I checked into Sandbox (under the appex 
> project) handles this aspect of it. I need abit more time to get the 
> documentation sorted out, but it'll be real soon now.

I think I submitted a simple framework for plugging in document
converters.  The idea was that converters would digest a document
and produce a list of named fields, and then there was a mapping
to map the document fields to the Lucene field names (which might
be different.)  It was pretty simple.

To unsubscribe, e-mail:   <>
For additional commands, e-mail: <>

View raw message