lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki>
Subject Re: indexing relational table(s)
Date Wed, 11 May 2005 16:50:52 GMT
Dick Hollenbeck wrote:
> As sources of indexable text we always see HTML, XML, PDF, etc. but I 
> have not seen much mention of relational tables as a source.  Anybody 
> know why?

I think no specific reason - Lucene is able to index just pure text, 
anything else must go through format converters first. So, the "plain 
text from a db" case is relatively simple...

> We have a database with 60,000 records in 6 tables and aproximately 15 
> *text* fields per table.  Can we use lucene to index this with JDBC 
> being used in an index gathering for loop?


> Is there a better way?

This is the simplest way for creating an infrequently-updated index. If 
you need to do freqeuent updates, then depending on the size of your 
index you may want to play some tricks with RAMDirectory, temporary 
indexes or somesuch, but for the scenario that you described so far it 
is not needed.

Best regards,
Andrzej Bialecki
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration  Contact: info at sigram dot com

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message