lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Klaas <>
Subject Re: index and access to lines of a CSV file
Date Thu, 13 Dec 2007 23:44:06 GMT
On 13-Dec-07, at 3:26 PM, Tobias Rothe wrote:

> I got a quick question.  I am handling hughe CSV files. They start  
> with a key in the first column and are followed by data.
> I need to retrieve randomly this data based on the key.  So it is  
> kind of a search where I give a unique key and ideally access to  
> the right line.
> The file contains about 200.000 lines or more.  I am not sure if  
> Lucene can handle things like that and I did not really find a hint  
> on this  topic.  So I hope to find help here.

Is that all you're doing?  You could accomplish the above with Lucene  
but it isn't really needed for that.  You need some kind of on-disk  
key->value mapper.  Something like a berkeley db hashtable or btree  
should work (store each line as a key/value pair).


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message