lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gregor Heinrich <>
Subject Numerical ids for terms?
Date Tue, 12 Apr 2011 09:41:03 GMT
Hi -- has there been any effort to create a numerical representation of Lucene 
indices. That is, to use the Lucene Directory backend as a large term-document 
matrix at index level. As this would require bijective mapping between terms 
(per-field, as customary in Lucene) and a numerical index (integer, monotonous 
from 0 to numTerms()-1), I guess this requires some some special modifications 
to the Lucene core.

Another interesting feature would be to use Lucene's Directory backend for 
storage of large dense matrices, for instance to data-mining tasks from within 

Any suggestions?

Best regards and thanks


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message