You need to maintain a huge number of distinct indexes.

Are we talking about secondary indexes? If yes, this sounds like exactly my problem. There is so little documentation! - but I think that if I read all there is on GitHub, I can probably start using it.

Check out Solandra.  http://github.com/tjake/Solandra

I need to store, say, 10M-100M documents, with each document having say 100 fields, like author, creation date, access date, etc., and then I want to ask questions like

give me all documents whose author is like abc**, and creation date any time in 2010 and access date in 2010-2011, and so on, perhaps 10-20 conditions, matching a list of some keywords.

What's best, Lucene, Katta, Cassandra CF with secondary indices, or plan scan and compare of every record?

Thanks a bunch!